Traditional Neural Networks are overrated.
Transformer is irrelevant when we have a small set of unique examples.
And no, brute-forcing with discrete program search is not the way too.
It's so hilarious how easy, simple and yet elegant the actual approach is.
With
@Harvard
, we built a โvirtual rodentโ powered by AI to help us better understand how the brain controls movement. ๐ง
With deep RL, it learned to operate a biomechanically accurate rat model - allowing us to compare real & virtual neural activity. โ
Announcing AlphaFold 3: our state-of-the-art AI model for predicting the structure and interactions of all lifeโs molecules. ๐งฌ
Hereโs how we built it with
@IsomorphicLabs
and what it means for biology. ๐งต
My speculation:
GPT2 is an advanced multi-transformer architecture that combines two transformers (Find and Replace)
The results speak for themselves
This is from paper that was published by an anonymous authors
We believed it was almost impossible but Deepseek did it
Model that performs better than gpt-4-turbo in coding workloads.
I will check out the model to see if it's better for me than gpt-4. 5
DeepSeek-Coder-V2: First Open Source Model Beats GPT4-Turbo in Coding and Math
> Excels in coding and math, beating GPT4-Turbo, Claude3-Opus, Gemini-1.5Pro, Codestral.
> Supports 338 programming languages and 128K context length.
> Fully open-sourced with two sizes: 230B (also
Most people missed this:
MCTS is good for LLMs but not for inference stage where it answers the Question or Query.
MCTS is good because we can generate synthetic data that is grounded grounded in truth.
Now I should refine it, 100x check everything and prepare for an actual submission.
I've tried to see if this architecture would be able to learn features from cat pictures and failed(possible scale issues)
It's superhuman at pattern recognition though.
My hands are shaking lmao.
@HKrassenstein
@realDonaldTrump
This account seems suspicious, howโs she so fast?) Like sheโs literally stalking trump to reply with her 5 cents. Can someone explain why is she stalking him?:)
gpt2-chatbot is very weird
Some speculate it could be Gemini 2
But consistent responses that it's trained by OpenAI for the google model would be a bad look
It's probably a new mid tier model from OpenAI
Not gpt-5 tier for sure
@browserdotsys
unironically testing physical prediction like falling objects with different material properties could be a good benchmark for those models
you tweet anything at all and people respond like oh is this tweet from openai employee will depue a sign agi is happening tomorrow. yes. yes it is. agi is tomorrow. mark your calendars. 7pm tลkyล time. please bring party poppers and your favorite paper clip. byob.
๐ฅ New theory says it's Cu0, not Cu2+, that could make
#LK99
super at room temp. Broad band Mott localization in Cu0 clusters makes singlet spin reservoirs supporting pairing at 100s of K! Competing orders beware!.
#LK99
#LK_99
LK-99 ๐งต
There are 8 possible Pb-Cu isotopic combinations in LK-99, each with slightly different masses and nuclear spins.
The phonon modes and electron-phonon coupling could potentially vary between these isotopic combinations. ๐งต
Introducing Maisa KPU: The next leap in AI reasoning capabilities.
The Knowledge Processing Unit is a Reasoning System for LLMs that leverages all their reasoning power and overcomes their intrinsic limitations.
With OpenAI, Figure 01 can now have full conversations with people
-OpenAI models provide high-level visual and language intelligence
-Figure neural networks deliver fast, low-level, dexterous robot actions
Everything in this video is a neural network:
@basedsarlcagan
@BorisMPower
Solution by GPT4-0314 below. LLama2-70B can also do this, but will refrain from speaking about LLama2-70B until my next thread(coming in June now).
Number of possible variations if White plays first and is to win: 688,712 and only 12 of those are possible checkmates.
Number of
@apples_jimmy
@ylecun
@iamgingertrash
Question: Regarding the upcoming LLaMa 3 400B+ model, will it be open-weight? There are several rumors about this...
Answer: No, it is still planned to be open and that will not change. I don't know where this rumor came from, but it is completely false.
Today we're excited to introduce Devin, the first AI software engineer.
Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork.
Devin is
Exciting news - the latest Arena result are out!
@cohere
's Command R+ has climbed to the 6th spot, matching GPT-4-0314 level by 13K+ human votes! It's undoubtedly the **best** open model on the leaderboard now๐ฅ
Big congrats to
@cohere
's incredible work & valuable contribution
Newly published work from FAIR, Chameleon: Mixed-Modal Early-Fusion Foundation Models.
This research presents a family of early-fusion token-based mixed-modal models capable of understanding & generating images & text in any arbitrary sequence.
Paper โก๏ธ
As part of our focus on developing Meta Llama 3 in a responsible way, weโve created a number of resources to help others use it responsibly as well โ like CyberSec Eval 2.
Research paper โก๏ธ
OpenAI has ASI, but it will take them 50 years to release it. Q* and GPT-7 are achieved internally.
Back to reality where you have to ship not yap.
Not a lot of people know about Anthropic and OpenAI is betting on this until some viral video on tiktok and it'll be over for them.
Introducing Gen-3 Alpha: Runwayโs new base model for video generation.
Gen-3 Alpha can create highly detailed videos with complex scene changes, a wide range of cinematic choices, and detailed art directions.
(1/10)
Introducing the next generation of the Meta Training and Inference Accelerator (MTIA), the next in our family of custom-made silicon, designed for Metaโs AI workloads.
Full details โก๏ธ
using technology to create abundance--intelligence, energy, longevity, whatever--will not solve all problems and will not magically make everyone happy.
but it is an unequivocally great thing to do, and expands our option space.
to me, it feels like a moral imperative.
Exclusive: OpenAI has fired two AI safety researchers for allegedly leaking information, including an ally of chief scientist Ilya Sutskever.
From
@erinkwoo
and
@steph_palazzolo
.
Introducing Udio, an app for music creation and sharing that allows you to generate amazing music in your favorite styles with intuitive and powerful text-prompting.
1/11
Itโs here! Meet Llama 3, our latest generation of models that is setting a new standard for state-of-the art performance and efficiency for openly available LLMs.
Key highlights
โข 8B and 70B parameter openly available pre-trained and fine-tuned models.
โข Trained on more