AI News by Smol AI Profile
AI News by Smol AI

@Smol_AI

5,503
Followers
7
Following
47
Media
189
Statuses

we make big news smol

https://smol.ai
Joined February 2023
Don't wanna be here? Send us removal request.
Pinned Tweet
@Smol_AI
AI News by Smol AI
11 months
AI Discord overwhelm? We gotchu. Coming to smol talk 🔜 (what are the top AI discords we should add? we have @openai @langchainai @nousresearch @Teknium1 @alignment_lab @latentspacepod )
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
6
45
@Smol_AI
AI News by Smol AI
1 month
[6 Sept 2024] Reflection 70B, by Matt from IT Congrats to @mattshumer_ and @csahil28 on training the world's top open model and passing the vibe check! As @polynoamial said today: "1 engineer working in the right direction beats 100 geniuses working in
Tweet media one
1
10
72
@Smol_AI
AI News by Smol AI
28 days
Avoiding "mid" AI output: smol tip from our work today - don't talk to LLMs* like you talk to humans. You are fundamentally still programming with English!! ❌ Tell it what outcome you want, let it figure out how to get there ✅ Tell it what steps to perform, give tokens and
Tweet media one
Tweet media two
2
1
63
@Smol_AI
AI News by Smol AI
4 months
[Jun 10 2024] Talaria, the MLOps superweapon powering Apple Intelligence helps Apple create the set of 20 ~3.5bit LoRAs showcased today, hot-swapped atop of Apple's new ~3b On-Device model to beat Mistral, Gemma, Phi-3 w. 30tok/s speed
Tweet media one
Tweet media two
Tweet media three
Tweet media four
@vboykis
vicki
4 months
👇🍏
1
1
63
0
12
55
@Smol_AI
AI News by Smol AI
23 days
it's notable how predictive the Lmsys Elo vs $ pricing curve is, and how the strategy is panning out. Today's Gemini Pro price cut brings it exactly in line with where a loglinear pricing curve predicts it should be for its Elo. More broadly, the Frontier model race is now back
Tweet media one
@OfficialLoganK
Logan Kilpatrick
23 days
Two new production Gemini models, >2x higher rate limits, >50% price drop on Gemini 1.5 Pro, filters switched to opt-in, updated Flash 8B experimental model, and more. It’s a good day to be a developer : )
176
328
2K
3
8
57
@Smol_AI
AI News by Smol AI
5 months
AINews: 29 May 2024 What if you KNEW that we may soon have models can that continuously process and reason over text/audio/video with a TRILLION token "context window"? Real time? On device? thanks to @cartesia_ai , @krandiash , @_albertgu
Tweet media one
2
6
53
@Smol_AI
AI News by Smol AI
2 months
[28 Aug 2024] Cerebras Inference: Faster, Better, AND Cheaper congrats to @CerebrasSystems for vaulting to the top of the @ArtificialAnlys leaderboard for price and speed at full precision!!!
1
2
51
@Smol_AI
AI News by Smol AI
3 months
[17 July 2024] Mini, Nemo, Turbo, Lite - Smol models go brrr! Vibe check of 4o vs mini for instruction following vs summarization: - we find some examples where mini is worse, some examples where it is better. Mini is subjectively: - unevenly worse at formatting
Tweet media one
Tweet media two
Tweet media three
2
10
50
@Smol_AI
AI News by Smol AI
2 months
here's a concept that works: ai newsletter that tells u when there's no news because we're incentivized and built different
Tweet media one
@ai_for_success
AshutoshShrivastava
2 months
Me trying to keep up with AI News.
Tweet media one
75
180
2K
4
2
33
@Smol_AI
AI News by Smol AI
29 days
[18 Sept 2024] For the first time ever, an LLM has been able to 100% match and accurately report what we consider to be the top stories of the day without our intervention. @openai o1 destroys @Lmsysorg Arena @Alibaba_Qwen 2.5 @kyutai_labs Moshi
0
4
33
@Smol_AI
AI News by Smol AI
2 months
[8 Aug 2024] Too Cheap To Meter: AI prices cut 50-70% in last 30 days Price cuts of @lmsysorg top models in the last 30 days: - Rank 2: GPT4o cut 50% from May to Aug - Rank 3: GPT4o-mini cut >70% vs GPT3.5/4T - Rank 4: Llama 3.1 405b cut 46% in first 48hrs - Rank 8: @MistralAI
Tweet media one
1
10
32
@Smol_AI
AI News by Smol AI
2 months
[29 Aug 2024] Summer of Code AI: $1.6b raised, 1 usable product - @cognition_labs : $175m - @poolsideai : $400m - @codeiumdev : $150m - @magicailabs : $320m You can only use one of these products right now, but there's lots of promise!
0
2
28
@Smol_AI
AI News by Smol AI
1 year
we are back thanks to @FanaHOVA ! first project 📈
@swyx
swyx
1 year
🐣 Introducing `smol-developer`! ▸ Human-centric, coherent whole program synthesis ▸ your own junior developer ▸ develop, debug, decompile ▸ open source: ▸ 200 LOC, half english Insights: 💡 100k context can summarize both content and codebases 💡
84
376
3K
1
7
26
@Smol_AI
AI News by Smol AI
1 year
Tiny Language Models (below 10m parameters or only one transformer block) can generate paragraphs of coherent text and reason…provided training is limited to stories that only contain words that a typical 3 to 4-year-olds usually understand. Paper -
1
3
21
@Smol_AI
AI News by Smol AI
4 months
[2 Jul 2024] GraphRAG: The Marriage of Knowledge Graphs and RAG This is finally on our radar thanks to @emileifrem 's tireless advocacy, and MSR now open sourcing their code!
Tweet media one
@altryne
Alex Volkov (Thursd/AI)
4 months
GraphRAG is now on Github! I first heard of GraphRAG from @emileifrem at @aiDotEngineer a week ago, in the context of @neo4j , then had a random chat over lunch with a @alexchaomander who worked on it at Microsoft! I will dive deeper and talk about it on @thursdai_pod 👀
5
8
43
2
6
21
@Smol_AI
AI News by Smol AI
3 months
[9 July 2024] Depth is all you need: - Everybody is sleeping on @lilianweng 's latest review of Hallucination Detection/Prevention/Evals - @ylecun and @reach_vb on MobileLLM takeaways - Summary of @xiaolonw et al's Test Time Training architecture research. A late issue today due
Tweet media one
1
4
20
@Smol_AI
AI News by Smol AI
3 months
[V-1 Preview] of Smol Talk @swyx walks AINews readers through "version minus one" of Smol Talk, the customizable AI News platform. h/t @TheNoahHein
6
4
20
@Smol_AI
AI News by Smol AI
3 months
[22 July 2024] Llama 3.1 Leaks: big bumps to 8B, minor bumps to 70b, and SOTA OSS 405b model Big day for OSS AI tomorrow! good luck @soumithchintala !
Tweet media one
2
1
19
@Smol_AI
AI News by Smol AI
2 months
[6 Aug 2024] GPT4o August + 100% Structured Outputs for All! the newest model + API from @michpokrass and @athyuttamre seriously impressed us! Cut 20 lines of Instructor code + will probably save about 55% of our API costs between the price cut + better model + less retries.
Tweet media one
@swyx
swyx
2 months
ok my @smol_ai vibe check is done. the new 4o Aug is clearly better than 4o May and mostly better than 4o Mini. (greens are wins)
Tweet media one
Tweet media two
2
1
13
1
4
18
@Smol_AI
AI News by Smol AI
20 days
Llama Vision took the spotlight this week, but don't sleep on @allen_ai Molmo, which is now the #2 vision language model in the world* but is completely open: 1. Molmo 72B scores higher than L3.2 90B, same for 7B > L3.2 11B. 2. Arena ratings for 72B beat Gemini 1.5 pro and
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
2
17
@Smol_AI
AI News by Smol AI
1 month
[13 Sep 2024] Learning from the @OpenAIDevs AMA
Tweet media one
1
2
13
@Smol_AI
AI News by Smol AI
1 year
My dream model - 6 modalities - 300-500M total params - very aligned & instruction following - 27-33 heads - cute animal name - loves humans - rlly helpful but a bit naughty - rlly harmless & has guardrails - rlly honest - not from bigcorp (high quality volunteer) - very smol
1
3
11
@Smol_AI
AI News by Smol AI
3 months
[17 July 2024] Gemma 2 tops /r/LocalLlama vibe check Less than a month old but already handily beating Llama 3, Phi 3, Qwen 2, Mistral, and everyone else!
Tweet media one
@Smol_AI
AI News by Smol AI
4 months
[27 Jun 2024] Gemma 2: the Open Model for Everyone! ft. @kathleenkenealy of Gemma team, and @danielhanchen of @UnslothAI , at @aidotengineer
0
3
10
0
2
8
@Smol_AI
AI News by Smol AI
1 month
[9 Sept 2024] AIPhone 16: The Visual Intelligence Phone How many years until Apple Visual Intelligence is just... always on?
Tweet media one
1
1
10
@Smol_AI
AI News by Smol AI
3 months
[19 July 2024] DataComp-LM: The Best New Open Weights model! An incredible paper, benchmark, dataset, and model from the DataComp team and Apple ML Research!
Tweet media one
Tweet media two
Tweet media three
1
1
9
@Smol_AI
AI News by Smol AI
2 months
[21 Aug 2024] Ideogram 2 + Function Calling Leaderboard V2 'Tis the season of sequels. After the spectacular launch of Flux (the former Stable Diffusion team), @ideogram_ai (the former Google Imagen 1 team) is back with a vengeance. A new model, with 5 distinct styles with
1
0
9
@Smol_AI
AI News by Smol AI
16 days
[1 Oct 2024] @OpenAI Realtime API and other devday goodies!
2
0
9
@Smol_AI
AI News by Smol AI
1 month
[11 Sept 2024] As @reach_vb pointed out, congrats to @mistralailabs for beating @aiatmeta to releasing a multimodal model!
Tweet media one
0
0
9
@Smol_AI
AI News by Smol AI
3 months
[8 July 2024] Problems with MMLU-Pro Before @DanHendrycks ' could complete MMLU 2, the community has embraced MMLU-Pro as the de facto replacement. However the /r/LocalLlama gang is finding some broken English and obvious discrepancies favoring the closed models over open ones:
Tweet media one
Tweet media two
@WenhuChen
Wenhu Chen
5 months
Tired of MMLU? The current models already hit the ceiling? It's time to upgrade MMLU! Introducing our new benchmark MMLU-Pro, a more robust and challenging massive multi-task language understanding benchmark with 12K questions. What's New? 1. MMLU-Pro uses 10 options instead of
Tweet media one
46
128
676
1
2
8
@Smol_AI
AI News by Smol AI
3 months
[23 July 2024] Llama 3.1: The Synthetic Data Model From the paper, we detail all the ways in which Synthetic Data was used to get Llama 3 to frontier-level performance across code, instruction following, math, multilnguality, long context, tool use, and RLHF.
Tweet media one
Tweet media two
@swyx
swyx
3 months
Llama 3: the Synthetic Data model Llama 3 paper is finally out! by @lvdmaaten and Angela Fan. Quick diffs from yesterday's leaks (+ watch our exclusive @ThomasScialom interview out now!) - NEW SCALING LAWS! turns out there's a reason why they trained a 405B param model because
Tweet media one
Tweet media two
Tweet media three
Tweet media four
7
37
255
6
2
8
@Smol_AI
AI News by Smol AI
3 months
[28 July 2024] Apple Intelligence Beta + Segment Anything Model 2 a longer form breakdown of the two big papers from today.
Tweet media one
Tweet media two
1
0
8
@Smol_AI
AI News by Smol AI
24 days
[23 Sep 2024] No clear headline story, but lots of minor notables ahead of anticipated big drops from Anthropic and Meta this week
Tweet media one
1
0
7
@Smol_AI
AI News by Smol AI
4 months
[14 Jun 2024] Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata h/t @ctnzr , @kuchaev et al "Notably, throughout the entire alignment process, we relied on only approximately 20K human-annotated data... while our data generation pipeline
Tweet media one
Tweet media two
Tweet media three
@_philschmid
Philipp Schmid
4 months
Not Llama 3 405B, but Nemotron 4 340B! @nvidia just released 340B dense LLM matching the original @OpenAI GPT-4 performance for chat applications and synthetic data generation. 🤯 NVIDIA does not claim ownership of any outputs generated. 💚 TL;DR: 🧮 340B Paramters with 4k
Tweet media one
53
200
1K
0
1
7
@Smol_AI
AI News by Smol AI
5 months
AINews: 30 May 2024 @jaseweston taught LLMs to count using this ONE weird trick. @giffmana thinks this is more promising than linear attention, and @krishnanrohit says you can use this to add EXTERNAL MEMORY to attention...
Tweet media one
Tweet media two
@jaseweston
Jason Weston
5 months
🚨 Contextual Position Encoding (CoPE) 🚨 Context matters! CoPE is a new positional encoding method for transformers that takes into account *context*. - Can "count" distances per head dependent on need, e.g. i-th sentence or paragraph, words, verbs, etc. Not just tokens. -
Tweet media one
1
304
2K
1
1
6
@Smol_AI
AI News by Smol AI
4 months
[21 Jun 2024] @NoamShazeer et al (2024): you are overpaying for inference by >13x!
Tweet media one
0
0
5
@Smol_AI
AI News by Smol AI
4 months
[11 Jun 2024] Today's feature is @fchollet and @mikeknoop 's new $1m ARC prize, which proposes a benchmark that directly tracks François' definitions of AGI and is designed to have a much lower saturation curve than others:
Tweet media one
Tweet media two
0
2
6
@Smol_AI
AI News by Smol AI
4 months
AINews for June 5 2024 - ChatGPT's voice mode is "coming soon" - @leopoldasch launched a 5 part AGI timelines piece - @profTomYeh illustrates llm.c - @willccbb dropped a comprehensive GenAI Handbook - @Cohere completed its $450m raise at $5b valuation
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
0
5
@Smol_AI
AI News by Smol AI
4 months
[12 Jun 2024] The Last Hurrah of Stable Diffusion? The ~2B param SD3 Medium is finally here, but whither Stability? Will they ever release the 8B model? What's after SD3?
0
1
5
@Smol_AI
AI News by Smol AI
3 months
something we didnt catch: 4o mini uses more tokens for images than 4o. to match the SAME COST (aka a lot more) this leads me to suspect that 4o mini will be a -lot- better at vision tasks, has anyone verified? whats a good benchmark for this? @yitayml vibeeval?
1
0
5
@Smol_AI
AI News by Smol AI
1 year
Incredible story saving $500k a month by switching to finetuned BERT models: with 90% parity to ChatGPT and 15% of the latency!
Tweet media one
0
0
4
@Smol_AI
AI News by Smol AI
3 months
[31 July 2024] Gemma 2 2B, Scope and Shield!
0
0
5
@Smol_AI
AI News by Smol AI
3 months
[15 July 2024] AgentInstruct: Toward Generative Teaching with Agentic Flows The future of synthetic pre- and post- training data could be armies of agents doing macrodata refinement for us.
Tweet media one
@_philschmid
Philipp Schmid
3 months
A recipe for Synthetic Data 2.0? @Microsoft introduced “AgentInstruct” a new way to teach an LLM a new skill or behavior from synthetic data generated by LLM Agents. AgentInstruct improved a 7B (Orca-3) model by ~20% across all benchmarks and matched GPT-4 on RAG. AgentInstruct
Tweet media one
4
59
283
0
0
5
@Smol_AI
AI News by Smol AI
1 year
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Tweet media one
0
0
5
@Smol_AI
AI News by Smol AI
2 months
[5 Aug 2024] How Carlini Uses AI
Tweet media one
Tweet media two
0
0
3
@Smol_AI
AI News by Smol AI
1 month
Full newspaper here: Thanks to @weights_biases for supporting this month's AINews - join their hackathon Sept 21/22 here!
Tweet media one
1
0
4
@Smol_AI
AI News by Smol AI
2 months
Meanwhile in AI Engineer land, @shishirpatil_ updated the Berkeley Function Calling Leaderboard (now commonly known as BFCL) to BFCL V2 • Live, adding 2251 "live, user-contributed function documentation and queries, avoiding the drawbacks of dataset contamination and biased
Tweet media one
Tweet media two
1
0
4
@Smol_AI
AI News by Smol AI
5 months
AINews for 31 May 2024 Anthropic's Tool Use API is GA and very fully featured! - streaming - forced use - vision - 5 architectures for agents - a @CodeColt course on Tool Use! congrats to the team on a great rollout.
Tweet media one
Tweet media two
@alexalbert__
Alex Albert
5 months
Excited to announce that we’re spinning up an AI educational program and we just released our first course on tool use! Let me walk you through what it covers:
Tweet media one
16
79
773
1
0
4
@Smol_AI
AI News by Smol AI
1 year
🐣 The BabyLM Challenge: matching LLMs with 0.01% the size
Tweet media one
Tweet media two
0
1
4
@Smol_AI
AI News by Smol AI
3 months
[2 Aug 2024] Execuhires: Tempting The Wrath of Khan We are suprised as anyone.
Tweet media one
0
1
4
@Smol_AI
AI News by Smol AI
17 days
@LiquidAI_ Experimental podcast version:
0
0
4
@Smol_AI
AI News by Smol AI
1 year
Neeva is shutting down Neeva dot com, and pivoting to smol models for the enterprise 👀
Tweet media one
@Neeva
Neeva
1 year
It is with heavy hearts we announce will shut down over the next few weeks. We appreciate our passionate community of customers & users that have supported us over the past few years. ❤️ We thank you for understanding. Here’s some more information ⤵️🧵
68
42
292
0
0
4
@Smol_AI
AI News by Smol AI
4 months
@HamelHusain @eugeneyan @vboykis Adding this to testimonials
0
0
4
@Smol_AI
AI News by Smol AI
1 month
[4 Sept 2024] $1150m for @ssi , @SakanaAILabs , @youdotcom + @AnthropicAI Claude 500m context!
Tweet media one
3
0
4
@Smol_AI
AI News by Smol AI
4 months
[17 Jun 2024] Is this... Q*?
Tweet media one
Tweet media two
@teortaxesTex
Teortaxes▶️
4 months
Slowly, then suddenly.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
22
270
2K
0
0
3
@Smol_AI
AI News by Smol AI
3 months
full issue here with a big breakdown of /r/localLlama's reactions:
Tweet media one
0
0
3
@Smol_AI
AI News by Smol AI
1 year
"big things ~~can~~ should always start small" 👏
@amasad
Amjad Masad
1 year
More than a year ago, Ghostwriter proof-of-concept took a few hours to prototype. Now it's a flagship product for Replit. This is how we move fast at Replit -- you can prototype entire features in the environment itself. In this case, the PoC worked by hosting a small OSS LLM
Tweet media one
28
52
745
0
1
3
@Smol_AI
AI News by Smol AI
2 years
hello world
1
0
3
@Smol_AI
AI News by Smol AI
3 months
[16 July 2024] SciCode: HumanEval gets a STEM PhD upgrade!
@MinyangTian1
Minyang Tian
3 months
SciCode is our new benchmark that challenges LMs to code solutions for scientific problems from advanced papers. The challenges were crafted by PhDs; ~10% of our benchmark is based on Nobel-winning research. GPT-4 and Sonnet 3.5 get <5% ACC. 🧵 1/6
Tweet media one
10
62
263
0
0
3
@Smol_AI
AI News by Smol AI
4 months
[25 Jun 2024] Claude 3.5 Sonnet wins everybody's hearts and memes
0
0
2
@Smol_AI
AI News by Smol AI
2 months
shh (dm us after for access)
@iamsyriaz
Soumar
2 months
@Smol_AI is there a way to use this but with a summary of my own feed?
0
0
0
1
0
2
@Smol_AI
AI News by Smol AI
29 days
@zimmskal @sroecker email is cheap - skimming and ctrl+f is how you are meant to use it, not read from start to end
1
0
2
@Smol_AI
AI News by Smol AI
5 months
nice complement to reading the recent Chameleon paper with only 2 categories of multimodality:
@rohanpaul_ai
Rohan Paul
5 months
Nice paper surveying Multimodal AI Architectures -- with a comprehensive taxonomy and analysis of their pros/cons & applications in any-to-any modality model development 📌 𝐂𝐨𝐦𝐩𝐫𝐞𝐡𝐞𝐧𝐬𝐢𝐯𝐞 𝐓𝐚𝐱𝐨𝐧𝐨𝐦𝐲: First work to explicitly identify and categorize four broad
Tweet media one
6
155
592
0
0
2
@Smol_AI
AI News by Smol AI
1 month
@ssi @SakanaAILabs @youdotcom @AnthropicAI we are experiencing some very serious email deliverability issues with buttondown and were unable to resolve it today, sorry but the archives at least are still up.
2
0
2
@Smol_AI
AI News by Smol AI
4 months
[18 June 2024] Gemini launches context caching... or does it? Today was a great day for AINews followups: - Nvidia's Nemotron now ranks #1 open model on @LMsysorg and #11 overall (beating Llama-3-70b, which maybe isn't that impressive but perhaps wasnt the point), - Meta's
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
0
2
@Smol_AI
AI News by Smol AI
1 month
@chongdashu we also just realized that a huge % of our mailing list was marked undeliverable without our notification - almost surely a bug. @justinmduke is on it
@Smol_AI
AI News by Smol AI
1 month
@ssi @SakanaAILabs @youdotcom @AnthropicAI we are experiencing some very serious email deliverability issues with buttondown and were unable to resolve it today, sorry but the archives at least are still up.
2
0
2
1
0
2
@Smol_AI
AI News by Smol AI
1 month
@logan_engstrom @Replit @cursor_ai @ValDotTown @CosineAI @honeycombsh @swyx done. interesting that gpt4o memorized (predicted?) your name...
1
0
2