Knut Jägersberg @JagersbergKnut Twitter profile

Last Seen Profiles

@SukmaadinugAdi

@butt4flies

@Junyanti26

@salma0972

@ruka_ra_

@jikookomegavrse

@sotwecom

@87eSc7ykX34FYj

@KFamaligi_BSS

@germa_gordo

@CliffMc84

@hilariusharsan

@RecruitTheDale

@stwmaniax

@hutchens_t51212

@bokeplokalmalam

@sashidash09

@denizmavri

@Carlhol89037972

@HarvardCCB

@AngiePears90339

@JPOkabol

@maqsoudm4

@GoddessTif888

@bokeplokalmalam

@CaractersUV

@khnikhmilo58283

@Ufcoin

@duxe_zie

@carolinwend

@bokeplokalmalam

@hina_jiao777

@thedanholo

@yeop_gun

@stw46

@Carlerklumpen

Knut Jägersberg

@JagersbergKnut

1 year

Insanely Fast Whisper Transcribe 300 minutes (5 hours) of audio in less than 98 seconds

GitHub - chenxwh/insanely-fast-whisper: Incredibly fast Whisper-large-v3

Incredibly fast Whisper-large-v3. Contribute to chenxwh/insanely-fast-whisper development by creating an account on GitHub.

github.com

9

125

811

Knut Jägersberg

@JagersbergKnut

2 months

Self-evolving Agents with reflective and memory-augmented abilities

Large language models (LLMs) have made significant advances in the field of natural language processing, but they still face challenges such as continuous decision-making. In this research, we...

arxiv.org

4

81

442

Knut Jägersberg

@JagersbergKnut

2 years

declare-lab/flan-alpaca-xl Base model: flan-t5, thus no license probs

declare-lab/flan-alpaca-xl · Hugging Face

huggingface.co

13

51

306

Knut Jägersberg

@JagersbergKnut

7 months

@tsarnick weirdly, I think it is existing celebrities. Sure you can make AI influencers, and they can and will compete, but even with interesting personality, the brands of at least some existing people remain valuable. People are interested in people, even with superinteresting AI around.

17

3

285

Knut Jägersberg

@JagersbergKnut

2 years

WizardLM: An Instruction-following LLM Using Evol-Instruct uuh "WizardLM-7B outperforms ChatGPT in the high-complexity instructions... Evol-Instruct is a novel method using LLMs instead of humans to automatically mass-produce open-domain instructions"

TheBloke/wizardLM-7B-HF · Hugging Face

huggingface.co

9

57

280

Knut Jägersberg

@JagersbergKnut

2 years

TheBloke/galpaca-30B-GPTQ-4bit-128g Tom Jobbins had the kindness to quantize galpaca 30b, it fits in 18gb of vram.

TheBloke/galpaca-30B-GPTQ · Hugging Face

huggingface.co

4

58

274

Knut Jägersberg

@JagersbergKnut

2 years

GeorgiaTechResearchInstitute/galpaca-30b Please somebody 4int this!

GeorgiaTechResearchInstitute/galpaca-30b · Hugging Face

huggingface.co

7

50

263

Knut Jägersberg

@JagersbergKnut

11 months

llmware/dragon-mistral-7b-v0 A RAG model

llmware/dragon-mistral-7b-v0 · Hugging Face

huggingface.co

7

19

259

Knut Jägersberg

@JagersbergKnut

10 months

TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T · Hugging Face

huggingface.co

7

33

203

Knut Jägersberg

@JagersbergKnut

7 months

Mechanics of Next Token Prediction with Self-Attention This was a key paper. There are more of those.

Mechanics of Next Token Prediction with Self-Attention

Transformer-based language models are trained on large datasets to predict the next token given an input sequence. Despite this simple training objective, they have led to revolutionary advances...

arxiv.org

0

35

203

Knut Jägersberg

@JagersbergKnut

2 years

Writer/camel-5b-hf Camel-5b, a state-of-the-art instruction-following large language model

Writer/camel-5b-hf · Hugging Face

huggingface.co

6

36

201

Knut Jägersberg

@JagersbergKnut

1 year

GeneZC/MiniMA-3B A language model distilled from an adapted version of LLaMA2-7B following "Towards the Law of Capacity Gap in Distilling Language Models".

6

28

196

Knut Jägersberg

@JagersbergKnut

2 years

TheBloke/alpaca-lora-65B-GGML -4bit -2bit Already with positive community reception: "This is the best model I have tried locally this far. Thank you!"

TheBloke/alpaca-lora-65B-GGML · Hugging Face

huggingface.co

5

37

186

Knut Jägersberg

@JagersbergKnut

1 year

Recent LLMs: - StableVicuna 13B - WizardLM 7B - GPT4 based Alpaca 30B - OpenAssistant data llama fine tune 30B - RWKV Raven 14B with EvolInstruct added - Replit Code - FastChat-T5 - GPT4-X-Alpasta-30B -GPT4-X-AlpacaDente-30B - llama-30b-supercot -Chimera-13B - Alpacino-30B

4

43

178

Knut Jägersberg

@JagersbergKnut

2 years

digitous/Alpacino30b A triple model merge of (Alpaca+(CoT+Storytelling)), resulting in a comprehensive boost in Alpaca's reasoning and story writing capabilities.

digitous/Alpacino30b at main

huggingface.co

4

34

176

Knut Jägersberg

@JagersbergKnut

2 months

Language-Guided World Models: A Model-Based Approach to AI Control

This paper introduces the concept of Language-Guided World Models (LWMs) -- probabilistic models that can simulate environments by reading texts. Agents equipped with these models provide humans...

arxiv.org

2

30

180

Knut Jägersberg

@JagersbergKnut

1 year

LLMs/AlpacaGPT4-LoRA-7B-OpenLLaMA Soon there will be high performance LLMs with permissive licenses. It won't take long until the fully trained openllama get's a fine tune on OpenAssistant data. Later we will have 30b redpajama on that.

LLMs/AlpacaGPT4-LoRA-7B-OpenLLaMA · Hugging Face

huggingface.co

3

45

174

Knut Jägersberg

@JagersbergKnut

11 months

ScienceGPT: 1T parameter model backed by Intel and US government has begun @bimedotcom @Khulood_Almani @debashis_dutta @sonu_monika @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @sulefati7 @XavierAncelin @enilev @sallyeaves @IanLJones98

The GPT to rule them all: Training for one trillion parameter model backed by Intel and US govern...

LLM playfully dubbed 'ScienceGPT' is being trained from data from the Aurora supercomputer

www.techradar.com

7

36

165

Knut Jägersberg

@JagersbergKnut

11 months

OpenHermes-2.5-Mistral-7B has become a go-to LLM @bimedotcom @Khulood_Almani @debashis_dutta @sonu_monika @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @sulefati7 @XavierAncelin @enilev @sallyeaves @IanLJones98

3

17

153

Knut Jägersberg

@JagersbergKnut

1 year

TheBloke/wizard-vicuna-13B-HF

8

28

147

Knut Jägersberg

@JagersbergKnut

10 months

Trelis/Mistral-7B-Instruct-v0.1-Summarize-16k used patent dataset to make it. They also have a 60k version, which you can buy for 30 euro. that's the first model I have seen on the hub which you can purchase. I don't think that's a bad idea.

Trelis/Mistral-7B-Instruct-v0.1-Summarize-16k · Hugging Face

huggingface.co

5

18

142

Knut Jägersberg

@JagersbergKnut

4 months

Self-Net: Lifelong Learning via Continual Self-Modeling

Frontiers | Self-Net: Lifelong Learning via Continual Self-Modeling

Learning a set of tasks over time, also known as continual learning (CL), is one of the most challenging problems in artificial intelligence. While recent ap...

www.frontiersin.org

0

39

144

Knut Jägersberg

@JagersbergKnut

6 months

llama3-42b: Pruned llama3-70b

5

21

144

Knut Jägersberg

@JagersbergKnut

10 months

Q-bert/Mamba-1B

1

14

142

Knut Jägersberg

@JagersbergKnut

1 year

01-ai/Yi-34B-200K The 200k context model was released.

01-ai/Yi-34B-200K · Hugging Face

huggingface.co

1

23

143

Knut Jägersberg

@JagersbergKnut

1 year

One guy is really flooding the hub

19

3

133

Knut Jägersberg

@JagersbergKnut

2 years

More ranking LLMs for distraction: Gpt-4 Gpt-4-distilling-alpaca-30b Chatgpt / gpt-3.5 alpaca-30b galpaca-30b Llama 30b Gpt-4-alpaca-13b Vicuna-13b Koala-13b Alpaca-13b Flan-t5-xxl

8

23

133

Knut Jägersberg

@JagersbergKnut

7 months

A Visual Guide to Mamba and State Space Models @bimedotcom @Khulood_Almani @ipfconline1 @theomitsa @BetaMoroney @Shi4Tech @FmFrancoise @sulefati7 @enilev @sallyeaves @sonu_monika

A Visual Guide to Mamba and State Space Models

An Alternative to Transformers for Language Modeling

newsletter.maartengrootendorst.com

1

48

130

Knut Jägersberg

@JagersbergKnut

9 months

internlm/internlm2-20b (200K) Claims ChatGPT comparable performance

internlm/internlm2-20b · Hugging Face

huggingface.co

4

18

123

Knut Jägersberg

@JagersbergKnut

11 months

Made a tiny-llama based 1b deacon and quantized it to 6bit. I'm surprised about the quality of the output. Upload might take 20 mins or so.

KnutJaegersberg/CPU-LLM-Horde · Hugging Face

huggingface.co

5

16

119

Knut Jägersberg

@JagersbergKnut

1 year

BLING: "Best Little Instruction-following No-GPU-required" - BLING is designed for enterprise automation use cases, especially in knowledge-intensive industries - BLING is not designed for 'chat-bot' or 'consumer-oriented' applications

llmware/bling-1.4b-0.1 · Hugging Face

huggingface.co

4

15

120

Knut Jägersberg

@JagersbergKnut

9 months

So what's up with this model here?

miqudev/miqu-1-70b · Hugging Face

huggingface.co

4

14

116

Knut Jägersberg

@JagersbergKnut

2 months

Configurable Foundation Models: Building LLMs from a Modular Perspective

Advancements in LLMs have recently unveiled challenges tied to computational efficiency and continual scalability due to their requirements of huge parameters, making the applications and...

arxiv.org

5

17

109

Knut Jägersberg

@JagersbergKnut

1 year

This model must be crazy fast at inference and still good

TheBloke/WizardLM-7B-uncensored-GPTQ · Hugging Face

huggingface.co

3

24

108

Knut Jägersberg

@JagersbergKnut

1 year

globuslabs/ScholarBERT-XL The model is pretrained on a large collection of scientific research articles (221B tokens).

globuslabs/ScholarBERT-XL · Hugging Face

huggingface.co

3

18

103

Knut Jägersberg

@JagersbergKnut

1 year

News: "Preparing the 33B version and we expect to empower WizardLM with the ability to perform instruction evolution itself, aiming to evolve your specific data at a low cost. - released 13B version of WizardLM trained with 250k evolved instructions.

GitHub - nlpxucan/WizardLM: LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath - nlpxucan/WizardLM

github.com

2

19

100

Knut Jägersberg

@JagersbergKnut

1 year

A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models @bimedotcom @Khulood_Almani @debashis_dutta @sonu_monika @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @XavierAncelin @enilev @sallyeaves @IanLJones98

10

28

98

Knut Jägersberg

@JagersbergKnut

7 months

any news on what happened to @TheBlokeAI ?

18

2

96

Knut Jägersberg

@JagersbergKnut

1 year

TheBloke/MPT-30B-Dolphin-v2-GGML This and the non-quantized weights might now be one of the best LLMs on HF. Replicating orca, I guess without censorship.

TheBloke/MPT-30B-Dolphin-v2-GGML · Hugging Face

huggingface.co

4

21

94

Knut Jägersberg

@JagersbergKnut

10 months

Altman says they brought cost down of operating gpt-3 (I guess that's gpt3.5 by now) by a factor of 40. Tell that anybody who thinks chatgpt has 175 billion parameters.

8

4

90

Knut Jägersberg

@JagersbergKnut

7 months

First Jamba fine tunes are incoming

mlabonne/Jambatypus-v0.1 · Hugging Face

huggingface.co

1

10

92

Knut Jägersberg

@JagersbergKnut

7 months

Building RAG with Command R+ from Cohere, DSPy, and Weaviate!

Hey everyone! Thank you so much for watching this overview of Command R+ showing you how you can use the new model in DSPy and a quick RAG demo, as well as w...

www.youtube.com

3

14

88

Knut Jägersberg

@JagersbergKnut

2 years

Effects of different quantization of llama

2

13

85

Knut Jägersberg

@JagersbergKnut

1 year

seonghyeonye/flipped_11B Generates Instructions

seonghyeonye/flipped_11B · Hugging Face

huggingface.co

3

18

87

Knut Jägersberg

@JagersbergKnut

10 months

LLM overview pretrained; below 2b: - Mamba 1b - TinyLlama 1.1b - Qwen-1_8b - LiteLlama-460M-1T pretrained; 3b: - stablelm-3b-4e1t - Phi-2 - MiniMa-2-3b - BTLM-3b

4

13

85

Knut Jägersberg

@JagersbergKnut

6 months

This is a nice open source story

You can now train a 70b language model at home – Answer.AI

We’re releasing an open source system, based on FSDP and QLoRA, that can train a 70b model on two 24GB GPUs.

www.answer.ai

0

16

85

Knut Jägersberg

@JagersbergKnut

2 months

Beyond Preferences in AI Alignment

The dominant practice of AI alignment assumes (1) that preferences are an adequate representation of human values, (2) that human rationality can be understood in terms of maximizing the...

arxiv.org

2

8

84

Knut Jägersberg

@JagersbergKnut

2 months

GPT-4 is judged more human than humans in displaced and inverted Turing tests

GPT-4 is judged more human than humans in displaced and inverted...

Everyday AI detection requires differentiating between people and AI in informal, online conversations. In many cases, people will not interact directly with AI systems but instead read...

arxiv.org

7

19

83

Knut Jägersberg

@JagersbergKnut

4 months

Wednesday’s vote on EU’s Chat Control bill could open the floodgates to unprecedented surveillance @bimedotcom @Khulood_Almani @theomitsa @FmFrancoise @sulefati7 @IanLJones98 @sallyeaves @AkwyZ @BetaMoroney @sonu_monika @bamitav @TheAIObserverX

Pirates: Wednesday's vote on EU’s Chat Control bill could open the floodgates to unprecedented...

Wednesday EU governments are to endorse proposed EU legislation ("child sexual abuse regulation" or "chat control") which provides for automatically searching all private communications and chats for...

www.patrick-breyer.de

1

39

81

Knut Jägersberg

@JagersbergKnut

1 year

TheBloke/MistralLite-7B-AWQ A quantized version of the mistral that is instruction following over 32k tokens.

TheBloke/MistralLite-7B-AWQ · Hugging Face

huggingface.co

3

7

77

Knut Jägersberg

@JagersbergKnut

10 months

Open-hermes 2.5 is better than GPT-3.5 by my real-world tests, change my mind

11

5

78

Knut Jägersberg

@JagersbergKnut

1 year

Ahead of e5 embeddings in MTEB!

thenlper/gte-large · Hugging Face

huggingface.co

4

11

78

Knut Jägersberg

@JagersbergKnut

7 months

well, hmm..

HuggingFaceH4/mistral-7b-grok · Hugging Face

huggingface.co

1

12

75

Knut Jägersberg

@JagersbergKnut

6 months

Did anybody notice Nvidia published a competitive llama3-70b QA/RAG fine tune?

8

79

Knut Jägersberg

@JagersbergKnut

1 year

amazon/FalconLite2 "By utilizing 4-bit GPTQ quantization and adapted RotaryEmbedding, FalconLite2 is able to process 10x longer contexts while consuming 4x less GPU memory than the original model."

amazon/FalconLite2 · Hugging Face

huggingface.co

2

15

76

Knut Jägersberg

@JagersbergKnut

1 year

dynamofl/mistral-2 What do we have here? seemingly a pruned mistral!

9

4

76

Knut Jägersberg

@JagersbergKnut

8 months

abideen/gemma-7b-openhermes openllm average: 73.5%

abideen/gemma-7b-openhermes · Hugging Face

huggingface.co

6

4

75

Knut Jägersberg

@JagersbergKnut

1 year

Become a cognitive engineer or perish.

9

5

72

Knut Jägersberg

@JagersbergKnut

11 months

This is a 600b LLM. But they don't give access to the weights. I could imagine it's falcon-180bs glued together.

deepnight-research/ai1 · Hugging Face

huggingface.co

9

4

74

Knut Jägersberg

@JagersbergKnut

8 months

2

7

72

Knut Jägersberg

@JagersbergKnut

10 months

Doctor-Shotgun/TinyLlama-1.1B-32k For speculative decoding

Doctor-Shotgun/TinyLlama-1.1B-32k · Hugging Face

huggingface.co

2

13

72

Knut Jägersberg

@JagersbergKnut

10 months

freecs/ArtificialThinker-Phi2 Adds an explicit reasoning phase in the prompt template, akin to kaist-ai/CoT-Collection

3

7

69

Knut Jägersberg

@JagersbergKnut

1 year

LumiOpen/Poro-34B "Poro is a 34B parameter decoder-only transformer pretrained on Finnish, English and code. It is being trained on 1 trillion tokens (300 billion as of this release)." More and more are coming, even for languages of small countries. 🇩🇪?

LumiOpen/Poro-34B · Hugging Face

huggingface.co

6

14

69

Knut Jägersberg

@JagersbergKnut

2 years

GPT-4 Is Coming: A Look Into The Future Of AI @bimedotcom @EvaSmartAI @Khulood_Almani @danfiehn @tobiaskintzel @chidambara09 @sonu_monika @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @enricomolinari @enilev @sallyeaves @IanLJones98

GPT-4 Is Coming: A Look Into The Future Of AI

An overview of hints and expecations about GPT-4 and what the OpenAI CEO recently said about it.

www.searchenginejournal.com

3

44

67

Knut Jägersberg

@JagersbergKnut

1 year

Oh so this is properly the first regular 1b model pretrained over 1 trillion tokens.

mlfoundations/open_lm_1B · Hugging Face

huggingface.co

3

9

69

Knut Jägersberg

@JagersbergKnut

10 months

I bet true AGI is build before the EU AI act kicks into effect.

17

2

68

Knut Jägersberg

@JagersbergKnut

8 months

dranger003/LWM-Text-Chat-128K-iMat.GGUF "The imatrix Q4-K quant fits with 32K context on 24GB and gives me ~100 t/s inference on a 3090. With IQ3_XXS it seems to fit ~37K context on 24GB (and it is even faster than Q4-K)."

dranger003/LWM-Text-Chat-128K-iMat.GGUF · Hugging Face

huggingface.co

2

10

69

Knut Jägersberg

@JagersbergKnut

11 months

TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T 💪

4

7

69

Knut Jägersberg

@JagersbergKnut

9 months

OrionStarAI/Orion-14B-Base 2.5T multilingual corpus, including Chinese, English, Japanese, Korean

OrionStarAI/Orion-14B-Base · Hugging Face

huggingface.co

1

17

66

Knut Jägersberg

@JagersbergKnut

11 months

Open-Orca/Mixtral-SlimOrca-8x7B 👀👀

Open-Orca/Mixtral-SlimOrca-8x7B · Hugging Face

huggingface.co

4

7

66

Knut Jägersberg

@JagersbergKnut

1 year

What's up with this?

Mistral loss instability · Issue #26498 · huggingface/transformers

System Info Hello, I've been working with dhokas who finetuned Mistral's official instruct model. I have been trying to finetune mistral with several datasets over dozens of ablations. Ther...

github.com

6

65

Knut Jägersberg

@JagersbergKnut

1 year

OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities @bimedotcom @TheAIObserverX @Khulood_Almani @danfiehn @tobiaskintzel @chidambara09 @sonu_monika @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @enricomolinari @enilev

4

23

65

Knut Jägersberg

@JagersbergKnut

1 year

I didn't know Mistral is that good

5

3

66

Knut Jägersberg

@JagersbergKnut

11 months

VAGOsolutions/SauerkrautLM-7b-HerO merge of Teknium's OpenHermes-2.5-Mistral-7B and Open-Orca's Mistral-7B-OpenOrca, fine-tuned on the Sauerkraut dataset

3

10

64

Knut Jägersberg

@JagersbergKnut

7 months

Vezora/Mistral-22B-v0.1 MoE merge, probably something

Vezora/Mistral-22B-v0.1 · Hugging Face

huggingface.co

3

9

63

Knut Jägersberg

@JagersbergKnut

1 year

Qwen-14B beats larger models in benchmarks, LLM community wonders how @bimedotcom @TheAIObserverX @Khulood_Almani @debashis_dutta @sonu_monika @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @enilev @sallyeaves @IanLJones98

9

12

62

Knut Jägersberg

@JagersbergKnut

10 months

KnutJaegersberg/Tess-M-34B-2bit 🦾 A quip# 2 bit quantization of Tess-M by @migtissera based on Yi-34B-200k by @01AI_Yi . Weights are 10 GB smol, yet model quality is very good! Made with 8k context hessians to enhance long context inference quality.

4

8

63

Knut Jägersberg

@JagersbergKnut

3 years

A survey of learning causality with data: Problems and methods A paper, see here

GitHub - rguo12/awesome-causality-algorithms: An index of algorithms for learning causality with...

An index of algorithms for learning causality with data - rguo12/awesome-causality-algorithms

github.com

2

18

59

Knut Jägersberg

@JagersbergKnut

1 year

HuggingFaceBR4/falcon-180B-python-sft-logging What's that?

4

7

62

Knut Jägersberg

@JagersbergKnut

8 months

ah here comes your friendly google over night ecosystem takeover

GitHub - google/gemma.cpp: lightweight, standalone C++ inference engine for Google's Gemma models.

lightweight, standalone C++ inference engine for Google's Gemma models. - google/gemma.cpp

github.com

5

60

Knut Jägersberg

@JagersbergKnut

3 years

DriveML and forester ML libraries have a very streamlined workflow to use tuned ML models for explainable ML, forester has EDA functions. One can expect good performance and insights ratio for coding time invested. #rstats #machinelearning #datascience

1

20

60

Knut Jägersberg

@JagersbergKnut

1 year

Apple’s new LLM - with just 34 million parameters @bimedotcom @TheAIObserverX @Khulood_Almani @debashis_dutta @sonu_monika @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @XavierAncelin @enricomolinari @enilev @sallyeaves @IanLJones98

3

18

61

Knut Jägersberg

@JagersbergKnut

1 year

There was this interview with @sama that suggested they considered open sourcing gpt3, but they thought most businesses could not handle it, as it is so big. I could download below and set up a system in the cloud with 2 80gb vram gpus, if I wanted to.

TigerResearch/tigerbot-180b-research-4bit-128g · Hugging Face

huggingface.co

5

6

61

Knut Jägersberg

@JagersbergKnut

9 months

haoranxu/ALMA-13B-R There it is, currently best open source option for machine translation.

haoranxu/ALMA-13B-R · Hugging Face

huggingface.co

4

12

61

Knut Jägersberg

@JagersbergKnut

11 months

4

9

57

Knut Jägersberg

@JagersbergKnut

1 year

openlm-research/open_llama_13b

openlm-research/open_llama_13b · Hugging Face

huggingface.co

2

11

58

Knut Jägersberg

@JagersbergKnut

2 years

MBZUAI/LaMini-T5-738M LaMini: A Diverse Herd of Distilled Models from Large-Scale Instructions

5

15

56

Knut Jägersberg

@JagersbergKnut

11 months

Llama3 as fast as possible

6

2

56

Knut Jägersberg

@JagersbergKnut

9 months

The Rise and Potential of LLM Based Agents: A Survey @bimedotcom @Khulood_Almani @ipfconline1 @sonu_monika @mikeflache @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @sulefati7 @XavierAncelin @enilev @sallyeaves @IanLJones98

1

18

56

Knut Jägersberg

@JagersbergKnut

25 days

@kimmonismus this wave of AI is indeed disruptive. entire professions almost disappear. translators, transcribers, narrators, and the field is expanding. Like professional drawers that were replaced by PCs, software and printers. It's that kind of change and it's accelerating.

6

3

55

Knut Jägersberg

@JagersbergKnut

2 years

AI: What is the future of artificial intelligence? - BBC News "I've tried to brief policymakers: it is like explaining particle physics to a chocolade chip cookie"

11

15

54

Knut Jägersberg

@JagersbergKnut

7 months

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

0

15

53

Knut Jägersberg

@JagersbergKnut

8 months

An Open Source text-to-speech system built by inverting Whisper. Somehow I forgot about that one

WhisperSpeech/WhisperSpeech · Hugging Face

huggingface.co

3

10

53

Knut Jägersberg

@JagersbergKnut

7 months

6

11

54

Knut Jägersberg

@JagersbergKnut

10 months

GeneZC/MiniMA-2-3B New more powerful iteration of MiniMA!

4

10

54

Knut Jägersberg

@JagersbergKnut

1 year

CausalLM/14B Wonderful, they merged llama2 and Qwen and made a llama out of it.

4

7

54

Knut Jägersberg

@JagersbergKnut

2 years

Why ChatGPT is not a threat to Google Search @bimedotcom @EvaSmartAI @Khulood_Almani @danfiehn @tobiaskintzel @chidambara09 @sonu_monika @theomitsa @BetaMoroney @Analytics_699 @Shi4Tech @FmFrancoise @enricomolinari @enilev @sallyeaves @IanLJones98

Why ChatGPT is not a threat to Google Search - TechTalks

ChatGPT is a remarkable LLM, with potential applications for online search. But it might be a bit of a stretch to say that it will dethrone Google.

bdtechtalks.com

6

40

53

Knut Jägersberg

@JagersbergKnut

5 months

This Hacker Tool Extracts All the Data Collected by Windows’ New Recall AI @bimedotcom @Khulood_Almani @theomitsa @pierrepinna @FmFrancoise @sulefati7 @IanLJones98 @sallyeaves @AkwyZ @BetaMoroney @sonu_monika @bamitav @TheAIObserverX

This Hacker Tool Extracts All the Data Collected by Windows’ New Recall AI

Windows Recall takes a screenshot every five seconds. Cybersecurity researchers say the system is simple to abuse—and one ethical hacker has already built a tool to show how easy it really is.

www.wired.com

7

13

51

Knut Jägersberg

@JagersbergKnut

10 months

myshell-ai/OpenVoice instant voice cloning