Yang You @YangYou1991 Twitter profile

Pinned Tweet

Yang You

@YangYou1991

1 year

I am happy to share that our paper won the Distinguished Paper Award of @RealAAAI Congratulations on my Ph.D. student @zhengzangw , who is the first author of this paper. #AAAI23 #AAAI #AI #ArtificialIntelligence

15

36

358

Last Seen Profiles

@MEMB3RS

@Tomecia_king

@HeidisLegacy

@_Irf_aN

@Pemlk21cm

@photographer

@ashlieatkinson

@desalambre

@wingingq

@DIfeanyide7848

@sglockett

@botdefreud

@syrinide

@AlinaL96555

@NovaMfWoava

@narendrane35824

@pengen_stw

@Sumaah_06

@sl2aiii

@abdlgwen

@Cornage25c

@Byswagli_

@stw_pdg

@RichardMThomps1

@karl_hadrika

@AMLBotHQ

@fegpuika

@gitagita7777

@asta_l2

@PrivaxProtocol

@ledgerboxd

@visualllife

@Luo123491938

@krumparrian

@JikaJikaDerry

@n_alajmeei

Yang You

@YangYou1991

2 years

It is my first time teaching face-to-face courses after I became a professor :-)

303

1K

16K

Yang You

@YangYou1991

3 months

Say hello to Grok-1's new PyTorch+HuggingFace edition! 🚀 314 billion parameters, 3.8x faster inference. Easy to use, open-source, and optimized by Colossal-AI. 🤖 Dive in: #Grok1 #ColossalAI 🌟 Download Now:

hpcai-tech/grok-1 · Hugging Face

huggingface.co

34

119

736

Yang You

@YangYou1991

4 months

Exciting News from Open-Sora! 🚀 They've just made the ENTIRE suite of their video-generation model open source! Dive into the world of cutting-edge AI with access to model weights, comprehensive training source code, and detailed architecture insights. Start building your dream

15

158

624

Yang You

@YangYou1991

4 months

🚀 🌐 Build your own video generation model like #Sora ! Experience the power of replication without the price tag! Open-Sora delivers a low-cost implementation of Sora, cutting costs by a staggering 46%. Expand your sequences to nearly a million with this innovative open-source

18

116

461

Yang You

@YangYou1991

8 months

Time flies! I got my PhD from Berkeley 1218 days ago. My first PhD student is graduating. That is my first achievement :-)

7

6

380

Yang You

@YangYou1991

1 year

AAAI distinguished paper award!

12

14

374

Yang You

@YangYou1991

4 months

Want to train a model like #Sora ? Check out our new project #OpenDiT ! OpenDiT is an easy-to-use, fast, and memory-efficient system for training and deploying DiT models, which are the foundation of models like Sora. With OpenDiT, you can achieve: * Up to 80% faster in training *

5

68

327

Yang You

@YangYou1991

2 years

Should we use deep neural networks or wide neural networks? I am happy to share our new paper, which was recently accepted by #AAAI2022

Go Wider Instead of Deeper

More transformer blocks with residual connections have recently achieved impressive results on various tasks. To achieve better performance with fewer trainable parameters, recent methods are...

arxiv.org

5

51

293

Yang You

@YangYou1991

4 years

I officially graduated from UC Berkeley. It really was a great journey!

10

0

285

Yang You

@YangYou1991

4 years

First day as a faculty member at Computer Science Department of @NUSComputing Excited!

8

0

253

Yang You

@YangYou1991

3 months

Speedup Open-Sora's training by 3x and inference by 2x with our novel DSP (Dynamic Sequence Parallelism)! For 10s 512x512 videos, Open-Sora's inference time: 1xH800: 106s 8xH800: 45s 8xH800+DSP: 22s DSP can be seamlessly adapted to all multi-dimensional transformers, unlocking

2

59

250

Yang You

@YangYou1991

3 years

I am excited to be appointed as "Presidential Young Professor" at @NUSComputing

20

5

209

Yang You

@YangYou1991

6 months

I am happy to share that our paper has been accepted by ICLR as an ORAL paper (1.2% acceptance rate). InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning InfoBatch randomly prunes a portion of less informative samples based on the

1

40

216

Yang You

@YangYou1991

2 years

Had a great dinner with Dr. Kai-Fu Lee (the man I admire enormously) @kaifulee

4

6

180

Yang You

@YangYou1991

4 years

I'm grateful to graduate from @Berkeley_EECS with the Lotfi A. Zadeh Prize. I'm excited to announce that I will join the National University of Singapore as a tenure-track assistant professor at the Department of Computer Science in @NUSComputing

18

4

178

Yang You

@YangYou1991

1 year

I am happy to share that our paper won the Outstanding Paper Award of ACL. We propose CAME to simultaneously achieve two goals: fast convergence as in traditional adaptive methods, and low memory usage as in LLM training.

CAME: Confidence-guided Adaptive Memory Efficient Optimization

Adaptive gradient methods, such as Adam and LAMB, have demonstrated excellent performance in the training of large language models. Nevertheless, the need for adaptivity requires maintaining...

arxiv.org

4

12

159

Yang You

@YangYou1991

3 years

It's my honor to get on the Forbes Asia 30 under 30 list!

You Yang

You Yang on the 2021 30 Under 30 - Asia - Healthcare & Science - Dr. You Yang is a Presidential Young Professor of computer science at the National University

www.forbes.com

9

1

147

Yang You

@YangYou1991

2 months

🌟 Get ready for cinematic magic with Open-Sora! 🎉 It generates 16s & 720p video. Say hello to seamless storytelling, where your vivid imagination comes to life in high-definition with just a prompt! 📹✨ Open-Sora's bucket strategy redefines efficiency, with only 64 GPUs,

7

32

146

Yang You

@YangYou1991

4 years

Goodbye United States! Hello Singapore! I will quarantine for 14 days.

7

0

124

Yang You

@YangYou1991

1 year

The new semester has started! Just finished my first class this semester :-)

2

0

121

Yang You

@YangYou1991

4 years

Students of @UCBerkeley usually got the Ph.D. lollipop when they submitted dissertations. I could not do that because of COVID-19. However, @GradDivision mailed it to me from 13590.66 km away! What a great tradition! What a big surprise! Thanks a lot! @GradDivision @Berkeley_EECS

2

3

118

Yang You

@YangYou1991

8 months

Our paper was published on May 26th of 2021 and it was also accepted by ACL. We clearly named the method ring self-attention (i.e. ring attention). I did not find any substantial difference between ring self-attention and ring attention. To my knowledge, our work is the first

Hao Liu

@haoliuhl

9 months

New paper w/ @matei_zaharia @pabbeel on transformers with large context size. We propose RingAttention, which allows training sequences that are device count times longer than those of prior state-of-the-arts, without attention approximations or incurring additional overhead.

10

180

851

6

18

106

Yang You

@YangYou1991

3 years

I am excited to be mentioned twice at #SC21 Awards Ceremony :-)

1

0

90

Yang You

@YangYou1991

6 months

Colossal-AI team just released SwiftInfer - a TensorRT-based implementation of StreamingLLM, boosting inference performance by a whopping 46%! In the scenario of long-text multi-round conversations, StearmingLLM can improve the ability of LLM to understand and remember context,

0

12

78

Yang You

@YangYou1991

2 years

I'll attend NeurIPS: please let me know if you want to chat or grab a coffee (or watch the FIFA World Cup)! DMs are open. Excited to be finally back at an in-person NeurIPS after 3 years 😁 #NeurIPS2022

2

3

74

Yang You

@YangYou1991

4 years

I am happy to be rated as a good reviewer by a top AI conference.

2

0

72

Yang You

@YangYou1991

2 years

Would you like to accelerate AI model training by 10x? Do you want an easy-to-use system that abstracts away all the repetitive nonsense from under the hood? Fret not, Colossal-AI is now open-source!

Efficient and Easy Training of Large AI Models — Introducing Colossal-AI

Would you like to dramatically accelerate large Deep Learning Model training? Would you like to modernize your AI systems stack? Do you…

medium.com

0

16

67

Yang You

@YangYou1991

3 years

It's my honor to receive IEEE @ComputerSociety TCHPC early career award! Hopefully see you at @Supercomputing 😄

9

0

61

Yang You

@YangYou1991

1 year

Aloha! I am in Hawaii for attending #ICML2023 My research group has 2 papers this year. Also, our startup's Colossal-AI is a platinum sponsor!

2

61

Yang You

@YangYou1991

4 years

I finally finished my 14-day quarantine with a negative COVID-19 testing result😁 It's like 14 weeks 😂

1

0

59

Yang You

@YangYou1991

2 years

Because of COVID-19, I left Berkeley without attending the graduation ceremony. It is amazing to meet my advisor again after 2 years!

0

2

57

Yang You

@YangYou1991

2 years

All of our 5 paper submissions have been accepted by #CVPR2022 Congrats to my students! Hopefully, see you in New Orleans! More details can be found here:

2

1

56

Yang You

@YangYou1991

4 years

My Ph.D. dissertation: fast and accurate machine learning on distributed systems and supercomputers.

2

3

52

Yang You

@YangYou1991

2 years

How can we use Adversarial Learning to speed up the training process of AI models? I am happy to share our new paper, which was recently accepted by ICLR'22

Concurrent Adversarial Learning for Large-Batch Training

Large-batch training has become a commonly used technique when training neural networks with a large number of GPU/TPU processors. As batch size increases, stochastic optimizers tend to converge...

arxiv.org

1

7

50

Yang You

@YangYou1991

1 year

It's great to have dinner with Prof. Jennifer Widom, the dean of @StanfordEng

0

1

50

Yang You

@YangYou1991

4 years

Goodbye 2020. Hello 2021!

1

0

46

Yang You

@YangYou1991

2 years

I'm happy to share that my students and I recently built a tech startup @HPCAITech We are working on AI systems (e.g. ). We have raised 4.7 million USD in just 3 months :-)

Under 30 Asia Alum Launches AI Startup With Funding From Kai-Fu Lee’s Sinovation Ventures And...

A Singapore-based startup that’s speeding up artificial intelligence computations through high-performance computing has attracted $4.7 million in venture capital.

www.forbes.com

3

0

45

Yang You

@YangYou1991

8 months

The former premier of China passed away. He is a visionary leader who dedicates himself to the progress and well-being of his nation. Emerging from humble origins, he, through his exceptional talent and wisdom, ascended to the nation's highest echelons of leadership. Tasked with

0

2

42

Yang You

@YangYou1991

2 months

🔥 Exciting news in AI! 20% enhancement in training efficiency for LLaMA3 8B and 70B! Colossal-AI offers tailored solutions for LLaMA3 models, significantly boosting training efficiency and setting new standards with exceptional performance. Check out the open-source project on

0

8

42

Yang You

@YangYou1991

3 years

NUS computer science Ph.D. program (full scholarship) has a Spring intake. The deadline is June 15th. Here is the application information: My research group's information can be found at

0

9

40

Yang You

@YangYou1991

4 months

Prompt Learning: forcing human beings to fit machines Instruct Learning: forcing machines to fit human beings

4

3

39

Yang You

@YangYou1991

2 years

Train 18-billion-parameter GPT models with a single GPU on your personal computer!

When it comes to training large AI models, people will think about using thousands of GPUs, expensive training costs, and only a few tech…

link.medium.com

2

3

39

Yang You

@YangYou1991

2 years

To my idol: you are my hero forever and I hope you all the best in your future endeavors.

4

1

39

Yang You

@YangYou1991

1 year

Congrats to my coauthors!

Yang You

@YangYou1991

1 year

I am happy to share that our paper won the Outstanding Paper Award of ACL. We propose CAME to simultaneously achieve two goals: fast convergence as in traditional adaptive methods, and low memory usage as in LLM training.

4

12

159

0

34

Yang You

@YangYou1991

2 years

Thanks to Dr. Kai-Fu Lee @kaifulee , we had a great meeting with OpenAI President @gdb and Chief Scientist @ilyasut

1

0

36

Yang You

@YangYou1991

3 years

I'd like to introduce the Colossal-AI system, which can potentially help you to train/deploy super-large AI models quickly without changing your code. GitHub: Paper:

Colossal-AI: A Unified Deep Learning System For Large-Scale...

The success of Transformer models has pushed the deep learning model scale to billions of parameters. Due to the limited memory resource of a single GPU, However, the best practice for choosing...

arxiv.org

1

6

35

Yang You

@YangYou1991

4 years

I am happy to see our LAMB optimizer was included in MLPerf's BERT implementation. Google finished BERT training in 24 seconds based on MLPerf. However, MLPerf used its own convergence metric, which is different from Mr. Jacob Devlin's baseline.

1

2

33

Yang You

@YangYou1991

1 year

It is my pleasure to be the session chair for ML: Optimization at #AAAI23 Our session will cover the latest techniques in machine learning optimization. If you are interested in improving the efficiency of chatGPT, stable diffusion, DALL·E 2, and AlphaFold 2, come to talk to us!

1

3

35

Yang You

@YangYou1991

27 days

🔥 Getting Chatbot Arena model rankings with 2000× less time (5 minutes) and 5000× less cost ($0.6), simply by mixing the off-the-shelf benchmarks! 🚀 Introducing our MixEval, a revolutionary #LLMs evaluation paradigm that's fast, cheap, and precise! By blending real-world

Jinjie Ni

@NiJinjie

27 days

How to get ⚔️Chatbot Arena⚔️ model rankings with 2000× less time (5 minutes) and 5000× less cost ($0.6)? Maybe simply mix the classic benchmarks. 🚀 Introducing MixEval, a new 🥇gold-standard🥇 LLM evaluation paradigm standing on the shoulder of giants (classic benchmarks).

10

64

233

2

7

35

Yang You

@YangYou1991

1 year

Try our chatbot for free. You can also build your own :-)

ColossalChat: An Open-Source Solution for Cloning ChatGPT With a Complete RLHF Pipeline

Large AI models and applications like ChatGPT and GPT-4 have become extremely popular worldwide, serving as a foundation for the…

medium.com

0

11

34

Yang You

@YangYou1991

11 months

Congrats to our team for giving a great talk at #ICML2023

1

0

33

Yang You

@YangYou1991

1 year

I will be speaking at the 37th AAAI Conference on Artificial Intelligence on Feb 7th and 8th! I’ll be discussing efficiently training large AI models like GPT-3 and Stable Diffusion. See you there. #AAAI23 #AAAI #AI #ArtificialIntelligence @RealAAAI

2

1

32

Yang You

@YangYou1991

1 year

We are actively seeking talented postdoctoral researchers specializing in LLM and MLSys. If you have a passion for these fields, please click on the links below for more information and to apply.

0

7

31

Yang You

@YangYou1991

2 years

NBA Finals!!!

1

0

29

Yang You

@YangYou1991

3 years

Berkeley was ranked as No.1 by Forbes on the top US colleges list! Berkeley is the first public university to win Forbes’ top ranking. That's amazing! I miss CAL :-)

0

3

30

Yang You

@YangYou1991

2 years

Congratulations to Prof. Jack Dongarra for winning the Turing award! Well deserved! BTW, I want to mention that my advisor Prof. James Demmel @Berkeley_EECS also made a significant contribution to HPC and Numerical Libraries. This picture can tell us something :-)

0

1

30

Yang You

@YangYou1991

4 years

Congratulations to Bill Gropp, who was recently elected as IEEE Computer Society president! Bill was the host of my faculty job interview at the UIUC CS department. He gave me a good piece of career advice. He was a very nice person. I wish him the best of luck for the new job!

0

27

Yang You

@YangYou1991

8 months

I am excited to get on the list of "100 Most Influential Chinese" by Forbes China.

2

0

29

Yang You

@YangYou1991

2 years

To have a happy life, we should find more people loving us, instead of minimizing the number of people hating us. The number of people hating us really does not matter, but the number of people loving us means how far we can go :-)

2

0

28

Yang You

@YangYou1991

2 years

Our two tutorials have been accepted by @RealAAAI . It is my privilege to teach AI to top AI experts in the world. See you in Washington DC! #AAAI23 Tutorial 1: Colossal-AI: Scaling AI Models in Big Model Era Tutorial 2: Large-scale Deep Learning Optimization Techniques

0

2

29

Yang You

@YangYou1991

3 years

Our new work!

AK

@_akhaliq

3 years

Sparse-MLP: A Fully-MLP Architecture with Conditional Computation pdf: abs:

0

31

114

0

2

26

Yang You

@YangYou1991

2 years

Our new work :-)

AK

@_akhaliq

2 years

FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours abs: github:

4

70

304

1

28

Yang You

@YangYou1991

9 months

Based on current techniques, LLM query will be more expensive than search engine query. LLM inference is mainly using matrix-matrix multiply. Search engine (e.g. PageRank algorithm) is based on matrix-vector multiply. Each database query is just a matching. Matrix-matrix

3

1

28

Yang You

@YangYou1991

8 months

ChatGPT (OpenAI) = The iOS of AI. LLaMA (Meta) = The Android of AI. Who will win? Both

6

27

Yang You

@YangYou1991

4 years

Faculty Positions in Computer Science at the National University of Singapore (NUS).

Faculty Positions in Computer Science

The Department of Computer Science at the National University of Singapore (NUS) invites applications for tenure-track and educator-track positions in all areas of computer science. Candidates for …

cra.org

0

5

27

Yang You

@YangYou1991

2 years

😂😂

1

2

23

Yang You

@YangYou1991

3 years

Thank you! Berkeley EECS.

Yang You wins IEEE CS TCHPC Early Career Researcher Award for Excellence in High Performance...

EECS alumnus Yang You (Ph.D. ’20, advisor: James Demmel) has won the IEEE Computer Society Technical Consortium on High Performance Computing (TCHPC) Early Career Researcher Award for Excellence in...

eecs.berkeley.edu

2

0

22

Yang You

@YangYou1991

1 year

Congrats to the ChatGPT leadership team!

Yang You

@YangYou1991

2 years

Thanks to Dr. Kai-Fu Lee @kaifulee , we had a great meeting with OpenAI President @gdb and Chief Scientist @ilyasut

1

0

36

0

2

22

Yang You

@YangYou1991

3 years

Transformers are transforming everything in deep learning. All the attention of deep learning is now on "Attention"!

0

1

21

Yang You

@YangYou1991

4 years

I'd like to share a paper recently published by Google: "Exploring the limits of Concurrency in ML Training on Google TPUs". It shows how Google finish the training of large deep learning models within one minute.

0

2

21

Yang You

@YangYou1991

2 years

I hope this figure helps me to say "Happy New Year to Everyone!" :-)

4

2

20

Yang You

@YangYou1991

4 years

I am looking for a postdoc or research fellow. Please feel free to contact me if you are interested.

postdoc.pdf

drive.google.com

0

5

21

Yang You

@YangYou1991

4 years

Thanks so much!

0

20

Yang You

@YangYou1991

4 years

A peaceful protest at @UCBerkeley . I am happy to see they draw lines on the ground to implement social distancing. I find many of them are actually elderly people, who are vulnerable to COVID-19 and violence. I want to thank them for what they did for the community.

1

19

Yang You

@YangYou1991

3 years

Today, Google highlights Dr. Lotfi A. Zadeh on its search engine! It was my honor to receive the first Lotfi A. Zadeh Prize by @Berkeley_EECS

0

19

Yang You

@YangYou1991

6 months

Excited to kick off the new semester! There's nothing quite like teaching in a bustling classroom packed with so much talent. Here's to a great term ahead! 😊📚

1

0

19

Yang You

@YangYou1991

4 years

Our new work with @quocleix and @tanmingxing . People now can finish the ImageNet training in 1 minute. However, a 75.9% convergence accuracy is probably too low to be practical. We achieve 83% ImageNet top-1 accuracy in 1 hour, which is a speed world record.

AK

@_akhaliq

4 years

83% ImageNet Accuracy in One Hour pdf: abs:

1

7

43

1

3

18

Yang You

@YangYou1991

2 years

Our paper has been accepted by #AAAI23 CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU Paper: Code:

GitHub - bytedance/LargeBatchCTR: Large batch training of CTR models based on DeepCTR with CowClip.

Large batch training of CTR models based on DeepCTR with CowClip. - bytedance/LargeBatchCTR

github.com

0

2

19

Yang You

@YangYou1991

9 months

Excited to share our #ICCV2023 paper: Fine-tuning Vision-Language Models without Zero-Shot Transfer Degradation (ZSCL). ZSCL outperforms the pre-trained model on downstream tasks and maintains its zero-shot transferability to other tasks. paper: blog:

0

4

18

Yang You

@YangYou1991

2 years

Introducing #ICLR2022 Concurrent Adversarial Learning for Large-Batch Training Motivation: Large-batch training has become a widely used technique when training neural networks with a large number of GPU/TPU processors.

6

1

19

Yang You

@YangYou1991

3 years

Our Colossal-AI project is currently ranked #1 by Github Trending (python category)

0

19

Yang You

@YangYou1991

2 years

Check our new paper at #NeurIPS2022 Random Sharpness-Aware Minimization We propose a novel random smoothing-based SAM (R-SAM) algorithm. R-SAM essentially smooths the loss landscape and improves the approximation of the inner maximization.

0

4

17

Yang You

@YangYou1991

9 months

Excited to introduce our #ICCV2023 paper Dataset Quantization (DQ). DQ achieves lossless training performances with 2% data keep ratio on language tasks and 60% data keep ratio on vision tasks. Just check out our paper and project:

1

2

16

Yang You

@YangYou1991

5 months

I just published Colossal-LLaMA-2: Low Cost and High-quality Domain-specific LLM Solution Using LLaMA and…

Colossal-LLaMA-2: Low Cost and High-quality Domain-specific LLM Solution Using LLaMA and…

The most prominent distinction between LLaMA-1 and LLaMA-2 lies in the incorporation of higher-quality corpora, a pivotal factor…

link.medium.com

3

1

16

Yang You

@YangYou1991

2 years

The major source of the energy cost for training AI models comes from moving the data? Communication costs 10x more energy than computation. Please correct me if I'm wrong :-) For GPT-3: The communication energy cost is 4.7e+26 PJ. The computation energy cost is 3.6e+25 PJ.

3

2

16

Yang You

@YangYou1991

3 years

PyTorch implementation of LARS for ImageNet: PyTorch implementation of LAMB for ImageNet: Both of them can achieve at least 76.7% accuracy in 90 epochs for both large batch sizes and small batch sizes.

GitHub - NUS-HPC-AI-Lab/pytorch-lamb: PyTorch implementation of LAMB for ImageNet/ResNet-50 training

PyTorch implementation of LAMB for ImageNet/ResNet-50 training - NUS-HPC-AI-Lab/pytorch-lamb

github.com

0

1

14

Yang You

@YangYou1991

3 years

Our new paper: ONES automatically manages the elasticity of each AI job based on the workload to maximize GPU utilization and improve scheduling efficiency. Experiments on 64 GPUs show great results. This paper will appear on @Supercomputing #SC21 ()

1

15

Yang You

@YangYou1991

2 years

I am happy to share that my Ph.D. advisor Prof. James Demmel and I will give an invited talk at @odsc See you in San Francisco!

1

0

14

Yang You

@YangYou1991

2 years

I just published “Colossal-AI Seamlessly Accelerates Large Models at Low Costs with Hugging Face”, please feel free to comment :-)

Colossal-AI Seamlessly Accelerates Large Models at Low Costs with Hugging Face

Forbes News, the world’s leading voice, recently declared large AI models as one of six AI trends to watch for in 2022. As large-scale AI…

link.medium.com

0

4

15

Yang You

@YangYou1991

2 years

I just published a blog post, check out how can we make AIGC (stable diffusion) much more efficient:

Diffusion Pretraining and Hardware Fine-Tuning Can Be Almost 7X Cheaper!

Author: Yang You, Presidential Young Professor at the National University of Singapore

medium.com

2

4

15

Yang You

@YangYou1991

9 months

Dubai = Do yoU Buy AI?

1

15

Yang You

@YangYou1991

2 years

Once again, Colossal-AI is ranked as No.1 on the GitHub Python trending list!

0

2

14

Yang You

@YangYou1991

2 years

Prof. James Demmel: Colossal-AI Makes Distributed Training Efficient, Easy and Scalable

James Demmel: Colossal-AI Makes Distributed Training Efficient, Easy...

Colossal-AI: https://github.com/hpcaitech/ColossalAIHPC-AI Tech: https://www.hpcaitech.com/

www.youtube.com

0

4

14

Yang You

@YangYou1991

2 years

I just published Embedding Training With 1% GPU Memory and 10 Times Less Budget, an Open Source Solution for Super-Large Recommendation Model Training on a Single GPU

Embedding Training With 1% GPU Memory and 10 Times Less Budget, an Open Source Solution for…

Deep recommendation models (DLRMs) have become critical for deep learning applications in IT companies. DLRMs can be used to improve user…

link.medium.com

0

1

12

Yang You

@YangYou1991

2 years

Our new work!

AK

@_akhaliq

2 years

FaceMAE: Privacy-Preserving Face Recognition via Masked Autoencoders abs: Compared to previous sota, FaceMAE consistently reduces at least 50% error rate on LFW, CFP-FP and AgeDB

1

10

48

0

2

12

Yang You

@YangYou1991

4 years

Chinese Academy of Sciences released a benchmark for fast AI training. They are not the first team to do this. MLPerf is already a huge success. But they have a good summary of how researchers reduced the ImageNet training time from 29 hours to 1 minute.

1

11

Yang You

@YangYou1991

4 months

Thanks for sharing our work. Congrats to the team led by @VictorKaiWang1 and @liuzhuang1234

AK

@_akhaliq

4 months

Neural Network Diffusion Diffusion models have achieved remarkable success in image and video generation. In this work, we demonstrate that diffusion models can also generate high-performing neural network parameters. Our approach is simple, utilizing an autoencoder and a

23

249

1K

0

2

12

Yang You

@YangYou1991

4 years

Congratulations to a former NUS Ph.D. student

Shweta Shinde

@shw3ta_shinde

4 years

Thrilled to share that I will be joining ETH Zurich ( @ETH_en ) as an assistant professor in the CS department ( @CSatETH ). Super excited to move to Switzerland this autumn and work with the amazing students and faculty.

22

3

341

0

12

Yang You

@YangYou1991

5 months

This is a photo of me generated by AI :-)

4

0

12