Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ Profile Banner
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ Profile
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ

@radekosmulski

25,844
Followers
579
Following
1,435
Media
8,179
Statuses

LLMs and retrieval by day, training robots on the weekend ๐Ÿงช Senior Data Scientist @NVIDIAAI ๐Ÿซ @fastdotai trained DL Eng ๐Ÿ“

see my projects โž
Joined April 2014
Don't wanna be here? Send us removal request.
Pinned Tweet
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Today is my first day at @NVIDIAAI ! ๐Ÿฅณ -From learning to code at 29 -through learning ML @fastdotai -winning a @kaggle competition -jobs at ๐Ÿ”ฅ startups -moving continents thx to AI -to joining the illustrious Merlin team โค๏ธ I am beyond grateful ๐Ÿ™ Will make this one count!
97
77
2K
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Mathematics for Machine Learning -- a 47-page introduction from UC Berkeley ๐Ÿš€ โ€ข Linear Algebra โ€ข Calculus and Optimization โ€ข Probability A 100% free resource! Source:
Tweet media one
30
679
3K
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
7 years
Favorite recent jupyter notebook discovery - the %debug magic: 1. Get an exception. 2. Insert a new cell, type %debug and run it. An interactive debugger will open bringing you to where the exception occurred and allowing you to look around!
Tweet media one
17
424
1K
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
I just became a @kaggle Grandmaster! ๐Ÿฅณ
102
25
1K
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Ok, this is not something I have expected ๐Ÿคฏ itertuples can be 50 times faster than iterrows!
Tweet media one
17
123
1K
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
To understand the foundations of NLP (pre-Transformers), where would you go? This 48-page paper is the answer ๐Ÿคฉ โœ… concise and clear explanations โœ… sklearn, spacy, and keras code snippets โœ… all the fundamentals of NLP in a single place
Tweet media one
20
148
779
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Merlin Dataloader is 119x faster than my own PyTorch Dataset + Dataloader combo! This is revolutionary for tabular data ๐Ÿฅณ Let's take a closer look at what is going on.
Tweet media one
19
131
784
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
This is probably the best intro to probability (along with the associated lectures ), now available online for free โค
@stat110
Joe Blitzstein
5 years
New edition of my probability book with Jessica Hwang is out! Read it FREE online at
Tweet media one
54
568
2K
0
166
780
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Curious about recommender models? Interested in endowing models from other domains with some of their superpowers? Please join me on a whirlwind tour of 6 recsys architectures! >> a thread ๐Ÿงต <<
Tweet media one
8
130
710
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
1 year
How does LangChain actually work? We see the wonderful things it can do, but what does it send to the model? What does the model send back? How does it all work? I decided to investigate ๐Ÿ•ต๏ธโ€โ™‚๏ธ Here is how LangChain allows LLMs to perform Google searches:
Tweet media one
14
125
707
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Linear Algebra -- an introductory course to Mathematics at the heart of Machine Learning โœ… 37 bite-sized videos (< 10 minutes) โœ… stellar visualizations โœ… expertly delivered by lecturers from Imperial College London Link:
Tweet media one
15
122
667
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
3 ways to speed up your Python/pandas code by up to 10x that I learned from a recent @kaggle notebook:
21
88
650
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
How to speed up your tabular data processing by 1053x A tutorial on how to vectorize a complex operation in pandas/cudf using a boolean mask Bonus at the end: how to seamlessly run on the GPU with arbitrarily large data 1/19
Tweet media one
Tweet media two
8
85
651
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
Interested in working with sound in Python? ๐ŸŽถ๐ŸŽ™๏ธ๐Ÿฅ now has full colab support - you can run the notebooks at a click of a button ๐Ÿ™‚ Also, new addition - how to work with large, multi-gigabyte wav files
Tweet media one
6
182
646
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Two techniques to speed up your Python code by 2 - 200x: 1. Use list comprehensions (they benefit from optimizations that for loops do not). 2. Use sets when checking for membership.
Tweet media one
Tweet media two
11
82
618
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
1 year
Programming has changed forever ๐Ÿ˜ณ Today I used GPT-4 for the first time and I can't imagine coding without it ever again. Here is the project I worked on:
26
71
590
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
1 year
I switched to vscode for running Jupyter Notebooks so that I can use @github Copilot and wow ๐Ÿฅฐ (that is coming from a vim/tmux/ jupyter notebook die-hard fan) Here is my experience:
25
40
580
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
Summary of BatchNorm, LayerNorm, InstanceNorm and GroupNorm
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
117
554
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
6 years
This is how little code it takes to implement a siamese net using @fastdotai and @pytorch . I share this because I continue to be amazed.
Tweet media one
12
111
526
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
6 years
1yr ago I gave up on ML. I didn't know what to learn nor how After a 5 mths break I decided to give ML one last try. If it would not work out I would need to let it go to not continue to waste my time - maybe I am unable to learn this I then signed up for the @fastdotai course
Tweet media one
45
66
510
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
I created a little resource to help one practice their deep learning skills! Only 1st lecture of part 2 v3 for now but maybe there will be more ๐Ÿ™‚
19
91
468
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
I think I have discovered a new level of freedom in dealing with files from Python ๐Ÿ™‚ Yet another aspect of @ProjectJupyter that is pure ๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ
Tweet media one
16
65
468
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
I am launching a new blog -- TabularMusings ๐Ÿฅณ Here is the first blog post: And here is the technology I am using and the reasons for starting the blog:
16
72
422
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
3 surprisingly effective techniques for training Computer Vision models I used to win a @kaggle competition Here is how you can apply them in your projects:
8
83
413
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
since I started to run things in parallel I cannot stop myself ๐Ÿ˜
Tweet media one
5
18
410
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
1 year
๐Ÿ’ก How to use ChatGPT in Jupyter Notebook I created a cell magic that will allow you to talk to the OpenAI API directly from your notebook:
Tweet media one
20
67
404
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
The most important skills for Machine Learning: โ€ข Python โ€ข Linux (env setup, ssh, moving files, editor) โ€ข git โ€ข pdb โ€ข creating good train - val - test splits โ€ข ability to scan papers for relevant information โ€ข a learner mindset โ€ข clear writing Anything I missed?
38
52
390
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
Learning embedding space for location?! Turns out the answer is... yes! ๐Ÿ™‚ Two absolutely fascinating reads with a lot of good information on approaching DL projects in general. The first one is a blog post by @sentiance .
Tweet media one
Tweet media two
6
83
392
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
How to win @kaggle : โœ… join a competition early โœ… read forums daily โœ… make small improvements every day โœ… find validation split that tracks LB โœ… posts by top kagglers will take you 80% of the way โœ… papers, blog posts, creativity ๐Ÿ‘‰remaining 20% โœ… ensemble results
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
6 years
1yr ago I gave up on ML. I didn't know what to learn nor how After a 5 mths break I decided to give ML one last try. If it would not work out I would need to let it go to not continue to waste my time - maybe I am unable to learn this I then signed up for the @fastdotai course
Tweet media one
45
66
510
7
53
374
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
6 years
12 months after starting the @fastdotai Deep Learning for Coders course, a little bit less after joining my first competition, I am now a @kaggle competition master.
Tweet media one
15
21
367
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
โœ๏ธGoing From Not Being Able To Code To Deep Learning Expert โœ… what are the notes to hit when learning to program โœ… how to practice Deep Learning effectively A detailed guide based on my experience:
6
79
353
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
THREAD: The hardest things about learning deep learning online (based on personal experience): โœ… outmaneuvering thousands of engineers at Reddit, Netflix, Twitter, etc, fighting for your attention โœ… carrying on through long periods of time where you don't notice progress
9
44
352
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
I just became a 2x @kaggle Grandmaster! ๐Ÿฅณ
24
6
354
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
What a cool project! Plus the code is available and there is a step by step walkthrough in the Jupyter NB ๐Ÿคฉ Not sure how it is possible, but the blog post is even better ๐Ÿ˜ Model training is done with @fastdotai ! What a treasure! Added to my reading list.
@jimmie_munyi
Jimmie Munyi
3 years
I used Deep Learning to create a Real Time Sign Language Classifier that runs on the webcam. The whole project is done from scratch. You can find all the code in my github repo: Here is a blog post on the same topic: : ๐Ÿ˜ƒ๐Ÿ˜ƒ
10
60
293
3
45
329
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
The last job I used a CV to get was a developer role from 6 years ago. I then did the @fastdotai courses. I followed the advice and started to blog. Roles started to find me. And they were outstanding. What makes such a big difference here?
13
43
328
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
How to train and deploy an XGBoost model in a single notebook 1/8
9
50
317
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
Everything you never knew you wanted to know about Python decorators (and more)!
Tweet media one
Tweet media two
Tweet media three
1
45
309
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
nvt op Tuesday! ๐Ÿค– Remember those @kaggle kernels where a person goes meticulously though the hundreds of columns in a DataFrame to reduce their size? What if I told you there exists an automated way to do just that? ๐Ÿ˜Ž
Tweet media one
8
48
304
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
๐Ÿงต How to become a Machine Learning Engineer without putting in the work? This thread contains everything I know on the subject. All the tips and tricks I learned over the last 8 years. Here it is.
22
42
302
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
What advice would you give yourself if you were starting out learning Machine Learning again?
80
37
298
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
This is what overemphasizing math when learning ML can lead to. Not a pretty picture ๐Ÿ‘‡
Tweet media one
Tweet media two
12
32
289
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
Meta Learning is out! ๐Ÿš€๐Ÿฅณ Above all, thank you for the warmth and support that you have shown me here on Twitter. That means the world to me and is completely out of this world ๐Ÿฅฐ If you would like to continue helping me, any feedback would be greatly appreciated ๐Ÿ˜Š
Tweet media one
12
32
290
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
TIL I learned about log-mean-exp pooling! Max pool can produce very sparse gradients making it hard for the network to learn. Avg pool is sometimes not applicable to the task. LME pooling allows you to find a spot somewhere between the two extremes
Tweet media one
Tweet media two
Tweet media three
5
55
294
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
9 months
It is my great pleasure to share that I am now a 3x @kaggle Grandmaster! ๐Ÿฅณ (Datasets being the most recent addition ๐Ÿ™‚)
Tweet media one
33
8
296
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
THREAD: Can you start learning cutting-edge deep learning without specialized hardware? ๐Ÿค– In this thread, we will train an advanced Computer Vision model on a challenging dataset. ๐Ÿ•๐Ÿˆ Training completes in 25 minutes on my 3yrs old Ryzen 5 CPU. Let me show you how...
8
55
282
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
How to train your TensorFlow or PyTorch RecSys models 400x faster? Use Merlin Dataloaders ๐Ÿ˜„ Two new examples just got merged! ๐Ÿฅณ โœ… TensorFlow: โœ… PyTorch:
Tweet media one
8
53
290
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
The law of working on machine learning projects: โœ… you are unable to tell if a problem can be solved until you build a baseline โœ… any time estimates you make before building a baseline are fortune-telling
9
31
287
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Are you serving machine learning models in production? How are you deploying your models? What technology do you use?
63
32
276
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
This is my favorite tweet of all time ๐Ÿ™‚I got quite a few new followers so thought I'd share it with you One thing I would add is that `l` and `ll` give you context around the line where exception was raised.
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
7 years
Favorite recent jupyter notebook discovery - the %debug magic: 1. Get an exception. 2. Insert a new cell, type %debug and run it. An interactive debugger will open bringing you to where the exception occurred and allowing you to look around!
Tweet media one
17
424
1K
1
56
270
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
What does docker give you as a Data Scientist? โ€ข reproducibility? โ€ข ease of switching between envs? โ€ข anything else? I keep meeting people gaslighted into using docker where it doesn't make any sense in their situation. Thinking of writing about this.
60
20
270
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
To learn Machine Learning fast you need to use the power of constraints. Here is what they are and how you can use them to learn 12x faster:
10
50
269
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
My second week at @NVIDIA is over! The most amazing thing is that everyone I run into (HR, IT, teammates) is both โœ… great at what they do โœ… very friendly I am starting to suspect this is by design ๐Ÿ˜„ And yeah, there are GPUs ๐Ÿ˜ป
Tweet media one
11
8
264
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
A junior data scientist begins working on a project by tweaking the architecture to improve results. A senior data scientist starts by learning about the business problem that the model is attempting to solve.
5
21
256
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
Being good at ML vs being good at talking online about ML are two very different things and don't often go hand in hand.
4
19
261
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
To learn quickly -- build lots of small projects. To impress hiring managers -- work on one big project that is truly impressive.
4
29
253
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
The opportunity cost of not learning AI is very high.
7
25
255
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
I am in love with Optuna ๐Ÿ˜ This is one of the least BS libraries I have ever come across. "You want your parameters optimized? Good. Because that is what we do" A tutorial on candidate generation (retrieval) with Merlin Models coming soon ๐Ÿ™‚
Tweet media one
8
24
252
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
so I heard this parquet file format is quite neat ๐Ÿ˜
Tweet media one
11
19
255
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
ouch
Tweet media one
7
33
251
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
I have never been too excited about @spacy_io , but OMG have I been wrong! I'm only starting to use it and it has the nicest human friendly API I have ever come across! Have you seen spacy.explain?! How cool is that! And then there is the amazing course:
Tweet media one
5
50
251
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
I finished watching @fastdotai v4 part 1 lectures and it is the best @fastdotai course yet! ๐Ÿค— Highlights: โœ…amazing intro to history of DL โœ…THE learning path -> create your own dataset, train a model, build an app, deploy โœ…unbelievably good ethics lecture by @math_rachel
10
28
250
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
What is the best way to get started with @PyTorch ? This tutorial by @jeremyphoward ! โœ… implement a fully-fledged NN from scratch โœ… learn foundational Deep Learning concepts along the way โœ… get exposed to best practices (set_trace, etc) Link:
Tweet media one
6
47
243
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
The @kaggle OTTO RecSys competition is underway! ๐Ÿฅณ๐Ÿš€ I share everything that I know So far: โ€ข 13x ๐Ÿฅ‡ medals for posts โ€ข 4x ๐Ÿฅ‡ medals for kernels If you'd like to jump into RecSys and be guided by me, I've put this post for you: see you there! ๐Ÿ™Œ๐Ÿ™‚
Tweet media one
9
38
242
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
If you were looking for an introduction to modern ML techniques with practical examples, which book should you choose? ๐Ÿค” I had the pleasure to review the new book by @rasbt and I believe it to be just the answer ๐Ÿ™‚ And a very good one at that! Here is what I found >> a ๐Ÿงต <<
Tweet media one
2
30
238
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
6 years
Object detection using @fastai with only 300 examples in the training set. Total train time: 3 min Please note the model definition - again very few lines of code. Annotated images myself - details in the NB. Shown in red are predictions.
Tweet media one
Tweet media two
0
41
242
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
A new @kaggle competition just launched! ๐Ÿฅณ New Playground Series, first in 2023!๐Ÿš€ Nice, clean tabular dataset where feature engineering will most likely shine โค๏ธโ€๐Ÿ”ฅ If you would like to jump right in, I created a starter notebook: Happy Kaggling! ๐Ÿ™‚
2
40
240
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
8 months
Probably the best, most actionable advice for getting started on @kaggle I have ever come across. From a 4x @kaggle GrandMaster and teacher extraordinaire of all ML things and beyond, Chris Deotte:
2
38
239
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
I love peering into the future and watching the GTC keynote is my favorite way to do so! Here are my highlights from the keynote from a few hours ago. Lots of amazing developments so buckle up and let's go for a ride! ๐Ÿ™‚ 1 of 15
Tweet media one
Tweet media two
6
48
236
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
This guest lecture by @jeremyphoward was a great watch. A very thorough overview of novel techniques essential to doing deep learning well. Awesome refresher plus I picked up a couple new things that seem extremely useful!
1
42
236
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Information on recommender systems is hard to come by But did you know that @eugeneyan has put together a list of โœ… 68 RecSys papers and articles โœ… 57 papers and articles on Search and Ranking This is amazing ๐Ÿฅณ๐Ÿพ thank you so very much for this!!!
Tweet media one
Tweet media two
4
52
232
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
let me please share with you a full docker setup for serving a @fastai model. The repository includes: โœ…jupyter NB for training and saving the model โœ…starlette endpoint performing inference โœ…rails frontend all < 40 lines of code (excl HTML + train)
Tweet media one
Tweet media two
Tweet media three
2
44
231
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
Would you like to study @fastdotai v4 with me? I created this little companion app while going through the course you can find here: Currently only lec 1 & 2 but will be releasing a new lecture every Monday. Fun fact: the lectures do not end at 7! ๐Ÿ˜„
Tweet media one
10
50
236
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
What I thought becoming employable in deep learning would be: โœ…learning a lot of math What it ended up being: โœ…learning how to talk about your work โœ…being able to point to the things you've done โœ…figuring out how to use social media and limiting the negative footprint it has
7
18
231
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
It took me 8 years to learn this truth about learning ML. I'll teach it to you in the next 5 minutes:
8
31
228
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
I just gained a superpower ๐Ÿฅณ Thank you so much @fastdotai for execnb!๐Ÿ™๐Ÿ™๐Ÿ™ I can now: โ€ข treat my notebooks as inputs โ€ข modify code/set variables โžก๏ธ run experiments โ€ข create data flows as I please OMG can't wait to see what I will build with this ๐Ÿ˜„ A dream come true ๐Ÿฅฐ๐Ÿ™
Tweet media one
6
22
226
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
What my colleagues have achieved is amazing ๐Ÿ™‚ โ€ข 683x speedup vs CPU โ€ข 43x speedup vs single A100-80GB GPU Key: use the high-memory bandwidth of the GPU for embedding lookups Available with just three lines of Python code (TF2) ๐Ÿ™‚ Read more here:
Tweet media one
0
51
215
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
The world is conspiring to keep you debt-laden and working longer than you physically can. In the process, you miss out on life. Learn a high-value skill (machine learning, programming, design) to set yourself free.
18
25
207
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
This is neat ๐Ÿ˜Š โœ…Train a model โœ…Realize you want some functionality the framework doesn't provide โœ…Define said functionality in your notebook โœ…๐Ÿฅณ๐ŸŽ‰๐Ÿ’ƒ (this is using @fastdotai v2)
Tweet media one
2
20
211
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
How to gain a competitive edge over 95% of people learning Machine Learning. Time for a thread โ˜•
6
32
208
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
1 year
This talk between Jensen Huang and @ilyasut from two days ago is a masterclass on what makes ChatGPT tick! And an awesome first-hand account of Deep Learning history. @ilyasut shared 2 core ideas behind @OpenAI :
Tweet media one
7
25
209
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
sweet little function for parallelizing processing of DataFrames ๐Ÿ™‚ All credit goes to: Very RAM heavy but useful in some scenarios
Tweet media one
4
26
197
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
THREAD: What is SHAP? SHAP = SHapely Additive Explanations of model predictions on ๐—ฎ ๐—ฝ๐—ฎ๐—ฟ๐˜๐—ถ๐—ฐ๐˜‚๐—น๐—ฎ๐—ฟ ๐—ฒ๐˜…๐—ฎ๐—บ๐—ฝ๐—น๐—ฒ They answer the question: How much each feature contributed to the prediction ๐Ÿ’ก Here is how SHAP does its magic.
Tweet media one
4
41
199
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
Today I am opening my upcoming book for pre-orders! ๐Ÿฅณ ๐Ÿ‘‰๐Ÿ‘ˆ 20 chapters and 15 000 words so far ๐Ÿ™‚ Each chapter received at least a single rewrite. Still a long way to go๐Ÿ™‚ The price will increase from $12.5 to 25$ upon release.
Tweet media one
19
19
198
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
The @kaggle Amex competition just ended ๐Ÿฅณ With it comes this awesome write-up by @ChrisDeotte who earned (another ๐Ÿ˜„) ๐Ÿฅ‡: โ€ข only two but very different archs โ€ข leak-free ensembling and feature selection And here is the kicker...
6
27
203
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Can one deliver a RecSys masterclass with a focus on โ€ข online vs batch predictions โ€ข monitoring: data distribution shifts โ€ข model deployment in 25 minutes? ๐Ÿค” Apparently, @chipro can! ๐Ÿ™‚ A must-watch Link to video:
Tweet media one
8
16
200
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
If you are a data scientist, introduce yourself below ๐Ÿ‘‡ This is an awesome community of people working with data ๐Ÿ˜Š Let's say hi and learn from each other!
127
17
200
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
Anki cards for @fastdotai v4 course are here! (all 8 lectures) These are based off questions from
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
33
202
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
Received my copy today ๐Ÿฅฐ Not only the how, but more importantly the why that will take you from your first encounter with deep learning to state of the art across so many domains. What an amazing book!!!
Tweet media one
7
18
189
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
Came across this really well written intro to HTML & CSS (covers more advanced concepts like flexbox and responsive design as well). If you are starting to learn web dev or would like to brush up on some of the concepts, this seems like a great resource
3
46
191
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
I just created my first ever Python pkg in 50 mins. Pre- @fastdotai I would read docs, not understand much, write 1 line of code, read more docs, run out of time, get discouraged. Now I skim docs, skim tutorial, copy license, copy and edit , hack readme,
3
12
185
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
5 years
A lot of very valuable thoughts packed into 21 minutes by @martinfowler โœ…what is 'agile' really? โœ…what is the role of a developer in a well functioning org? โœ…what are the ethics of being a programmer? โœ…is software industry a meritocracy?
1
37
182
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
The most important Machine Learning skill: ๐Ÿ‘‰ How to create a good validation set. ๐Ÿ‘ˆ But most people make a couple of basic mistakes. Read this, and I guarantee you won't be one of them: >> a thread ๐Ÿงต <<
6
34
181
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
How to build your personal website and serve it using GithubPages for free: ๐Ÿ‘‰ ๐Ÿ‘‰ My new personal site: ๐Ÿ‘‰ built without writing a single line of CSS using ๐Ÿ‘‡in around 2 hours
@jeremyphoward
Jeremy Howard
4 years
mvp.css is a nice easy way to quickly make a decent-looking web page, without even needing any class attributes. View the source of this page to see how clean the HTML is.
Tweet media one
3
34
303
3
30
177
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
This is so awesome I need to preserve this for posterity (AKA myself a week from now trying to remember how to do this...) ๐Ÿ˜„ Excel style conditional formatting using pandas ๐Ÿคฉ + font-size control From @fastdotai 's fastbook:
Tweet media one
4
26
177
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
Taking MOOCs changed my life. โ€ข they reignited my passion for learning โ€ข gave me the confidence I can learn complex things on my own โ€ข taught me marketable skills MOOCs mattered to me. Be careful before dismissing their enormous potential.
6
10
178
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
๐ŸŽฅModern Artificial Intelligence 1980s - 2021 by @SchmidhuberAI ! This talk delivers! ๐Ÿ™‚ โœ… starts with the Big Bang (literally) โœ… history of everything explained in the first 10 minutes โœ… only accelerates from there ๐Ÿ™‚ ๐Ÿ‘‰ Here is what I learned...
Tweet media one
4
33
173
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
3 years
Extremely big heartfelt thank you to everyone who made this possible โค๏ธ I started taking @fastdotai courses thinking I could achieve anything I applied myself to on my own. But that was a misconception. Your are only as strong as the people around you.
Tweet media one
Tweet media two
29
4
177
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
2 years
What is the one resource I would recommend for anyone getting into RecSys? This lecture by @xamat . โ€ข it covers several foundational methods โ€ข more importantly, it will teach you how to think about RecSys problems Here are a couple of highlights:
3
26
172
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
1 year
I ran into SLURM but its docs are written for MLOps engs with 20 years of experience. So I wrote a blog post from the perspective of someone who does devops only when they need to. SLURM is not scary at all! ๐Ÿ˜Š Maybe some of this can be of help to you!
5
29
175
@radekosmulski
Radek Osmulski ๐Ÿ‡บ๐Ÿ‡ฆ
4 years
This is an introduction to a book that I am writing. ๐Ÿ‘‰๐Ÿ‘ˆ After reading this, what do you really want the rest of the book to cover?
Tweet media one
Tweet media two
Tweet media three
Tweet media four
19
10
169