Maarten van Smeden Profile Banner
Maarten van Smeden Profile
Maarten van Smeden

@MaartenvSmeden

42,224
Followers
836
Following
1,428
Media
20,367
Statuses

Statistician & data scientist • Associate prof • Interested in health research methodology, Julius Center @umcutrecht • own views

Utrecht
Joined December 2010
Don't wanna be here? Send us removal request.
Pinned Tweet
@MaartenvSmeden
Maarten van Smeden
2 years
NEW PAPER I was challenged by @morleycp to write a 1000-word paper about a tweet. It was fun to do, and hopefully useful to some of you Open acces 👉
Tweet media one
28
395
1K
@MaartenvSmeden
Maarten van Smeden
2 years
If I throw my paper into a forest, I can claim on my CV that I have submitted to Nature
153
1K
19K
@MaartenvSmeden
Maarten van Smeden
4 years
How to become a SUCCESSFUL academic: a guide 1/n
172
3K
16K
@MaartenvSmeden
Maarten van Smeden
3 years
Students often ask me: why do so many choose 5% as the threshold for significance (“alpha”)? So let me try to explain the main rationale 1/20
211
2K
14K
@MaartenvSmeden
Maarten van Smeden
1 year
It is the only way
Tweet media one
96
2K
13K
@MaartenvSmeden
Maarten van Smeden
3 years
This is why programming is an acquired skill
215
4K
13K
@MaartenvSmeden
Maarten van Smeden
3 years
COVID has killed a lot of my heroes although none of them have died
127
1K
11K
@MaartenvSmeden
Maarten van Smeden
3 years
“After adjusting for known confounders, the association remained highly significant”
119
1K
11K
@MaartenvSmeden
Maarten van Smeden
3 years
Tweet media one
49
1K
11K
@MaartenvSmeden
Maarten van Smeden
2 years
This is wonderful @xkcd
Tweet media one
16
1K
9K
@MaartenvSmeden
Maarten van Smeden
3 years
Math question: ever wondered what is n^-1? Let me explain... 1/n
119
1K
9K
@MaartenvSmeden
Maarten van Smeden
4 years
Statistical terms: what they really mean Multicolinearity— they all look the same Heteroscedasticity— the variation varies Attenuation— being too modest Overfitting— too good to be true Confounding— nothing is what it seems P-value— it’s complicated
58
2K
8K
@MaartenvSmeden
Maarten van Smeden
3 years
This seems pretty accurate
Tweet media one
70
1K
7K
@MaartenvSmeden
Maarten van Smeden
6 years
The sound of machine learning that is just logistic regression
60
2K
6K
@MaartenvSmeden
Maarten van Smeden
4 years
who made this
Tweet media one
87
463
5K
@MaartenvSmeden
Maarten van Smeden
5 years
After my CV, personal and institution website, Google Scholar, ResearchGate, Publons, LinkedIn, Orchid, Web of Science, Scopus, Pure, Academia, I can’t wait for the next tool to simplify managing my academic profile
63
714
5K
@MaartenvSmeden
Maarten van Smeden
3 years
Have never seen a better way to describe the Dutch response to Covid
53
790
4K
@MaartenvSmeden
Maarten van Smeden
4 years
Who made this
Tweet media one
17
720
4K
@MaartenvSmeden
Maarten van Smeden
3 years
You are not even a real scientist until someone on Twitter says you don’t understand the one thing you have been studying for years
54
310
4K
@MaartenvSmeden
Maarten van Smeden
3 years
Once you realize p-values are probabilities relating to observing data rather than probabilities of an hypothesis being true, you are already doing significantly better than most people
43
418
4K
@MaartenvSmeden
Maarten van Smeden
5 years
Linear regression with bootstrapped standard errors
48
629
3K
@MaartenvSmeden
Maarten van Smeden
4 years
Trying to control the position of figures in MS Word
33
365
3K
@MaartenvSmeden
Maarten van Smeden
3 years
Could not resist
Tweet media one
41
525
3K
@MaartenvSmeden
Maarten van Smeden
4 years
Why did you become a scientist? Wrong answers only
3K
383
3K
@MaartenvSmeden
Maarten van Smeden
3 years
manuscript_v1.docx manuscript_v2.docx manuscript_v4b.docx manuscript_v5_with_comments.docx manuscript_v7_tv_tdl_mvd_omg.docx manuscript_final.docx manuscript_final_v2.docx manuscript_final_v2_with_comments.docx manuscript_final_v3.docx manuscript_final_FINAL.docx
80
199
3K
@MaartenvSmeden
Maarten van Smeden
3 years
Stats question: what is the relative contribution of a single individual in a sample of size n to the sample mean Let me tell you… 1/n
46
287
3K
@MaartenvSmeden
Maarten van Smeden
4 years
THIS
40
909
2K
@MaartenvSmeden
Maarten van Smeden
4 years
I sincerely hope this thread will help YOU become an even more successful academic n/n
94
47
2K
@MaartenvSmeden
Maarten van Smeden
4 years
Personal top 10 fallacies and paradoxes in statistics 1. Absence of evidence fallacy 2. Ecological fallacy 3. Stein’s paradox 4. Lord’s paradox 5. Simpson’s paradox 6. Berkson’s paradox 7. Prosecutors fallacy 8. Gambler’s fallacy 9. Lindsey’s paradox 10. Low birthweight paradox
39
661
2K
@MaartenvSmeden
Maarten van Smeden
3 years
Cannot stop watching this
43
548
2K
@MaartenvSmeden
Maarten van Smeden
1 year
Mandatory periodic reminder to this work of art
Tweet media one
31
217
2K
@MaartenvSmeden
Maarten van Smeden
3 years
Nature: you pay $10k for an open access publication Also Nature:
@Nature
nature
3 years
Study finds open-access papers have fewer lead authors from low-income countries than paywalled articles
132
218
766
24
411
2K
@MaartenvSmeden
Maarten van Smeden
4 years
absence of evidence ≠ evidence of absence measurement error ≠ bias towards null clinical trial ≠ study with patients multivariate ≠ multivariable 95%CI ≠ 95% probability p-value ≠ probability H0 correlation ≠ causation significant ≠ relevant bias ≠ error odds ≠ risk
30
591
2K
@MaartenvSmeden
Maarten van Smeden
5 years
If art were like scientific manuscripts Artist: worked some months on this painting that would fit your gallery I believe. Would you consider? Gallery: fill out these forms A: okay G: please remove the frame and attach it to the bottom A: what? Okay...
23
692
2K
@MaartenvSmeden
Maarten van Smeden
4 years
academics to other academics: stay in your lane the lanes:
Tweet media one
12
232
2K
@MaartenvSmeden
Maarten van Smeden
3 years
Probably the best introductory text on modern statistical learning I have read so far. Great for non-economists (like me) too. Highly recommended h/t @causalinf
Tweet media one
4
368
2K
@MaartenvSmeden
Maarten van Smeden
4 years
T-test, ANOVA and linear regression walk into a bar. Barman asks: are you here alone or are you waiting for someone?
41
193
2K
@MaartenvSmeden
Maarten van Smeden
5 years
First they ignore you, then they laugh at you, then they fight you, then you realize they never actually read your paper.
21
251
2K
@MaartenvSmeden
Maarten van Smeden
4 years
Statisticians, let’s be realistic we are terrible with branding. If it was up to us Deep Learning would have been called Hidden Layers Black Boxy Box
42
140
2K
@MaartenvSmeden
Maarten van Smeden
2 years
Terminology explained - Statistics: we fitted a curve through data points - Data science: we fitted a curve through data points - Machine learning: we fitted a curve through data points - Artificial intelligence: we fitted a curve through data points
40
221
2K
@MaartenvSmeden
Maarten van Smeden
3 years
Hi, I am here to report a dataviz crime
Tweet media one
52
164
2K
@MaartenvSmeden
Maarten van Smeden
2 years
Going to the gym, things are just normal
Tweet media one
29
89
2K
@MaartenvSmeden
Maarten van Smeden
3 years
RR: 0.50 [0.33; 0.99] 🚨🚨HOLY MOTHER OF GOD 🚨🚨treatment reduces cases by 50% RR: 0.50 [0.32; 1.01] treatment ineffective, study shows
26
134
1K
@MaartenvSmeden
Maarten van Smeden
4 years
🚨🚨NEW EVIDENCE PYRAMID🚨🚨
Tweet media one
30
293
1K
@MaartenvSmeden
Maarten van Smeden
4 years
Absence of evidence
Tweet media one
14
250
1K
@MaartenvSmeden
Maarten van Smeden
4 years
1) Be the ultimate collaborator but also don't be Say yes to as many collaborations as physically possible: co-produce papers, LEARN, co-write grants, DISCUSS, it is all about synergy. But also, collaborations slow you down, have your own ideas! Just say no to collaborations
4
27
1K
@MaartenvSmeden
Maarten van Smeden
3 years
TFW you align your figures in MS word
Tweet media one
17
61
1K
@MaartenvSmeden
Maarten van Smeden
2 years
Data from UK or US: this is a very important study Data from anywhere else: this is an interesting study, but the authors should clarify how the data from X generalizes to other countries
27
201
1K
@MaartenvSmeden
Maarten van Smeden
4 years
Tweet media one
10
209
1K
@MaartenvSmeden
Maarten van Smeden
2 years
Stop calling everything AI is a good suggestion
Tweet media one
34
208
1K
@MaartenvSmeden
Maarten van Smeden
3 years
Five simple ways to write a better scientific paper than your colleagues
14
296
1K
@MaartenvSmeden
Maarten van Smeden
2 years
Periodic reminder that for most statistical tests: It’s Just A Linear Model #IJALM
Tweet media one
14
206
1K
@MaartenvSmeden
Maarten van Smeden
3 years
Medical Research Bingo
Tweet media one
16
366
1K
@MaartenvSmeden
Maarten van Smeden
4 years
Sensitivity analysis— tried a bunch of stuff Post-hoc— main analysis not sexy enough Multivariate— oops, meant to say multivariable Normality— a very rare shape for data Dichotomized— data was tortured Extrapolation— just guessing
12
151
1K
@MaartenvSmeden
Maarten van Smeden
4 years
Top 10 statistics things you should NEVER do 1) dichotomize unnecessarily 2) conclude no effect from p>.05 3) use Hosmer–Lemeshow test 4) test normality of covariates 5) impute the mean for missing data 6) confuse correlation for causation 7) make top 10 never do lists
29
198
1K
@MaartenvSmeden
Maarten van Smeden
5 years
1. choose your supervisors carefully 2. choose your supervisors carefully 3. choose your supervisors carefully 4. choose your supervisors carefully ... 10. choose your supervisors carefully
@dynarski
Prof Dynarski
7 years
Earned a PhD? Pay it forward & help the next generation. What is your most important advice for a new PhD student? #phdtips
382
323
1K
24
167
1K
@MaartenvSmeden
Maarten van Smeden
5 years
“A prediction model was developed and externally validated” Footage of the validation:
23
216
1K
@MaartenvSmeden
Maarten van Smeden
5 years
Terminology explained - Regression: we used an algorithm - Machine learning: we used a fancy algorithm - Artificial intelligence: we used a VERY fancy algorithm, please don't ask
19
342
1K
@MaartenvSmeden
Maarten van Smeden
3 years
Police? I would like to report a dataviz murder
Tweet media one
34
131
1K
@MaartenvSmeden
Maarten van Smeden
3 years
Lots of relevant work in epidemiology is *descriptive* in nature and very often such work is NOT improved by "correcting" for a bunch of stuff in a multivariable regression or by doing automated variable selection. Sometimes, averages and percentages is just what you need
21
129
1K
@MaartenvSmeden
Maarten van Smeden
5 years
The biggest difference between statistics and machine learning may be in language! So a few months ago I created this (inspired by @DanielOberski ) but haven't made much progress since. Welcoming suggestions for improvements
Tweet media one
26
318
1K
@MaartenvSmeden
Maarten van Smeden
4 years
I love my job, but at least 35 more years seems terribly long to continue discussing whether a model for X predicting Y is statistics, statistical modeling, machine learning, artificial intelligence, statistical learning, data science, data analytics or just regression
32
104
1K
@MaartenvSmeden
Maarten van Smeden
4 years
2) Be the methods ninja but also don't be Science is only as good as its weakest link: don't be satisfied by applying the default analyses in the field. But also, don't let perfect be the enemy of the good and don't confuse reviewers. Just apply the default analyses in the field
2
26
1K
@MaartenvSmeden
Maarten van Smeden
4 years
3) Be the superstar teacher but also don't be Professor means teacher, it is LITERALLY in the name. Being a good professor means being a superstar teacher. But also, focus on the science and minimize the hours of teaching, don't try to become a superstar teacher
5
29
1K
@MaartenvSmeden
Maarten van Smeden
4 years
We thank the reviewers for their thoughtful comments. Below we will provide a point by point response
14
134
1K
@MaartenvSmeden
Maarten van Smeden
4 years
Still consult this much more often than I like to admit
Tweet media one
18
168
1K
@MaartenvSmeden
Maarten van Smeden
2 years
This is my *top 10* favorite methods papers of 2022 Appearing in a single thread and in random order
12
252
1K
@MaartenvSmeden
Maarten van Smeden
4 years
Remember folks, it is only causal if it originates in the causalité region of France. Otherwise, it is just sparkling correlation
11
139
1K
@MaartenvSmeden
Maarten van Smeden
3 years
Making a meme within 3 minutes Twitter: ❤️ 2k, 🔁 500 Sharing work that costs 2 years of my life with never ending discussions, 56 drafts, 20 rounds of peer review, blood, sweat, tears and a kidney Twitter: ❤️ 12, 🔁 3, only one kidney?
25
85
1K
@MaartenvSmeden
Maarten van Smeden
4 years
I have had single rounds of peer review that took longer than it took to develop, test and get approval for a vaccine
20
74
963
@MaartenvSmeden
Maarten van Smeden
3 years
Periodic reminder of the best place on the whole internet
Tweet media one
18
90
983
@MaartenvSmeden
Maarten van Smeden
4 years
Too real...
Tweet media one
13
144
950
@MaartenvSmeden
Maarten van Smeden
2 years
When your simulation runs without error
8
89
947
@MaartenvSmeden
Maarten van Smeden
4 years
protocol, submitted manuscript, published article
9
110
942
@MaartenvSmeden
Maarten van Smeden
10 months
Do not understand why not every PI is hiring a statistician. There is a wealth of data showing that statisticians are very effective in making research slower, more difficult to understand for non-statisticians, analyses more expensive, results less impressive and more boring
42
78
936
@MaartenvSmeden
Maarten van Smeden
4 years
Pretty nervous about sharing my FIRST single author paper (still a preprint). Comments welcome Link:
Tweet media one
43
92
919
@MaartenvSmeden
Maarten van Smeden
2 years
Correlation vs causation
Tweet media one
16
135
913
@MaartenvSmeden
Maarten van Smeden
4 years
Statistics is hard. That’s it. That’s the tweet
32
78
911
@MaartenvSmeden
Maarten van Smeden
4 years
5) Be the literature addict but also don't be READ YOUR LITERATURE. Be the literature addict and know what is out there to prioritize your own science and become THE EXPERT. But also, there is just too much! Invest time spend on reading in writing your own stuff! DON'T READ
5
31
903
@MaartenvSmeden
Maarten van Smeden
4 years
R: could not find function "lenght" me: alright then... lenght <- length
25
43
893
@MaartenvSmeden
Maarten van Smeden
3 years
When people ask you to explain publication bias
9
154
896
@MaartenvSmeden
Maarten van Smeden
4 years
Linear regression— line through data points t-test— linear regression correlation— linear regression ANOVA— linear regression ANCOVA— linear regression Chi-square test— logistic regression Deep learning— bunch of regressions
5
161
882
@MaartenvSmeden
Maarten van Smeden
4 years
Statistical things to worry about *less* 1) significance of univariable associations 2) significant model goodness-of-fit tests 3) imbalance in randomized trials 4) non-normality of observations 5) multicollinearity
10
220
860
@MaartenvSmeden
Maarten van Smeden
2 years
Why are so few clinical prediction models actually implemented in medical practice? This leaky model implementation pipeline summarizes some of the reasons
Tweet media one
16
215
858
@MaartenvSmeden
Maarten van Smeden
2 years
Should a prediction model be developed?
Tweet media one
15
173
842
@MaartenvSmeden
Maarten van Smeden
4 years
4) Be the open science practitioner but also don't be A modern scientist is an open scientist. Open up your code, your data and your publications. But also, your code is messy, the data isn't yours to share and you should save the APC of open publishing to hire new lab members
5
19
832
@MaartenvSmeden
Maarten van Smeden
3 years
Why do we continue to focus on *doing* stats instead of stats comprehension and critical appraisal in the medical curriculum? I don’t care whether or not my doctor knows SPSS but I surely want them to be able to critically read their literature
38
95
833
@MaartenvSmeden
Maarten van Smeden
4 years
If you are looking for a short paper to read today, let it be this lovely 2-pager article by @ADAlthousePhD
Tweet media one
22
212
833
@MaartenvSmeden
Maarten van Smeden
4 years
6) Be the supreme knowledge sponge but also don't be Become the best in the world by borrowing knowledge from different scientific disciplines and by working in multidisciplinary teams. But also, be THE SPECIALIST. Focus on your own discipline and team, your CV is begging you
1
21
827
@MaartenvSmeden
Maarten van Smeden
4 years
10) Be the family person but also don't be Don't forget to live while becoming successful: family time should always be the number 1 priority. But also, all of the above should be number 1 priority
5
20
818
@MaartenvSmeden
Maarten van Smeden
4 years
How do I know how to become a successful academic? I don't, but I have received plenty of advice. As a good academic, I will just summarize what I have learned from listening
8
35
811
@MaartenvSmeden
Maarten van Smeden
5 years
1. Google for code 2. Google for code 3. Google for code 4. Google for code 5. Google for code 6. Google for code 7. Google for code
@Datasciencectrl
Data Science Central
5 years
Learning R in Seven Simple Steps
0
92
276
4
111
805
@MaartenvSmeden
Maarten van Smeden
4 years
WHAT??? h/t @swvanderlaan
Tweet media one
55
76
798
@MaartenvSmeden
Maarten van Smeden
4 years
Was asked for personal favorite resources for improving methods and statistics skills. I promised to make it a thread, so here it is 1/n
13
255
799
@MaartenvSmeden
Maarten van Smeden
3 years
Hahahahahahahahaha *breath* hahahahahahahahaha
Tweet media one
24
49
790
@MaartenvSmeden
Maarten van Smeden
5 years
I am moving from R to STATA, tips anyone?
@HackettKate
Kate Hackett
5 years
You've been kidnapped. Your kidnappers allow you to keep tweeting to pretend everything is alright. What would you tweet that would alarm your followers without the kidnappers knowing you're asking for help? "And then I put in the exact amount of garlic the recipe called for."
6K
1K
16K
35
50
785
@MaartenvSmeden
Maarten van Smeden
4 years
Based on my experience with simulation and advanced modeling, let me summarize my thoughts on this: HAHAHAHAHAHAHAHAHAHAHAHAHA
@nxthompson
nxthompson
4 years
Ray Kurzweil: "I believe that by the end of the decade we will be able realistically model all biology and simulate interventions for diseases without the need for human trials."
27
31
93
22
93
780
@MaartenvSmeden
Maarten van Smeden
4 years
7) Be the social media rockstar but also don't be Outreach! Show you can and will communicate with the public to explain your science. But also, TIME DRAIN! Surely your tenure track committee is not impressed by your 30k SoMe followers half of whom are bots anyway
4
19
782
@MaartenvSmeden
Maarten van Smeden
4 years
Besides statistics and programming, there seems to be no other profession that people will try to master in a weekend
70
67
769