David Sumpter Profile Banner
David Sumpter Profile
David Sumpter

@Soccermatics

33,665
Followers
1,911
Following
1,814
Media
10,338
Statuses

Professor of Applied Maths and Author. Co-founder and data scientist at Twelve football.

Uppsala, Sweden
Joined January 2015
Don't wanna be here? Send us removal request.
Pinned Tweet
@Soccermatics
David Sumpter
1 month
Wordalisations: automated explanations of data. It is my great pleasure to announce that @twelve_football have created an open-source educational version of our Twelve GPT product. Now you can learn how to create football and other wordalisations!
1
7
42
@Soccermatics
David Sumpter
2 years
Do you want to learn football analytics? 🤔 You have come to the right place! ✅ My course 'Soccermatics: mathematical modelling of football' is available now. AND IT IS FREE.🤗
71
537
3K
@Soccermatics
David Sumpter
2 years
Look in the bottom right corner of this graph. 👀 The two teams with the lowest xG/shot against and highest xG/shot for are owned by professional gamblers who made their fortunes using expected goals. Football analytics really does work...
Tweet media one
26
195
2K
@Soccermatics
David Sumpter
2 years
Expected Threat (xT) is one of the most important (but least well-understood) statistics in football. ⚽️⚽️ It was used by Liverpool (they call it Goals Added) to do some of their best scouting. 📈 And now, YOU can learn all about it...🧵
22
208
2K
@Soccermatics
David Sumpter
2 years
Arsenal are very good this season. Here is a thread with the numbers which explain why... (Yes, of course being 1st in the league is also an important number😉) Before today they have outperformed their opponents on xG in every match.
Tweet media one
11
213
1K
@Soccermatics
David Sumpter
2 years
A thread on Liverpool using numbers. Six games is great for using expected goals and expected threat to find out what is going on. First of all they have 'won' all six on xG.
Tweet media one
49
201
1K
@Soccermatics
David Sumpter
8 months
Hammarby are doing something of a first in football analytics. The club has set up metrics for their style of play, which will be communicated throughout the season with fans, the board and players alike.
Tweet media one
14
83
935
@Soccermatics
David Sumpter
4 years
The complete guide course to football analytics is now available for free. Covering everything from getting started in python, through expected goals, simulating matches and evaluating players, to tracking data and expected possession value. Enjoy!
6
87
608
@Soccermatics
David Sumpter
2 years
The Future of Life Institute is a problem. Being in same age-group (lower end maybe😃)/cultural background (8-bit programming📼⌨️) as these men, makes me feel uncomfortable and embarrassed for them. Here they (Musk, Hassabis, Tegmark, Bostrom etc.) are in 2017.
Tweet media one
14
130
501
@Soccermatics
David Sumpter
2 years
How should we do mathematical modelling? With more and more strange ideas flying about AI (It can read minds! it is truly intelligent and might take over the world!) we need to refocus. This is what I do in this paper with Linnea Gyllingberg and @Abebab .
Tweet media one
7
97
477
@Soccermatics
David Sumpter
3 years
Almost twenty thousand fans welcome @Hammarbyfotboll women’s team on to the pitch.
6
39
477
@Soccermatics
David Sumpter
4 years
Mathematical modelling in football course starts 31st of August. Free to all EU citizens. Full syllabus and details how to register here:
32
76
475
@Soccermatics
David Sumpter
3 years
You might not be interested in football or maths, but you HAVE to watch @maramperninety here. A 20-year-old Muslim woman, she is one of the absolute best at combining these two (otherwise) male-dominated areas. She talks so eloquently about her journey.
6
81
476
@Soccermatics
David Sumpter
3 years
If we are going to be able to scout players using data, we have to use an understanding of the game. I explain how to do this using Professor Paco Seirul·lo’s zonal method.
Tweet media one
3
54
430
@Soccermatics
David Sumpter
3 years
It never ceases to amaze me how close goal distributions are to Poisson. This is distribution of number of goals in a match since 2005 in the Premier League. Bars are histogram. Line is Poisson distribution. 0-0 draws are the only outlier.
Tweet media one
17
30
404
@Soccermatics
David Sumpter
6 years
News from me. I am now working as data scientist at @Hammarbyfotboll football club. I will be active at all levels of the club: from working with players to communicating with fans. Bringing top quality analytics to #bajen
46
19
405
@Soccermatics
David Sumpter
2 years
They are best in league at getting the ball in to the final third and in to the box per match.
Tweet media one
5
42
369
@Soccermatics
David Sumpter
4 years
Here is my list, based on some research I made during my course, who should get credit for first application of things in football analytics. Expected goals, Expected Threat, Pitch Control etc.....
5
52
357
@Soccermatics
David Sumpter
4 years
The Masterclass is Ready! Mathematical Modelling Course in Football videos now in a start to finish playlist. With amazing lectures by @EightyFivePoint , @the_spearman , @suds_g , @lucpappalard , @JanVanHaaren and @petermckeever .
Tweet media one
6
49
335
@Soccermatics
David Sumpter
2 years
The first video lecture in 'Soccermatics: mathematical modelling of football' is here! 🤗 Learn how to make fantastic pitch visualisations. 💡
4
26
306
@Soccermatics
David Sumpter
6 years
The genius of Barcelona players. Xavi takes in information very fast. Iniesta creates unique solutions using the info. Dotted line is population average.
Tweet media one
5
72
279
@Soccermatics
David Sumpter
9 months
An innovation in latest release of @twelve_football TwelveGPT is a transfer model. Here is how players perform as they move between leagues in terms of passing quality. Players coming from La Liga can hold the same level of passing after a move to the Premier League. But...
Tweet media one
9
33
295
@Soccermatics
David Sumpter
2 years
Want to learn how to build a neural network model that uses @statsbomb 360 data to create an expected goals model which accounts for player positions when shot is taken? @aleksander_and has created a step by step implementation of exactly that!
3
22
284
@Soccermatics
David Sumpter
3 years
Applications now open for Mathematical Modelling of Football. This course is taught by me in Sep-Oct for 8 weeks (10-15 hours work per week). Deadline for application is 15th April. Application here: Course website 2021:
10
72
279
@Soccermatics
David Sumpter
10 months
When a football player peaks in performance depends very strongly on what you want from them. Model fit on event data from seven seasons of Premier League, La Liga, Bundesliga, Ligue 1 and Series A.
Tweet media one
4
30
272
@Soccermatics
David Sumpter
2 years
It takes attention from the real research that has been done in this area, by Gebru, Mitchell, Bender, O'Neil and many others who have worked on good data practices, the costs associated with AI and the real dangers involved.
Tweet media one
4
59
269
@Soccermatics
David Sumpter
10 months
When scouting players we want measurements to be repeatable over seasons: that a player can do what they did previous season in the coming season. Finishing does not have that property, even over a season. Each dot below compares the finishing of a player between two seasons...
Tweet media one
13
43
263
@Soccermatics
David Sumpter
7 months
We (myself and @KozlovaNicole at @twelve_football ) have modeled how players perform in different metrics when they move between leagues. Here is the overall ranking of leagues, looking at how (on average) metrics change when players transfer. Red is worse, green better.
Tweet media one
14
32
265
@Soccermatics
David Sumpter
5 months
Great project. Look forward to seeing more from this.
@skalskip92
SkalskiP
5 months
almost fully functional version of my football AI project today, I added player tracking using ByteTrack and projection of players onto the map code coming soon:
52
209
2K
3
19
260
@Soccermatics
David Sumpter
8 months
Det finns en klubb som jag jobbar med som kommer alltid ligger närmast min 💚 Tack att jag fick vara med ikväll, träffa så många av er och prata ”nyckeltal”
Tweet media one
6
0
261
@Soccermatics
David Sumpter
7 years
Arsenal just sold their 2nd best player to Liverpool. More defensive impact that Xhaka, twice the xG of Giroud and passing that rivals Özil!
Tweet media one
28
162
234
@Soccermatics
David Sumpter
2 years
And they completely dominated the game.
Tweet media one
1
17
238
@Soccermatics
David Sumpter
2 years
Their most dangerous combination is Ben White to Saka, closely followed by Odegaard to Saka. Jesus is receiving the ball well from everyone. Plot shows expected threat (value added by passes) between pairs of players.
Tweet media one
2
28
231
@Soccermatics
David Sumpter
2 years
Lesson 2 of 'Soccermatics: the course' is live! ⚽️ ✅Learn how to make an expected goals model. ✅Does possession influence match outcome? ✅Test whether your favourite football myth is true. Get started with this video:
5
23
237
@Soccermatics
David Sumpter
5 years
How Swedes were fooled by one of the biggest scientific bluffs of our time... Think that people can be classified as colours? Think again. Over New Year I translated and edited Dan Katz investigation in the 'Surrounded by Idiots'.
14
83
236
@Soccermatics
David Sumpter
2 years
The Soccermatics course: ✅All teaching materials free forever ✅Covers all coding, machine learning, maths and data science skills ✅Given in Student, Professional and Enthusiast versions ✅ Includes graded individual & project work. Take a look:
3
27
231
@Soccermatics
David Sumpter
1 year
What is the best way to learn football analytics? Out on the pitch, of course! I have just come back from a course with the Polish federation for analysts, where we started exactly there.
Tweet media one
2
17
229
@Soccermatics
David Sumpter
2 years
And Odegaard is alsovery important. He is involved in pass chains that create an average of 1.12xG per match! These are his passes with an xT value of over 0.05. Most of them directly forward in to the box.
Tweet media one
2
32
217
@Soccermatics
David Sumpter
4 years
It is live! How to build an expected goals model. First part of my step by step guide using @wyscout freely available football event data. Everything you need to understand and build your own model in Python.
4
27
212
@Soccermatics
David Sumpter
2 years
As I always say ... you have to wait at least five matches to trust expected goals to give a picture of how good a team really is.
Tweet media one
7
14
207
@Soccermatics
David Sumpter
2 years
Their strength is in converting final third entries in to box entries (this is where Jesus is key).
Tweet media one
1
14
192
@Soccermatics
David Sumpter
3 years
Mathematical modelling of football course is up and running again. All the material (lectures, tutorials etc.) is available here: You can't register now but you can follow the progress. And you are welcome next year!
2
29
201
@Soccermatics
David Sumpter
7 years
Should you write about real goals or expected goals? A guide for journalists.
5
56
190
@Soccermatics
David Sumpter
2 years
Arsenal deserve to be where they are in the league (xG before todays match). It is going to be an exciting season for the Gunners!
Tweet media one
3
16
188
@Soccermatics
David Sumpter
2 years
@santvm7 @wr_ghty You won’t. But it could mean that things aren’t as bad as they look according to the table.
3
1
192
@Soccermatics
David Sumpter
4 years
I have written a book about Ten Equations that can change your life for the better. Out 1st of October with @AllenLaneBooks , but you can get a head start here: What Are The Ten Equations?
8
36
192
@Soccermatics
David Sumpter
5 years
Hammarby dominated the match from start to finish. 2.55 against 0.75 in expected goals. Worthy winners of the Stockholm Derby.
Tweet media one
1
35
191
@Soccermatics
David Sumpter
4 years
Dear Jurgen Klopp @LFC . I have been watching your team play and compared it to some value models. It is straightforward. Balls out wide to TAA (by Thiago for example, shown below) do have value. (1/n)
Tweet media one
14
37
181
@Soccermatics
David Sumpter
5 years
Advice for anyone who wants to become a football data scientist. All the most important information from last night's broadcast. via @YouTube
6
25
179
@Soccermatics
David Sumpter
5 months
Is the model wrong or is it everyone else? Phil Foden is obviously a great player. But TwelveGPT puts a lot of this season's excitement about him down to some good finishing.
Tweet media one
13
14
177
@Soccermatics
David Sumpter
5 years
If the league was decided by simulating it 10,000 times using xG then today Manchester City would have won by 8.9 points while sending Brighton down. Wolves would have qualified for the Champions League at Anfield. Meanwhile, Cardiff just secured survival against Man United.
Tweet media one
10
42
173
@Soccermatics
David Sumpter
5 years
"I've never been in a team that dominated a match like this", said Tankovic afterwards, "we rolled over them." Top down look at how Hammarby's press produces goals. Data from @signality . Pitch control by @JaviOnData . Work as part of Fran Peralta's masters thesis.
6
24
175
@Soccermatics
David Sumpter
4 years
It is on! If you would like to learn football analytics and get University credits then this is the course for you. FREE TO EU/EEA CITIZENS. #FoT
14
30
173
@Soccermatics
David Sumpter
2 years
Of course, the league position and the results are important statistics too! But overall, Liverpool are (together with Arsenal and Manchester City) one of the strongest three teams in terms of underlying numbers. (Figure not updated for today's results)
Tweet media one
8
18
168
@Soccermatics
David Sumpter
2 years
Today was no exception. Spurs total includes a 0.75xG penalty!
Tweet media one
2
13
167
@Soccermatics
David Sumpter
5 years
After seven games played there is one team that stands out in the quality and quantity of chances created in Allsvenskan. The darker the shading the higher the average xG. The darkest green in #bajen .
Tweet media one
6
21
176
@Soccermatics
David Sumpter
4 years
Want something to do to take your mind off things? I have put up all my lectures for Statistical Machine Learning. The course is built by the excellent machine learning group in Uppsala. Google Colab notebooks and pen and paper exercises throughout.
3
21
172
@Soccermatics
David Sumpter
2 years
Lesson 3 of Soccermatics: mathematical modelling of football is available now! Learn how to make player radars and rank players. What to include, how to correct for possession and to get the right context.
2
25
168
@Soccermatics
David Sumpter
4 years
This evening the Friends of Tracking Youtube channel will be announcing an exciting new collaboration with @SkillCorner . Free tracking data of the best teams in the world, freely available to all! Watch this space #FoT .
7
15
170
@Soccermatics
David Sumpter
2 years
And those box entries result in shots. (The Bournemouth result influences these results somewhat but the numbers still look good).
Tweet media one
2
20
159
@Soccermatics
David Sumpter
2 years
But the single most dangerous pairing is TAA to Salah. This matrix shows expected threat of passes between players.
Tweet media one
1
9
159
@Soccermatics
David Sumpter
10 months
New for me for 2024. This year I will work half-time for @twelve_football . I have always had Twelve as a 'hobby', but now interest amongst clubs has really taken off.🚀 We will build the most comprehensive tool for data analysis - TwelveGPT: football in words and visuals.🤖⚽️
Tweet media one
10
2
163
@Soccermatics
David Sumpter
5 years
The most exciting league in the world and this morning we are top of it. Feels very good to go in to work today. What a team!
Tweet media one
5
7
160
@Soccermatics
David Sumpter
5 years
I have the best job in the world, talking space creation with the best attacking team in Sweden. And better than that I get to tag @leobengtsson10 . Good discussion. Thanks boys.
Tweet media one
2
2
159
@Soccermatics
David Sumpter
3 years
Another individual skill radar. Saka is effective because he does things quickly. He ranks highly in making rapid passes (top 15%), producing value in xT (top 10%) and dribbling at high speed (top 20%). Most of all, he can pass the ball effectively when play is moving fast.
Tweet media one
5
22
145
@Soccermatics
David Sumpter
5 years
The next Friends of Tracking Youtube live talk on Thursday will look at "Open resources for getting started in football data science". I'll start a list in the thread below. Feel free to add to help me prepare the talk.
1
33
152
@Soccermatics
David Sumpter
2 years
Little of what football commentators say seems motivated by facts. One narrative is Salah is playing wider this season. Below is xT for 3 PL games, compared to same opponents last season. Make up your own mind🤔 It would be so easy to show things like this on TV.🤩
Tweet media one
5
14
153
@Soccermatics
David Sumpter
5 years
Ni är bästa fans i hela världen💚
0
4
155
@Soccermatics
David Sumpter
5 years
How Bojanic, Tankovic, Kacaniklic and Djurdjic open up defences. No wonder we caught the interest of Ibrahimovic. Green are areas where pass success probability is higher, red are less probable.
2
9
153
@Soccermatics
David Sumpter
2 years
Want to learn how the maths and coding needed to build a betting model for the World Cup? I have got you covered! I have created a step by step guide to The Betting Equation from my book The Ten Equations That Rule The World.
3
18
149
@Soccermatics
David Sumpter
5 years
Kacaniklic goal against Elfsborg. Amazing team-buildup and control of opponent's space. Especially Tankovic (22) first involvement and then run for the assist. The finish was all Alex, though!
5
9
154
@Soccermatics
David Sumpter
5 years
Sometimes the numbers just make sense!
@Ibra_official
Zlatan Ibrahimović
5 years
340
900
3K
8
5
150
@Soccermatics
David Sumpter
2 years
Idag tog jag gula bussen till jobbet🚌🚌
Tweet media one
Tweet media two
2
2
146
@Soccermatics
David Sumpter
3 years
Is Pedri equal to Kante plus Jorginho? A bit of a cheeky title, but it is interesting to look at how tracking data can be used to show why Pedri was so good in La Liga last season.
6
33
141
@Soccermatics
David Sumpter
3 years
Explaining Expected Threat Nice to see everyone getting in to a method we have been using at @twelve_football and made publicly available for the last 4 years 😎 But even more important to remember that it came from @srudd_ok over a decade ago 😱😎😎
Tweet media one
6
23
146
@Soccermatics
David Sumpter
8 years
The Human Histogram. Demonstrating probability distributions using the power of crowds.
0
111
141
@Soccermatics
David Sumpter
4 years
This 360 thing from @StatsBomb could be a real game changer for several reasons...
1
13
144
@Soccermatics
David Sumpter
5 years
Richard Magyar. A hero in so many ways and also the statistically best player today.
0
11
147
@Soccermatics
David Sumpter
2 years
How do you turn your football analytics in to a beautiful app? I explain it all step-by-step in this video: Thanks to @streamlit for your amazing product and @jernejfl at @twelve_football for creating beautiful visuals.
3
10
144
@Soccermatics
David Sumpter
2 years
How should we measure the performance of a machine learning model? 😕 When I was teaching ML for the first time last year, I was surprised to find there was no agreed upon single number which measures model performance. 🤯 So I decided to look at the question myself... 🧵
5
19
141
@Soccermatics
David Sumpter
6 years
I just talked to an AI bot about football. How cool is that! 🤖🤖🤖
Tweet media one
10
28
137
@Soccermatics
David Sumpter
2 years
I was so busy after finishing the Soccermatics free football analytics course I never got round to sharing all the different parts we created. I think I stopped at expected threat, but I will recap. Here is the concept explained:
3
6
143
@Soccermatics
David Sumpter
2 years
Pitch control is one of the most important ways of understanding movement of football players. In this video I give an overview of the concept.
2
21
140
@Soccermatics
David Sumpter
4 years
I never dreamed that one day I'd make it in to 'Roy of the Rovers', but life takes some strange turns.
Tweet media one
5
9
137
@Soccermatics
David Sumpter
6 years
Mitt ny arbetsplats!
Tweet media one
1
2
137
@Soccermatics
David Sumpter
5 years
Paulinho was per minute one of the absolute best players in Allsvenskan last season. Here is why... (1/5)
Tweet media one
3
18
136
@Soccermatics
David Sumpter
4 years
The lectures for the 2nd week of the Modelling Football course are up now. We develop expected goals and other statistical models of football.
2
19
136
@Soccermatics
David Sumpter
2 years
I made this as an 'exercise' for my course. Wanted to think about how KPIs can help at this stage in the season to give some perspective.
13
5
133
@Soccermatics
David Sumpter
3 years
Yesterday really was freedom day for me. I got my second dose of vaccine. But most importantly I got this... Not a freedom I asked for, but one I am happy to receive.
Tweet media one
5
0
134
@Soccermatics
David Sumpter
7 years
Turns out that @theresa_may was only slightly more influential than @LordBuckethead during election Twitter.
Tweet media one
3
36
127
@Soccermatics
David Sumpter
8 years
Where Chelsea regained possession (line thickness=num actions) @ChelseaStats . Oscar showing Kante how its done!
Tweet media one
8
154
126
@Soccermatics
David Sumpter
3 years
Using analytics to create patterns of play for @BarcaInnoHub . I look at how Manchester City implement the "5-second-rule" using pitch control. It is all about where you are before you lose the ball.
Tweet media one
Tweet media two
4
16
129