David Sumpter Profile picture
Sep 18 12 tweets 5 min read
Expected Threat (xT) is one of the most important (but least well-understood) statistics in football. ⚽️⚽️

It was used by Liverpool (they call it Goals Added) to do some of their best scouting. 📈

And now, YOU can learn all about it...🧵
The basic idea of xT is to assign a value to a position on the pitch or an action based on the probability it will lead to a goal.
It was proposed (ten years ago) by @srudd_ok who had the idea of using Markov chains to measure how much different parts of the pitch are worth.
This was a mathematically elegant solution and in this video I explain how xT (or probability of a goal) can be calculated step by step.
The name Expected Threat (xT) came form @karun1710, who also described how it works in this amazing interactive blog post. karun.in/blog/expected-…
Lots of different names --- EPV, OBV, G+, VAEP etc. --- have been given to the xT approach.

I distinguish between position-based xT (where place on the pitch is assigned value) and action-based xT (where passes, dribbles etc. are assigned value).

soccermatics.readthedocs.io/en/latest/less…
I start by explaining the position-based version of xT. By measuring transitions between different parts of the pitch and the probability of scoring from them, we can iteratively value different places on the pitch.
Then I turn to action-based xT, which better captures the dynamic nature of football.

We can assign value, not just to position, but also how we move between positions.
The action-based version of xT has not been presented in so much detail before, although it is better for professional applications.

I try to put this right in this video...
And @aleksander_and has done a wonderful step-by-step implementation here. soccermatics.readthedocs.io/en/latest/gall…
And I use the @twelve_football implementation of action-based xT to give application examples in this video.
I hope learning about xT is as much fun as it has been to write and make videos about.

And that you can make your own version🤗🤗

Enjoy!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with David Sumpter

David Sumpter Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Soccermatics

Sep 4
A thread on Liverpool using numbers.

Six games is great for using expected goals and expected threat to find out what is going on.

First of all they have 'won' all six on xG.
They are best in league at getting the ball in to the final third and in to the box per match.
And those box entries result in shots.

(The Bournemouth result influences these results somewhat but the numbers still look good).
Read 8 tweets
Jun 20
How should we measure the performance of a machine learning model? 😕

When I was teaching ML for the first time last year, I was surprised to find there was no agreed upon single number which measures model performance. 🤯

So I decided to look at the question myself... 🧵
Here is the question. Imagine you have created an algorithm which assigns a score to an image based on how likely it is to contain a cat. 🐈‍⬛

The pictures and the scores from your algorithm might look something like the following.
Now think about setting a threshold for saying 'this is a cat'. For example, in the image above a threshold of r=67 will correctly identify 5 cats (87, 85, 82, 79, 68), but wrongly include 2 non-cats (74, 73) in the images you have labelled as cats. 🐏🐇
Read 16 tweets
Mar 9
In the new spirit of me writing about what I am working on each day, here is a random thread about the 'bespoke football analytics' which we are working on at @twelve_football and featured in the @AnalyticsFC podcast.
The image below is a player radar for Adama Traore from this season in PL before he went to Barcelona. It shows that he is top rated for high-speed dribbles, creating his own chances and maintaining possession. He is also good at cutbacks and through balls.
These kind of stats are important because they move on from simply counting number of actions, to assigning on-ball-value to actions (xT) in terms of ball progression to counting the type of actions a player does and how much value they give (which is what we measure).
Read 10 tweets
Feb 8, 2021
I have spent much of today studying the Stochastic Parrots paper by @emilymbender, @timnitGebru, @mcmillan_majora and @mmitchell_ai. (faculty.washington.edu/ebender/papers…) This paper is important for many reasons...
It explains why language models like GPT3 are in large part gimmicks: stochastic parrots. They are just generating randomish sequences of sentences from the data put in to them. (1/n)
These can be a bit of fun. But they don't communicate with us: they just repeat random fragments of Reddit, Wikipedia and sensationalised newspaper articles. So it's not clear how they are useful in a wider setting. (2/n)
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(