Це відео не доступне.
Перепрошуємо.

How to Build An Expected Goals Model 2: Statistical fitting

Поділитися
Вставка
  • Опубліковано 5 тра 2020
  • David Sumpter goes through the steps needed to create an expected goals model. This lecture covers:
    - Fitting straight lines using linear regression
    - The logistic function
    - Fitting shot angle and distance separately
    - Fitting shot angle and distance separately
    - Working with regression tools
    The code for this lecture can be found here:
    github.com/Fri....
    Specifically, I use: 4LinearRegression.py and 5xGModelFit.py
    Using Wyscout data from: figshare.com/c....

КОМЕНТАРІ • 12

  • @4lex355
    @4lex355 3 роки тому +2

    based on these videos i may have a nice ideia for a project of mine. thank you.

  • @Sam-ue2ry
    @Sam-ue2ry Рік тому

    I just wanted to share something here - if you look at the shot angle function. You'll see an initial descrease in scoring probability that david attributes to being an anomaly when in fact if you think intuitively about this and i have verified with data. The likelyhood is that despite the angle being low the Distance is also probably low too which possesses significant explanatory power and hence the higher goal probability is due to this rather than any direct effect of shot angle on goal scoring probability. Always check for Multicolinearity because i bet there is some interaction here. Are shots with lower angles typically from lower distances?

  • @lucaspecina
    @lucaspecina 4 роки тому

    Very interesting!! Thank you

  • @danijara
    @danijara 4 роки тому +2

    Hi David! Nice tutorial! Congrats. I have a question. How do we know how predictible our model would be? P value of intercept? or just each variable independently? Thanks in advance!!!!

    • @victorantunes3357
      @victorantunes3357 2 роки тому

      I think neither. You should first divide your dataset into two parts, training data, and test data. Once you fit your model, you can test the accuracy using the Confusion Matrix and the ROC Curve. Usually, more than 70% accuracy is considered very good. This way you know your model predicts well, and you are safer to input new data to predict the probabilities of scoring goals.

  • @viniciusmelo2907
    @viniciusmelo2907 Рік тому

    I was thinking, if using distance as log and angle as cos, wouldn't be better. Then you could make a linear regresion.
    Also, you would have: 0 < xG < 1 | 1 < log(d+1) < 2 | -1 < cosø < 1
    This suposing that a field has 99 m, on average, and you cannot shoot over 180°

    • @viniciusmelo2907
      @viniciusmelo2907 Рік тому

      Appears to be a smoothier function either. Because you have similar ranges of values to all variables.

  • @roro479
    @roro479 4 роки тому +2

    Hey, for the 2D model for expected goals I am only getting an empty pitch. I am not getting the various coloured areas.

    • @roro479
      @roro479 4 роки тому +1

      Nevermind I figured it out. It was because I was creating a model with angle, distance, X and C. If I remove the variables X and C it works.

  • @rizkyzulkarnaen6594
    @rizkyzulkarnaen6594 3 роки тому

    Hi David, I tried to open your dataset, but I got Page not found, could you please provide the new link?

    • @abdelmalekelkasabi4778
      @abdelmalekelkasabi4778 2 роки тому

      check out the link on part 1, it seems there's something work with the link in this description
      here it is anyway:
      figshare.com/collections/Soccer_match_event_dataset/4415000/5

  • @neilrohra7374
    @neilrohra7374 Рік тому

    Hi, Is the data no longer available to download?