SlideShare une entreprise Scribd logo
1  sur  32
SOME STATISTICAL
ANALYSES OF NBA
BASKETBALL
EDWARD N. TORRES
TOPICS OF FOCUS
• Importance of a Lead in the NBA
• Overdispersion?
• Free-Throw Momentum?
• Linear Regression: Predicting Wins
IMPORTANCE OF A LEAD IN THE NBA
WIENER PROCESS
• Stochastic Process: A collection of random variables indexed by t (time).
• Wiener Process: A type of Stochastic Process with the following properties:
i. W(0) = 0
ii. W(t) – W(s) has a normal distribution with mean 0 and variance 𝜎2 𝑡 − 𝑠 ; 𝑠 ≤ 𝑡
iii. 𝑊(𝑡2) − 𝑊 𝑡1 , 𝑊 𝑡3 − 𝑊 𝑡2 , … . , 𝑊 𝑡 𝑛 − 𝑊 𝑡 𝑛−1 are independent for
𝑡1 ≤ 𝑡2 ≤ ⋯ ≤ 𝑡 𝑛
• Let W(t) be a Wiener process that represents the score differential at time t, where
t is the proportion of the game that has been played.
FINDING 𝐸 𝑊 𝑡 UNDER THE ASSUMPTION THAT
BOTH TEAMS ARE EQUAL IN ABILITY
• What this means:
• What is the average point differential at time t, given both teams are equal
in ability?
• 𝐸 𝑊 𝑡 = 𝐸 𝑊 𝑡 − 𝑊 0 = 0 (tied)
FINDING V𝑎𝑟 𝑊 𝑡 , GIVEN 𝑊 1 = 𝜎2
(VARIANCE OF THE FINAL SCORE DIFFERENTIAL)
• 𝑉𝑎𝑟 𝑊 𝑡 = 𝑉𝑎𝑟 𝑊 𝑡 − 𝑊 0 = 𝜎2 𝑡 − 0 = 𝜎2
• Note: the variance of the score differential at time t is proportional to
the variance of the final score differential
• Ex: 𝑉𝑎𝑟 𝑊
1
2
=
1
2
𝜎2 (variance of the score differential at half-time)
FINDING 𝑃 𝑊 1 > 0 𝑊 𝑡 > 0 )
• Decompose W(1) into two components and express the components in terms of t and standardized
normal v 𝑎𝑟𝑖𝑎𝑏𝑙𝑒𝑠 𝑍1& 𝑍2.
𝑊 1 = 𝑊1 𝑡 + 𝑊2 1 − 𝑡
𝑊1 𝑡 = 𝑊1 𝑡 − 𝑊1 0 ~ 𝑁 0, 𝜎2
𝑡 − 0
⇒ 𝑊1 𝑡 ~ 𝑁 0, 𝜎2
⇒ 𝑊1 𝑡 = 𝑍1 𝜎 𝑡
𝑊2 1 − 𝑡 = 𝑊2 1 − 𝑊2 𝑡 ~ 𝑁 0, 𝜎2
1 − 𝑡
⇒ 𝑊2 1 − 𝑡 ~ 𝑁 0, 𝜎2
(1 − 𝑡 )
⇒ 𝑊2 1 − 𝑡 = 𝑍2 𝜎 1 − 𝑡
FINDING 𝑃 𝑊 1 > 0 𝑊 𝑡 > 0 )
(CONTINUED)
• Substitute W(1) for our two components
𝑃 𝑊 1 > 0 𝑊(𝑡) > 0) = 𝑃(𝑊1 𝑡 + 𝑊2(1 − 𝑡) > 0 | 𝑊(𝑡) > 0)
= 𝑃 𝑍1 𝜎 𝑡 + 𝑍2 𝜎 1 − 𝑡 > 0 𝑍1 𝜎 𝑡 > 0)
= 𝑃 𝑍1 𝑡 + 𝑍2 1 − 𝑡 > 0 𝑍1 > 0)
=
𝑃 𝑍1 𝑡 + 𝑍2 1 − 𝑡 > 0
𝑃(𝑍1 > 0)
FINDING 𝑃 𝑊 1 > 0 𝑊 𝑡 > 0 )
(CONTINUED)
=
𝑃 𝑍1 𝑡 + 𝑍2 1 − 𝑡 > 0
𝑃(𝑍1 > 0)
= 2
ℝ
𝜌 𝑍1 𝜌 𝑍2 𝑑 𝑍1
𝑑 𝑧2
= 2
0
∞
−
𝑍1 𝑡
1−𝑡
∞
𝑒− 𝑥1
2+𝑥2
2
2𝜋
𝑑 𝑧2
𝑑 𝑧1
FINDING 𝑃 𝑊 1 > 0 𝑊 𝑡 > 0 )
(CONTINUED)
• Change to polar Coordinates
2
tan−1(
𝑡
1−𝑡
)
𝜋/2
0
∞
𝑒−
𝑟2
2 𝑟 𝑑𝑟𝑑𝜃
=
1
𝜋 tan−1(
𝑡
1−𝑡
)
𝜋
2
1 𝑑𝜃
=
1
𝜋
[
𝜋
2
− tan−1
𝑡
1 − 𝑡
]
WHAT ARE THE CHANCES OF A TEAM WINNING A
GAME AFTER BEING UP AT AFTER THE 1Q? 2Q? 3Q?
Lead After: Calculations P(Win)
1Q
𝑃 𝑊 1 > 0 𝑊
1
4
> 0 ) =
1
𝜋
[
𝜋
2
− tan−1
1
4
3
4
] 66.7% =
2
3
2Q
𝑃 𝑊 1 > 0 𝑊
1
2
> 0 ) =
1
𝜋
[
𝜋
2
− tan−1
1
2
1
2
] 75% =
3
4
3Q
𝑃 𝑊 1 > 0 𝑊
3
4
> 0 ) =
1
𝜋
[
𝜋
2
− tan−1
3
4
1
4
] 83.3% =
5
6
COMPARING MODEL TO REALITY
COMPARING MODEL TO REALITY
Lead After: BVN Actual Model difference
1Q 64.5% 65.1% 66.7% -1.6
2Q 72.4% 72.5% 75.0% -2.5
3Q 81.1% 82.0% 83.3% -1.3
• Conclusions:
• This model is quite accurate compared to the Actual
percentage.
OVERDISPERSION
OVERDISPERSION
• Definition: “In statistics, overdispersion is the presence of greater variability
(statistical dispersion) in a data set than would be expected based on a given
statistical model.”
• In our case we would like to check for overdispersion for game-by-game free throws
success for certain players, based on the model x~Bin(n,𝜋)
• Where: n = number of free throws attempted
𝜋 = free throw percentage
x = number of makes
• Std(x) = 𝑛𝜋𝑞
• We Collected free throw data on a few players who averaged a large amount of free-
throws attempts per game, from the 2018-19 NBA season.
CHI-SQUARED TEST
• 𝒳2 = 𝑖 𝑗
𝜃 𝑖,𝑗−𝐸 𝑖,𝑗
2
𝐸 𝑖,𝑗
• Where 𝐸𝑖,𝑗 =
(𝑟𝑜𝑤 𝑡𝑜𝑡𝑎𝑙)(𝑐𝑜𝑙 𝑡𝑜𝑡𝑎𝑙)
𝑔𝑟𝑎𝑛𝑑 𝑡𝑜𝑡𝑎𝑙
• = (total ft’s in game i) × ( 𝜋)
• Test of homogeneity
• 𝐻0: 𝑃 𝑓𝑡 𝑠𝑎𝑚𝑒 𝑒𝑎𝑐ℎ 𝑔𝑎𝑚𝑒
• 𝐻0: 𝜋1 = 𝜋2 = ⋯ = 𝜋 𝑛 = 𝜋
• 𝜋 = mle ft% (season)
• A large 𝒳2 is evidence against our 𝐻0
Game FT made FT miss FTA
1 3 1 4
2 11 4 15
3 5 1 6
4 6 3 9
5 6 2 8
6 9 0 9
7 1 2 3
⋮ ⋮ ⋮ ⋮
50 9 2 11
Total: 513 79 592
James Harden FT’s (18’/19’
season)
CHI-SQUARED TEST
(RESULTS)
• We chose players who averaged a
high number of free-throw attempts
per game.
• Data suggests null hypothesis holds
• No player has a significant p-value
player P-value
James Harden .1
Joel Embiid .86
Giannis .58
Blake Griffin .77
Kevin Durant .66
Damian Lillard .74
Paul George .34
Anthony Davis .20
FREE-THROW MOMENTUM
WALD-WOLFOWITZ RUNS TEST
• We want to test free-throw
Dependency within a game
• How one free-throw affects the next
• 𝐻0: the order of makes or misses in a
game is random.
• To do this we use Wald-Wolfowitz Runs
Test
• Longer runs is evidence against 𝐻0
• Data comes from 16/17 season.
PREDICTING TOTAL SEASON WINS FROM
TEAM STATISTICS
SIMPLE LINEAR REGRESSION
• Data Used:
• 14’/15’ NBA Team Stats
• 15’/16’ NBA Team Stats
• 16/17’ NBA Team Stats
• 17/18’ NBA Team Stats
• 18/19’ NBA Team Stats
• Response Variable:
• won (total games won during the season)
• Predictor Variable:
• 3pa (three point attempts)
• 3pm (three pointers made)
• 3p%
• fg%
SIMPLE LINEAR REGRESSION: 𝑅2
VALUES
3pa 3pm 3p% fg%
18’/19’ 9.74% 23.44% 29.74% 36.89%
17’/18’ 5.79% 11.52% 23.21% 42.39%
16’/17’ 7.61% 19.88% 40.34% 44.38%
15’/16’ 6.23% 15.80% 41.54% 42.62%
14’/15’ 22.25% 37.27% 48.10% 55.51%
• Using fg% as the predictor variable gives us a higher 𝑅2
value
SIMPLE LINEAR REGRESSION USING FG% TO PREDICT
WINS (16’/17’ SEASON)
SIMPLE LINEAR REGRESSION USING PT. DIFFERENTIAL AS
A PREDICTOR VARIABLE (16’/17’ SEASON)
MULTIPLE LINEAR REGRESSION
• Data Used:
• 2016-2017 NBA Team Stats
• Response Variable:
• Won ( total games won during the
season)
• Predictor Variables:
• 2p%, 3p%, to (turnovers), tr ( total
rebounds), bk (blocks), O 2p%, O
3p%, O tr, O to
Regression Analysis: won versus 2p%, 3p%, tr, to, bk, O 2p%, O 3p%, O tr, O to
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Regression 9 3393.07 377.01 31.82 0.000
2p% 1 293.05 293.05 24.74 0.000
3p% 1 119.74 119.74 10.11 0.005
tr 1 133.91 133.91 11.30 0.003
to 1 249.82 249.82 21.09 0.000
bk 1 79.57 79.57 6.72 0.017
O 2p% 1 184.10 184.10 15.54 0.001
O 3p% 1 207.14 207.14 17.49 0.000
O tr 1 161.41 161.41 13.63 0.001
O to 1 166.80 166.80 14.08 0.001
Error 20 236.93 11.85
Total 29 3630.00
Model Summary
S R-sq R-sq(adj) R-sq(pred)
3.44187 93.47% 90.54% 85.32%
Regression Analysis: won versus 2p%, 3p%, tr, to, bk, O 2p%, O 3p%, O tr, O to
Coefficients
Term Coef SE Coef T-Value P-Value VIF
Constant 86.5 65.2 1.33 0.200
2p% 238.7 48.0 4.97 0.000 2.00
3p% 168.0 52.8 3.18 0.005 2.23
tr 0.02197 0.00654 3.36 0.003 2.05
to -0.03498 0.00762 -4.59 0.000 1.59
bk -0.0418 0.0161 -2.59 0.017 2.11
O 2p% -256.6 65.1 -3.94 0.001 1.80
O 3p% -283.3 67.8 -4.18 0.000 1.94
O tr -0.01723 0.00467 -3.69 0.001 1.19
O to 0.0387 0.0103 3.75 0.001 1.70
Regression Equation
won = 86.5 + 238.7 2p% + 168.0 3p% + 0.02197 tr - 0.03498 to - 0.0418 bk - 256.6 O 2p%
- 283.3 O 3p% - 0.01723 O tr + 0.0387 O to
Correlation: 2p%, 3p%, tr, to, bk, O 2p%, O 3p%, O tr, O to
Correlations
2p% 3p% tr to bk O 2p% O 3p% O tr
3p% 0.478
tr 0.046 -0.332
to 0.283 -0.178 0.262
bk 0.132 0.172 0.204 0.120
O 2p% 0.008 -0.261 -0.136 0.148 -0.560
O 3p% -0.413 -0.313 -0.121 0.010 -0.546 0.396
O tr -0.141 -0.256 -0.126 0.075 0.078 0.065 0.020
O to 0.134 -0.002 -0.352 0.288 0.170 0.137 -0.171 0.136
Cell Contents: Pearson correlation
OVERVIEW
• Importance of a lead in the NBA.
• It’s important to get off to a good start in a game.
• You don’t want to be behind going into the 4th quarter.
• Overdispersion?:
• Found no evidence that suggested overdispersion for game-by-game free-throw percentages
• Free-throw Momentum?:
• Found no evidence that suggested that one free-throw affects the next during a game.
• Predicting wins Using Team data (regression)
• Fg% had highest 𝑅2
value when using simple linear regression on our data.
• MULT. LINEAR REGRESSION EQUATION
• won = 86.5 + 238.7 2p% + 168.0 3p% + 0.02197 tr - 0.03498 to - 0.0418 bk -
256.6 O 2p% - 283.3 O 3p% - 0.01723 O tr + 0.0387 O to
Thank
you!

Contenu connexe

Similaire à Some Statistical Analyses on NBA Basketball

05.scd_cuantificacion_y_senales_de_prueba
05.scd_cuantificacion_y_senales_de_prueba05.scd_cuantificacion_y_senales_de_prueba
05.scd_cuantificacion_y_senales_de_pruebaHipólito Aguilar
 
GRADE11-STATISTICS -AND PROBABILITY.pptx
GRADE11-STATISTICS -AND PROBABILITY.pptxGRADE11-STATISTICS -AND PROBABILITY.pptx
GRADE11-STATISTICS -AND PROBABILITY.pptxValEranielMPineda
 
Foundations of Statistics for Ecology and Evolution. 3. Normality
Foundations of Statistics for Ecology and Evolution. 3. NormalityFoundations of Statistics for Ecology and Evolution. 3. Normality
Foundations of Statistics for Ecology and Evolution. 3. NormalityAndres Lopez-Sepulcre
 
07-Convolution.pptx signal spectra and signal processing
07-Convolution.pptx signal spectra and signal processing07-Convolution.pptx signal spectra and signal processing
07-Convolution.pptx signal spectra and signal processingJordanJohmMallillin
 
Normal probability distribution
Normal probability distributionNormal probability distribution
Normal probability distributionNadeem Uddin
 
YamadaiR(Categorical Factor Analysis)
YamadaiR(Categorical Factor Analysis)YamadaiR(Categorical Factor Analysis)
YamadaiR(Categorical Factor Analysis)考司 小杉
 
Introduction to Gaussian Processes
Introduction to Gaussian ProcessesIntroduction to Gaussian Processes
Introduction to Gaussian ProcessesDmytro Fishman
 
ANOVA - Dr. Manu Melwin Joy - School of Management Studies, Cochin University...
ANOVA - Dr. Manu Melwin Joy - School of Management Studies, Cochin University...ANOVA - Dr. Manu Melwin Joy - School of Management Studies, Cochin University...
ANOVA - Dr. Manu Melwin Joy - School of Management Studies, Cochin University...manumelwin
 
Direct solution of sparse network equations by optimally ordered triangular f...
Direct solution of sparse network equations by optimally ordered triangular f...Direct solution of sparse network equations by optimally ordered triangular f...
Direct solution of sparse network equations by optimally ordered triangular f...Dimas Ruliandi
 
07.mdsd_modelado_termicos_liquidos
07.mdsd_modelado_termicos_liquidos07.mdsd_modelado_termicos_liquidos
07.mdsd_modelado_termicos_liquidosHipólito Aguilar
 
Reliable multimedia transmission under noisy condition
Reliable multimedia transmission under noisy conditionReliable multimedia transmission under noisy condition
Reliable multimedia transmission under noisy conditionShahrukh Ali Khan
 
Max flows via electrical flows (long talk)
Max flows via electrical flows (long talk)Max flows via electrical flows (long talk)
Max flows via electrical flows (long talk)Thatchaphol Saranurak
 
Chapter one on sampling distributions.ppt
Chapter one on sampling distributions.pptChapter one on sampling distributions.ppt
Chapter one on sampling distributions.pptFekaduAman
 
Factorial design
Factorial designFactorial design
Factorial designGaurav Kr
 

Similaire à Some Statistical Analyses on NBA Basketball (20)

05.scd_cuantificacion_y_senales_de_prueba
05.scd_cuantificacion_y_senales_de_prueba05.scd_cuantificacion_y_senales_de_prueba
05.scd_cuantificacion_y_senales_de_prueba
 
Presentation14
Presentation14Presentation14
Presentation14
 
Static Models of Continuous Variables
Static Models of Continuous VariablesStatic Models of Continuous Variables
Static Models of Continuous Variables
 
GRADE11-STATISTICS -AND PROBABILITY.pptx
GRADE11-STATISTICS -AND PROBABILITY.pptxGRADE11-STATISTICS -AND PROBABILITY.pptx
GRADE11-STATISTICS -AND PROBABILITY.pptx
 
Foundations of Statistics for Ecology and Evolution. 3. Normality
Foundations of Statistics for Ecology and Evolution. 3. NormalityFoundations of Statistics for Ecology and Evolution. 3. Normality
Foundations of Statistics for Ecology and Evolution. 3. Normality
 
07-Convolution.pptx signal spectra and signal processing
07-Convolution.pptx signal spectra and signal processing07-Convolution.pptx signal spectra and signal processing
07-Convolution.pptx signal spectra and signal processing
 
Normal probability distribution
Normal probability distributionNormal probability distribution
Normal probability distribution
 
YamadaiR(Categorical Factor Analysis)
YamadaiR(Categorical Factor Analysis)YamadaiR(Categorical Factor Analysis)
YamadaiR(Categorical Factor Analysis)
 
Introduction to Gaussian Processes
Introduction to Gaussian ProcessesIntroduction to Gaussian Processes
Introduction to Gaussian Processes
 
ANOVA - Dr. Manu Melwin Joy - School of Management Studies, Cochin University...
ANOVA - Dr. Manu Melwin Joy - School of Management Studies, Cochin University...ANOVA - Dr. Manu Melwin Joy - School of Management Studies, Cochin University...
ANOVA - Dr. Manu Melwin Joy - School of Management Studies, Cochin University...
 
Direct solution of sparse network equations by optimally ordered triangular f...
Direct solution of sparse network equations by optimally ordered triangular f...Direct solution of sparse network equations by optimally ordered triangular f...
Direct solution of sparse network equations by optimally ordered triangular f...
 
Av 738- Adaptive Filtering - Wiener Filters[wk 3]
Av 738- Adaptive Filtering - Wiener Filters[wk 3]Av 738- Adaptive Filtering - Wiener Filters[wk 3]
Av 738- Adaptive Filtering - Wiener Filters[wk 3]
 
01_SLR_final (1).pptx
01_SLR_final (1).pptx01_SLR_final (1).pptx
01_SLR_final (1).pptx
 
07.mdsd_modelado_termicos_liquidos
07.mdsd_modelado_termicos_liquidos07.mdsd_modelado_termicos_liquidos
07.mdsd_modelado_termicos_liquidos
 
Reliable multimedia transmission under noisy condition
Reliable multimedia transmission under noisy conditionReliable multimedia transmission under noisy condition
Reliable multimedia transmission under noisy condition
 
Ch15
Ch15Ch15
Ch15
 
Max flows via electrical flows (long talk)
Max flows via electrical flows (long talk)Max flows via electrical flows (long talk)
Max flows via electrical flows (long talk)
 
Topic 1 part 2
Topic 1 part 2Topic 1 part 2
Topic 1 part 2
 
Chapter one on sampling distributions.ppt
Chapter one on sampling distributions.pptChapter one on sampling distributions.ppt
Chapter one on sampling distributions.ppt
 
Factorial design
Factorial designFactorial design
Factorial design
 

Dernier

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Dernier (20)

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

Some Statistical Analyses on NBA Basketball

  • 1. SOME STATISTICAL ANALYSES OF NBA BASKETBALL EDWARD N. TORRES
  • 2. TOPICS OF FOCUS • Importance of a Lead in the NBA • Overdispersion? • Free-Throw Momentum? • Linear Regression: Predicting Wins
  • 3. IMPORTANCE OF A LEAD IN THE NBA
  • 4. WIENER PROCESS • Stochastic Process: A collection of random variables indexed by t (time). • Wiener Process: A type of Stochastic Process with the following properties: i. W(0) = 0 ii. W(t) – W(s) has a normal distribution with mean 0 and variance 𝜎2 𝑡 − 𝑠 ; 𝑠 ≤ 𝑡 iii. 𝑊(𝑡2) − 𝑊 𝑡1 , 𝑊 𝑡3 − 𝑊 𝑡2 , … . , 𝑊 𝑡 𝑛 − 𝑊 𝑡 𝑛−1 are independent for 𝑡1 ≤ 𝑡2 ≤ ⋯ ≤ 𝑡 𝑛 • Let W(t) be a Wiener process that represents the score differential at time t, where t is the proportion of the game that has been played.
  • 5.
  • 6. FINDING 𝐸 𝑊 𝑡 UNDER THE ASSUMPTION THAT BOTH TEAMS ARE EQUAL IN ABILITY • What this means: • What is the average point differential at time t, given both teams are equal in ability? • 𝐸 𝑊 𝑡 = 𝐸 𝑊 𝑡 − 𝑊 0 = 0 (tied)
  • 7. FINDING V𝑎𝑟 𝑊 𝑡 , GIVEN 𝑊 1 = 𝜎2 (VARIANCE OF THE FINAL SCORE DIFFERENTIAL) • 𝑉𝑎𝑟 𝑊 𝑡 = 𝑉𝑎𝑟 𝑊 𝑡 − 𝑊 0 = 𝜎2 𝑡 − 0 = 𝜎2 • Note: the variance of the score differential at time t is proportional to the variance of the final score differential • Ex: 𝑉𝑎𝑟 𝑊 1 2 = 1 2 𝜎2 (variance of the score differential at half-time)
  • 8. FINDING 𝑃 𝑊 1 > 0 𝑊 𝑡 > 0 ) • Decompose W(1) into two components and express the components in terms of t and standardized normal v 𝑎𝑟𝑖𝑎𝑏𝑙𝑒𝑠 𝑍1& 𝑍2. 𝑊 1 = 𝑊1 𝑡 + 𝑊2 1 − 𝑡 𝑊1 𝑡 = 𝑊1 𝑡 − 𝑊1 0 ~ 𝑁 0, 𝜎2 𝑡 − 0 ⇒ 𝑊1 𝑡 ~ 𝑁 0, 𝜎2 ⇒ 𝑊1 𝑡 = 𝑍1 𝜎 𝑡 𝑊2 1 − 𝑡 = 𝑊2 1 − 𝑊2 𝑡 ~ 𝑁 0, 𝜎2 1 − 𝑡 ⇒ 𝑊2 1 − 𝑡 ~ 𝑁 0, 𝜎2 (1 − 𝑡 ) ⇒ 𝑊2 1 − 𝑡 = 𝑍2 𝜎 1 − 𝑡
  • 9. FINDING 𝑃 𝑊 1 > 0 𝑊 𝑡 > 0 ) (CONTINUED) • Substitute W(1) for our two components 𝑃 𝑊 1 > 0 𝑊(𝑡) > 0) = 𝑃(𝑊1 𝑡 + 𝑊2(1 − 𝑡) > 0 | 𝑊(𝑡) > 0) = 𝑃 𝑍1 𝜎 𝑡 + 𝑍2 𝜎 1 − 𝑡 > 0 𝑍1 𝜎 𝑡 > 0) = 𝑃 𝑍1 𝑡 + 𝑍2 1 − 𝑡 > 0 𝑍1 > 0) = 𝑃 𝑍1 𝑡 + 𝑍2 1 − 𝑡 > 0 𝑃(𝑍1 > 0)
  • 10. FINDING 𝑃 𝑊 1 > 0 𝑊 𝑡 > 0 ) (CONTINUED) = 𝑃 𝑍1 𝑡 + 𝑍2 1 − 𝑡 > 0 𝑃(𝑍1 > 0) = 2 ℝ 𝜌 𝑍1 𝜌 𝑍2 𝑑 𝑍1 𝑑 𝑧2 = 2 0 ∞ − 𝑍1 𝑡 1−𝑡 ∞ 𝑒− 𝑥1 2+𝑥2 2 2𝜋 𝑑 𝑧2 𝑑 𝑧1
  • 11. FINDING 𝑃 𝑊 1 > 0 𝑊 𝑡 > 0 ) (CONTINUED) • Change to polar Coordinates 2 tan−1( 𝑡 1−𝑡 ) 𝜋/2 0 ∞ 𝑒− 𝑟2 2 𝑟 𝑑𝑟𝑑𝜃 = 1 𝜋 tan−1( 𝑡 1−𝑡 ) 𝜋 2 1 𝑑𝜃 = 1 𝜋 [ 𝜋 2 − tan−1 𝑡 1 − 𝑡 ]
  • 12. WHAT ARE THE CHANCES OF A TEAM WINNING A GAME AFTER BEING UP AT AFTER THE 1Q? 2Q? 3Q? Lead After: Calculations P(Win) 1Q 𝑃 𝑊 1 > 0 𝑊 1 4 > 0 ) = 1 𝜋 [ 𝜋 2 − tan−1 1 4 3 4 ] 66.7% = 2 3 2Q 𝑃 𝑊 1 > 0 𝑊 1 2 > 0 ) = 1 𝜋 [ 𝜋 2 − tan−1 1 2 1 2 ] 75% = 3 4 3Q 𝑃 𝑊 1 > 0 𝑊 3 4 > 0 ) = 1 𝜋 [ 𝜋 2 − tan−1 3 4 1 4 ] 83.3% = 5 6
  • 14. COMPARING MODEL TO REALITY Lead After: BVN Actual Model difference 1Q 64.5% 65.1% 66.7% -1.6 2Q 72.4% 72.5% 75.0% -2.5 3Q 81.1% 82.0% 83.3% -1.3 • Conclusions: • This model is quite accurate compared to the Actual percentage.
  • 16. OVERDISPERSION • Definition: “In statistics, overdispersion is the presence of greater variability (statistical dispersion) in a data set than would be expected based on a given statistical model.” • In our case we would like to check for overdispersion for game-by-game free throws success for certain players, based on the model x~Bin(n,𝜋) • Where: n = number of free throws attempted 𝜋 = free throw percentage x = number of makes • Std(x) = 𝑛𝜋𝑞 • We Collected free throw data on a few players who averaged a large amount of free- throws attempts per game, from the 2018-19 NBA season.
  • 17. CHI-SQUARED TEST • 𝒳2 = 𝑖 𝑗 𝜃 𝑖,𝑗−𝐸 𝑖,𝑗 2 𝐸 𝑖,𝑗 • Where 𝐸𝑖,𝑗 = (𝑟𝑜𝑤 𝑡𝑜𝑡𝑎𝑙)(𝑐𝑜𝑙 𝑡𝑜𝑡𝑎𝑙) 𝑔𝑟𝑎𝑛𝑑 𝑡𝑜𝑡𝑎𝑙 • = (total ft’s in game i) × ( 𝜋) • Test of homogeneity • 𝐻0: 𝑃 𝑓𝑡 𝑠𝑎𝑚𝑒 𝑒𝑎𝑐ℎ 𝑔𝑎𝑚𝑒 • 𝐻0: 𝜋1 = 𝜋2 = ⋯ = 𝜋 𝑛 = 𝜋 • 𝜋 = mle ft% (season) • A large 𝒳2 is evidence against our 𝐻0 Game FT made FT miss FTA 1 3 1 4 2 11 4 15 3 5 1 6 4 6 3 9 5 6 2 8 6 9 0 9 7 1 2 3 ⋮ ⋮ ⋮ ⋮ 50 9 2 11 Total: 513 79 592 James Harden FT’s (18’/19’ season)
  • 18. CHI-SQUARED TEST (RESULTS) • We chose players who averaged a high number of free-throw attempts per game. • Data suggests null hypothesis holds • No player has a significant p-value player P-value James Harden .1 Joel Embiid .86 Giannis .58 Blake Griffin .77 Kevin Durant .66 Damian Lillard .74 Paul George .34 Anthony Davis .20
  • 20. WALD-WOLFOWITZ RUNS TEST • We want to test free-throw Dependency within a game • How one free-throw affects the next • 𝐻0: the order of makes or misses in a game is random. • To do this we use Wald-Wolfowitz Runs Test • Longer runs is evidence against 𝐻0 • Data comes from 16/17 season.
  • 21. PREDICTING TOTAL SEASON WINS FROM TEAM STATISTICS
  • 22. SIMPLE LINEAR REGRESSION • Data Used: • 14’/15’ NBA Team Stats • 15’/16’ NBA Team Stats • 16/17’ NBA Team Stats • 17/18’ NBA Team Stats • 18/19’ NBA Team Stats • Response Variable: • won (total games won during the season) • Predictor Variable: • 3pa (three point attempts) • 3pm (three pointers made) • 3p% • fg%
  • 23. SIMPLE LINEAR REGRESSION: 𝑅2 VALUES 3pa 3pm 3p% fg% 18’/19’ 9.74% 23.44% 29.74% 36.89% 17’/18’ 5.79% 11.52% 23.21% 42.39% 16’/17’ 7.61% 19.88% 40.34% 44.38% 15’/16’ 6.23% 15.80% 41.54% 42.62% 14’/15’ 22.25% 37.27% 48.10% 55.51% • Using fg% as the predictor variable gives us a higher 𝑅2 value
  • 24. SIMPLE LINEAR REGRESSION USING FG% TO PREDICT WINS (16’/17’ SEASON)
  • 25. SIMPLE LINEAR REGRESSION USING PT. DIFFERENTIAL AS A PREDICTOR VARIABLE (16’/17’ SEASON)
  • 26. MULTIPLE LINEAR REGRESSION • Data Used: • 2016-2017 NBA Team Stats • Response Variable: • Won ( total games won during the season) • Predictor Variables: • 2p%, 3p%, to (turnovers), tr ( total rebounds), bk (blocks), O 2p%, O 3p%, O tr, O to
  • 27. Regression Analysis: won versus 2p%, 3p%, tr, to, bk, O 2p%, O 3p%, O tr, O to Analysis of Variance Source DF Adj SS Adj MS F-Value P-Value Regression 9 3393.07 377.01 31.82 0.000 2p% 1 293.05 293.05 24.74 0.000 3p% 1 119.74 119.74 10.11 0.005 tr 1 133.91 133.91 11.30 0.003 to 1 249.82 249.82 21.09 0.000 bk 1 79.57 79.57 6.72 0.017 O 2p% 1 184.10 184.10 15.54 0.001 O 3p% 1 207.14 207.14 17.49 0.000 O tr 1 161.41 161.41 13.63 0.001 O to 1 166.80 166.80 14.08 0.001 Error 20 236.93 11.85 Total 29 3630.00 Model Summary S R-sq R-sq(adj) R-sq(pred) 3.44187 93.47% 90.54% 85.32%
  • 28. Regression Analysis: won versus 2p%, 3p%, tr, to, bk, O 2p%, O 3p%, O tr, O to Coefficients Term Coef SE Coef T-Value P-Value VIF Constant 86.5 65.2 1.33 0.200 2p% 238.7 48.0 4.97 0.000 2.00 3p% 168.0 52.8 3.18 0.005 2.23 tr 0.02197 0.00654 3.36 0.003 2.05 to -0.03498 0.00762 -4.59 0.000 1.59 bk -0.0418 0.0161 -2.59 0.017 2.11 O 2p% -256.6 65.1 -3.94 0.001 1.80 O 3p% -283.3 67.8 -4.18 0.000 1.94 O tr -0.01723 0.00467 -3.69 0.001 1.19 O to 0.0387 0.0103 3.75 0.001 1.70 Regression Equation won = 86.5 + 238.7 2p% + 168.0 3p% + 0.02197 tr - 0.03498 to - 0.0418 bk - 256.6 O 2p% - 283.3 O 3p% - 0.01723 O tr + 0.0387 O to
  • 29. Correlation: 2p%, 3p%, tr, to, bk, O 2p%, O 3p%, O tr, O to Correlations 2p% 3p% tr to bk O 2p% O 3p% O tr 3p% 0.478 tr 0.046 -0.332 to 0.283 -0.178 0.262 bk 0.132 0.172 0.204 0.120 O 2p% 0.008 -0.261 -0.136 0.148 -0.560 O 3p% -0.413 -0.313 -0.121 0.010 -0.546 0.396 O tr -0.141 -0.256 -0.126 0.075 0.078 0.065 0.020 O to 0.134 -0.002 -0.352 0.288 0.170 0.137 -0.171 0.136 Cell Contents: Pearson correlation
  • 30.
  • 31. OVERVIEW • Importance of a lead in the NBA. • It’s important to get off to a good start in a game. • You don’t want to be behind going into the 4th quarter. • Overdispersion?: • Found no evidence that suggested overdispersion for game-by-game free-throw percentages • Free-throw Momentum?: • Found no evidence that suggested that one free-throw affects the next during a game. • Predicting wins Using Team data (regression) • Fg% had highest 𝑅2 value when using simple linear regression on our data. • MULT. LINEAR REGRESSION EQUATION • won = 86.5 + 238.7 2p% + 168.0 3p% + 0.02197 tr - 0.03498 to - 0.0418 bk - 256.6 O 2p% - 283.3 O 3p% - 0.01723 O tr + 0.0387 O to

Notes de l'éditeur

  1. Notes