SlideShare une entreprise Scribd logo
1  sur  33
PROBABILITY ANALYSIS : Sachin
 Tendulkar’s Test Cricket Records




            Presented by :
                             Sandipan Maiti
            Lal Bahadur Shastri Institute of Management
FLOWLINE

   Introduction
   Data Summarization
   Century Analysis
   Half century analysis
   Not out Analysis
   Series Total
   Key Learnings
INTRODUCTION

 Statistics: Game of data.
 Probability: Game of chances.
 Cricket: Game close to Indian hearts.


 Attempt to link the 3 games to reach to some
  meaningful conclusions.
WHY SACHIN !

 Most number of test played: Better applicability of
  statistics.
 Huge achievements: Solid base for discussion on
  probability techniques.
 Variety of data: Scope to cover different probability
  techniques.
 Finally: Because he is “THE SACHIN – GOD OF
  CRICKET”
DATA SUMMARIZATION
DATA SUMMARIZATION

 Data Source: http://www.cricketarchive.com/
 Raw data : Details of matches played against each
  team in each season from 1989 to 2011. (Excluding
  ongoing Test Series).
 Using cross tabulations to structure the data in
  organized manner.
OPPONENT-WISE RECORDS
                                                                       Half     Caught
 Team       Matches Innings Not out   Runs   HS     Avg     Century
                                                                      Century    out
Pakistan      18      27       2      1057   194   42.28       2         7        8
Newzeal
              22      36       5      1532   217   49.42       4         8        10
  and
England       24      39       4      2150   193   61.43       7        10        19

Srilanka      25      36       3      1995   203   60.45       9         6        12

Australia     31      59       7      3151   241   60.60      11        13        19
Banglade
               7       9       3      820    248   136.67      5         0        6
   sh
Zimbabw
               9      14       2      918    201   76.50       3         3        5
    e
  West
              16      25       2      1328   179   57.74       3         7        14
 Indies
 South
              25      45       4      1741   169   42.46       7         5        13
 Africa
  Total       177     290     32      1469   248   56.95      51        59       106
SEASON-WISE RECORDS
                                                               Half Caught
Season Matches Innings Not out Runs   HS    Average Century
                                                              Century out
1989-90   7     10      0      332    88     33.20    0          3     2
1990-91   4      6      1      256    119    51.20    1         1       3
1991-92   5      9      1      368    148    46.00    2         0       5
1992-93   9     12      1      566    165    51.45    2         4       8
1993-94   7      8      2      501    142    83.50    2         2       5
1994-95   3      6      0      402    179    67.00    1         2       5
1995-96   3      4      2      58     52     29.00    0         1       3
1996-97   15    25      1     1134    177    47.25    3         5       11
1997-98   8     12      1      935    177    85.00    5         1       5
1998-99   7     13      1      625    227    52.08    3         2       1
1999-00   8     16      2      859    217    61.36    3         3       4


                                                                        ..
                                                                    Contd
SEASON-WISE RECORDS…cntd..
                                                               Half Caught
Season Matches Innings Not out Runs   HS    Average Century
                                                              Century out
2000-01    6    10      1      684    201    76.00    3          2     4
2001-02    14   23      2     1284    176    61.14    4         6     8
2002-03    9    15      1      807    193    57.64    2         3     4
2003-04    9    15      3      659    241    54.92    2         2     4
2004-05    9    14      2      664    248    55.33    1         4     5
2005-06    9    13      1      335    189    27.92    1         0     5
2006-07    3     6      0      199    64     33.17    0         2     3
2007-08    12   21      3     1114    154    61.89    4         6     13
2008-09    12   23      2      991    160    47.19    3         4     4
2009-10    7    10      2      674    143    84.25    5         1     2
2010-11    11   19      3     1245    214    77.81    4         5     2
 Total    177   290     32    14692   248    56.95    51        59   106
CENTURY ANALYSIS
PROBABILITY OF CENTURY IN A
              MATCH
               MATCHES   CENTURIES   PROBABILITY OF
  TEAM
                PLAYED    SCORED        CENTURY
 Pakistan         18         2            0.111
Newzealand       22          4            0.182
 England         24          7            0.292
  Srilanka       25          9            0.360
 Australia       31         11            0.355
Bangladesh        7          5            0.714
Zimbabwe          9          3            0.333
West Indies      16          3            0.188
South Africa     25          7            0.280
OVERALL          177        51            0.288
PROBABILITY OF CENTURY IN A
             TEST MATCH
0.8

0.7

0.6

0.5

0.4

0.3                          PROBABILITY OF CENTURY


0.2

0.1

 0
IF A CENTURY, PROBABLE TEAM
  TEAM         CENTURIES SCORED   PROBABILITY
 Pakistan             2              0.039
Bangladesh            4              0.078
  England             7              0.137
  Srilanka            9              0.176
 Australia            11             0.216
Bangladesh            5              0.098
Zimbabwe              3              0.059
West Indies           3              0.059
South Africa          7              0.137
  TOTAL               51             1.000
IF A CENTURY, PROBABLE TEAM
0.25




 0.2




0.15



                                                                                                     PROBABILITY
 0.1




0.05




  0
       Pakistan Bangladesh England   Srilanka   Australia Bangladesh Zimbabwe West Indies   South
                                                                                            Africa
OBSERVATIONS

 If Sachin plays a test match, then he is most likely to
  score a century when opponent is Bangladesh.
 If Sachin scores a century in a test match it is most
  likely that the opponent is Australia.
 In both type of above situations Sachin is least likely
  to score a century against Pakistan.
HALF CENTURY ANALYSIS:
DISCRETE PROBABILITY
Fifties   Number of series           f(x)
                              X

  0             29            0      0.433


  1             19            1      0.284


  2             17            2      0.254


  3              2            3      0.030


Total           67           ∑f(x)   1.000
Probability distribution for half                          Cumulative probability
                         centuries                                           distribution
                                                                                               1.200

              0.450


              0.400                                                                            1.000


              0.350

                                                                                               0.800
              0.300
Probability




              0.250

                                                                                               0.600
              0.200                                           f(x)                                     f(x)


              0.150
                                                                                               0.400

              0.100


              0.050                                                                            0.200


              0.000
                      0           1          2            3
                          Number of fifties in a series                                        0.000
                                                                     0    1        2       3
VARIATION AND STD. DEVIATION

X   x-µ     (x-µ)^2    f(x)    f(x)*(x-µ)^2
                                               Variation of random
                                                variable x (fifties in a
0   -0.88   0.775451   0.433     0.3356         series) is 0.7917 squared
                                                fifties
1   0.12    0.014257   0.284     0.0040
                                               Standard deviation in the
2   1.12    1.253063   0.254     0.3179         number of fifties in a
                                                series (σ) is 0.8898
3   2.12    4.491869   0.030     0.1341
                                                fifties
                                 0.7917
                                  =σ^2
EXPECTED FIFTIES IN A SERIES

X     f(x)    xf(x)
                       Thus the expected value
                        E(x) for Sachin scoring a
0     0.433   0.00
                        half century in a series is
1     0.284   0.28      0.88 or almost 1
                       In every test series he
2     0.254   0.51      plays, he is expected to
                        score a half century
3     0.030   0.09

      1.000   0.88
SERIES TOTAL ANALYSIS:
NORMAL DISTRIBUTION
TOTAL RUNS IN A SERIES
                                                                      Histogram
                                                 14


Bin(Total in a series)   Frequency
                                                 12
          0                  2
                                                 10
         60                  7
         120                 8                    8




                                     Frequency
         180                 7
                                                  6
         240                13                                                                          Frequency


         300                11                    4


         360                 7
                                                  2
         420                 8
                                                  0
   More than 420             4                        0   60   120   180   240   300   360   420 More
                                                                                                 than
                                                                                                 420
                                                                           Bin
TOTAL RUNS IN A SERIES..
      Statistical Summary        Slightly skewed towards
      Mean           219.2836
                                  right.
     Median            213
      Mode             199       Skewness is just 0.07
Standard Deviation   126.9055     approximately.
 Sample Variance     16104.99
                                 Data can be considered
    Skewness         0.068918
      Range            493        to      be     normally
    Minimum             0         distributed for analysis
    Maximum            493        purpose.
      Sum             14692
      Count             67
PROBABILITY OF A TOTAL OF 250 IN
           A SERIES


 Std Deviation σ = 127
  Mean µ = 219
  x = 400
 f(x) = 0.00314* exp-(250-219)^2)/(2*127^2)
       = 0.003
       = 0.3 %
PROBABILITY OF A TOTAL OF 100 IN
            A SERIES


 Std Deviation σ = 127
  Mean µ = 219
  x = 400
 f(x) = 0.00314* exp-(100-219)^2)/(2*127^2)
       = 0.002
       = 0.2 %
PROBABILITY OF A TOTAL OF 400 IN
           A SERIES


 Std Deviation σ = 127
  Mean µ = 219
  x = 400
 f(x) = 0.00314* exp-(400-219)^2)/(2*127^2)
       = 0.001
       = 0.1 %
NOT OUT INNINGS :
BINOMIAL DISTRIBUTION
NOT OUT INNINGS: BINOMIAL
           DISTRIBUTION
 Only two possibilities in an innings - out or not out.
 Remaining not out in any match is independent of
  being out or not out in any other match.
 Probability of remaining not out = 1- Probability of
  being                                                 out.
  =>      Binomial         Probability        distribution
                   n!                 n x
      P( X )               p (1  p)
                             x
               x !(n  x )!
NOT OUT INNINGS..
 Probability of being not out in a match
  P(N) = Total not out innings/ Total matches
        = 32/290
        = 0.11
 Probability of being out in a match
  P(O) = 1 - Probability of being not out in a match
       = 1 – 0.11
       = 0.89
PROBABILLITY OF SINGLE NOT OUT IN
           10 MATCHES
 X=1
  n = 10
  p = 0.11
  1 – p = 0.89

  Thus,                n!
          P( X )               p x (1  p)n  x
                   x !(n  x )!
                 = (10!/(1! * 9!)) * (0.11^1)*(0.89^9)
                 = 0.39
PROBABILLITY OF TWO NOT OUTS IN
           20 MATCHES
 X=2
  n = 20
  p = 0.11
  1 – p = 0.89

  Thus,                n!
          P( X )               p x (1  p)n  x
                   x !(n  x )!
                 = (20!/(2! * 18!)) * (0.11^2)*(0.89^18)
                 = 0. 28
KEY LEARNINGS

1. Trends contrasting to the preconceived notions.
    (Binomial Probability Distribution)
2. Proves the general statements.(Half Century every
    match)
3. Great tool for analysis.
4. Easy to use, apply and understand.
Thank You…

Contenu connexe

Tendances (14)

Tabel Nilai Kritis Distribusi Chi-Square
Tabel Nilai Kritis Distribusi Chi-SquareTabel Nilai Kritis Distribusi Chi-Square
Tabel Nilai Kritis Distribusi Chi-Square
 
Valores para calcular chi cuadrado crítico
Valores para calcular chi cuadrado críticoValores para calcular chi cuadrado crítico
Valores para calcular chi cuadrado crítico
 
Tabla Chi cuadrado PSPP
Tabla Chi cuadrado PSPPTabla Chi cuadrado PSPP
Tabla Chi cuadrado PSPP
 
Tablaschi
TablaschiTablaschi
Tablaschi
 
Tablas, estadistica
Tablas, estadisticaTablas, estadistica
Tablas, estadistica
 
Tabla f05
Tabla f05Tabla f05
Tabla f05
 
Us20170260029 a1
Us20170260029 a1Us20170260029 a1
Us20170260029 a1
 
京都大学図書館機構概要2009
京都大学図書館機構概要2009京都大学図書館機構概要2009
京都大学図書館機構概要2009
 
Tabela qui quadrado 2012
Tabela qui quadrado 2012Tabela qui quadrado 2012
Tabela qui quadrado 2012
 
Tabelas do teste f, 10, 5, 1%
Tabelas do teste f, 10, 5, 1%Tabelas do teste f, 10, 5, 1%
Tabelas do teste f, 10, 5, 1%
 
Tablaf
TablafTablaf
Tablaf
 
Termocupla k
Termocupla kTermocupla k
Termocupla k
 
Chi square table
Chi square tableChi square table
Chi square table
 
Keynote technicals intraday future levels for 110213
Keynote technicals intraday future levels for 110213Keynote technicals intraday future levels for 110213
Keynote technicals intraday future levels for 110213
 

En vedette

2012 EURIB - Brand Management
2012 EURIB - Brand Management2012 EURIB - Brand Management
2012 EURIB - Brand Management
Joost Augusteijn
 
my goals presentation
my goals presentationmy goals presentation
my goals presentation
Manuel Rubio
 

En vedette (20)

Garage Door Opener Sammamish
Garage Door Opener SammamishGarage Door Opener Sammamish
Garage Door Opener Sammamish
 
Management accounting accounting
Management accounting accountingManagement accounting accounting
Management accounting accounting
 
Federal Legislative and Regulatory Update
Federal Legislative and Regulatory UpdateFederal Legislative and Regulatory Update
Federal Legislative and Regulatory Update
 
2012 EURIB - Brand Management
2012 EURIB - Brand Management2012 EURIB - Brand Management
2012 EURIB - Brand Management
 
Actividades quandary
Actividades quandaryActividades quandary
Actividades quandary
 
My biography
My biographyMy biography
My biography
 
Media Placement Portfolio Lindsay Krupa
Media Placement Portfolio  Lindsay KrupaMedia Placement Portfolio  Lindsay Krupa
Media Placement Portfolio Lindsay Krupa
 
互联网产品监测报告(第五十二期)
互联网产品监测报告(第五十二期)互联网产品监测报告(第五十二期)
互联网产品监测报告(第五十二期)
 
Les choristes
Les choristesLes choristes
Les choristes
 
20100729.atlassian
20100729.atlassian20100729.atlassian
20100729.atlassian
 
บุคลากรครูที่เกษียณ
บุคลากรครูที่เกษียณบุคลากรครูที่เกษียณ
บุคลากรครูที่เกษียณ
 
囲碁のお勉強(社内勉強会資料)
囲碁のお勉強(社内勉強会資料)囲碁のお勉強(社内勉強会資料)
囲碁のお勉強(社内勉強会資料)
 
บุคลากรครูที่เกษียณ12
บุคลากรครูที่เกษียณ12บุคลากรครูที่เกษียณ12
บุคลากรครูที่เกษียณ12
 
Asociados Agosto 2011
Asociados Agosto 2011Asociados Agosto 2011
Asociados Agosto 2011
 
my goals presentation
my goals presentationmy goals presentation
my goals presentation
 
ท่องเที่ยวเชิงประวัติศาสตร์11111
ท่องเที่ยวเชิงประวัติศาสตร์11111ท่องเที่ยวเชิงประวัติศาสตร์11111
ท่องเที่ยวเชิงประวัติศาสตร์11111
 
Principales monedas
Principales monedasPrincipales monedas
Principales monedas
 
FM&P - VISA Europe
FM&P - VISA EuropeFM&P - VISA Europe
FM&P - VISA Europe
 
ท่องเที่ยวเชิงประวัติศาสตร์
ท่องเที่ยวเชิงประวัติศาสตร์ท่องเที่ยวเชิงประวัติศาสตร์
ท่องเที่ยวเชิงประวัติศาสตร์
 
Describing People
Describing PeopleDescribing People
Describing People
 

Similaire à Probability analysis of sachin's batting

6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape
Chhay Teng
 
R and B SOR FOR BUILDING WORKS AHMEDABAD CITY 2021_22.pdf
R and B SOR FOR BUILDING WORKS AHMEDABAD CITY 2021_22.pdfR and B SOR FOR BUILDING WORKS AHMEDABAD CITY 2021_22.pdf
R and B SOR FOR BUILDING WORKS AHMEDABAD CITY 2021_22.pdf
ssuserc0e014
 
Actividad 1 funcion lineal
Actividad 1 funcion linealActividad 1 funcion lineal
Actividad 1 funcion lineal
juandiego2011
 
League standing report 03.24.2012 iiii
League standing report 03.24.2012 iiiiLeague standing report 03.24.2012 iiii
League standing report 03.24.2012 iiii
Santiago Ramos
 
Tabla pt amonia
Tabla pt amoniaTabla pt amonia
Tabla pt amonia
orlandoes
 
League standing report 10.06.2012
League standing report 10.06.2012League standing report 10.06.2012
League standing report 10.06.2012
Santiago Ramos
 
League standing report 03.31.2012
League standing report 03.31.2012League standing report 03.31.2012
League standing report 03.31.2012
Santiago Ramos
 

Similaire à Probability analysis of sachin's batting (20)

6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape
 
GOLD
GOLDGOLD
GOLD
 
R and B SOR FOR BUILDING WORKS AHMEDABAD CITY 2021_22.pdf
R and B SOR FOR BUILDING WORKS AHMEDABAD CITY 2021_22.pdfR and B SOR FOR BUILDING WORKS AHMEDABAD CITY 2021_22.pdf
R and B SOR FOR BUILDING WORKS AHMEDABAD CITY 2021_22.pdf
 
Actividad 1 funcion lineal
Actividad 1 funcion linealActividad 1 funcion lineal
Actividad 1 funcion lineal
 
Ενδέκατη Εβδομάδα Τιμώμενου Μέλους
Ενδέκατη Εβδομάδα Τιμώμενου ΜέλουςΕνδέκατη Εβδομάδα Τιμώμενου Μέλους
Ενδέκατη Εβδομάδα Τιμώμενου Μέλους
 
Gsom1
Gsom1Gsom1
Gsom1
 
Excel Básico 1
Excel Básico 1Excel Básico 1
Excel Básico 1
 
The Science and Practice of Cartographic Interaction
The Science and Practice of Cartographic InteractionThe Science and Practice of Cartographic Interaction
The Science and Practice of Cartographic Interaction
 
League standing report 03.24.2012 iiii
League standing report 03.24.2012 iiiiLeague standing report 03.24.2012 iiii
League standing report 03.24.2012 iiii
 
Besavne cevi din_2448
Besavne cevi din_2448Besavne cevi din_2448
Besavne cevi din_2448
 
Tabla pt amonia
Tabla pt amoniaTabla pt amonia
Tabla pt amonia
 
캐주얼 게임 마케팅의 과거와 미래
캐주얼 게임 마케팅의 과거와 미래캐주얼 게임 마케팅의 과거와 미래
캐주얼 게임 마케팅의 과거와 미래
 
Human Movement 2009: Snapshots and Trends
Human Movement 2009: Snapshots and TrendsHuman Movement 2009: Snapshots and Trends
Human Movement 2009: Snapshots and Trends
 
Sample excel spreadsheets, charts
Sample excel spreadsheets, chartsSample excel spreadsheets, charts
Sample excel spreadsheets, charts
 
Catalogo tubos colmena
Catalogo tubos colmenaCatalogo tubos colmena
Catalogo tubos colmena
 
Trigonotabel
TrigonotabelTrigonotabel
Trigonotabel
 
League standing report 10.06.2012
League standing report 10.06.2012League standing report 10.06.2012
League standing report 10.06.2012
 
League standing report 03.31.2012
League standing report 03.31.2012League standing report 03.31.2012
League standing report 03.31.2012
 
Box Plots
Box PlotsBox Plots
Box Plots
 
Limites de control para gráficos xr xs
Limites de control para gráficos xr xsLimites de control para gráficos xr xs
Limites de control para gráficos xr xs
 

Dernier

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 

Dernier (20)

The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 

Probability analysis of sachin's batting

  • 1. PROBABILITY ANALYSIS : Sachin Tendulkar’s Test Cricket Records Presented by : Sandipan Maiti Lal Bahadur Shastri Institute of Management
  • 2. FLOWLINE  Introduction  Data Summarization  Century Analysis  Half century analysis  Not out Analysis  Series Total  Key Learnings
  • 3. INTRODUCTION  Statistics: Game of data.  Probability: Game of chances.  Cricket: Game close to Indian hearts.  Attempt to link the 3 games to reach to some meaningful conclusions.
  • 4. WHY SACHIN !  Most number of test played: Better applicability of statistics.  Huge achievements: Solid base for discussion on probability techniques.  Variety of data: Scope to cover different probability techniques.  Finally: Because he is “THE SACHIN – GOD OF CRICKET”
  • 6. DATA SUMMARIZATION  Data Source: http://www.cricketarchive.com/  Raw data : Details of matches played against each team in each season from 1989 to 2011. (Excluding ongoing Test Series).  Using cross tabulations to structure the data in organized manner.
  • 7. OPPONENT-WISE RECORDS Half Caught Team Matches Innings Not out Runs HS Avg Century Century out Pakistan 18 27 2 1057 194 42.28 2 7 8 Newzeal 22 36 5 1532 217 49.42 4 8 10 and England 24 39 4 2150 193 61.43 7 10 19 Srilanka 25 36 3 1995 203 60.45 9 6 12 Australia 31 59 7 3151 241 60.60 11 13 19 Banglade 7 9 3 820 248 136.67 5 0 6 sh Zimbabw 9 14 2 918 201 76.50 3 3 5 e West 16 25 2 1328 179 57.74 3 7 14 Indies South 25 45 4 1741 169 42.46 7 5 13 Africa Total 177 290 32 1469 248 56.95 51 59 106
  • 8. SEASON-WISE RECORDS Half Caught Season Matches Innings Not out Runs HS Average Century Century out 1989-90 7 10 0 332 88 33.20 0 3 2 1990-91 4 6 1 256 119 51.20 1 1 3 1991-92 5 9 1 368 148 46.00 2 0 5 1992-93 9 12 1 566 165 51.45 2 4 8 1993-94 7 8 2 501 142 83.50 2 2 5 1994-95 3 6 0 402 179 67.00 1 2 5 1995-96 3 4 2 58 52 29.00 0 1 3 1996-97 15 25 1 1134 177 47.25 3 5 11 1997-98 8 12 1 935 177 85.00 5 1 5 1998-99 7 13 1 625 227 52.08 3 2 1 1999-00 8 16 2 859 217 61.36 3 3 4 .. Contd
  • 9. SEASON-WISE RECORDS…cntd.. Half Caught Season Matches Innings Not out Runs HS Average Century Century out 2000-01 6 10 1 684 201 76.00 3 2 4 2001-02 14 23 2 1284 176 61.14 4 6 8 2002-03 9 15 1 807 193 57.64 2 3 4 2003-04 9 15 3 659 241 54.92 2 2 4 2004-05 9 14 2 664 248 55.33 1 4 5 2005-06 9 13 1 335 189 27.92 1 0 5 2006-07 3 6 0 199 64 33.17 0 2 3 2007-08 12 21 3 1114 154 61.89 4 6 13 2008-09 12 23 2 991 160 47.19 3 4 4 2009-10 7 10 2 674 143 84.25 5 1 2 2010-11 11 19 3 1245 214 77.81 4 5 2 Total 177 290 32 14692 248 56.95 51 59 106
  • 11. PROBABILITY OF CENTURY IN A MATCH MATCHES CENTURIES PROBABILITY OF TEAM PLAYED SCORED CENTURY Pakistan 18 2 0.111 Newzealand 22 4 0.182 England 24 7 0.292 Srilanka 25 9 0.360 Australia 31 11 0.355 Bangladesh 7 5 0.714 Zimbabwe 9 3 0.333 West Indies 16 3 0.188 South Africa 25 7 0.280 OVERALL 177 51 0.288
  • 12. PROBABILITY OF CENTURY IN A TEST MATCH 0.8 0.7 0.6 0.5 0.4 0.3 PROBABILITY OF CENTURY 0.2 0.1 0
  • 13. IF A CENTURY, PROBABLE TEAM TEAM CENTURIES SCORED PROBABILITY Pakistan 2 0.039 Bangladesh 4 0.078 England 7 0.137 Srilanka 9 0.176 Australia 11 0.216 Bangladesh 5 0.098 Zimbabwe 3 0.059 West Indies 3 0.059 South Africa 7 0.137 TOTAL 51 1.000
  • 14. IF A CENTURY, PROBABLE TEAM 0.25 0.2 0.15 PROBABILITY 0.1 0.05 0 Pakistan Bangladesh England Srilanka Australia Bangladesh Zimbabwe West Indies South Africa
  • 15. OBSERVATIONS  If Sachin plays a test match, then he is most likely to score a century when opponent is Bangladesh.  If Sachin scores a century in a test match it is most likely that the opponent is Australia.  In both type of above situations Sachin is least likely to score a century against Pakistan.
  • 17. Fifties Number of series f(x) X 0 29 0 0.433 1 19 1 0.284 2 17 2 0.254 3 2 3 0.030 Total 67 ∑f(x) 1.000
  • 18. Probability distribution for half Cumulative probability centuries distribution 1.200 0.450 0.400 1.000 0.350 0.800 0.300 Probability 0.250 0.600 0.200 f(x) f(x) 0.150 0.400 0.100 0.050 0.200 0.000 0 1 2 3 Number of fifties in a series 0.000 0 1 2 3
  • 19. VARIATION AND STD. DEVIATION X x-µ (x-µ)^2 f(x) f(x)*(x-µ)^2  Variation of random variable x (fifties in a 0 -0.88 0.775451 0.433 0.3356 series) is 0.7917 squared fifties 1 0.12 0.014257 0.284 0.0040  Standard deviation in the 2 1.12 1.253063 0.254 0.3179 number of fifties in a series (σ) is 0.8898 3 2.12 4.491869 0.030 0.1341 fifties 0.7917 =σ^2
  • 20. EXPECTED FIFTIES IN A SERIES X f(x) xf(x)  Thus the expected value E(x) for Sachin scoring a 0 0.433 0.00 half century in a series is 1 0.284 0.28 0.88 or almost 1  In every test series he 2 0.254 0.51 plays, he is expected to score a half century 3 0.030 0.09 1.000 0.88
  • 22. TOTAL RUNS IN A SERIES Histogram 14 Bin(Total in a series) Frequency 12 0 2 10 60 7 120 8 8 Frequency 180 7 6 240 13 Frequency 300 11 4 360 7 2 420 8 0 More than 420 4 0 60 120 180 240 300 360 420 More than 420 Bin
  • 23. TOTAL RUNS IN A SERIES.. Statistical Summary  Slightly skewed towards Mean 219.2836 right. Median 213 Mode 199  Skewness is just 0.07 Standard Deviation 126.9055 approximately. Sample Variance 16104.99  Data can be considered Skewness 0.068918 Range 493 to be normally Minimum 0 distributed for analysis Maximum 493 purpose. Sum 14692 Count 67
  • 24. PROBABILITY OF A TOTAL OF 250 IN A SERIES  Std Deviation σ = 127 Mean µ = 219 x = 400  f(x) = 0.00314* exp-(250-219)^2)/(2*127^2) = 0.003 = 0.3 %
  • 25. PROBABILITY OF A TOTAL OF 100 IN A SERIES  Std Deviation σ = 127 Mean µ = 219 x = 400  f(x) = 0.00314* exp-(100-219)^2)/(2*127^2) = 0.002 = 0.2 %
  • 26. PROBABILITY OF A TOTAL OF 400 IN A SERIES  Std Deviation σ = 127 Mean µ = 219 x = 400  f(x) = 0.00314* exp-(400-219)^2)/(2*127^2) = 0.001 = 0.1 %
  • 27. NOT OUT INNINGS : BINOMIAL DISTRIBUTION
  • 28. NOT OUT INNINGS: BINOMIAL DISTRIBUTION  Only two possibilities in an innings - out or not out.  Remaining not out in any match is independent of being out or not out in any other match.  Probability of remaining not out = 1- Probability of being out. => Binomial Probability distribution n! n x P( X )  p (1  p) x x !(n  x )!
  • 29. NOT OUT INNINGS..  Probability of being not out in a match P(N) = Total not out innings/ Total matches = 32/290 = 0.11  Probability of being out in a match P(O) = 1 - Probability of being not out in a match = 1 – 0.11 = 0.89
  • 30. PROBABILLITY OF SINGLE NOT OUT IN 10 MATCHES  X=1 n = 10 p = 0.11 1 – p = 0.89 Thus, n! P( X )  p x (1  p)n  x x !(n  x )! = (10!/(1! * 9!)) * (0.11^1)*(0.89^9) = 0.39
  • 31. PROBABILLITY OF TWO NOT OUTS IN 20 MATCHES  X=2 n = 20 p = 0.11 1 – p = 0.89 Thus, n! P( X )  p x (1  p)n  x x !(n  x )! = (20!/(2! * 18!)) * (0.11^2)*(0.89^18) = 0. 28
  • 32. KEY LEARNINGS 1. Trends contrasting to the preconceived notions. (Binomial Probability Distribution) 2. Proves the general statements.(Half Century every match) 3. Great tool for analysis. 4. Easy to use, apply and understand.