SlideShare une entreprise Scribd logo
1  sur  3
Télécharger pour lire hors ligne
Measures of Dispersion                              Variance and the standard deviation
  •    Concept and objective                                      Variance is the average squared deviation from the mean
  •    Range, inter-quartile range (IQR =Q3 – Q1)
  •    Variance and standard deviation (sd)                                 1    N

  •    Computation
                                                                   σ2 =
                                                                            N
                                                                                ∑(X
                                                                                i =1
                                                                                        i   − µ )2
                                                                                                          x    x                x       x
  •    Chebyshev’s Inequality                                                                             X1   Xi    µ          X2      XN
  •    Relative dispersion -- the coefficient of
                                                                       σ, the standard deviation is the (+ve) square root of the variance
       variation
                                                                  In the sample variance calculation, use the denominator (n-1)
                           σ
                  C.V. =     × 100%
                           µ                                                     1 n
                                                                        S2 =         ∑ ( X i − X )2
                                                                               n − 1 i =1




            Chebyshev’s Inequality
                                                                          Chebyshev’s Inequality:
      • At least (1 - 1/k2) proportion of the data must be
        within k standard deviation of the mean.                               Illustration
      • Here k is a number (not necessarily an integer)
        greater than 1.
      • The statement is valid for ANY distribution,
        discrete or continuous, symmetric or otherwise.
      • If a r.v. X has a mean µ and a standard deviation σ
      then a equivalent probability statement is :
                                                                µ-3σ         µ-2σ       µ-σ           µ    µ+σ           µ+2σ        µ+3σ
                                           1
             P [ | X − µ | > kσ ] ≤                                                            At least 75%
                                           k2
                                                                                       At least 88.89%




       Understanding Standard deviation
                                                                    The Coefficient of Variation:
294 MLA of WB has an average wealth of 68 Lakh                     A measure of relative dispersion
and s.d. = 10 Lakh. What does that tell you?                                                   σ    S
                                                                                                 or                 × 100 %
  In particular, what can you say about % of MLAs                                              µ    X
    having wealth                                             • Unit free
  • between 58L and 78L ?                                     • Amenable to comparison
                                                              • Often expressed in terms of percentages
  • between 53L and 83L ?
  • Less than 53 L or more than 83L?
  • between 50L and 1 crore ?
  • More than 1 crore?




                                                                                                                                             1
Box Plot                                                                           Box Plot
                                                                                                                Elements of a Box Plot
                                                                                                     Smallest data                          Largest data point
                                                                                                     point not below                        not exceeding        Suspected
                                                                                 Outlier             inner fence                            inner fence          outlier




  o                X                                                 X    *      o               X                                                 X
                                                                                                                                                                  *


                                                                                                 Inner                 Q1   Median
                                                                                 Outer                                               Q3             Inner                    Outer
                                                                                 Fence           Fence                                              Fence                    Fence
                                                                                                Q1-1.5(IQR)                 Interquartile         Q3+1.5(IQR)
                                                                                                                             Range
                                                                                 Q1-3(IQR)                                                                            Q3+3(IQR)




           Review Descriptive Statistics
                                                                                              Poll Forecasting – Exit polls
• Graphical representations, frequency
  distribution                                                                             • Exit Polls in US election 2000 in the critical
                                                                                             state of Florida
• Measures of central tendency and dispersion
      – Computation and interpretation                                                     • Indian Election 2004, 2009,
      – Chebychev’s result                                                                 • UP 2012
• Skewness and Kurtosis                                                                    • Karnataka 2008
• Outliers
      – What are they? How to detect?
      – What to do if there are outliers?




             A simple example: Overview                                                        Some of the questions to be
  Population = all projects undertaken by a company                                                    answered
  An unknown proportion π of them took longer than scheduled
                                          *                                                • To what degree, the randomness in Y can be
   * Delay                                         *
                                *      ** *                  *                               attributed to sampling fluctuations?
                                          *         *
                            *
                                        * *
                                                                                           • How close is Y = p to π ?
                                 *              *                                                           n
                                                        **
                            *          * *      *                *                         • If we want p to be within ±0.05 of π , how
                                                                                             many projects do we need to look at?
A random sample of n projects are selected. Y of them are found to be delayed.

  (sample outcome) Y is random, but the randomness depends on π.

  Given the value of Y, one can make an objective inference about π.




                                                                                                                                                                                     2
Myth and Mystery of Probability
                                                        Overview of Probability
• What is chance of getting any rain today in   • Approaches for defining probability
  the campus?
• What is the probability that India will win   • Basic Probability rules
  WC2015?
• What is the chance that India’s space         • Conditional probability and notion of
  mission will send a human being to moon         independence
  by 2020?

                                                • Bayes’ rule




      Approaches for defining
                                                              Probability Laws
           probability
• Classical approach                            •   0 ≤ P[A] ≤1;
                                                •   P[impossible event]=0; P[Sure event]=1
• (Asymptotic) Relative frequency approach      •   P[A or B] = P[A] + P[B] - P[AB]
                                                •   In particular, P[not A] = 1- P[A]
• Subjective probability                        •   Look at the Venn diagram and write down
                                                    other formulae like
                                                       • P[A] = P[A and B] + P[A and (not B)]




                                                                                                3

Contenu connexe

Tendances

UT Austin - Portugal Lectures on Portfolio Choice
UT Austin - Portugal Lectures on Portfolio ChoiceUT Austin - Portugal Lectures on Portfolio Choice
UT Austin - Portugal Lectures on Portfolio Choiceguasoni
 
Fundamental Theorem of Asset Pricing
Fundamental Theorem of Asset PricingFundamental Theorem of Asset Pricing
Fundamental Theorem of Asset Pricingguasoni
 
9. administración y pronóstico de la demanda
9. administración y pronóstico de la demanda9. administración y pronóstico de la demanda
9. administración y pronóstico de la demandahackerdaddy
 
Who Should Sell Stocks?
Who Should Sell Stocks?Who Should Sell Stocks?
Who Should Sell Stocks?guasoni
 
Detecting Drug Effects in the Brain
Detecting Drug Effects in the BrainDetecting Drug Effects in the Brain
Detecting Drug Effects in the Brainhtstatistics
 
Parameter Estimation for Semiparametric Models with CMARS and Its Applications
Parameter Estimation for Semiparametric Models with CMARS and Its ApplicationsParameter Estimation for Semiparametric Models with CMARS and Its Applications
Parameter Estimation for Semiparametric Models with CMARS and Its ApplicationsSSA KPI
 
Pr 2-bayesian decision
Pr 2-bayesian decisionPr 2-bayesian decision
Pr 2-bayesian decisionshivamsoni123
 
Transactional Data Mining
Transactional Data MiningTransactional Data Mining
Transactional Data MiningTed Dunning
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tenconDr. Jai Sakthi
 
AP Calculus - Semester Review
AP Calculus - Semester ReviewAP Calculus - Semester Review
AP Calculus - Semester ReviewChris Wilson
 
Statistics lecture 10(ch10)
Statistics lecture 10(ch10)Statistics lecture 10(ch10)
Statistics lecture 10(ch10)jillmitchell8778
 
Chapter 3 projection
Chapter 3 projectionChapter 3 projection
Chapter 3 projectionNBER
 
60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...
60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...
60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...Antoine Savine
 
Statistics lecture 9 (chapter 8)
Statistics lecture 9 (chapter 8)Statistics lecture 9 (chapter 8)
Statistics lecture 9 (chapter 8)jillmitchell8778
 

Tendances (20)

Interview Preparation
Interview PreparationInterview Preparation
Interview Preparation
 
UT Austin - Portugal Lectures on Portfolio Choice
UT Austin - Portugal Lectures on Portfolio ChoiceUT Austin - Portugal Lectures on Portfolio Choice
UT Austin - Portugal Lectures on Portfolio Choice
 
Fundamental Theorem of Asset Pricing
Fundamental Theorem of Asset PricingFundamental Theorem of Asset Pricing
Fundamental Theorem of Asset Pricing
 
9. administración y pronóstico de la demanda
9. administración y pronóstico de la demanda9. administración y pronóstico de la demanda
9. administración y pronóstico de la demanda
 
Who Should Sell Stocks?
Who Should Sell Stocks?Who Should Sell Stocks?
Who Should Sell Stocks?
 
Detecting Drug Effects in the Brain
Detecting Drug Effects in the BrainDetecting Drug Effects in the Brain
Detecting Drug Effects in the Brain
 
Parameter Estimation for Semiparametric Models with CMARS and Its Applications
Parameter Estimation for Semiparametric Models with CMARS and Its ApplicationsParameter Estimation for Semiparametric Models with CMARS and Its Applications
Parameter Estimation for Semiparametric Models with CMARS and Its Applications
 
Risk08a
Risk08aRisk08a
Risk08a
 
Pr 2-bayesian decision
Pr 2-bayesian decisionPr 2-bayesian decision
Pr 2-bayesian decision
 
Slides edf-1
Slides edf-1Slides edf-1
Slides edf-1
 
Transactional Data Mining
Transactional Data MiningTransactional Data Mining
Transactional Data Mining
 
Final
FinalFinal
Final
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
 
Black scholas theory for venu(RVSKVK)
Black scholas theory for venu(RVSKVK)Black scholas theory for venu(RVSKVK)
Black scholas theory for venu(RVSKVK)
 
AP Calculus - Semester Review
AP Calculus - Semester ReviewAP Calculus - Semester Review
AP Calculus - Semester Review
 
Statistics lecture 10(ch10)
Statistics lecture 10(ch10)Statistics lecture 10(ch10)
Statistics lecture 10(ch10)
 
Fuzzy logic in ATM
Fuzzy logic in ATMFuzzy logic in ATM
Fuzzy logic in ATM
 
Chapter 3 projection
Chapter 3 projectionChapter 3 projection
Chapter 3 projection
 
60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...
60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...
60 Years Birthday, 30 Years of Ground Breaking Innovation: A Tribute to Bruno...
 
Statistics lecture 9 (chapter 8)
Statistics lecture 9 (chapter 8)Statistics lecture 9 (chapter 8)
Statistics lecture 9 (chapter 8)
 

En vedette

M2 l1 relations functions
M2 l1 relations functionsM2 l1 relations functions
M2 l1 relations functionsRDemolina
 
ProjectX-workbook.Jan27
ProjectX-workbook.Jan27ProjectX-workbook.Jan27
ProjectX-workbook.Jan27Kristen Deyo
 
MRC Presentation Community Outreach
MRC Presentation Community OutreachMRC Presentation Community Outreach
MRC Presentation Community OutreachRenee Sheriff
 
Linea
LineaLinea
Lineamau1
 
3D Visualization Techniques
3D Visualization Techniques3D Visualization Techniques
3D Visualization TechniquesSheldon Jessup
 
Shahid Lecture-12- MKAG1273
Shahid Lecture-12- MKAG1273Shahid Lecture-12- MKAG1273
Shahid Lecture-12- MKAG1273nchakori
 
GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)
GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)
GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)Sehan Lee
 
Agrobacterium mediated transformation
Agrobacterium mediated transformationAgrobacterium mediated transformation
Agrobacterium mediated transformationDeepika Rana
 
Mpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationMpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationVasant Kothari
 

En vedette (14)

M2 l1 relations functions
M2 l1 relations functionsM2 l1 relations functions
M2 l1 relations functions
 
ProjectX-workbook.Jan27
ProjectX-workbook.Jan27ProjectX-workbook.Jan27
ProjectX-workbook.Jan27
 
Clustering - Scheda corso LEN
Clustering - Scheda corso LENClustering - Scheda corso LEN
Clustering - Scheda corso LEN
 
Tqm: total quality management - Scheda corso LEN
Tqm: total quality management - Scheda corso LENTqm: total quality management - Scheda corso LEN
Tqm: total quality management - Scheda corso LEN
 
MRC Presentation Community Outreach
MRC Presentation Community OutreachMRC Presentation Community Outreach
MRC Presentation Community Outreach
 
Linea
LineaLinea
Linea
 
MS PowerPoint 2003 - Scheda corso LEN
MS PowerPoint 2003 - Scheda corso LENMS PowerPoint 2003 - Scheda corso LEN
MS PowerPoint 2003 - Scheda corso LEN
 
3D Visualization Techniques
3D Visualization Techniques3D Visualization Techniques
3D Visualization Techniques
 
Shahid Lecture-12- MKAG1273
Shahid Lecture-12- MKAG1273Shahid Lecture-12- MKAG1273
Shahid Lecture-12- MKAG1273
 
GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)
GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)
GoogleCTF 2016 [No Big Deal] Write-Up (ver.korean)
 
VNMC Mekong Dam Study: Modelling
VNMC Mekong Dam Study: ModellingVNMC Mekong Dam Study: Modelling
VNMC Mekong Dam Study: Modelling
 
Agrobacterium mediated transformation
Agrobacterium mediated transformationAgrobacterium mediated transformation
Agrobacterium mediated transformation
 
Coastal regulation zone
Coastal regulation   zoneCoastal regulation   zone
Coastal regulation zone
 
Mpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationMpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlation
 

Similaire à Session 2

Basic statistics
Basic statisticsBasic statistics
Basic statisticsdhwhite
 
Randomness conductors
Randomness conductorsRandomness conductors
Randomness conductorswtyru1989
 
Hypergeometric Distribution
Hypergeometric DistributionHypergeometric Distribution
Hypergeometric Distributionmathscontent
 
Lecture on solving1
Lecture on solving1Lecture on solving1
Lecture on solving1NBER
 
Exchange confirm
Exchange confirmExchange confirm
Exchange confirmNBER
 
Perceptron Arboles De Decision
Perceptron Arboles De DecisionPerceptron Arboles De Decision
Perceptron Arboles De DecisionESCOM
 
Statr sessions 9 to 10
Statr sessions 9 to 10Statr sessions 9 to 10
Statr sessions 9 to 10Ruru Chowdhury
 
Ssp notes
Ssp notesSsp notes
Ssp notesbalu902
 
アーベル論理でセッション型
アーベル論理でセッション型アーベル論理でセッション型
アーベル論理でセッション型Yoichi Hirai
 
Probability, Discrete Probability, Normal Probabilty
Probability, Discrete Probability, Normal ProbabiltyProbability, Discrete Probability, Normal Probabilty
Probability, Discrete Probability, Normal ProbabiltyFaisal Hussain
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysisrik0
 
Mhf4 U Trig
Mhf4 U TrigMhf4 U Trig
Mhf4 U Trigmaxchem
 

Similaire à Session 2 (20)

Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Randomness conductors
Randomness conductorsRandomness conductors
Randomness conductors
 
Hypergeometric Distribution
Hypergeometric DistributionHypergeometric Distribution
Hypergeometric Distribution
 
Hypergeometric Distribution
Hypergeometric DistributionHypergeometric Distribution
Hypergeometric Distribution
 
Lecture on solving1
Lecture on solving1Lecture on solving1
Lecture on solving1
 
Exchange confirm
Exchange confirmExchange confirm
Exchange confirm
 
Perceptron Arboles De Decision
Perceptron Arboles De DecisionPerceptron Arboles De Decision
Perceptron Arboles De Decision
 
Models
ModelsModels
Models
 
Statr sessions 9 to 10
Statr sessions 9 to 10Statr sessions 9 to 10
Statr sessions 9 to 10
 
Ssp notes
Ssp notesSsp notes
Ssp notes
 
Simplex3
Simplex3Simplex3
Simplex3
 
6.4
6.46.4
6.4
 
Modeling and interpretation
Modeling and interpretationModeling and interpretation
Modeling and interpretation
 
Normal distribution stat
Normal distribution statNormal distribution stat
Normal distribution stat
 
アーベル論理でセッション型
アーベル論理でセッション型アーベル論理でセッション型
アーベル論理でセッション型
 
Probability, Discrete Probability, Normal Probabilty
Probability, Discrete Probability, Normal ProbabiltyProbability, Discrete Probability, Normal Probabilty
Probability, Discrete Probability, Normal Probabilty
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Mhf4 U Trig
Mhf4 U TrigMhf4 U Trig
Mhf4 U Trig
 
Chi sqr
Chi sqrChi sqr
Chi sqr
 
T tests anovas and regression
T tests anovas and regressionT tests anovas and regression
T tests anovas and regression
 

Plus de vivek_shaw

Lecture 11 market structure- perfect competition
Lecture 11  market structure- perfect competitionLecture 11  market structure- perfect competition
Lecture 11 market structure- perfect competitionvivek_shaw
 
Lecture 9 costs
Lecture 9  costsLecture 9  costs
Lecture 9 costsvivek_shaw
 
Lecture 8 production, optimal inputs
Lecture 8  production, optimal inputsLecture 8  production, optimal inputs
Lecture 8 production, optimal inputsvivek_shaw
 
Lecture 8 production, optimal inputs (1)
Lecture 8  production, optimal inputs (1)Lecture 8  production, optimal inputs (1)
Lecture 8 production, optimal inputs (1)vivek_shaw
 
Lecture 3 dds sand elasticity
Lecture 3  dds sand elasticityLecture 3  dds sand elasticity
Lecture 3 dds sand elasticityvivek_shaw
 
Consumertheory1
Consumertheory1Consumertheory1
Consumertheory1vivek_shaw
 
Consumer theory 2
Consumer theory 2Consumer theory 2
Consumer theory 2vivek_shaw
 
Class2 market, demand and supply
Class2  market, demand and supplyClass2  market, demand and supply
Class2 market, demand and supplyvivek_shaw
 
Policy implications
Policy implicationsPolicy implications
Policy implicationsvivek_shaw
 
Man org session 3-org and technology_5th july 2012
Man org session 3-org and technology_5th july 2012Man org session 3-org and technology_5th july 2012
Man org session 3-org and technology_5th july 2012vivek_shaw
 
Man org session 14_org decision making_16th august 2012
Man org session 14_org decision making_16th august 2012Man org session 14_org decision making_16th august 2012
Man org session 14_org decision making_16th august 2012vivek_shaw
 
Man org session 12_org learning_3rd august 2012
Man org session 12_org learning_3rd august 2012Man org session 12_org learning_3rd august 2012
Man org session 12_org learning_3rd august 2012vivek_shaw
 
Man org session 11 interorganizational relationships_2nd august 2012
Man org session 11 interorganizational relationships_2nd august 2012Man org session 11 interorganizational relationships_2nd august 2012
Man org session 11 interorganizational relationships_2nd august 2012vivek_shaw
 
Man org session 10_org control_27th july 2012
Man org session 10_org control_27th july 2012Man org session 10_org control_27th july 2012
Man org session 10_org control_27th july 2012vivek_shaw
 

Plus de vivek_shaw (20)

Lecture 11 market structure- perfect competition
Lecture 11  market structure- perfect competitionLecture 11  market structure- perfect competition
Lecture 11 market structure- perfect competition
 
Lecture 9 costs
Lecture 9  costsLecture 9  costs
Lecture 9 costs
 
Lecture 8 production, optimal inputs
Lecture 8  production, optimal inputsLecture 8  production, optimal inputs
Lecture 8 production, optimal inputs
 
Lecture 8 production, optimal inputs (1)
Lecture 8  production, optimal inputs (1)Lecture 8  production, optimal inputs (1)
Lecture 8 production, optimal inputs (1)
 
Lecture 3 dds sand elasticity
Lecture 3  dds sand elasticityLecture 3  dds sand elasticity
Lecture 3 dds sand elasticity
 
Game theory 3
Game theory 3Game theory 3
Game theory 3
 
Game theory 1
Game theory 1Game theory 1
Game theory 1
 
Game theory 2
Game theory 2Game theory 2
Game theory 2
 
Ford motors
Ford motorsFord motors
Ford motors
 
Consumertheory1
Consumertheory1Consumertheory1
Consumertheory1
 
Consumer theory 2
Consumer theory 2Consumer theory 2
Consumer theory 2
 
Class2 market, demand and supply
Class2  market, demand and supplyClass2  market, demand and supply
Class2 market, demand and supply
 
Auctions 1
Auctions 1Auctions 1
Auctions 1
 
Costs2
Costs2Costs2
Costs2
 
Policy implications
Policy implicationsPolicy implications
Policy implications
 
Man org session 3-org and technology_5th july 2012
Man org session 3-org and technology_5th july 2012Man org session 3-org and technology_5th july 2012
Man org session 3-org and technology_5th july 2012
 
Man org session 14_org decision making_16th august 2012
Man org session 14_org decision making_16th august 2012Man org session 14_org decision making_16th august 2012
Man org session 14_org decision making_16th august 2012
 
Man org session 12_org learning_3rd august 2012
Man org session 12_org learning_3rd august 2012Man org session 12_org learning_3rd august 2012
Man org session 12_org learning_3rd august 2012
 
Man org session 11 interorganizational relationships_2nd august 2012
Man org session 11 interorganizational relationships_2nd august 2012Man org session 11 interorganizational relationships_2nd august 2012
Man org session 11 interorganizational relationships_2nd august 2012
 
Man org session 10_org control_27th july 2012
Man org session 10_org control_27th july 2012Man org session 10_org control_27th july 2012
Man org session 10_org control_27th july 2012
 

Session 2

  • 1. Measures of Dispersion Variance and the standard deviation • Concept and objective Variance is the average squared deviation from the mean • Range, inter-quartile range (IQR =Q3 – Q1) • Variance and standard deviation (sd) 1 N • Computation σ2 = N ∑(X i =1 i − µ )2 x x x x • Chebyshev’s Inequality X1 Xi µ X2 XN • Relative dispersion -- the coefficient of σ, the standard deviation is the (+ve) square root of the variance variation In the sample variance calculation, use the denominator (n-1) σ C.V. = × 100% µ 1 n S2 = ∑ ( X i − X )2 n − 1 i =1 Chebyshev’s Inequality Chebyshev’s Inequality: • At least (1 - 1/k2) proportion of the data must be within k standard deviation of the mean. Illustration • Here k is a number (not necessarily an integer) greater than 1. • The statement is valid for ANY distribution, discrete or continuous, symmetric or otherwise. • If a r.v. X has a mean µ and a standard deviation σ then a equivalent probability statement is : µ-3σ µ-2σ µ-σ µ µ+σ µ+2σ µ+3σ 1 P [ | X − µ | > kσ ] ≤ At least 75% k2 At least 88.89% Understanding Standard deviation The Coefficient of Variation: 294 MLA of WB has an average wealth of 68 Lakh A measure of relative dispersion and s.d. = 10 Lakh. What does that tell you? σ S or × 100 % In particular, what can you say about % of MLAs µ X having wealth • Unit free • between 58L and 78L ? • Amenable to comparison • Often expressed in terms of percentages • between 53L and 83L ? • Less than 53 L or more than 83L? • between 50L and 1 crore ? • More than 1 crore? 1
  • 2. Box Plot Box Plot Elements of a Box Plot Smallest data Largest data point point not below not exceeding Suspected Outlier inner fence inner fence outlier o X X * o X X * Inner Q1 Median Outer Q3 Inner Outer Fence Fence Fence Fence Q1-1.5(IQR) Interquartile Q3+1.5(IQR) Range Q1-3(IQR) Q3+3(IQR) Review Descriptive Statistics Poll Forecasting – Exit polls • Graphical representations, frequency distribution • Exit Polls in US election 2000 in the critical state of Florida • Measures of central tendency and dispersion – Computation and interpretation • Indian Election 2004, 2009, – Chebychev’s result • UP 2012 • Skewness and Kurtosis • Karnataka 2008 • Outliers – What are they? How to detect? – What to do if there are outliers? A simple example: Overview Some of the questions to be Population = all projects undertaken by a company answered An unknown proportion π of them took longer than scheduled * • To what degree, the randomness in Y can be * Delay * * ** * * attributed to sampling fluctuations? * * * * * • How close is Y = p to π ? * * n ** * * * * * • If we want p to be within ±0.05 of π , how many projects do we need to look at? A random sample of n projects are selected. Y of them are found to be delayed. (sample outcome) Y is random, but the randomness depends on π. Given the value of Y, one can make an objective inference about π. 2
  • 3. Myth and Mystery of Probability Overview of Probability • What is chance of getting any rain today in • Approaches for defining probability the campus? • What is the probability that India will win • Basic Probability rules WC2015? • What is the chance that India’s space • Conditional probability and notion of mission will send a human being to moon independence by 2020? • Bayes’ rule Approaches for defining Probability Laws probability • Classical approach • 0 ≤ P[A] ≤1; • P[impossible event]=0; P[Sure event]=1 • (Asymptotic) Relative frequency approach • P[A or B] = P[A] + P[B] - P[AB] • In particular, P[not A] = 1- P[A] • Subjective probability • Look at the Venn diagram and write down other formulae like • P[A] = P[A and B] + P[A and (not B)] 3