SlideShare une entreprise Scribd logo
1  sur  42
Télécharger pour lire hors ligne
Outline
                                      Topics
Probability, sample space, random variable
                    Probability distribution
                            Expected value
                                   Variance
                                  Moments
Linear transformations of random variables
                        Joint distributions




         Applied Statistics for Economics
      2. Introduction to Probability Theory

                      SFC - juliohuato@gmail.com


                                    Spring 2012



              SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                            Topics
      Probability, sample space, random variable
                          Probability distribution
                                  Expected value
                                         Variance
                                        Moments
      Linear transformations of random variables
                              Joint distributions



Topics
Probability, sample space, random variable
Probability distribution
Expected value
Variance
Moments
Linear transformations of random variables
Joint distributions

                    SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                                Topics
          Probability, sample space, random variable
                              Probability distribution
                                      Expected value
                                             Variance
                                            Moments
          Linear transformations of random variables
                                  Joint distributions


Topics



   The main topics in this chapter are:
         random variables and probability distributions,
         expected values: mean and variance, and
         Two random variables jointly considered.




                        SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                               Topics
         Probability, sample space, random variable
                             Probability distribution
                                     Expected value
                                            Variance
                                           Moments
         Linear transformations of random variables
                                 Joint distributions


Probability
   The world in motion is viewed as a set of random processes or
   random experiments.
   Randomness means that, no matter how much our understanding
   of the world may advance, there is always an element of ignorance
   or uncertainty in such understanding. In other words: given
   specific causes, we don’t know fully which states of the world will
   result. Or, given specific states of the world, we don’t know fully
   what specifically caused such states of the world.
   In other words, we are uncertain or – more plainly said – ignorant
   about the specific causes or, alternatively, effects involved in these
   processes.
                       SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Probability


   Examples of random processes: Your meeting the next person, SFC
   students commuting to school, residents of the U.S. producing new
   goods in a given year, etc.
   Why are they random? Because we are uncertain about the gender
   or the age of the next person you’ll meet, the commuting time of
   SFC students or the means of transportation they will use, the
   annual gross domestic product in the U.S. or its composition, etc.



                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Probability



   The mutually exclusive possible results of these experiments are
   called the outcomes. E.g. the next person you’ll meet could be
   female or male, young or old; SFC students may take a few or
   many minutes to commute to school; U.S. annual GDP may go up
   or down by some amount compared to the previous year, etc.




                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                                Topics
          Probability, sample space, random variable
                              Probability distribution
                                      Expected value
                                             Variance
                                            Moments
          Linear transformations of random variables
                                  Joint distributions


Probability
   Probability: the degree of belief that the outcome of an
   experiment will be a particular one.
   How to decide which probability to assign to a particular outcome of an
   experiment (e.g. that if you meet another person, the gender of such
   person will be female)? How to make this decision in a well-informed,
   disciplined, scientific way?1
   One can only use experience – individual or collective – i.e. history. We
   may keep record of the gender of the people we meet over time and use
   the data compiled to inform our belief or look at records on the gender
   composition of the local population, etc.
      1
        In a sense, the whole purpose of statistics is to determine probabilities or,
   alternatively, expectations based on probabilities.
                        SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                                  Topics
            Probability, sample space, random variable
                                Probability distribution
                                        Expected value
                                               Variance
                                              Moments
            Linear transformations of random variables
                                    Joint distributions


Sample space, event


   Sample space or population: the set of all the possible outcomes of
   the experiment. E.g. the sample space of the experiment of
   flipping a coin once is: S = {H, T }.2
   Event: a subset of the sample space, i.e. a set of one or more
   outcomes. E.g. the event (M ≤ 1) that your car will “need one
   repair at most” includes “no repairs” (M = 0) and “one repair”
   (M = 1).


     2
         We rule out ‘freak’ possibilities, like the coin landing on its edge.
                          SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                               Topics
         Probability, sample space, random variable
                             Probability distribution
                                     Expected value
                                            Variance
                                           Moments
         Linear transformations of random variables
                                 Joint distributions


Random variables


   Random variable (r.v.): a numerical summary of a random
   outcome. For example, G = g , where (e.g.) g is 0 if ‘male’ and 1
   if ‘female’. The number of times your car needs repair during a
   given month: M = m, where m = 0, 1, 2, 3, . . .. The time it takes
   for SFC students to commute to school: T = t, where t is time in
   minutes.




                       SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Random variables


   There are discrete and continuous random variables. Gender,
   summarized as a 0 if ‘male’ and 1 if ‘female’, and the number of
   car repairs in a month are discrete random variables. The
   commuting time, if recorded in fractions of an hour – or even
   fractions of minutes and seconds, etc. can be regarded as a
   continuous r.v.




                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                               Topics
         Probability, sample space, random variable
                             Probability distribution
                                     Expected value
                                            Variance
                                           Moments
         Linear transformations of random variables
                                 Joint distributions


Probability distribution of a discrete r.v.

  The probability distribution of a discrete r.v. is a list of all the values of
  the r.v. and the probabilities associated to each value of the r.v. By
  convention, the probabilities are a number between 0 and 1, where 0
  means impossibility and 1 means full certainty; the probabilities over the
  sample space must add up to 1. E.g. let G = 0, 1 be the r.v. ‘gender of
  the next person you’ll meet’. Then:

                                              G         Pr(G = g )
                                             0                0.45
                                             1                0.55




                       SFC - juliohuato@gmail.com           Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Probability distribution of a discrete r.v.


  Using the information in the probability distribution, you can compute
  the probability of a given event. E.g. the probability that you’ll meet ‘a
  male or a female’:


  Pr(G = 0 or G = 1) = Pr(G = 0)+Pr(G = 1) = 0.45+0.55 = 1 = 100%.

  In words, we are completely certain that you’ll meet either a male or a
  female the next time you meet a person.



                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Probability distribution of a discrete r.v.
  Admittedly, the previous example is trivial. But consider the probability
  distribution of your car needing repair(s) in a given month. The r.v.
  ‘number of repairs’ needed is denoted as M:

                                             M         Pr(M = m)
                                            0                0.80
                                            1                0.10
                                            2                0.06
                                            3                0.03
                                            4                0.01



  What’s the probability that the the car will need one or two repairs in a
  month? Answer:
  Pr(M = 1 or M = 2) = Pr(M = 1)+Pr(M = 2) = 0.10+0.06 = 0.16 = 16%.

                      SFC - juliohuato@gmail.com          Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Probability distribution of a discrete r.v.

  The cumulative probability distribution (also known as a
  ‘cumulative distribution function’ or c.d.f.) is the probability that
  the random variable is less than or equal to a particular value. The
  first two columns of the following table are the same as in the
  previous table. The last column gives the c.d.f.:

                                    M       Pr(M = m)       Pr(M ≤ m)
                                   0              0.80            0.80
                                   1              0.10            0.90
                                   2              0.06            0.96
                                   3              0.03            0.99
                                   4              0.01            1.00




                      SFC - juliohuato@gmail.com         Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Probability distribution of a discrete r.v.


  A binary discrete r.v. (e.g. G = 0, 1) is called a Bernoulli r.v. The
  Bernoulli distribution is:
                                       1 with probability p
                         G=
                                       0 with probability 1 − p

  where p is the probability of the next person being ‘female’.




                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                                  Topics
            Probability, sample space, random variable
                                Probability distribution
                                        Expected value
                                               Variance
                                              Moments
            Linear transformations of random variables
                                    Joint distributions


Probability distribution of a continuous r.v.

  The cumulative probability distribution of a continuous r.v. is also
  the probability that the random variable is less than or equal to a
  particular value.
  The probability density function (p.d.f.) of a continuous random
  variable summarizes the probabilities for each value of the random
  variable.
  The mathematical description of the p.d.f. of a continuous variable
  requires that you’re familiar with calculus. So, we’ll skip it for now.
  NB: Strictly speaking, the probability that a continuous random variable has a particular value is zero. We can only
  speak of the probability of the random variable falling in a range (between two given values).
  NB2: The p.d.f. and the c.d.f. show the same information in different formats.


                           SFC - juliohuato@gmail.com          Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Characteristics of a r.v. distribution



  In the practice of statistics, two basic measures are used
  extensively to characterize the distribution of a r.v.:
      the expected value or mean (or average) and
      the variance.




                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Expected value
  The expected value of a r.v. X or E (X ) is the average value of the
  variable over many repeated trials.
  It is computed as a weighted average of the possible outcomes,
  where the weights are the probabilities of the outcomes. It is also
  called the mean of X and denoted by µX . For a discrete r.v.:
                                                                              k
                E (X ) = x1 p1 + x2 p2 + · · · + xk pk =                          x i pi
                                                                            i=1

  E.g.: You loan $100 to your friend for a year at 10% interest.
  There’s a 99% chance he’ll repay the loan and 1% he won’t.
  What’s the expected value of your loan at maturity?
                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Expected value


  E.g.: You loan $100 to your friend for a year at 10% interest.
  There’s a 99% chance he’ll repay the loan and 1% he won’t.
  What’s the expected value of your loan at maturity?
  Answer:
                ($110 × 0.99) + ($0 × .01) = $108.90
  E.g.: What’s the expected value or average of the number of car
  repairs per month? See the table above.



                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Expected value

  E.g.: What’s the expected value or average of the number of car
  repairs per month (M)? See the table above.
  Answer:

              E (M) = (0 × 0.80) + (1 × 0.10) + (2 × 0.06)+

                            (3 × 0.03) + (4 × 0.01) = 0.35
  What does that mean?
  E.g.: In general, what’s the expected value of a Bernoulli r.v. with
  Pr(G = 1) = p and Pr(G = 0) = 1 − p?

                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Expected value

  E.g.: In general, what’s the expected value of a Bernoulli r.v. with
  Pr(G = 1) = p and Pr(G = 0) = 1 − p?
  Answer:
                  E (G ) = (1 × p) + (0 × (1 − p)) = p
  Note 1: Think of the operator E (.) as a function that transforms
  data on a variable by multiplying each value of the variable by its
  probability and then adding up all the products.
  Note 2: The formula for the expected value of a continuous r.v.
  requires calculus. So we’ll skip it for now.


                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Variance and standard deviation
  The variance of a r.v. Y is:

                            var(Y ) = σY = E [(Y − µY )2 ]
                                       2


  The standard deviation is the positive square root of the variance
  σY :
                                                     2
                                  s.d.(Y ) = σY = + σY
  Basically, the s.d. gives the same information as the variance, but
  in units that are easier to understand. The units of the standard
  deviation are the same units as Y and µY .
  What is the intuition behind the variance and/or the standard
  deviation?
                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Variance and standard deviation

  For a discrete r.v.:
                                                                    k
            var(Y ) = σY = E [(Y − µY )2 ] =
                       2
                                                                        (yi − µY )2 pi
                                                                  i=1

                                                        k
                        s.d.(Y ) = σY =                      (yi − µY )2 pi
                                                       i=1

  E.g.: What are the var. and s.d. of the number of car repairs per
  month (M)?

                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Variance and standard deviation

  E.g.: What are the var. and s.d. of the number of car repairs per
  month (M)?
  Answer:

  var(M) = [(0−0.35)2 ×0.80]+[(1−0.35)2 ×0.10]+[(2−0.35)2 ×0.06]

        +[(3 − 0.35)2 × 0.03] + [(4 − 0.35)2 × 0.01] = 0.6475
                                √
                     s.d.(M) = 0.6475 ∼ 0.80
                                         =
  E.g.: What are the var. and s.d. of a Bernoulli r.v.?

                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                               Topics
         Probability, sample space, random variable
                             Probability distribution
                                     Expected value
                                            Variance
                                           Moments
         Linear transformations of random variables
                                 Joint distributions


Variance
  E.g.: What are the var. and s.d. of a Bernoulli r.v.?
  Answer:
       var(G ) = [(0 − p)2 × (1 − p)] + [(1 − p)2 × p] = p(1 − p)
                                     s.d.(G ) =         p(1 − p)
  Note 1: Think of the operator var(.) as a function that transforms
  data on a variable by taking the distance or difference between
  each value of the variable and its mean, squaring that difference,
  multiplying it by the respective probability, and then adding up all
  the products.
  Note 1: Think of the operator s.d.(.) as a function that transforms
  data on a variable by taking the distance or difference between
  each value of the -variable and its mean, Statistics for Economicsdifference, to Probability
                  SFC juliohuato@gmail.com Applied squaring that 2. Introduction
Outline
                                             Topics
       Probability, sample space, random variable
                           Probability distribution
                                   Expected value
                                          Variance
                                         Moments
       Linear transformations of random variables
                               Joint distributions


Moments

 More formally, in statistics, the characteristics of the distribution of
 a r.v. are called moments.
 E (Y ) is the first moment, E (Y 2 ) is the second moment, and
 E (Y r ) is the r th moment of Y . The first moment is the mean and
 it is a measure of the center of the distribution, the second
 moment is a measure of its dispersion or spread, and r -th moments
 for r > 2 measure other aspects of the distribution’s shape.
 Clearly, the second moment of the distribution is intimately related
 to the variance. How?


                     SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Moments
 Two other measures of the shape (using higher moments) of a
 distribution are:
 Skewness:
                                       E [(Y − µY )3 ]
                                 Skewness =    3
                                             σY
 For a symmetric distribution, the skewness is zero. If the distribution has
 a long left tail, the skewness is negative. If the distribution has a long
 right tail, the skewness is positive.
 Kurtosis:

                                             E [(Y − µY )4 ]
                                  Kurtosis =           4
                                                    σY
 For a distribution with heavy tails (outliers are likely), the kurtosis is to Probability
                  SFC - juliohuato@gmail.com     Applied Statistics for Economics 2. Introduction
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Mean of a linear function of a r.v.
  Consider the income tax schedule:

                                        Y = 2, 000 + 0.8X
  where X is pre-tax earnings and Y is after-tax earnings. What is the
  marginal tax rate?
  Suppose an individual’s next year pre-tax earnings are a r.v. with mean
                     2
  µX and variance σX . Since her pre-tax earnings are random, her after-tax
  earnings are random as well. With the following mean:

                               E (Y ) = µY = 2, 000 + 0.8µX
  Why? Remember that the operator E (Y ) means “multiply each value of
  Y by its probability and add up the results.”
                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Variance of a linear function of a r.v.
  In turn, the variance of Y is:

                              var(Y ) = σY = E [(Y − µY )2 ].
                                         2


  Since Y = 2, 000 + 0.8X , then
  (Y − µY ) = (2, 000 + 0.8X ) − (2, 000 + 0.8µX ) = 0.8(X − µX ).
  Therefore:

          E (Y − µY )2 = E {[0.8(X − µX )2 ]} = 0.64E [(X − µX )2 ].
            2         2
  That is: σY = 0.64σX .
  And taking the positive square root of that number:

                                             σY = 0.8σX

                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Mean and var. of a linear function of a r.v.


  More generally, if X and Y are r.v.’s related by Y = a + bX , then:

                                           µY = a + bµX
                                               2
                                              σY = b 2 σY
                                                        2


                                               σY = bσY




                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Two random variables


  We now deal with the distribution of two random variables
  considered together.
  The joint probability distribution of two random variables X and Y
  is the probability that the random variables take certain values at
  once or Pr (X = x, Y = y ).
  The marginal probability distribution of a random variable Y is its
  probability distribution in the context of its relationship with
  (an)other variable(s).



                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Multi-variate distributions
  The following table shows relative frequencies (probabilities):
                        Joint distribution of weather conditions and commuting times
                                                Rain (X = 0)      No rain (X = 1)    Total
                  Long commute (Y = 0)              0.15                0.07         0.22
                  Short commute (Y = 1)             0.15                0.63         0.78
                  Total                             0.30                0.70         1.00


  The cells show the joint probabilities. The marginal probabilities
  (the marginal distribution) of Y can be calculated from the joint
  distribution of X and Y . If X can take l different values x1 , . . . , xl ,
  then:
                                                  l
                          Pr (Y = y ) =                Pr (X = xi , Y = y )
                                                i=1

                      SFC - juliohuato@gmail.com         Applied Statistics for Economics 2. Introduction to Probability
Outline
                                             Topics
       Probability, sample space, random variable
                           Probability distribution
                                   Expected value
                                          Variance
                                         Moments
       Linear transformations of random variables
                               Joint distributions


Conditional distribution


  The conditional probability that Y takes the value y when X is
  known to take the value x is written Pr (Y = y |X = x).
  The conditional distribution of Y given X = x is:
                                                      Pr (X = x, Y = y )
                      Pr (Y = y |X = x) =
                                                          Pr (X = x)




                     SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Conditional mean
  Consider the following table:
                                Joint and conditional distribution of M and A
                                     M =0        M =1        M =2       M =3    M =4       Total
         Joint distribution
         Old car (A = 0)              0.35       0.065      0.05       0.025      0.01      0.50
         New car (A = 1)              0.45       0.035      0.01       0.005      0.00      0.50
         Total                        0.8         0.1       0.06        0.03      0.01      1.00
         Conditional distribution
         Pr(M | A = 0)                0.70        0.13      0.10       0.05       0.02      1.00
         Pr(M | A = 1)                0.90        0.07      0.02       0.01       0.00      1.00

  The conditional expectation of Y given X (or conditional mean of
  Y given X ) is the mean of the conditional distribution of Y given
  X.
                                                  k
                       E (Y |X = x) =                  yi Pr (Y = yi |X = x).
                                                i=1
                      SFC - juliohuato@gmail.com         Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Law of iterated expectations

  The mean height of adults is the weighted average of the mean
  height of men and the mean height of women, weighted by the
  proportions of men and women. More generally:
                                         l
                         E (Y ) =            E (Y |X = xi )Pr (X = xi ).
                                       i=1

  In other terms:
                                      E (Y ) = E [E (Y |X )].
  This is called the law of iterated expectations. If E (Y |X ) = 0 then
  E (Y ) = 0.

                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                             Topics
       Probability, sample space, random variable
                           Probability distribution
                                   Expected value
                                          Variance
                                         Moments
       Linear transformations of random variables
                               Joint distributions


Conditional variance


  The variance of Y conditional on X is the variance of the
  conditional distribution of Y given X :
                                    k
        var(Y |X = x) =                  [yi − E (Y |X = x)]2 Pr (Y = yi |X = x).
                                   i=1

  Example.




                     SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                               Topics
         Probability, sample space, random variable
                             Probability distribution
                                     Expected value
                                            Variance
                                           Moments
         Linear transformations of random variables
                                 Joint distributions


Independence

   Two r.v.’s X and Y are independently distributed (i.e.
   independent) if knowing the value of one of them gives no
   information about the other, that is, if the conditional distribution
   of Y given X equals the marginal distribution of Y . Formally, X
   and Y are independent if, for all values x and y ,

                            Pr(Y = y |X = x) = Pr(Y = y ) or
                     Pr(X = x, Y = y ) = Pr(X = x) Pr(Y = y )
   In other words, the joint distribution of X and Y is the product of their
   marginal distributions.

                       SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Covariance
  The covariance between two r.v.’s X and Y measures the extent to
  which they move together. The covariance is the expected value of
  the product of the deviations of the variables from their expected
  values. The first equation below is the general formula of the
  covariance. The second equation is specific to discrete r.v.’s and it
  assumes that X can take on l values and Y can take on k values:

                     cov(X , Y ) = σXY = E [(X − µX )(Y − µY )]
                              k      l
        cov(X , Y ) =                    (xj − µX )(yi − µY ) Pr(X = xj , Y = yi ).
                             i=1 j=1
  Note that −∞ < σXY < +∞. How do you interpret the covariance
  formula?
                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Correlation

  The problem with the covariance is that it is not bounded. Its size
  depends on the units of X and Y and is, thus, hard to interpret.
  The correlation between X and Y is another measure of their
  covariation. But, unlike the covariance, the correlation eliminates
  the ‘units’ problem. Its formula is:
                                                  cov(X , Y )              σXY
                      corr(X , Y ) =                                 =
                                                var(X ) var(Y )           σX σY

  Note that −1 ≤ corr(X , Y ) ≤ 1.



                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                              Topics
        Probability, sample space, random variable
                            Probability distribution
                                    Expected value
                                           Variance
                                          Moments
        Linear transformations of random variables
                                Joint distributions


Correlation and conditional mean



  If E (Y |X = x) = E (Y ) = µY , then X and Y are uncorrelated.
  That is,

                           cov(X , Y ) = 0 and cov(X , Y ) = 0.
  This follows from the law of iterated expectations.




                      SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                                Topics
          Probability, sample space, random variable
                              Probability distribution
                                      Expected value
                                             Variance
                                            Moments
          Linear transformations of random variables
                                  Joint distributions


Mean and variance of sums of r.v.’s
   The mean of the sum of two r.v.’s, X and Y , is the sum of their means:
                          E (X + Y ) = E (X ) + E (Y ) = µX + µY
   The variance of the sum of X and Y is the sum of their variance plus
   twice their covariance:

                                                        2    2
      var(X + Y ) = var(X ) + var(Y ) + 2cov(X , Y ) = σX + σY + 2σXY
   If X and Y are independent, the covariance is zero and the variance of
   their sum is the sum of their variances:
                                                         2    2
                      var(X + Y ) = var(X ) + var(Y ) = σX + σY
   Why?
                        SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability
Outline
                                               Topics
         Probability, sample space, random variable
                             Probability distribution
                                     Expected value
                                            Variance
                                           Moments
         Linear transformations of random variables
                                 Joint distributions


Sums of r.v.’s
   Let X , Y , and V be r.v.’s and a, b, and c be constants. These
   facts follow from the definitions of mean, variance, covariance, and
   correlation:
                            E (a + bX + cY ) = a + bµX + cµY
                                       var(a + bY ) = b 2 σY
                                                           2

                       var(aX + bY ) = a2 σX + 2abσXY + b 2 σY
                                           2                 2

                                         E (Y 2 ) = σY + µ2
                                                     2
                                                          Y
                                       cov(a + bX + cV , Y )
                                     E (XY ) = σXY + µX µY

                                          |σXY | ≤       2 2
                                                        σX σY
   Can you prove them?
                       SFC - juliohuato@gmail.com       Applied Statistics for Economics 2. Introduction to Probability

Contenu connexe

Tendances

Multinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsMultinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsAnirudha si
 
INFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONINFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONJohn Labrador
 
Poisson distribution
Poisson distributionPoisson distribution
Poisson distributionAnindya Jana
 
Multicolinearity
MulticolinearityMulticolinearity
MulticolinearityPawan Kawan
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionzcreichenbach
 
Logistic regression with SPSS examples
Logistic regression with SPSS examplesLogistic regression with SPSS examples
Logistic regression with SPSS examplesGaurav Kamboj
 
Multiple Regression and Logistic Regression
Multiple Regression and Logistic RegressionMultiple Regression and Logistic Regression
Multiple Regression and Logistic RegressionKaushik Rajan
 
Poisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric DistributionPoisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric DistributionDataminingTools Inc
 
A.1 properties of point estimators
A.1 properties of point estimatorsA.1 properties of point estimators
A.1 properties of point estimatorsUlster BOCES
 
Estimation theory 1
Estimation theory 1Estimation theory 1
Estimation theory 1Gopi Saiteja
 

Tendances (20)

Multinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationshipsMultinomial logisticregression basicrelationships
Multinomial logisticregression basicrelationships
 
INFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONINFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTION
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Poisson distribution
Poisson distributionPoisson distribution
Poisson distribution
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Confidence interval
Confidence intervalConfidence interval
Confidence interval
 
Poission distribution
Poission distributionPoission distribution
Poission distribution
 
Multicolinearity
MulticolinearityMulticolinearity
Multicolinearity
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Anova, ancova
Anova, ancovaAnova, ancova
Anova, ancova
 
Logistic regression with SPSS examples
Logistic regression with SPSS examplesLogistic regression with SPSS examples
Logistic regression with SPSS examples
 
Fishers test
Fishers testFishers test
Fishers test
 
Multiple Regression and Logistic Regression
Multiple Regression and Logistic RegressionMultiple Regression and Logistic Regression
Multiple Regression and Logistic Regression
 
Poisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric DistributionPoisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric Distribution
 
8.1
8.18.1
8.1
 
confounding 2*2
confounding 2*2confounding 2*2
confounding 2*2
 
F-Distribution
F-DistributionF-Distribution
F-Distribution
 
A.1 properties of point estimators
A.1 properties of point estimatorsA.1 properties of point estimators
A.1 properties of point estimators
 
Binary Logistic Regression
Binary Logistic RegressionBinary Logistic Regression
Binary Logistic Regression
 
Estimation theory 1
Estimation theory 1Estimation theory 1
Estimation theory 1
 

En vedette

Basic Concept Of Probability
Basic Concept Of ProbabilityBasic Concept Of Probability
Basic Concept Of Probabilityguest45a926
 
Probability and probability distributions ppt @ bec doms
Probability and probability distributions ppt @ bec domsProbability and probability distributions ppt @ bec doms
Probability and probability distributions ppt @ bec domsBabasab Patil
 
Lecture 3 Probability Theory
Lecture 3 Probability TheoryLecture 3 Probability Theory
Lecture 3 Probability TheoryMarina Santini
 
Moments in statistics
Moments in statisticsMoments in statistics
Moments in statistics515329748
 
PROBABILITY
PROBABILITYPROBABILITY
PROBABILITYVIV13
 
Probability Theory and Introduction to Statics
Probability Theory and Introduction to StaticsProbability Theory and Introduction to Statics
Probability Theory and Introduction to StaticsKübra Mumcu
 
Theoretical probability distributions
Theoretical probability distributionsTheoretical probability distributions
Theoretical probability distributionsHasnain Baber
 
Lecture: Joint, Conditional and Marginal Probabilities
Lecture: Joint, Conditional and Marginal Probabilities Lecture: Joint, Conditional and Marginal Probabilities
Lecture: Joint, Conditional and Marginal Probabilities Marina Santini
 
Introduction to probability
Introduction to probabilityIntroduction to probability
Introduction to probabilityDavid Radcliffe
 
Basic Probability
Basic Probability Basic Probability
Basic Probability kaurab
 
Probability powerpoint
Probability powerpointProbability powerpoint
Probability powerpointTiffany Deegan
 
Probability Powerpoint
Probability PowerpointProbability Powerpoint
Probability Powerpointspike2904
 
Probability distribution
Probability distributionProbability distribution
Probability distributionManoj Bhambu
 
Theoretical Probability
Theoretical ProbabilityTheoretical Probability
Theoretical ProbabilityLindsey Brown
 
Unit 11.2 theoretical probability
Unit 11.2 theoretical probabilityUnit 11.2 theoretical probability
Unit 11.2 theoretical probabilityMark Ryder
 
Normal distribution notes
Normal  distribution notesNormal  distribution notes
Normal distribution notesmonster2010
 

En vedette (20)

Basic Concept Of Probability
Basic Concept Of ProbabilityBasic Concept Of Probability
Basic Concept Of Probability
 
Probability and probability distributions ppt @ bec doms
Probability and probability distributions ppt @ bec domsProbability and probability distributions ppt @ bec doms
Probability and probability distributions ppt @ bec doms
 
Probability concept and Probability distribution
Probability concept and Probability distributionProbability concept and Probability distribution
Probability concept and Probability distribution
 
Lecture 3 Probability Theory
Lecture 3 Probability TheoryLecture 3 Probability Theory
Lecture 3 Probability Theory
 
Moments in statistics
Moments in statisticsMoments in statistics
Moments in statistics
 
PROBABILITY
PROBABILITYPROBABILITY
PROBABILITY
 
Probability Theory and Introduction to Statics
Probability Theory and Introduction to StaticsProbability Theory and Introduction to Statics
Probability Theory and Introduction to Statics
 
Theoretical probability distributions
Theoretical probability distributionsTheoretical probability distributions
Theoretical probability distributions
 
Lecture: Joint, Conditional and Marginal Probabilities
Lecture: Joint, Conditional and Marginal Probabilities Lecture: Joint, Conditional and Marginal Probabilities
Lecture: Joint, Conditional and Marginal Probabilities
 
Introduction to probability
Introduction to probabilityIntroduction to probability
Introduction to probability
 
Basic Probability
Basic Probability Basic Probability
Basic Probability
 
Probability powerpoint
Probability powerpointProbability powerpoint
Probability powerpoint
 
Probability Powerpoint
Probability PowerpointProbability Powerpoint
Probability Powerpoint
 
Statistical theory.3.18.15
Statistical theory.3.18.15Statistical theory.3.18.15
Statistical theory.3.18.15
 
stat_03
stat_03stat_03
stat_03
 
What is statistics
What is statisticsWhat is statistics
What is statistics
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
 
Theoretical Probability
Theoretical ProbabilityTheoretical Probability
Theoretical Probability
 
Unit 11.2 theoretical probability
Unit 11.2 theoretical probabilityUnit 11.2 theoretical probability
Unit 11.2 theoretical probability
 
Normal distribution notes
Normal  distribution notesNormal  distribution notes
Normal distribution notes
 

Similaire à Statistics - Probability theory 1

Probability theory 2
Probability theory 2Probability theory 2
Probability theory 2Julio Huato
 
Applied Statistics - Parametric Distributions
Applied Statistics - Parametric DistributionsApplied Statistics - Parametric Distributions
Applied Statistics - Parametric DistributionsJulio Huato
 
Theory of probability and probability distribution
Theory of probability and probability distributionTheory of probability and probability distribution
Theory of probability and probability distributionpolscjp
 
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptDescriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptRama Krishna
 
Optimistic decision making using an
Optimistic decision making using anOptimistic decision making using an
Optimistic decision making using anijaia
 
Ece analog-communications
Ece analog-communicationsEce analog-communications
Ece analog-communicationsTeju Kotti
 
Probability theory
Probability theory Probability theory
Probability theory Gaditek
 
Basic probability theory and statistics
Basic probability theory and statisticsBasic probability theory and statistics
Basic probability theory and statisticsLearnbay Datascience
 
Perspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsPerspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsGianluca Bontempi
 
Basic Elements of Probability Theory
Basic Elements of Probability TheoryBasic Elements of Probability Theory
Basic Elements of Probability TheoryMaira Carvalho
 
Brain based teaching_learning_erickson
Brain based teaching_learning_ericksonBrain based teaching_learning_erickson
Brain based teaching_learning_ericksontjj12377
 
Otto_Elmgart_Noise_Vol_Struct
Otto_Elmgart_Noise_Vol_StructOtto_Elmgart_Noise_Vol_Struct
Otto_Elmgart_Noise_Vol_StructOtto Elmgart
 
Binomial Probability Distribution statistics.pdf
Binomial Probability Distribution statistics.pdfBinomial Probability Distribution statistics.pdf
Binomial Probability Distribution statistics.pdfUVAS
 
Statistics and probability pp
Statistics and  probability ppStatistics and  probability pp
Statistics and probability ppRuby Vidal
 
Society for Risk Analysis On Transnational Risk & Terrorism
Society for Risk Analysis  On Transnational Risk & TerrorismSociety for Risk Analysis  On Transnational Risk & Terrorism
Society for Risk Analysis On Transnational Risk & TerrorismJohn Marke
 
Conditional Correlation 2009
Conditional Correlation 2009Conditional Correlation 2009
Conditional Correlation 2009yamanote
 

Similaire à Statistics - Probability theory 1 (20)

Probability theory 2
Probability theory 2Probability theory 2
Probability theory 2
 
Applied Statistics - Parametric Distributions
Applied Statistics - Parametric DistributionsApplied Statistics - Parametric Distributions
Applied Statistics - Parametric Distributions
 
Dissertation
DissertationDissertation
Dissertation
 
Theory of probability and probability distribution
Theory of probability and probability distributionTheory of probability and probability distribution
Theory of probability and probability distribution
 
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.pptDescriptive statistics-Skewness-Kurtosis-Correlation.ppt
Descriptive statistics-Skewness-Kurtosis-Correlation.ppt
 
Optimistic decision making using an
Optimistic decision making using anOptimistic decision making using an
Optimistic decision making using an
 
Ece analog-communications
Ece analog-communicationsEce analog-communications
Ece analog-communications
 
Probability theory
Probability theory Probability theory
Probability theory
 
Basic probability theory and statistics
Basic probability theory and statisticsBasic probability theory and statistics
Basic probability theory and statistics
 
Perspective of feature selection in bioinformatics
Perspective of feature selection in bioinformaticsPerspective of feature selection in bioinformatics
Perspective of feature selection in bioinformatics
 
Basic Elements of Probability Theory
Basic Elements of Probability TheoryBasic Elements of Probability Theory
Basic Elements of Probability Theory
 
Brain based teaching_learning_erickson
Brain based teaching_learning_ericksonBrain based teaching_learning_erickson
Brain based teaching_learning_erickson
 
Otto_Elmgart_Noise_Vol_Struct
Otto_Elmgart_Noise_Vol_StructOtto_Elmgart_Noise_Vol_Struct
Otto_Elmgart_Noise_Vol_Struct
 
Unit – III Spatial data Ajustment.pdf
Unit – III Spatial data Ajustment.pdfUnit – III Spatial data Ajustment.pdf
Unit – III Spatial data Ajustment.pdf
 
Binomial Probability Distribution statistics.pdf
Binomial Probability Distribution statistics.pdfBinomial Probability Distribution statistics.pdf
Binomial Probability Distribution statistics.pdf
 
Dubrovnik Pres
Dubrovnik PresDubrovnik Pres
Dubrovnik Pres
 
Statistics and probability pp
Statistics and  probability ppStatistics and  probability pp
Statistics and probability pp
 
DMV (1) (1).docx
DMV (1) (1).docxDMV (1) (1).docx
DMV (1) (1).docx
 
Society for Risk Analysis On Transnational Risk & Terrorism
Society for Risk Analysis  On Transnational Risk & TerrorismSociety for Risk Analysis  On Transnational Risk & Terrorism
Society for Risk Analysis On Transnational Risk & Terrorism
 
Conditional Correlation 2009
Conditional Correlation 2009Conditional Correlation 2009
Conditional Correlation 2009
 

Plus de Julio Huato

Uncertainty, Risk, and Risk Management
Uncertainty, Risk, and Risk ManagementUncertainty, Risk, and Risk Management
Uncertainty, Risk, and Risk ManagementJulio Huato
 
Hecksher-Ohlin model
Hecksher-Ohlin modelHecksher-Ohlin model
Hecksher-Ohlin modelJulio Huato
 
Int econ bases_trade
Int econ bases_tradeInt econ bases_trade
Int econ bases_tradeJulio Huato
 
A two-good economy
A two-good economyA two-good economy
A two-good economyJulio Huato
 
Two-good output choice
Two-good output choiceTwo-good output choice
Two-good output choiceJulio Huato
 
Intecon micro review 1
Intecon micro review 1Intecon micro review 1
Intecon micro review 1Julio Huato
 
Applied Statistics - Introduction
Applied Statistics - IntroductionApplied Statistics - Introduction
Applied Statistics - IntroductionJulio Huato
 
Summation Operator
Summation OperatorSummation Operator
Summation OperatorJulio Huato
 
Inputs output costs
Inputs output costsInputs output costs
Inputs output costsJulio Huato
 
Slides money banking risk reward capm
Slides money banking risk reward capmSlides money banking risk reward capm
Slides money banking risk reward capmJulio Huato
 
Slides money banking time value
Slides money banking time valueSlides money banking time value
Slides money banking time valueJulio Huato
 

Plus de Julio Huato (20)

Uncertainty, Risk, and Risk Management
Uncertainty, Risk, and Risk ManagementUncertainty, Risk, and Risk Management
Uncertainty, Risk, and Risk Management
 
Int Econ BoP
Int Econ BoPInt Econ BoP
Int Econ BoP
 
Hecksher-Ohlin model
Hecksher-Ohlin modelHecksher-Ohlin model
Hecksher-Ohlin model
 
Ricardian model
Ricardian modelRicardian model
Ricardian model
 
Int econ bases_trade
Int econ bases_tradeInt econ bases_trade
Int econ bases_trade
 
Fin sys
Fin sysFin sys
Fin sys
 
A two-good economy
A two-good economyA two-good economy
A two-good economy
 
Two-good output choice
Two-good output choiceTwo-good output choice
Two-good output choice
 
Intecon micro review 1
Intecon micro review 1Intecon micro review 1
Intecon micro review 1
 
Applied Statistics - Introduction
Applied Statistics - IntroductionApplied Statistics - Introduction
Applied Statistics - Introduction
 
Summation Operator
Summation OperatorSummation Operator
Summation Operator
 
Inputs output costs
Inputs output costsInputs output costs
Inputs output costs
 
Risk
RiskRisk
Risk
 
Elasticity
ElasticityElasticity
Elasticity
 
Slides money banking risk reward capm
Slides money banking risk reward capmSlides money banking risk reward capm
Slides money banking risk reward capm
 
Slides money banking time value
Slides money banking time valueSlides money banking time value
Slides money banking time value
 
Market model
Market modelMarket model
Market model
 
Macro review
Macro reviewMacro review
Macro review
 
Micro review 2
Micro review 2Micro review 2
Micro review 2
 
Micro review
Micro reviewMicro review
Micro review
 

Dernier

psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 

Dernier (20)

psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 

Statistics - Probability theory 1

  • 1. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Applied Statistics for Economics 2. Introduction to Probability Theory SFC - juliohuato@gmail.com Spring 2012 SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 2. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 3. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Topics The main topics in this chapter are: random variables and probability distributions, expected values: mean and variance, and Two random variables jointly considered. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 4. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability The world in motion is viewed as a set of random processes or random experiments. Randomness means that, no matter how much our understanding of the world may advance, there is always an element of ignorance or uncertainty in such understanding. In other words: given specific causes, we don’t know fully which states of the world will result. Or, given specific states of the world, we don’t know fully what specifically caused such states of the world. In other words, we are uncertain or – more plainly said – ignorant about the specific causes or, alternatively, effects involved in these processes. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 5. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability Examples of random processes: Your meeting the next person, SFC students commuting to school, residents of the U.S. producing new goods in a given year, etc. Why are they random? Because we are uncertain about the gender or the age of the next person you’ll meet, the commuting time of SFC students or the means of transportation they will use, the annual gross domestic product in the U.S. or its composition, etc. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 6. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability The mutually exclusive possible results of these experiments are called the outcomes. E.g. the next person you’ll meet could be female or male, young or old; SFC students may take a few or many minutes to commute to school; U.S. annual GDP may go up or down by some amount compared to the previous year, etc. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 7. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability Probability: the degree of belief that the outcome of an experiment will be a particular one. How to decide which probability to assign to a particular outcome of an experiment (e.g. that if you meet another person, the gender of such person will be female)? How to make this decision in a well-informed, disciplined, scientific way?1 One can only use experience – individual or collective – i.e. history. We may keep record of the gender of the people we meet over time and use the data compiled to inform our belief or look at records on the gender composition of the local population, etc. 1 In a sense, the whole purpose of statistics is to determine probabilities or, alternatively, expectations based on probabilities. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 8. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Sample space, event Sample space or population: the set of all the possible outcomes of the experiment. E.g. the sample space of the experiment of flipping a coin once is: S = {H, T }.2 Event: a subset of the sample space, i.e. a set of one or more outcomes. E.g. the event (M ≤ 1) that your car will “need one repair at most” includes “no repairs” (M = 0) and “one repair” (M = 1). 2 We rule out ‘freak’ possibilities, like the coin landing on its edge. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 9. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Random variables Random variable (r.v.): a numerical summary of a random outcome. For example, G = g , where (e.g.) g is 0 if ‘male’ and 1 if ‘female’. The number of times your car needs repair during a given month: M = m, where m = 0, 1, 2, 3, . . .. The time it takes for SFC students to commute to school: T = t, where t is time in minutes. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 10. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Random variables There are discrete and continuous random variables. Gender, summarized as a 0 if ‘male’ and 1 if ‘female’, and the number of car repairs in a month are discrete random variables. The commuting time, if recorded in fractions of an hour – or even fractions of minutes and seconds, etc. can be regarded as a continuous r.v. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 11. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability distribution of a discrete r.v. The probability distribution of a discrete r.v. is a list of all the values of the r.v. and the probabilities associated to each value of the r.v. By convention, the probabilities are a number between 0 and 1, where 0 means impossibility and 1 means full certainty; the probabilities over the sample space must add up to 1. E.g. let G = 0, 1 be the r.v. ‘gender of the next person you’ll meet’. Then: G Pr(G = g ) 0 0.45 1 0.55 SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 12. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability distribution of a discrete r.v. Using the information in the probability distribution, you can compute the probability of a given event. E.g. the probability that you’ll meet ‘a male or a female’: Pr(G = 0 or G = 1) = Pr(G = 0)+Pr(G = 1) = 0.45+0.55 = 1 = 100%. In words, we are completely certain that you’ll meet either a male or a female the next time you meet a person. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 13. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability distribution of a discrete r.v. Admittedly, the previous example is trivial. But consider the probability distribution of your car needing repair(s) in a given month. The r.v. ‘number of repairs’ needed is denoted as M: M Pr(M = m) 0 0.80 1 0.10 2 0.06 3 0.03 4 0.01 What’s the probability that the the car will need one or two repairs in a month? Answer: Pr(M = 1 or M = 2) = Pr(M = 1)+Pr(M = 2) = 0.10+0.06 = 0.16 = 16%. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 14. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability distribution of a discrete r.v. The cumulative probability distribution (also known as a ‘cumulative distribution function’ or c.d.f.) is the probability that the random variable is less than or equal to a particular value. The first two columns of the following table are the same as in the previous table. The last column gives the c.d.f.: M Pr(M = m) Pr(M ≤ m) 0 0.80 0.80 1 0.10 0.90 2 0.06 0.96 3 0.03 0.99 4 0.01 1.00 SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 15. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability distribution of a discrete r.v. A binary discrete r.v. (e.g. G = 0, 1) is called a Bernoulli r.v. The Bernoulli distribution is: 1 with probability p G= 0 with probability 1 − p where p is the probability of the next person being ‘female’. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 16. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Probability distribution of a continuous r.v. The cumulative probability distribution of a continuous r.v. is also the probability that the random variable is less than or equal to a particular value. The probability density function (p.d.f.) of a continuous random variable summarizes the probabilities for each value of the random variable. The mathematical description of the p.d.f. of a continuous variable requires that you’re familiar with calculus. So, we’ll skip it for now. NB: Strictly speaking, the probability that a continuous random variable has a particular value is zero. We can only speak of the probability of the random variable falling in a range (between two given values). NB2: The p.d.f. and the c.d.f. show the same information in different formats. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 17. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Characteristics of a r.v. distribution In the practice of statistics, two basic measures are used extensively to characterize the distribution of a r.v.: the expected value or mean (or average) and the variance. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 18. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Expected value The expected value of a r.v. X or E (X ) is the average value of the variable over many repeated trials. It is computed as a weighted average of the possible outcomes, where the weights are the probabilities of the outcomes. It is also called the mean of X and denoted by µX . For a discrete r.v.: k E (X ) = x1 p1 + x2 p2 + · · · + xk pk = x i pi i=1 E.g.: You loan $100 to your friend for a year at 10% interest. There’s a 99% chance he’ll repay the loan and 1% he won’t. What’s the expected value of your loan at maturity? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 19. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Expected value E.g.: You loan $100 to your friend for a year at 10% interest. There’s a 99% chance he’ll repay the loan and 1% he won’t. What’s the expected value of your loan at maturity? Answer: ($110 × 0.99) + ($0 × .01) = $108.90 E.g.: What’s the expected value or average of the number of car repairs per month? See the table above. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 20. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Expected value E.g.: What’s the expected value or average of the number of car repairs per month (M)? See the table above. Answer: E (M) = (0 × 0.80) + (1 × 0.10) + (2 × 0.06)+ (3 × 0.03) + (4 × 0.01) = 0.35 What does that mean? E.g.: In general, what’s the expected value of a Bernoulli r.v. with Pr(G = 1) = p and Pr(G = 0) = 1 − p? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 21. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Expected value E.g.: In general, what’s the expected value of a Bernoulli r.v. with Pr(G = 1) = p and Pr(G = 0) = 1 − p? Answer: E (G ) = (1 × p) + (0 × (1 − p)) = p Note 1: Think of the operator E (.) as a function that transforms data on a variable by multiplying each value of the variable by its probability and then adding up all the products. Note 2: The formula for the expected value of a continuous r.v. requires calculus. So we’ll skip it for now. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 22. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Variance and standard deviation The variance of a r.v. Y is: var(Y ) = σY = E [(Y − µY )2 ] 2 The standard deviation is the positive square root of the variance σY : 2 s.d.(Y ) = σY = + σY Basically, the s.d. gives the same information as the variance, but in units that are easier to understand. The units of the standard deviation are the same units as Y and µY . What is the intuition behind the variance and/or the standard deviation? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 23. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Variance and standard deviation For a discrete r.v.: k var(Y ) = σY = E [(Y − µY )2 ] = 2 (yi − µY )2 pi i=1 k s.d.(Y ) = σY = (yi − µY )2 pi i=1 E.g.: What are the var. and s.d. of the number of car repairs per month (M)? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 24. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Variance and standard deviation E.g.: What are the var. and s.d. of the number of car repairs per month (M)? Answer: var(M) = [(0−0.35)2 ×0.80]+[(1−0.35)2 ×0.10]+[(2−0.35)2 ×0.06] +[(3 − 0.35)2 × 0.03] + [(4 − 0.35)2 × 0.01] = 0.6475 √ s.d.(M) = 0.6475 ∼ 0.80 = E.g.: What are the var. and s.d. of a Bernoulli r.v.? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 25. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Variance E.g.: What are the var. and s.d. of a Bernoulli r.v.? Answer: var(G ) = [(0 − p)2 × (1 − p)] + [(1 − p)2 × p] = p(1 − p) s.d.(G ) = p(1 − p) Note 1: Think of the operator var(.) as a function that transforms data on a variable by taking the distance or difference between each value of the variable and its mean, squaring that difference, multiplying it by the respective probability, and then adding up all the products. Note 1: Think of the operator s.d.(.) as a function that transforms data on a variable by taking the distance or difference between each value of the -variable and its mean, Statistics for Economicsdifference, to Probability SFC juliohuato@gmail.com Applied squaring that 2. Introduction
  • 26. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Moments More formally, in statistics, the characteristics of the distribution of a r.v. are called moments. E (Y ) is the first moment, E (Y 2 ) is the second moment, and E (Y r ) is the r th moment of Y . The first moment is the mean and it is a measure of the center of the distribution, the second moment is a measure of its dispersion or spread, and r -th moments for r > 2 measure other aspects of the distribution’s shape. Clearly, the second moment of the distribution is intimately related to the variance. How? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 27. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Moments Two other measures of the shape (using higher moments) of a distribution are: Skewness: E [(Y − µY )3 ] Skewness = 3 σY For a symmetric distribution, the skewness is zero. If the distribution has a long left tail, the skewness is negative. If the distribution has a long right tail, the skewness is positive. Kurtosis: E [(Y − µY )4 ] Kurtosis = 4 σY For a distribution with heavy tails (outliers are likely), the kurtosis is to Probability SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction
  • 28. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Mean of a linear function of a r.v. Consider the income tax schedule: Y = 2, 000 + 0.8X where X is pre-tax earnings and Y is after-tax earnings. What is the marginal tax rate? Suppose an individual’s next year pre-tax earnings are a r.v. with mean 2 µX and variance σX . Since her pre-tax earnings are random, her after-tax earnings are random as well. With the following mean: E (Y ) = µY = 2, 000 + 0.8µX Why? Remember that the operator E (Y ) means “multiply each value of Y by its probability and add up the results.” SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 29. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Variance of a linear function of a r.v. In turn, the variance of Y is: var(Y ) = σY = E [(Y − µY )2 ]. 2 Since Y = 2, 000 + 0.8X , then (Y − µY ) = (2, 000 + 0.8X ) − (2, 000 + 0.8µX ) = 0.8(X − µX ). Therefore: E (Y − µY )2 = E {[0.8(X − µX )2 ]} = 0.64E [(X − µX )2 ]. 2 2 That is: σY = 0.64σX . And taking the positive square root of that number: σY = 0.8σX SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 30. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Mean and var. of a linear function of a r.v. More generally, if X and Y are r.v.’s related by Y = a + bX , then: µY = a + bµX 2 σY = b 2 σY 2 σY = bσY SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 31. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Two random variables We now deal with the distribution of two random variables considered together. The joint probability distribution of two random variables X and Y is the probability that the random variables take certain values at once or Pr (X = x, Y = y ). The marginal probability distribution of a random variable Y is its probability distribution in the context of its relationship with (an)other variable(s). SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 32. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Multi-variate distributions The following table shows relative frequencies (probabilities): Joint distribution of weather conditions and commuting times Rain (X = 0) No rain (X = 1) Total Long commute (Y = 0) 0.15 0.07 0.22 Short commute (Y = 1) 0.15 0.63 0.78 Total 0.30 0.70 1.00 The cells show the joint probabilities. The marginal probabilities (the marginal distribution) of Y can be calculated from the joint distribution of X and Y . If X can take l different values x1 , . . . , xl , then: l Pr (Y = y ) = Pr (X = xi , Y = y ) i=1 SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 33. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Conditional distribution The conditional probability that Y takes the value y when X is known to take the value x is written Pr (Y = y |X = x). The conditional distribution of Y given X = x is: Pr (X = x, Y = y ) Pr (Y = y |X = x) = Pr (X = x) SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 34. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Conditional mean Consider the following table: Joint and conditional distribution of M and A M =0 M =1 M =2 M =3 M =4 Total Joint distribution Old car (A = 0) 0.35 0.065 0.05 0.025 0.01 0.50 New car (A = 1) 0.45 0.035 0.01 0.005 0.00 0.50 Total 0.8 0.1 0.06 0.03 0.01 1.00 Conditional distribution Pr(M | A = 0) 0.70 0.13 0.10 0.05 0.02 1.00 Pr(M | A = 1) 0.90 0.07 0.02 0.01 0.00 1.00 The conditional expectation of Y given X (or conditional mean of Y given X ) is the mean of the conditional distribution of Y given X. k E (Y |X = x) = yi Pr (Y = yi |X = x). i=1 SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 35. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Law of iterated expectations The mean height of adults is the weighted average of the mean height of men and the mean height of women, weighted by the proportions of men and women. More generally: l E (Y ) = E (Y |X = xi )Pr (X = xi ). i=1 In other terms: E (Y ) = E [E (Y |X )]. This is called the law of iterated expectations. If E (Y |X ) = 0 then E (Y ) = 0. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 36. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Conditional variance The variance of Y conditional on X is the variance of the conditional distribution of Y given X : k var(Y |X = x) = [yi − E (Y |X = x)]2 Pr (Y = yi |X = x). i=1 Example. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 37. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Independence Two r.v.’s X and Y are independently distributed (i.e. independent) if knowing the value of one of them gives no information about the other, that is, if the conditional distribution of Y given X equals the marginal distribution of Y . Formally, X and Y are independent if, for all values x and y , Pr(Y = y |X = x) = Pr(Y = y ) or Pr(X = x, Y = y ) = Pr(X = x) Pr(Y = y ) In other words, the joint distribution of X and Y is the product of their marginal distributions. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 38. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Covariance The covariance between two r.v.’s X and Y measures the extent to which they move together. The covariance is the expected value of the product of the deviations of the variables from their expected values. The first equation below is the general formula of the covariance. The second equation is specific to discrete r.v.’s and it assumes that X can take on l values and Y can take on k values: cov(X , Y ) = σXY = E [(X − µX )(Y − µY )] k l cov(X , Y ) = (xj − µX )(yi − µY ) Pr(X = xj , Y = yi ). i=1 j=1 Note that −∞ < σXY < +∞. How do you interpret the covariance formula? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 39. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Correlation The problem with the covariance is that it is not bounded. Its size depends on the units of X and Y and is, thus, hard to interpret. The correlation between X and Y is another measure of their covariation. But, unlike the covariance, the correlation eliminates the ‘units’ problem. Its formula is: cov(X , Y ) σXY corr(X , Y ) = = var(X ) var(Y ) σX σY Note that −1 ≤ corr(X , Y ) ≤ 1. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 40. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Correlation and conditional mean If E (Y |X = x) = E (Y ) = µY , then X and Y are uncorrelated. That is, cov(X , Y ) = 0 and cov(X , Y ) = 0. This follows from the law of iterated expectations. SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 41. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Mean and variance of sums of r.v.’s The mean of the sum of two r.v.’s, X and Y , is the sum of their means: E (X + Y ) = E (X ) + E (Y ) = µX + µY The variance of the sum of X and Y is the sum of their variance plus twice their covariance: 2 2 var(X + Y ) = var(X ) + var(Y ) + 2cov(X , Y ) = σX + σY + 2σXY If X and Y are independent, the covariance is zero and the variance of their sum is the sum of their variances: 2 2 var(X + Y ) = var(X ) + var(Y ) = σX + σY Why? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability
  • 42. Outline Topics Probability, sample space, random variable Probability distribution Expected value Variance Moments Linear transformations of random variables Joint distributions Sums of r.v.’s Let X , Y , and V be r.v.’s and a, b, and c be constants. These facts follow from the definitions of mean, variance, covariance, and correlation: E (a + bX + cY ) = a + bµX + cµY var(a + bY ) = b 2 σY 2 var(aX + bY ) = a2 σX + 2abσXY + b 2 σY 2 2 E (Y 2 ) = σY + µ2 2 Y cov(a + bX + cV , Y ) E (XY ) = σXY + µX µY |σXY | ≤ 2 2 σX σY Can you prove them? SFC - juliohuato@gmail.com Applied Statistics for Economics 2. Introduction to Probability