SlideShare une entreprise Scribd logo
1  sur  25
3.1 Sampling Distributions
Contents:

                 Sampling Distributions
1.   Populations and Samples
2.   The Sampling Distribution of the mean ( known)
3.   The Sampling Distribution of the mean ( unknown)
4.   The Sampling Distribution of the Variance
Populations and Samples


                Finite
              Population
Population
               Infinite
              Population
Populations and Samples
Population: A set or collection of all the objects, actual
  or conceptual and mainly the set of numbers,
  measurements or observations which are under
  investigation.
Finite Population : All students in a College
Infinite Population : Total water in the sea or all the
  sand particle in sea shore.
Populations are often described by the distributions of
  their values, and it is common practice to refer to a
  population in terms of its distribution.
Finite Populations
• Finite populations are described by the actual
  distribution of its values and infinite populations are
  described by corresponding probability distribution or
  probability density.
• “Population f(x)” means a population is described
  by a frequency distribution, a probability distribution
  or a density f(x).
Infinite Population
If a population is infinite it is impossible to observe all
   its values, and even if it is finite it may be impractical
   or uneconomical to observe it in its entirety. Thus it is
   necessary to use a sample.
Sample: A part of population collected for investigation
   which needed to be representative of population and
   to be large enough to contain all information about
   population.
Random Sample (finite population):
• A set of observations X1, X2, …,Xn constitutes a
  random sample of size n from a finite
  population of size N, if its values are chosen so
  that each subset of n of the N elements of the
  population has the same probability of being
  selected.
Random Sample (infinite Population):
A set of observations X1, X2, …, Xn constitutes a random sample
     of size n from the infinite population ƒ(x) if:
      1. Each Xi is a random variable whose distribution is given
     by ƒ(x)
     2. These n random variables are independent.

We consider two types of random sample: those drawn with
   replacement and those drawn without replacement.
Sampling with replacement:
• In sampling with replacement, each object
  chosen is returned to the population before the
  next object is drawn. We define a random
  sample of size n drawn with replacement, as
  an ordered n-tuple of objects from the
  population, repetitions allowed.
The Space of random samples drawn with
              replacement:
 • If samples of size n are drawn with replacement from a
   population of size N, then there are Nn such samples. In any
   survey involving sample of size n, each of these should have
   same probability of being chosen. This is equivalent to making
   a collection of all Nn samples a probability space in which each
   sample has probability of being chosen 1/Nn.
 • Hence in above example There are 32 = 9 random samples of
   size 2 and each of the 9 random sample has probability 1/9 of
   being chosen.
Sampling without replacement:
In sampling without replacement, an object chosen is not returned
   to the population before the next object is drawn. We define
   random sample of size n, drawn without replacement as an
   unordered subset of n objects from the population.
The Space of random samples drawn
        without replacement:
• If sample of size n are drawn without
  replacement from a population of size N, then
            N    
  there are 
             n
               such samples. The collection of
                  
                  
                  

  all random samples drawn without
  replacement can be made into a probability
  space in which each sample has same chance
  of being selected.
Mean and Variance
If X1, X2, …, Xn constitute a random sample, then
                                        n

                                             X   i
                                       i 1
                        X 
                                              n
               is called the sample mean and
                                 n

                                      (X i  X )
                                                      2

                                i 1
                            
                        2
                    S
                                            n 1

                is called the sample variance.
Sampling distribution:
• The probability distribution of a random variable
  defined on a space of random samples is called a
  sampling distribution.
The Sampling Distribution of the Mean ( Known)

 Suppose that a random sample of n observations has been taken from
    some population and x has been computed, say, to estimate the mean
    of the population. If we take a second sample of size n from this
    population we get some different value for x . Similarly if we take
    several more samples and calculate x , probably no two of the x ' s     .
    would be alike. The difference among such x ' s are generally attributed
    to chance and this raises important question concerning their
    distribution, specially concerning the extent of their chance of
    fluctuations.
  Let  X and 
                  2
                  X
                      be mean and variance for sampling    distributi on of the
  mean X .
The Sampling Distribution of the Mean ( Known)

                           2
  Formula for μ X and σ X :
  Theorem 1: If a random sample of size n is taken from a population
  having the mean  and the variance 2, then X is a random variable
  whose distribution has the mean  .
                                                                             
                                                                               2

  For samples from infinite populations the variance of this distribution is     .
                                                                                           n
                                                                               N n
                                                                        2

  For samples from a finite population of size N the variance is n                    .
                                                                                N 1

                                      2
                                                  (for infinite   Population              )
                                      n
   That is  X   and             
                               2
                               X
                                        N n
                                         2
                                                   (for finite Population             )
                                      n N 1
                                     
The Sampling Distribution of the Mean ( Known)

 Proof of    for infinite population for the
                  X


   continuous
 case: From the definition we have
                      

     X      .....  x          f ( x1 , x 2 ,...., x n )dx 1 dx 2 .... dx n
                    
                            n
                                    xi
               .....            n
                                         f ( x1 , x 2 ,...., x n )dx 1 dx 2 .... dx n
                      i 1

                 n                 
            1
        
            n
                 .....  x            i
                                             f ( x1 , x 2 ,...., x n )dx 1 dx 2 .... dx n
                i 1            
The Sampling Distribution of the Mean ( Known


Where f(x1, x2, … , xn) is the joint density function of the random variables
which constitute the random sample. From the assumption of random sample
(for infinite population) each Xi is a random variable whose density function
is given by f(x) and these n random variables are independent, we can write
                   f(x1, x2, … , xn) = f(x1) f(x2) …… f(xn)
and we have
                    n               
               1
       X 
               n
                    .....  x          i
                                              f ( x1 ) f ( x 2 ).... f ( x n )dx 1 dx 2 .... dx n
                   i 1          

                    n                                                  
               1
           
               n
                            f ( x1 ) dx 1 ...  x i f ( x i ) dx i ....  f ( x n ) dx n
                   i 1                                             
The Sampling Distribution of the Mean ( Known)

Since each integral except the one with the integrand xi f(xi) equals 1 and the
one with the integrand xi f(xi) equals to , so we will have
                             n
                        1              1
                 X 
                        n
                                 
                                       n
                                           n   .
                            i 1

Note: for the discrete case the proof follows the same steps, with integral sign
replaced by  ' s.
For the proof of  X   2 / n we require the following result
                    2


Result: If X is a continuous random variable and Y = X - X , then Y = 0
   and hence  Y2   X .
                        2


Proof: Y = E(Y) = E(X - X) = E(X) - X = 0 and 
       Y2 = E[(Y - Y)2] = E [((X - X ) - 0)2] = E[(X - X)2] = X2.
The Sampling Distribution of the Mean ( Known)

• Regardless of the form of the population distribution, the
  distribution of     is approximately normal with mean  and
  variance 2/n whenever n is large.
• In practice, the normal distribution provides an excellent
                    X
  approximation to the sampling distribution of the mean
  for n as small as 25 or 30, with hardly any restrictions on the
  shape of the population.
• If the random samples come from a normal population, the
               X
  sampling distribution of the mean is normal regardless of
  the size of the sample.
The Sampling Distribution of the mean
               ( unknown)
• Application of the theory of previous section requires knowledge of the
  population standard deviation  .
• If n is large, this does not pose any problems even when  is unknown, as
  it is reasonable in that case to use for it the sample standard deviation s.
• However, when it comes to random variable whose values are given by
           very little is known about its exact sampling distribution for

   small values of n unless we make the assumption that the sample comes
   from a normal population.

                                 X  
                                           ,
                                 S /   n
The Sampling Distribution of the mean
            ( unknown)
 Theorem : If     is the mean of a random sample of size n taken from a
   normal population having the mean  and the variance 2, and
             X



                      (Xi  X )
                n                 2

                                    , then
       2
   S
               i 1     n 1
                                       X 
                               t
                       S/ n
is a random variable having the t distribution with the parameter  = n – 1.
This theorem is more general than Theorem 6.2 in the sense that it does not
require knowledge of  ; on the other hand, it is less general than Theorem 6.2
in the sense that it requires the assumption of a normal population.
The Sampling Distribution of the mean
           ( unknown)
• The t distribution was introduced by William S.Gosset in
  1908, who published his scientific paper under the pen name
  “Student,” since his company did not permit publication by
  employees. That’s why t distribution is also known as the
  Student-t distribution, or Student’s           t distribution.
• The shape of t distribution is similar to that of a normal
  distribution i.e. both are bell-shaped and symmetric about the
  mean.
• Like the standard normal distribution, the t distribution has the
  mean 0, but its variance depends on the parameter , called the
  number of degrees of freedom.
The Sampling Distribution of the mean
           ( unknown)


                                      t ( =10)
                  Normal




                           t ( =1)


 Figure: t distribution and standard normal distributions
The Sampling Distribution of the mean
              ( unknown)
• When   , the t distribution approaches the standard
  normal distribution i.e. when   , t  z.
• The standard normal distribution provides a good
  approximation to the t distribution for samples of size 30 or
  more.

Contenu connexe

Tendances

The sampling distribution
The sampling distributionThe sampling distribution
The sampling distribution
Harve Abella
 

Tendances (20)

Sampling Distributions
Sampling DistributionsSampling Distributions
Sampling Distributions
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
The sampling distribution
The sampling distributionThe sampling distribution
The sampling distribution
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
Probability
ProbabilityProbability
Probability
 
z-test
z-testz-test
z-test
 
Hypergeometric probability distribution
Hypergeometric probability distributionHypergeometric probability distribution
Hypergeometric probability distribution
 
T test and types of t-test
T test and types of t-testT test and types of t-test
T test and types of t-test
 
Sampling theory
Sampling theorySampling theory
Sampling theory
 
Correlation
CorrelationCorrelation
Correlation
 
Basic concepts of probability
Basic concepts of probabilityBasic concepts of probability
Basic concepts of probability
 
INFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONINFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTION
 
Testing Hypothesis
Testing HypothesisTesting Hypothesis
Testing Hypothesis
 
Probability Distributions
Probability DistributionsProbability Distributions
Probability Distributions
 
Discrete probability distributions
Discrete probability distributionsDiscrete probability distributions
Discrete probability distributions
 
Categorical data analysis
Categorical data analysisCategorical data analysis
Categorical data analysis
 
Lecture2 hypothesis testing
Lecture2 hypothesis testingLecture2 hypothesis testing
Lecture2 hypothesis testing
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: Estimation
 
Basic Probability
Basic Probability Basic Probability
Basic Probability
 

En vedette

13-2 Frequency Tables and Counting Principle.pdf
13-2 Frequency Tables and Counting Principle.pdf13-2 Frequency Tables and Counting Principle.pdf
13-2 Frequency Tables and Counting Principle.pdf
bwlomas
 
10 8 geometric probability
10 8 geometric probability10 8 geometric probability
10 8 geometric probability
bwlomas
 
Geometric Distribution
Geometric DistributionGeometric Distribution
Geometric Distribution
guestbc6c0e
 
13-6 Conditional Probability2.pdf
13-6 Conditional Probability2.pdf13-6 Conditional Probability2.pdf
13-6 Conditional Probability2.pdf
bwlomas
 
13-4 Probability of Multiple Events.pdf
13-4 Probability of Multiple Events.pdf13-4 Probability of Multiple Events.pdf
13-4 Probability of Multiple Events.pdf
bwlomas
 
13-3 Combination Permutation.pdf
13-3 Combination Permutation.pdf13-3 Combination Permutation.pdf
13-3 Combination Permutation.pdf
bwlomas
 
Binomial probability distributions ppt
Binomial probability distributions pptBinomial probability distributions ppt
Binomial probability distributions ppt
Tayab Ali
 
Normal distribution and sampling distribution
Normal distribution and sampling distributionNormal distribution and sampling distribution
Normal distribution and sampling distribution
Mridul Arora
 

En vedette (10)

13-2 Frequency Tables and Counting Principle.pdf
13-2 Frequency Tables and Counting Principle.pdf13-2 Frequency Tables and Counting Principle.pdf
13-2 Frequency Tables and Counting Principle.pdf
 
Simulation
SimulationSimulation
Simulation
 
10 8 geometric probability
10 8 geometric probability10 8 geometric probability
10 8 geometric probability
 
Geometric Distribution
Geometric DistributionGeometric Distribution
Geometric Distribution
 
Poisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric DistributionPoisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric Distribution
 
13-6 Conditional Probability2.pdf
13-6 Conditional Probability2.pdf13-6 Conditional Probability2.pdf
13-6 Conditional Probability2.pdf
 
13-4 Probability of Multiple Events.pdf
13-4 Probability of Multiple Events.pdf13-4 Probability of Multiple Events.pdf
13-4 Probability of Multiple Events.pdf
 
13-3 Combination Permutation.pdf
13-3 Combination Permutation.pdf13-3 Combination Permutation.pdf
13-3 Combination Permutation.pdf
 
Binomial probability distributions ppt
Binomial probability distributions pptBinomial probability distributions ppt
Binomial probability distributions ppt
 
Normal distribution and sampling distribution
Normal distribution and sampling distributionNormal distribution and sampling distribution
Normal distribution and sampling distribution
 

Similaire à Sampling Distributions

Intro probability 4
Intro probability 4Intro probability 4
Intro probability 4
Phong Vo
 

Similaire à Sampling Distributions (20)

Hypergeometric Distribution
Hypergeometric DistributionHypergeometric Distribution
Hypergeometric Distribution
 
Hypergeometric Distribution
Hypergeometric DistributionHypergeometric Distribution
Hypergeometric Distribution
 
Basics of probability in statistical simulation and stochastic programming
Basics of probability in statistical simulation and stochastic programmingBasics of probability in statistical simulation and stochastic programming
Basics of probability in statistical simulation and stochastic programming
 
sampling distribution
sampling distributionsampling distribution
sampling distribution
 
lecture4.pdf
lecture4.pdflecture4.pdf
lecture4.pdf
 
Doe02 statistics
Doe02 statisticsDoe02 statistics
Doe02 statistics
 
Ssp notes
Ssp notesSsp notes
Ssp notes
 
Intro probability 4
Intro probability 4Intro probability 4
Intro probability 4
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
Statistics (1): estimation, Chapter 1: Models
Statistics (1): estimation, Chapter 1: ModelsStatistics (1): estimation, Chapter 1: Models
Statistics (1): estimation, Chapter 1: Models
 
ISM_Session_5 _ 23rd and 24th December.pptx
ISM_Session_5 _ 23rd and 24th December.pptxISM_Session_5 _ 23rd and 24th December.pptx
ISM_Session_5 _ 23rd and 24th December.pptx
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of variance
 
ttest_intro.pdf
ttest_intro.pdfttest_intro.pdf
ttest_intro.pdf
 
Talk 3
Talk 3Talk 3
Talk 3
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
random variation 9473 by jaideep.ppt
random variation 9473 by jaideep.pptrandom variation 9473 by jaideep.ppt
random variation 9473 by jaideep.ppt
 
Robustness under Independent Contamination Model
Robustness under Independent Contamination ModelRobustness under Independent Contamination Model
Robustness under Independent Contamination Model
 
Stochastic Differentiation
Stochastic DifferentiationStochastic Differentiation
Stochastic Differentiation
 
Probability and Statistics : Binomial Distribution notes ppt.pdf
Probability and Statistics : Binomial Distribution notes ppt.pdfProbability and Statistics : Binomial Distribution notes ppt.pdf
Probability and Statistics : Binomial Distribution notes ppt.pdf
 

Plus de mathscontent

Plus de mathscontent (13)

Interval Estimation & Estimation Of Proportion
Interval Estimation & Estimation Of ProportionInterval Estimation & Estimation Of Proportion
Interval Estimation & Estimation Of Proportion
 
Point Estimation
Point EstimationPoint Estimation
Point Estimation
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
Bernoullis Random Variables And Binomial Distribution
Bernoullis Random Variables And Binomial DistributionBernoullis Random Variables And Binomial Distribution
Bernoullis Random Variables And Binomial Distribution
 
Gamma, Expoential, Poisson And Chi Squared Distributions
Gamma, Expoential, Poisson And Chi Squared DistributionsGamma, Expoential, Poisson And Chi Squared Distributions
Gamma, Expoential, Poisson And Chi Squared Distributions
 
Uniform Distribution
Uniform DistributionUniform Distribution
Uniform Distribution
 
Continuous Random Variables
Continuous Random VariablesContinuous Random Variables
Continuous Random Variables
 
Moment Generating Functions
Moment Generating FunctionsMoment Generating Functions
Moment Generating Functions
 
Mathematical Expectation And Variance
Mathematical Expectation And VarianceMathematical Expectation And Variance
Mathematical Expectation And Variance
 
Discrete Random Variables And Probability Distributions
Discrete Random Variables And Probability DistributionsDiscrete Random Variables And Probability Distributions
Discrete Random Variables And Probability Distributions
 
Theorems And Conditional Probability
Theorems And Conditional ProbabilityTheorems And Conditional Probability
Theorems And Conditional Probability
 
Probability And Its Axioms
Probability And Its AxiomsProbability And Its Axioms
Probability And Its Axioms
 
Sample Space And Events
Sample Space And EventsSample Space And Events
Sample Space And Events
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Sampling Distributions

  • 2. Contents: Sampling Distributions 1. Populations and Samples 2. The Sampling Distribution of the mean ( known) 3. The Sampling Distribution of the mean ( unknown) 4. The Sampling Distribution of the Variance
  • 3. Populations and Samples Finite Population Population Infinite Population
  • 4. Populations and Samples Population: A set or collection of all the objects, actual or conceptual and mainly the set of numbers, measurements or observations which are under investigation. Finite Population : All students in a College Infinite Population : Total water in the sea or all the sand particle in sea shore. Populations are often described by the distributions of their values, and it is common practice to refer to a population in terms of its distribution.
  • 5. Finite Populations • Finite populations are described by the actual distribution of its values and infinite populations are described by corresponding probability distribution or probability density. • “Population f(x)” means a population is described by a frequency distribution, a probability distribution or a density f(x).
  • 6. Infinite Population If a population is infinite it is impossible to observe all its values, and even if it is finite it may be impractical or uneconomical to observe it in its entirety. Thus it is necessary to use a sample. Sample: A part of population collected for investigation which needed to be representative of population and to be large enough to contain all information about population.
  • 7. Random Sample (finite population): • A set of observations X1, X2, …,Xn constitutes a random sample of size n from a finite population of size N, if its values are chosen so that each subset of n of the N elements of the population has the same probability of being selected.
  • 8. Random Sample (infinite Population): A set of observations X1, X2, …, Xn constitutes a random sample of size n from the infinite population ƒ(x) if: 1. Each Xi is a random variable whose distribution is given by ƒ(x) 2. These n random variables are independent. We consider two types of random sample: those drawn with replacement and those drawn without replacement.
  • 9. Sampling with replacement: • In sampling with replacement, each object chosen is returned to the population before the next object is drawn. We define a random sample of size n drawn with replacement, as an ordered n-tuple of objects from the population, repetitions allowed.
  • 10. The Space of random samples drawn with replacement: • If samples of size n are drawn with replacement from a population of size N, then there are Nn such samples. In any survey involving sample of size n, each of these should have same probability of being chosen. This is equivalent to making a collection of all Nn samples a probability space in which each sample has probability of being chosen 1/Nn. • Hence in above example There are 32 = 9 random samples of size 2 and each of the 9 random sample has probability 1/9 of being chosen.
  • 11. Sampling without replacement: In sampling without replacement, an object chosen is not returned to the population before the next object is drawn. We define random sample of size n, drawn without replacement as an unordered subset of n objects from the population.
  • 12. The Space of random samples drawn without replacement: • If sample of size n are drawn without replacement from a population of size N, then N  there are   n  such samples. The collection of    all random samples drawn without replacement can be made into a probability space in which each sample has same chance of being selected.
  • 13. Mean and Variance If X1, X2, …, Xn constitute a random sample, then n  X i i 1 X  n is called the sample mean and n  (X i  X ) 2 i 1  2 S n 1 is called the sample variance.
  • 14. Sampling distribution: • The probability distribution of a random variable defined on a space of random samples is called a sampling distribution.
  • 15. The Sampling Distribution of the Mean ( Known) Suppose that a random sample of n observations has been taken from some population and x has been computed, say, to estimate the mean of the population. If we take a second sample of size n from this population we get some different value for x . Similarly if we take several more samples and calculate x , probably no two of the x ' s . would be alike. The difference among such x ' s are generally attributed to chance and this raises important question concerning their distribution, specially concerning the extent of their chance of fluctuations. Let  X and  2 X be mean and variance for sampling distributi on of the mean X .
  • 16. The Sampling Distribution of the Mean ( Known) 2 Formula for μ X and σ X : Theorem 1: If a random sample of size n is taken from a population having the mean  and the variance 2, then X is a random variable whose distribution has the mean  .  2 For samples from infinite populations the variance of this distribution is . n  N n 2 For samples from a finite population of size N the variance is n  . N 1  2  (for infinite Population )  n That is  X   and    2 X  N n 2  (for finite Population )  n N 1 
  • 17. The Sampling Distribution of the Mean ( Known) Proof of    for infinite population for the X continuous case: From the definition we have    X    .....  x f ( x1 , x 2 ,...., x n )dx 1 dx 2 .... dx n      n xi    .....   n f ( x1 , x 2 ,...., x n )dx 1 dx 2 .... dx n    i 1 n    1  n    .....  x i f ( x1 , x 2 ,...., x n )dx 1 dx 2 .... dx n i 1     
  • 18. The Sampling Distribution of the Mean ( Known Where f(x1, x2, … , xn) is the joint density function of the random variables which constitute the random sample. From the assumption of random sample (for infinite population) each Xi is a random variable whose density function is given by f(x) and these n random variables are independent, we can write f(x1, x2, … , xn) = f(x1) f(x2) …… f(xn) and we have n    1 X  n    .....  x i f ( x1 ) f ( x 2 ).... f ( x n )dx 1 dx 2 .... dx n i 1      n    1  n   f ( x1 ) dx 1 ...  x i f ( x i ) dx i ....  f ( x n ) dx n i 1    
  • 19. The Sampling Distribution of the Mean ( Known) Since each integral except the one with the integrand xi f(xi) equals 1 and the one with the integrand xi f(xi) equals to , so we will have n 1 1 X  n   n n   . i 1 Note: for the discrete case the proof follows the same steps, with integral sign replaced by  ' s. For the proof of  X   2 / n we require the following result 2 Result: If X is a continuous random variable and Y = X - X , then Y = 0 and hence  Y2   X . 2 Proof: Y = E(Y) = E(X - X) = E(X) - X = 0 and  Y2 = E[(Y - Y)2] = E [((X - X ) - 0)2] = E[(X - X)2] = X2.
  • 20. The Sampling Distribution of the Mean ( Known) • Regardless of the form of the population distribution, the distribution of is approximately normal with mean  and variance 2/n whenever n is large. • In practice, the normal distribution provides an excellent X approximation to the sampling distribution of the mean for n as small as 25 or 30, with hardly any restrictions on the shape of the population. • If the random samples come from a normal population, the X sampling distribution of the mean is normal regardless of the size of the sample.
  • 21. The Sampling Distribution of the mean ( unknown) • Application of the theory of previous section requires knowledge of the population standard deviation  . • If n is large, this does not pose any problems even when  is unknown, as it is reasonable in that case to use for it the sample standard deviation s. • However, when it comes to random variable whose values are given by very little is known about its exact sampling distribution for small values of n unless we make the assumption that the sample comes from a normal population. X   , S / n
  • 22. The Sampling Distribution of the mean ( unknown) Theorem : If is the mean of a random sample of size n taken from a normal population having the mean  and the variance 2, and X (Xi  X ) n 2   , then 2 S i 1 n 1 X  t S/ n is a random variable having the t distribution with the parameter  = n – 1. This theorem is more general than Theorem 6.2 in the sense that it does not require knowledge of  ; on the other hand, it is less general than Theorem 6.2 in the sense that it requires the assumption of a normal population.
  • 23. The Sampling Distribution of the mean ( unknown) • The t distribution was introduced by William S.Gosset in 1908, who published his scientific paper under the pen name “Student,” since his company did not permit publication by employees. That’s why t distribution is also known as the Student-t distribution, or Student’s t distribution. • The shape of t distribution is similar to that of a normal distribution i.e. both are bell-shaped and symmetric about the mean. • Like the standard normal distribution, the t distribution has the mean 0, but its variance depends on the parameter , called the number of degrees of freedom.
  • 24. The Sampling Distribution of the mean ( unknown) t ( =10) Normal t ( =1) Figure: t distribution and standard normal distributions
  • 25. The Sampling Distribution of the mean ( unknown) • When   , the t distribution approaches the standard normal distribution i.e. when   , t  z. • The standard normal distribution provides a good approximation to the t distribution for samples of size 30 or more.