SlideShare une entreprise Scribd logo
1  sur  22
Télécharger pour lire hors ligne
Stat310            Sequences of rvs


                            Hadley Wickham
Wednesday, 17 March 2010
Major’s day
                   2:30-4:30pm Today
                   Oshman Engineering Design Kitchen


                   Come along and talk to me (or Rudy
                   Guerra) if you’re interested in becoming a
                   stat major


Wednesday, 17 March 2010
Assessment

                   Test model answers online tonight
                   (hopefully)
                   Usual help session tonight 4-5pm.




Wednesday, 17 March 2010
1. Sequences
               2. Limits
               3. Chebyshev’s theorem
               4. The law of large numbers
               5. The central limit theorem



Wednesday, 17 March 2010
Sequences

                   1 variable: X
                   2 variables: X, Y
                   ...
                   n variables: X1, X2, X3, ..., Xn




Wednesday, 17 March 2010
Sequences
                   Xi ~ Normal(μi, σi)
                   Xi ~ Normal(μ, σi)
                   Xi ~ Normal(μi, σ)
                   Xi ~ Normal(μ, σ)
                   Almost always assume that the Xi’s are
                   independent. In the last case they are
                   also identically distributed.


Wednesday, 17 March 2010
iid = independent &
                identically distributed


Wednesday, 17 March 2010
Your turn

                   Xi are iid N(0, 2).
                   What is E(X30)? What is Var(X2001)?
                   What is Cor(X10, X11)? Cor(X1, X1000)?




Wednesday, 17 March 2010
n
                                               n
                                                
               E(                   Xi ) =           E(Xi )
                               i                 i
                           n
                                                n
                                                 
         V ar(                      ai Xi ) =         2
                                                     ai V   ar(Xi )
                               i                 i
                                                     If what is true?
                       n
                                           n
                                            
          E(                       Xi ) =        E(Xi )
                           i                 i        If what is true?
Wednesday, 17 March 2010
Limits
                   Typically will define some function of n
                                           ¯
                   random variables, e.g. Xn
                                   ¯
                   What happens to Xn when n → ∞?
                   Why? Because often it will converge, and
                   we can use this to approximate results for
                   any large n.



Wednesday, 17 March 2010
New notation

                   If xn → 0, and n is big, we can say xn ≈ 0.
                   If Xn → Z, Z ~ N(0, 1), and n is big,
                   we can say Xn ~ . N(0,1).

                   Read as approximately distributed.
                   Other ways to write it



Wednesday, 17 March 2010
N
                                       go

                                          o
                                        od
                                             lim art
                           Chebyshev




                                                it ing
                                            st

                                                  -b p
                                                     ut oin
                                                       a t
                                1
         P (|X − µ|  Kσ) ≥ 1 − 2
                               K
                             1
         P (|X − µ|  Kσ) ≤ 2
                            K
                                        For K  0
Wednesday, 17 March 2010
Your turn

                   How can you put this in words?
                                      1
                   P (|X − µ|  Kσ) ≤ 2
                                     K


Wednesday, 17 March 2010
The probability of being more
                               than K standard deviations
        80                     away from the mean is less
                               than one over K squared.
        60
                               (For K  0)
 1 K2




        40




        20




                 0         2     4           6     8       10
                                     K
Wednesday, 17 March 2010
(For K  1)
        1.0




        0.8




        0.6
 1 K2




        0.4




        0.2




        0.0

                           2   4       6   8           10
                                   K
Wednesday, 17 March 2010
Your turn

                   How does this compare to the normal
                   distribution? Compare the probability of
                   being less than 1, 2 and 3 standard
                   deviations away from the mean given by
                   Chebychev and what we know about the
                   normal.



Wednesday, 17 March 2010
1.0




         0.8




         0.6

                                                    variable
 value




                                                        cheby
                                                        norm
         0.4




         0.2




         0.0

                           2   4       6   8   10
                                   x
Wednesday, 17 March 2010
LLN
                   Law of large numbers
                   X1, X2, ..., Xn iid.

                           n
                           
                  ¯
                  Xn =          Xi
                            i


                   There are five ways to write the result.


Wednesday, 17 March 2010
What does it mean?
                   As we collect more and more data, the
                   sample mean gets closer and closer to
                   the true mean.
                   Not that surprising!
                   But note that we didn’t make any
                   assumptions about the distributions



Wednesday, 17 March 2010
CLT

                   Central limit theorem.
                   The distribution of a mean is normal when
                   gets big.




Wednesday, 17 March 2010
Approximation


                   This implies that if n is big then ...




Wednesday, 17 March 2010
Reading


                Section 4.1
                Focus on the general ideas and the
                defintions




Wednesday, 17 March 2010

Contenu connexe

En vedette

Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Chris Rucker
 
Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Hadley Wickham
 
R workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesR workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesVivian S. Zhang
 
Machine learning in R
Machine learning in RMachine learning in R
Machine learning in Rapolol92
 
4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply FunctionSakthi Dasans
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyrRomain Francois
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Ram Narasimhan
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rVivian S. Zhang
 

En vedette (20)

20 date-times
20 date-times20 date-times
20 date-times
 
04 Wrapup
04 Wrapup04 Wrapup
04 Wrapup
 
Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2
 
24 modelling
24 modelling24 modelling
24 modelling
 
21 spam
21 spam21 spam
21 spam
 
03 Conditional
03 Conditional03 Conditional
03 Conditional
 
Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)
 
Graphical inference
Graphical inferenceGraphical inference
Graphical inference
 
03 Modelling
03 Modelling03 Modelling
03 Modelling
 
R workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesR workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 series
 
23 data-structures
23 data-structures23 data-structures
23 data-structures
 
R packages
R packagesR packages
R packages
 
02 Ddply
02 Ddply02 Ddply
02 Ddply
 
01 Intro
01 Intro01 Intro
01 Intro
 
Reshaping Data in R
Reshaping Data in RReshaping Data in R
Reshaping Data in R
 
Machine learning in R
Machine learning in RMachine learning in R
Machine learning in R
 
4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyr
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
 

Plus de Hadley Wickham (20)

27 development
27 development27 development
27 development
 
27 development
27 development27 development
27 development
 
22 spam
22 spam22 spam
22 spam
 
19 tables
19 tables19 tables
19 tables
 
18 cleaning
18 cleaning18 cleaning
18 cleaning
 
17 polishing
17 polishing17 polishing
17 polishing
 
16 critique
16 critique16 critique
16 critique
 
15 time-space
15 time-space15 time-space
15 time-space
 
14 case-study
14 case-study14 case-study
14 case-study
 
13 case-study
13 case-study13 case-study
13 case-study
 
12 adv-manip
12 adv-manip12 adv-manip
12 adv-manip
 
11 adv-manip
11 adv-manip11 adv-manip
11 adv-manip
 
11 adv-manip
11 adv-manip11 adv-manip
11 adv-manip
 
10 simulation
10 simulation10 simulation
10 simulation
 
10 simulation
10 simulation10 simulation
10 simulation
 
09 bootstrapping
09 bootstrapping09 bootstrapping
09 bootstrapping
 
08 functions
08 functions08 functions
08 functions
 
07 problem-solving
07 problem-solving07 problem-solving
07 problem-solving
 
06 data
06 data06 data
06 data
 
05 subsetting
05 subsetting05 subsetting
05 subsetting
 

Dernier

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 

Dernier (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 

16 Sequences

  • 1. Stat310 Sequences of rvs Hadley Wickham Wednesday, 17 March 2010
  • 2. Major’s day 2:30-4:30pm Today Oshman Engineering Design Kitchen Come along and talk to me (or Rudy Guerra) if you’re interested in becoming a stat major Wednesday, 17 March 2010
  • 3. Assessment Test model answers online tonight (hopefully) Usual help session tonight 4-5pm. Wednesday, 17 March 2010
  • 4. 1. Sequences 2. Limits 3. Chebyshev’s theorem 4. The law of large numbers 5. The central limit theorem Wednesday, 17 March 2010
  • 5. Sequences 1 variable: X 2 variables: X, Y ... n variables: X1, X2, X3, ..., Xn Wednesday, 17 March 2010
  • 6. Sequences Xi ~ Normal(μi, σi) Xi ~ Normal(μ, σi) Xi ~ Normal(μi, σ) Xi ~ Normal(μ, σ) Almost always assume that the Xi’s are independent. In the last case they are also identically distributed. Wednesday, 17 March 2010
  • 7. iid = independent & identically distributed Wednesday, 17 March 2010
  • 8. Your turn Xi are iid N(0, 2). What is E(X30)? What is Var(X2001)? What is Cor(X10, X11)? Cor(X1, X1000)? Wednesday, 17 March 2010
  • 9. n n E( Xi ) = E(Xi ) i i n n V ar( ai Xi ) = 2 ai V ar(Xi ) i i If what is true? n n E( Xi ) = E(Xi ) i i If what is true? Wednesday, 17 March 2010
  • 10. Limits Typically will define some function of n ¯ random variables, e.g. Xn ¯ What happens to Xn when n → ∞? Why? Because often it will converge, and we can use this to approximate results for any large n. Wednesday, 17 March 2010
  • 11. New notation If xn → 0, and n is big, we can say xn ≈ 0. If Xn → Z, Z ~ N(0, 1), and n is big, we can say Xn ~ . N(0,1). Read as approximately distributed. Other ways to write it Wednesday, 17 March 2010
  • 12. N go o od lim art Chebyshev it ing st -b p ut oin a t 1 P (|X − µ| Kσ) ≥ 1 − 2 K 1 P (|X − µ| Kσ) ≤ 2 K For K 0 Wednesday, 17 March 2010
  • 13. Your turn How can you put this in words? 1 P (|X − µ| Kσ) ≤ 2 K Wednesday, 17 March 2010
  • 14. The probability of being more than K standard deviations 80 away from the mean is less than one over K squared. 60 (For K 0) 1 K2 40 20 0 2 4 6 8 10 K Wednesday, 17 March 2010
  • 15. (For K 1) 1.0 0.8 0.6 1 K2 0.4 0.2 0.0 2 4 6 8 10 K Wednesday, 17 March 2010
  • 16. Your turn How does this compare to the normal distribution? Compare the probability of being less than 1, 2 and 3 standard deviations away from the mean given by Chebychev and what we know about the normal. Wednesday, 17 March 2010
  • 17. 1.0 0.8 0.6 variable value cheby norm 0.4 0.2 0.0 2 4 6 8 10 x Wednesday, 17 March 2010
  • 18. LLN Law of large numbers X1, X2, ..., Xn iid. n ¯ Xn = Xi i There are five ways to write the result. Wednesday, 17 March 2010
  • 19. What does it mean? As we collect more and more data, the sample mean gets closer and closer to the true mean. Not that surprising! But note that we didn’t make any assumptions about the distributions Wednesday, 17 March 2010
  • 20. CLT Central limit theorem. The distribution of a mean is normal when gets big. Wednesday, 17 March 2010
  • 21. Approximation This implies that if n is big then ... Wednesday, 17 March 2010
  • 22. Reading Section 4.1 Focus on the general ideas and the defintions Wednesday, 17 March 2010