SlideShare une entreprise Scribd logo
1  sur  48
Statistics: First Steps Andrew Martin PS 372 University of Kentucky
Variance Variance  is a measure of dispersion of data points about the mean for interval- and ratio-level data. Variance is a fundamental concept that social scientists seek to explain in the dependent variable.
 
Standard Deviation Standard deviation  is a measure of dispersion of data points about the mean for interval- and ratio-level data.  Like the mean, standard deviation is sensitive to extreme values.  Standard deviation is calculated as the square root of the variance.
 
 
Normal Distribution ,[object Object],[object Object],[object Object],[object Object],[object Object]
Normal Distribution ,[object Object],[object Object],[object Object],[object Object]
Frequency Distribution
What about categorical variables?
 
Example Calculate the ID and IQV for this PS 372 class grades using the following frequencies or proportions: Grade Freq. Prop. A 4 (.12)‏ B 7 (.21)‏ C 4 (.12)‏ D 7 (.21)‏ E 12 (.34)
Index of Diversity ID = 1 – ( p 2 a  +  p 2 b  +  p 2 c  + p 2 d  + p 2 e )‏ ID = 1 - (.12 2  + .21 2  + .12 2  + .21 2  + .34 2 )‏ ID = 1 - (.0144 + .0441 + .0144 + .0441 + .1156)‏ ID = 1 - (.2326)‏ ID = .7674
Index of Qualitative Variation 1 – ( p 2 a  +  p 2 b  +  p 2 c  + p 2 d  + p 2 e )‏ 1 - (1/K)‏
Index of Qualitative Variation .7674 (1 – 1/5)‏ .9592
 
Data Matrix A  data matrix  is an array of rows and columns that stores the values of a set of variables for all the cases in a data set. This is frequently referred to as a dataset.
 
 
Data Matrix from JRM
Properties of Good Graphs Should answer several of the following questions: (JRM 384)‏ 1. Where does the center of the distribution lie? 2. How spread out or bunched up are the observations? 3. Does it have a single peak or more than one?  4. Approximately what proportion of observations in in the ends of the distributions?
Properties of Good Graphs 5. Do observations tend to pile up at one end of the measurement scale, with relatively few observations at the other end? 6. Are there values that, compared with most, seem very large or very small? 7. How does one distribution compare to another in terms of shape, spread, and central tendency? 8. Do values of one variable seem related to another variable?
 
 
 
 
 
Statistical Concepts Let's quickly review some concepts.
Population A  population  refers to any well-defined set of objects such as people, countries, states, organizations, and so on. The term doesn't simply mean the population of the United States or some other geographical area.
Population ,[object Object],[object Object],[object Object]
Populations ,[object Object],[object Object],[object Object],[object Object]
Two Kinds of Inference ,[object Object],[object Object]
Hypothesis Testing Many claims can be translated into specific statements about a population that can be confirmed or disconfirmed with the aid of probability theory. Ex: There is no ideological difference evident in the voting patterns of Republican and Democrat justices on the U.S. Supreme Court.
Point and Interval Estimation The goal here is to estimate unknown population parameters from samples and to surround those estimates with confidence intervals. Confidence intervals suggest the estimate's reliability or precision.
Hypothesis Testing Start with a specific verbal claim or proposition. Ex: The chances of getting heads or tails when flipping the coin is are roughly the same. Ex: The chances of the United States electing a Republican or Democrat president are roughly the same.
Hypothesis Testing Next, the researcher constructs a null hypothesis. A  null hypothesis  is a statement that a population parameter equals a specific value.
Hypothesis Testing Following up on the coin example, the null hypothesis would equal .5.  Stated more formally: H 0 :  P  = .5 Where  P  stands for the probability that the coin will be heads when tossed.  H 0  is  typically used to denote a null hypothesis.
Hypothesis Testing ,[object Object],[object Object],[object Object]
Hypothesis Testing ,[object Object],[object Object],[object Object]
Hypothesis Testing Perhaps you believe the coin is more likely to come up heads than tails. You would formulate the following alternative hypothesis: H A  :  P  > .5 Conversely, if you believe the coin is less likely to come up heads than tails, you would formulate the alternative hypothesis in the opposite direction: H A :  P  < .5
Hypothesis Testing ,[object Object],[object Object]
Hypothesis Testing ,[object Object],[object Object]
 
Hypothesis Testing ,[object Object],[object Object],[object Object],[object Object]
Hypothesis Testing ,[object Object],[object Object]
 
Hypothesis Testing ,[object Object],[object Object],[object Object],[object Object]
 

Contenu connexe

Tendances

Malimu descriptive statistics.
Malimu descriptive statistics.Malimu descriptive statistics.
Malimu descriptive statistics.
Miharbi Ignasm
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
bmcfad01
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Aileen Balbido
 
Statistics 091208004734-phpapp01 (1)
Statistics 091208004734-phpapp01 (1)Statistics 091208004734-phpapp01 (1)
Statistics 091208004734-phpapp01 (1)
mandrewmartin
 
Chapter 05
Chapter 05Chapter 05
Chapter 05
bmcfad01
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notes
cartlidge
 

Tendances (20)

Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Frequency Tables & Univariate Charts
Frequency Tables & Univariate ChartsFrequency Tables & Univariate Charts
Frequency Tables & Univariate Charts
 
Statistics for Librarians, Session 2: Descriptive statistics
Statistics for Librarians, Session 2: Descriptive statisticsStatistics for Librarians, Session 2: Descriptive statistics
Statistics for Librarians, Session 2: Descriptive statistics
 
Psych stats Probability and Probability Distribution
Psych stats Probability and Probability DistributionPsych stats Probability and Probability Distribution
Psych stats Probability and Probability Distribution
 
Descriptive statistics ii
Descriptive statistics iiDescriptive statistics ii
Descriptive statistics ii
 
RSS probability theory
RSS probability theoryRSS probability theory
RSS probability theory
 
Malimu descriptive statistics.
Malimu descriptive statistics.Malimu descriptive statistics.
Malimu descriptive statistics.
 
Inferential statistics
Inferential  statisticsInferential  statistics
Inferential statistics
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Descriptive Statistics, Numerical Description
Descriptive Statistics, Numerical DescriptionDescriptive Statistics, Numerical Description
Descriptive Statistics, Numerical Description
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Statistics 091208004734-phpapp01 (1)
Statistics 091208004734-phpapp01 (1)Statistics 091208004734-phpapp01 (1)
Statistics 091208004734-phpapp01 (1)
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
 
Chapter 05
Chapter 05Chapter 05
Chapter 05
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
 
Basics of Educational Statistics (Inferential statistics)
Basics of Educational Statistics (Inferential statistics)Basics of Educational Statistics (Inferential statistics)
Basics of Educational Statistics (Inferential statistics)
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notes
 
Probability
ProbabilityProbability
Probability
 

En vedette

En vedette (7)

Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01Morestatistics22 091208004743-phpapp01
Morestatistics22 091208004743-phpapp01
 
Week 7 - sampling
Week 7  - samplingWeek 7  - sampling
Week 7 - sampling
 
Berry et al
Berry et alBerry et al
Berry et al
 
Presidency
PresidencyPresidency
Presidency
 
Week 7 Sampling
Week 7   SamplingWeek 7   Sampling
Week 7 Sampling
 
Civil Rights
Civil RightsCivil Rights
Civil Rights
 
Am Federalism
Am FederalismAm Federalism
Am Federalism
 

Similaire à Statistics

Review Z Test Ci 1
Review Z Test Ci 1Review Z Test Ci 1
Review Z Test Ci 1
shoffma5
 
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docxTopic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
AASTHA76
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
karlhennesey
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminology
aswhite
 

Similaire à Statistics (20)

Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
More Statistics
More StatisticsMore Statistics
More Statistics
 
Chi-square IMP.ppt
Chi-square IMP.pptChi-square IMP.ppt
Chi-square IMP.ppt
 
0hypothesis testing.pdf
0hypothesis testing.pdf0hypothesis testing.pdf
0hypothesis testing.pdf
 
Review Z Test Ci 1
Review Z Test Ci 1Review Z Test Ci 1
Review Z Test Ci 1
 
Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data science
 
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docxTopic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
 
Important terminologies
Important terminologiesImportant terminologies
Important terminologies
 
Chapter_9.pptx
Chapter_9.pptxChapter_9.pptx
Chapter_9.pptx
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
 
Data Science interview questions of Statistics
Data Science interview questions of Statistics Data Science interview questions of Statistics
Data Science interview questions of Statistics
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Statistical Significance Tests.pptx
Statistical Significance Tests.pptxStatistical Significance Tests.pptx
Statistical Significance Tests.pptx
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminology
 
Data analysis
Data analysis Data analysis
Data analysis
 
Data science
Data scienceData science
Data science
 
Steps in hypothesis.pptx
Steps in hypothesis.pptxSteps in hypothesis.pptx
Steps in hypothesis.pptx
 
Machine learning session2
Machine learning   session2Machine learning   session2
Machine learning session2
 
How to read a paper
How to read a paperHow to read a paper
How to read a paper
 
Review & Hypothesis Testing
Review & Hypothesis TestingReview & Hypothesis Testing
Review & Hypothesis Testing
 

Plus de mandrewmartin (20)

Regression
RegressionRegression
Regression
 
Diffmeans
DiffmeansDiffmeans
Diffmeans
 
More tabs
More tabsMore tabs
More tabs
 
Crosstabs
CrosstabsCrosstabs
Crosstabs
 
Statisticalrelationships
StatisticalrelationshipsStatisticalrelationships
Statisticalrelationships
 
Research design pt. 2
Research design pt. 2Research design pt. 2
Research design pt. 2
 
Research design
Research designResearch design
Research design
 
Measurement pt. 2
Measurement pt. 2Measurement pt. 2
Measurement pt. 2
 
Measurement
MeasurementMeasurement
Measurement
 
Introduction
IntroductionIntroduction
Introduction
 
Building blocks of scientific research
Building blocks of scientific researchBuilding blocks of scientific research
Building blocks of scientific research
 
Studying politics scientifically
Studying politics scientificallyStudying politics scientifically
Studying politics scientifically
 
Chapter 11 Psrm
Chapter 11 PsrmChapter 11 Psrm
Chapter 11 Psrm
 
Stats Intro Ps 372
Stats Intro Ps 372Stats Intro Ps 372
Stats Intro Ps 372
 
Media
MediaMedia
Media
 
Media
MediaMedia
Media
 
Political Parties
Political PartiesPolitical Parties
Political Parties
 
Elections
ElectionsElections
Elections
 
Bureaucracy
BureaucracyBureaucracy
Bureaucracy
 
Judiciary
JudiciaryJudiciary
Judiciary
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Dernier (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Statistics

  • 1. Statistics: First Steps Andrew Martin PS 372 University of Kentucky
  • 2. Variance Variance is a measure of dispersion of data points about the mean for interval- and ratio-level data. Variance is a fundamental concept that social scientists seek to explain in the dependent variable.
  • 3.  
  • 4. Standard Deviation Standard deviation is a measure of dispersion of data points about the mean for interval- and ratio-level data. Like the mean, standard deviation is sensitive to extreme values. Standard deviation is calculated as the square root of the variance.
  • 5.  
  • 6.  
  • 7.
  • 8.
  • 11.  
  • 12. Example Calculate the ID and IQV for this PS 372 class grades using the following frequencies or proportions: Grade Freq. Prop. A 4 (.12)‏ B 7 (.21)‏ C 4 (.12)‏ D 7 (.21)‏ E 12 (.34)
  • 13. Index of Diversity ID = 1 – ( p 2 a + p 2 b + p 2 c + p 2 d + p 2 e )‏ ID = 1 - (.12 2 + .21 2 + .12 2 + .21 2 + .34 2 )‏ ID = 1 - (.0144 + .0441 + .0144 + .0441 + .1156)‏ ID = 1 - (.2326)‏ ID = .7674
  • 14. Index of Qualitative Variation 1 – ( p 2 a + p 2 b + p 2 c + p 2 d + p 2 e )‏ 1 - (1/K)‏
  • 15. Index of Qualitative Variation .7674 (1 – 1/5)‏ .9592
  • 16.  
  • 17. Data Matrix A data matrix is an array of rows and columns that stores the values of a set of variables for all the cases in a data set. This is frequently referred to as a dataset.
  • 18.  
  • 19.  
  • 21. Properties of Good Graphs Should answer several of the following questions: (JRM 384)‏ 1. Where does the center of the distribution lie? 2. How spread out or bunched up are the observations? 3. Does it have a single peak or more than one? 4. Approximately what proportion of observations in in the ends of the distributions?
  • 22. Properties of Good Graphs 5. Do observations tend to pile up at one end of the measurement scale, with relatively few observations at the other end? 6. Are there values that, compared with most, seem very large or very small? 7. How does one distribution compare to another in terms of shape, spread, and central tendency? 8. Do values of one variable seem related to another variable?
  • 23.  
  • 24.  
  • 25.  
  • 26.  
  • 27.  
  • 28. Statistical Concepts Let's quickly review some concepts.
  • 29. Population A population refers to any well-defined set of objects such as people, countries, states, organizations, and so on. The term doesn't simply mean the population of the United States or some other geographical area.
  • 30.
  • 31.
  • 32.
  • 33. Hypothesis Testing Many claims can be translated into specific statements about a population that can be confirmed or disconfirmed with the aid of probability theory. Ex: There is no ideological difference evident in the voting patterns of Republican and Democrat justices on the U.S. Supreme Court.
  • 34. Point and Interval Estimation The goal here is to estimate unknown population parameters from samples and to surround those estimates with confidence intervals. Confidence intervals suggest the estimate's reliability or precision.
  • 35. Hypothesis Testing Start with a specific verbal claim or proposition. Ex: The chances of getting heads or tails when flipping the coin is are roughly the same. Ex: The chances of the United States electing a Republican or Democrat president are roughly the same.
  • 36. Hypothesis Testing Next, the researcher constructs a null hypothesis. A null hypothesis is a statement that a population parameter equals a specific value.
  • 37. Hypothesis Testing Following up on the coin example, the null hypothesis would equal .5. Stated more formally: H 0 : P = .5 Where P stands for the probability that the coin will be heads when tossed. H 0 is typically used to denote a null hypothesis.
  • 38.
  • 39.
  • 40. Hypothesis Testing Perhaps you believe the coin is more likely to come up heads than tails. You would formulate the following alternative hypothesis: H A : P > .5 Conversely, if you believe the coin is less likely to come up heads than tails, you would formulate the alternative hypothesis in the opposite direction: H A : P < .5
  • 41.
  • 42.
  • 43.  
  • 44.
  • 45.
  • 46.  
  • 47.
  • 48.