SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
INTRODUCTION TO STATISTICS &
PROBABILITY
Chapter 2:
Looking at Data–Relationships (Part 1)
1
Dr. Nahid Sultana
Chapter 2:
Looking at Data–Relationships
2
2.1: Scatterplots
2.2: Correlation
2.3: Least-Squares Regression
2.5: Data Analysis for Two-Way Tables
3
Objectives
 Bivariate data
 Explanatory and response variables
 Scatterplots
 Interpreting scatterplots
 Outliers
 Categorical variables in scatterplots
2.1: Scatterplots
Bivariate data
4
 For each individual studied, we record
data on two variables.
 We then examine whether there is a
relationship between these two
variables: Do changes in one variable
tend to be associated with specific
changes in the other variables?
Student
ID
Number
of Beers
Blood Alcohol
Content
1 5 0.1
2 2 0.03
3 9 0.19
6 7 0.095
7 3 0.07
9 3 0.02
11 4 0.07
13 5 0.085
4 8 0.12
5 3 0.04
8 5 0.06
10 5 0.05
12 6 0.1
14 7 0.09
15 1 0.01
16 4 0.05
Here we have two quantitative variables
recorded for each of 16 students:
1. how many beers they drank
2. their resulting blood alcohol content
(BAC)
5
 Many interesting examples of the use of statistics involve
relationships between pairs of variables.
Two variables measured on the same cases are associated if
knowing the value of one of the variables tells you something about
the values of the other variable that you would not know without this
information.
5
Associations Between Variables
 A response (dependent) variable measures an outcome of a study.
 An explanatory (independent) variable explains changes in the
response variable.
6
Scatterplot
6
 The most useful graph for displaying the relationship between two
quantitative variables on the same individuals is a scatterplot.
1. Decide which variable should go on which axis.
2. Typically, the explanatory or independent variable is plotted
on the x-axis, and the response or dependent variable is plotted
on the y-axis.
3. Label and scale your axes.
4. Plot individual data values.
How to Make a Scatterplot
7
Scatterplot (Cont…)
Example: Make a scatterplot of the relationship between body
weight and backpack weight for a group of hikers.
7
Body weight (lb) 120 187 109 103 131 165 158 116
Backpack weight (lb) 26 30 26 24 29 35 31 28
8
Interpreting Scatterplots
8
 After plotting two variables on a scatterplot, we describe the
overall pattern of the relationship. Specifically, we look for form,
direction, and strength .
Form: linear, curved, clusters, no pattern
Direction: positive, negative, no direction
Strength: how closely the points fit the “form”
… and clear deviations from that pattern
Outliers of the relationship, , an individual value that falls
outside the overall pattern of the relationship
How to Examine a Scatterplot
9
Linear
Nonlinear
No relationship
Interpreting Scatterplots (Cont…)
(Form)
10
Interpreting Scatterplots (Cont…)
(Direction)
Positive association: High values of one variable tend to occur
together with high values of the other variable.
Negative association: High values of one variable tend to occur
together with low values of the other variable
11
Interpreting Scatterplots (Cont…)
No relationship: X and Y vary independently. Knowing X tells you
nothing about Y.
12
Interpreting Scatterplots (Cont…)
(Strength)
The strength of the relationship between the two variables can be
seen by how much variation, or scatter, there is around the main
form.
13
Interpreting Scatterplots (Cont…)
(Outliers)
In a scatterplot, outliers are points that fall outside of the overall
pattern of the relationship.
14
Interpreting Scatterplots (Cont…)
Direction FormStrength
 There is one possible
outlier―the hiker with
the body weight of 187
pounds seems to be
carrying relatively less
weight than are the
other group members.
 There is a moderately strong, positive, linear relationship between body
weight and backpack weight.
 It appears that lighter hikers are carrying lighter backpacks.
How to scale a scatterplot
15
Using an inappropriate
scale for a scatterplot can
give an incorrect
impression.
Both variables should be
given a similar amount of
space:
• Plot roughly square
• Points should occupy all
the plot space (no blank
space)
Same data in all four plots
Categorical variables in scatterplots
16
What may look like a positive
linear relationship is in fact a
series of negative linear
associations.
Plotting different habitats in
different colors allows us to
make that important distinction.
To add a categorical variable, use a different plot color or symbol for
each category.
17
Categorical variables in scatterplots
(Cont…)
Comparison of men and women
racing records over time.
Each group shows a very strong
negative linear relationship that
would not be apparent without the
gender categorization.
Relationship between lean body
mass and metabolic rate in men
and women.
Both men and women follow the
same positive linear trend, but
women show a stronger association.
Categorical explanatory variables
When the explanatory variable is categorical, you cannot make a
scatterplot, but you can compare the different categories side by side on
the same graph (boxplots, or mean +/− standard deviation).
Comparison of income (quantitative
response variable) for different
education levels (five categories).
But be careful in your
interpretation: This is NOT a
positive association, because
education is not quantitative.

Contenu connexe

Tendances

Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
drasifk
 
Spss cross tab n chi sq bivariate analysis
Spss  cross tab n chi sq bivariate analysisSpss  cross tab n chi sq bivariate analysis
Spss cross tab n chi sq bivariate analysis
Raja Azrul Raja Ahmad
 
Chapter 16: Correlation (enhanced by VisualBee)
Chapter 16: Correlation  
(enhanced by VisualBee)Chapter 16: Correlation  
(enhanced by VisualBee)
Chapter 16: Correlation (enhanced by VisualBee)
nunngera
 

Tendances (20)

Visualization-1
Visualization-1Visualization-1
Visualization-1
 
Assumptions of Linear Regression - Machine Learning
Assumptions of Linear Regression - Machine LearningAssumptions of Linear Regression - Machine Learning
Assumptions of Linear Regression - Machine Learning
 
Regression
RegressionRegression
Regression
 
Simple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-StepSimple Linear Regression: Step-By-Step
Simple Linear Regression: Step-By-Step
 
More tabs
More tabsMore tabs
More tabs
 
Regression
RegressionRegression
Regression
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
 
8 correlation regression
8 correlation regression 8 correlation regression
8 correlation regression
 
Simple regression and correlation
Simple regression and correlationSimple regression and correlation
Simple regression and correlation
 
Graphs that Enlighten and Graphs that Deceive
Graphs that Enlighten and Graphs that DeceiveGraphs that Enlighten and Graphs that Deceive
Graphs that Enlighten and Graphs that Deceive
 
Correlation and regression analysis
Correlation and regression analysisCorrelation and regression analysis
Correlation and regression analysis
 
Statistics in nursing research
Statistics in nursing researchStatistics in nursing research
Statistics in nursing research
 
Spss cross tab n chi sq bivariate analysis
Spss  cross tab n chi sq bivariate analysisSpss  cross tab n chi sq bivariate analysis
Spss cross tab n chi sq bivariate analysis
 
Presentation on Regression Analysis
Presentation on Regression AnalysisPresentation on Regression Analysis
Presentation on Regression Analysis
 
Chapter 16: Correlation (enhanced by VisualBee)
Chapter 16: Correlation  
(enhanced by VisualBee)Chapter 16: Correlation  
(enhanced by VisualBee)
Chapter 16: Correlation (enhanced by VisualBee)
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regression
 
Crosstabs
CrosstabsCrosstabs
Crosstabs
 
How to Make a Bar Graph
How to Make a Bar GraphHow to Make a Bar Graph
How to Make a Bar Graph
 
Crosstabs
CrosstabsCrosstabs
Crosstabs
 
03.data presentation(2015) 2
03.data presentation(2015) 203.data presentation(2015) 2
03.data presentation(2015) 2
 

En vedette

Chapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical InferenceChapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical Inference
nszakir
 
Портрет слова группа 2
Портрет слова группа 2Портрет слова группа 2
Портрет слова группа 2
Harokol
 
Проект Павленко "Безопасные каникулы".
Проект Павленко "Безопасные каникулы".Проект Павленко "Безопасные каникулы".
Проект Павленко "Безопасные каникулы".
Harokol
 
Портфолио Чекусовой
Портфолио Чекусовой Портфолио Чекусовой
Портфолио Чекусовой
Harokol
 
Портрет слова группа 1
Портрет слова группа 1Портрет слова группа 1
Портрет слова группа 1
Harokol
 
LABORATORY AND PHYSICAL ASSESSMENT DATA (1)
LABORATORY AND PHYSICAL ASSESSMENT DATA (1)LABORATORY AND PHYSICAL ASSESSMENT DATA (1)
LABORATORY AND PHYSICAL ASSESSMENT DATA (1)
Andrew Agbenin
 
Презентация памятники Волгодонска. Петрова Алла
Презентация памятники Волгодонска. Петрова АллаПрезентация памятники Волгодонска. Петрова Алла
Презентация памятники Волгодонска. Петрова Алла
Harokol
 
Expecting Parents Guide to Birth Defects ebook
Expecting Parents Guide to Birth Defects ebookExpecting Parents Guide to Birth Defects ebook
Expecting Parents Guide to Birth Defects ebook
Perey Law
 
Space Hustlers Comic
Space Hustlers ComicSpace Hustlers Comic
Space Hustlers Comic
Steve Owen
 
Impact of the greece downturn
Impact of the greece downturnImpact of the greece downturn
Impact of the greece downturn
Zain Shaikh
 

En vedette (20)

Chapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by ContrapositiveChapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by Contrapositive
 
Chapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical InferenceChapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical Inference
 
Chapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample MeanChapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample Mean
 
Портрет слова группа 2
Портрет слова группа 2Портрет слова группа 2
Портрет слова группа 2
 
DMDL EditorXとToad Editorの紹介
DMDL EditorXとToad Editorの紹介DMDL EditorXとToad Editorの紹介
DMDL EditorXとToad Editorの紹介
 
Receiving your State Pension abroad
Receiving your State Pension abroadReceiving your State Pension abroad
Receiving your State Pension abroad
 
Проект Павленко "Безопасные каникулы".
Проект Павленко "Безопасные каникулы".Проект Павленко "Безопасные каникулы".
Проект Павленко "Безопасные каникулы".
 
proekti
proektiproekti
proekti
 
Cheney Court - Linguarama
Cheney Court - LinguaramaCheney Court - Linguarama
Cheney Court - Linguarama
 
Laboratory and physical assessment data (1)
Laboratory and physical assessment data (1)Laboratory and physical assessment data (1)
Laboratory and physical assessment data (1)
 
Портфолио Чекусовой
Портфолио Чекусовой Портфолио Чекусовой
Портфолио Чекусовой
 
Портрет слова группа 1
Портрет слова группа 1Портрет слова группа 1
Портрет слова группа 1
 
samoupravlenye
samoupravlenyesamoupravlenye
samoupravlenye
 
LABORATORY AND PHYSICAL ASSESSMENT DATA (1)
LABORATORY AND PHYSICAL ASSESSMENT DATA (1)LABORATORY AND PHYSICAL ASSESSMENT DATA (1)
LABORATORY AND PHYSICAL ASSESSMENT DATA (1)
 
Презентация памятники Волгодонска. Петрова Алла
Презентация памятники Волгодонска. Петрова АллаПрезентация памятники Волгодонска. Петрова Алла
Презентация памятники Волгодонска. Петрова Алла
 
Expecting Parents Guide to Birth Defects ebook
Expecting Parents Guide to Birth Defects ebookExpecting Parents Guide to Birth Defects ebook
Expecting Parents Guide to Birth Defects ebook
 
Space Hustlers Comic
Space Hustlers ComicSpace Hustlers Comic
Space Hustlers Comic
 
(社)アンチエイジング学会 年会費特典
(社)アンチエイジング学会 年会費特典(社)アンチエイジング学会 年会費特典
(社)アンチエイジング学会 年会費特典
 
2016: A good year to invest in Spanish property?
2016: A good year to invest in Spanish property?2016: A good year to invest in Spanish property?
2016: A good year to invest in Spanish property?
 
Impact of the greece downturn
Impact of the greece downturnImpact of the greece downturn
Impact of the greece downturn
 

Similaire à Chapter 2 part1-Scatterplots

Maths A - Chapter 11
Maths A - Chapter 11Maths A - Chapter 11
Maths A - Chapter 11
westy67968
 
DoW #6 TVs and Life ExpectanciesFor this weeks DoW, you wi.docx
DoW #6 TVs and Life ExpectanciesFor this weeks DoW, you wi.docxDoW #6 TVs and Life ExpectanciesFor this weeks DoW, you wi.docx
DoW #6 TVs and Life ExpectanciesFor this weeks DoW, you wi.docx
kanepbyrne80830
 
Correlation: Bivariate Data and Scatter Plot
Correlation: Bivariate Data and Scatter PlotCorrelation: Bivariate Data and Scatter Plot
Correlation: Bivariate Data and Scatter Plot
DenzelMontuya1
 
iStockphotoThinkstockchapter 8Factorial and Mixed-Fac.docx
iStockphotoThinkstockchapter 8Factorial and Mixed-Fac.docxiStockphotoThinkstockchapter 8Factorial and Mixed-Fac.docx
iStockphotoThinkstockchapter 8Factorial and Mixed-Fac.docx
vrickens
 
Linear regression
Linear regressionLinear regression
Linear regression
DepEd
 
Stats For Life Module7 Oc
Stats For Life Module7 OcStats For Life Module7 Oc
Stats For Life Module7 Oc
N Rabe
 
Frequency Tables - Statistics
Frequency Tables - StatisticsFrequency Tables - Statistics
Frequency Tables - Statistics
mscartersmaths
 
Exploring bivariate data
Exploring bivariate dataExploring bivariate data
Exploring bivariate data
Ulster BOCES
 

Similaire à Chapter 2 part1-Scatterplots (20)

Chapter 03 scatterplots and correlation
Chapter 03 scatterplots and correlationChapter 03 scatterplots and correlation
Chapter 03 scatterplots and correlation
 
Maths A - Chapter 11
Maths A - Chapter 11Maths A - Chapter 11
Maths A - Chapter 11
 
Scatter diagram
Scatter diagramScatter diagram
Scatter diagram
 
Scatterplots - LSRLs - RESIDs
Scatterplots - LSRLs - RESIDsScatterplots - LSRLs - RESIDs
Scatterplots - LSRLs - RESIDs
 
Scatter plot- Complete
Scatter plot- CompleteScatter plot- Complete
Scatter plot- Complete
 
Scattergrams
ScattergramsScattergrams
Scattergrams
 
DoW #6 TVs and Life ExpectanciesFor this weeks DoW, you wi.docx
DoW #6 TVs and Life ExpectanciesFor this weeks DoW, you wi.docxDoW #6 TVs and Life ExpectanciesFor this weeks DoW, you wi.docx
DoW #6 TVs and Life ExpectanciesFor this weeks DoW, you wi.docx
 
Examining relationships m2
Examining relationships m2Examining relationships m2
Examining relationships m2
 
Scatter Plot.pptx
Scatter Plot.pptxScatter Plot.pptx
Scatter Plot.pptx
 
Tps4e ch1 1.1
Tps4e ch1 1.1Tps4e ch1 1.1
Tps4e ch1 1.1
 
Logistic regression teaching
Logistic regression teachingLogistic regression teaching
Logistic regression teaching
 
Notes s8811 structuralequations2004
Notes s8811 structuralequations2004Notes s8811 structuralequations2004
Notes s8811 structuralequations2004
 
Correlation: Bivariate Data and Scatter Plot
Correlation: Bivariate Data and Scatter PlotCorrelation: Bivariate Data and Scatter Plot
Correlation: Bivariate Data and Scatter Plot
 
Correlation.pptx.pdf
Correlation.pptx.pdfCorrelation.pptx.pdf
Correlation.pptx.pdf
 
Chapter 5
Chapter 5Chapter 5
Chapter 5
 
iStockphotoThinkstockchapter 8Factorial and Mixed-Fac.docx
iStockphotoThinkstockchapter 8Factorial and Mixed-Fac.docxiStockphotoThinkstockchapter 8Factorial and Mixed-Fac.docx
iStockphotoThinkstockchapter 8Factorial and Mixed-Fac.docx
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Stats For Life Module7 Oc
Stats For Life Module7 OcStats For Life Module7 Oc
Stats For Life Module7 Oc
 
Frequency Tables - Statistics
Frequency Tables - StatisticsFrequency Tables - Statistics
Frequency Tables - Statistics
 
Exploring bivariate data
Exploring bivariate dataExploring bivariate data
Exploring bivariate data
 

Plus de nszakir

Chapter 3 part2- Sampling Design
Chapter 3 part2- Sampling DesignChapter 3 part2- Sampling Design
Chapter 3 part2- Sampling Design
nszakir
 
Chapter 3 part1-Design of Experiments
Chapter 3 part1-Design of ExperimentsChapter 3 part1-Design of Experiments
Chapter 3 part1-Design of Experiments
nszakir
 
Chapter 2 part2-Correlation
Chapter 2 part2-CorrelationChapter 2 part2-Correlation
Chapter 2 part2-Correlation
nszakir
 
Density Curves and Normal Distributions
Density Curves and Normal DistributionsDensity Curves and Normal Distributions
Density Curves and Normal Distributions
nszakir
 
Describing Distributions with Numbers
Describing Distributions with NumbersDescribing Distributions with Numbers
Describing Distributions with Numbers
nszakir
 

Plus de nszakir (15)

Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVEChapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
 
Chapter 2: Relations
Chapter 2: RelationsChapter 2: Relations
Chapter 2: Relations
 
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
 
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
 
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
 
Chapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability RulesChapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability Rules
 
Chapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random VariablesChapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random Variables
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variables
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Model
 
Chapter 3 part2- Sampling Design
Chapter 3 part2- Sampling DesignChapter 3 part2- Sampling Design
Chapter 3 part2- Sampling Design
 
Chapter 3 part1-Design of Experiments
Chapter 3 part1-Design of ExperimentsChapter 3 part1-Design of Experiments
Chapter 3 part1-Design of Experiments
 
Chapter 2 part2-Correlation
Chapter 2 part2-CorrelationChapter 2 part2-Correlation
Chapter 2 part2-Correlation
 
Density Curves and Normal Distributions
Density Curves and Normal DistributionsDensity Curves and Normal Distributions
Density Curves and Normal Distributions
 
Describing Distributions with Numbers
Describing Distributions with NumbersDescribing Distributions with Numbers
Describing Distributions with Numbers
 

Chapter 2 part1-Scatterplots

  • 1. INTRODUCTION TO STATISTICS & PROBABILITY Chapter 2: Looking at Data–Relationships (Part 1) 1 Dr. Nahid Sultana
  • 2. Chapter 2: Looking at Data–Relationships 2 2.1: Scatterplots 2.2: Correlation 2.3: Least-Squares Regression 2.5: Data Analysis for Two-Way Tables
  • 3. 3 Objectives  Bivariate data  Explanatory and response variables  Scatterplots  Interpreting scatterplots  Outliers  Categorical variables in scatterplots 2.1: Scatterplots
  • 4. Bivariate data 4  For each individual studied, we record data on two variables.  We then examine whether there is a relationship between these two variables: Do changes in one variable tend to be associated with specific changes in the other variables? Student ID Number of Beers Blood Alcohol Content 1 5 0.1 2 2 0.03 3 9 0.19 6 7 0.095 7 3 0.07 9 3 0.02 11 4 0.07 13 5 0.085 4 8 0.12 5 3 0.04 8 5 0.06 10 5 0.05 12 6 0.1 14 7 0.09 15 1 0.01 16 4 0.05 Here we have two quantitative variables recorded for each of 16 students: 1. how many beers they drank 2. their resulting blood alcohol content (BAC)
  • 5. 5  Many interesting examples of the use of statistics involve relationships between pairs of variables. Two variables measured on the same cases are associated if knowing the value of one of the variables tells you something about the values of the other variable that you would not know without this information. 5 Associations Between Variables  A response (dependent) variable measures an outcome of a study.  An explanatory (independent) variable explains changes in the response variable.
  • 6. 6 Scatterplot 6  The most useful graph for displaying the relationship between two quantitative variables on the same individuals is a scatterplot. 1. Decide which variable should go on which axis. 2. Typically, the explanatory or independent variable is plotted on the x-axis, and the response or dependent variable is plotted on the y-axis. 3. Label and scale your axes. 4. Plot individual data values. How to Make a Scatterplot
  • 7. 7 Scatterplot (Cont…) Example: Make a scatterplot of the relationship between body weight and backpack weight for a group of hikers. 7 Body weight (lb) 120 187 109 103 131 165 158 116 Backpack weight (lb) 26 30 26 24 29 35 31 28
  • 8. 8 Interpreting Scatterplots 8  After plotting two variables on a scatterplot, we describe the overall pattern of the relationship. Specifically, we look for form, direction, and strength . Form: linear, curved, clusters, no pattern Direction: positive, negative, no direction Strength: how closely the points fit the “form” … and clear deviations from that pattern Outliers of the relationship, , an individual value that falls outside the overall pattern of the relationship How to Examine a Scatterplot
  • 10. 10 Interpreting Scatterplots (Cont…) (Direction) Positive association: High values of one variable tend to occur together with high values of the other variable. Negative association: High values of one variable tend to occur together with low values of the other variable
  • 11. 11 Interpreting Scatterplots (Cont…) No relationship: X and Y vary independently. Knowing X tells you nothing about Y.
  • 12. 12 Interpreting Scatterplots (Cont…) (Strength) The strength of the relationship between the two variables can be seen by how much variation, or scatter, there is around the main form.
  • 13. 13 Interpreting Scatterplots (Cont…) (Outliers) In a scatterplot, outliers are points that fall outside of the overall pattern of the relationship.
  • 14. 14 Interpreting Scatterplots (Cont…) Direction FormStrength  There is one possible outlier―the hiker with the body weight of 187 pounds seems to be carrying relatively less weight than are the other group members.  There is a moderately strong, positive, linear relationship between body weight and backpack weight.  It appears that lighter hikers are carrying lighter backpacks.
  • 15. How to scale a scatterplot 15 Using an inappropriate scale for a scatterplot can give an incorrect impression. Both variables should be given a similar amount of space: • Plot roughly square • Points should occupy all the plot space (no blank space) Same data in all four plots
  • 16. Categorical variables in scatterplots 16 What may look like a positive linear relationship is in fact a series of negative linear associations. Plotting different habitats in different colors allows us to make that important distinction. To add a categorical variable, use a different plot color or symbol for each category.
  • 17. 17 Categorical variables in scatterplots (Cont…) Comparison of men and women racing records over time. Each group shows a very strong negative linear relationship that would not be apparent without the gender categorization. Relationship between lean body mass and metabolic rate in men and women. Both men and women follow the same positive linear trend, but women show a stronger association.
  • 18. Categorical explanatory variables When the explanatory variable is categorical, you cannot make a scatterplot, but you can compare the different categories side by side on the same graph (boxplots, or mean +/− standard deviation). Comparison of income (quantitative response variable) for different education levels (five categories). But be careful in your interpretation: This is NOT a positive association, because education is not quantitative.