SlideShare une entreprise Scribd logo
1  sur  38
Télécharger pour lire hors ligne
Chapter 4
Describing the Relation
Between Two Variables
4.1
Scatter Diagrams; Correlation
Bivariate data is data in which two
variables are measured on an individual.
The response variable is the variable
whose value can be explained or
determined based upon the value of the
predictor variable.
A lurking variable is one that is related to
the response and/or predictor variable, but
is excluded from the analysis
A scatter diagram shows the relationship
between two quantitative variables
measured on the same individual. Each
individual in the data set is represented by a
point in the scatter diagram. The predictor
variable is plotted on the horizontal axis and
the response variable is plotted on the
vertical axis. Do not connect the points
when drawing a scatter diagram.
EXAMPLE Drawing a Scatter Diagram
The following data are based on a study for
drilling rock. The researchers wanted to
determine whether the time it takes to dry drill
a distance of 5 feet in rock increases with the
depth at which the drilling begins. So, depth
at which drilling begins is the predictor
variable, x, and time (in minutes) to drill five
feet is the response variable, y. Draw a
scatter diagram of the data.
Source: Penner, R., and Watts, D.G. “Mining Information.” The American Statistician, Vol.
45, No. 1, Feb. 1991, p. 6.
Two variables that are linearly related are said to
be positively associated when above average
values of one variable are associated with above
average values of the corresponding variable.
That is, two variables are positively associated
when the values of the predictor variable increase,
the values of the response variable also increase.
Two variables that are linearly related are said to
be negatively associated when above average
values of one variable are associated with below
average values of the corresponding variable.
That is, two variables are negatively associated
when the values of the predictor variable increase,
the values of the response variable decrease
The linear correlation coefficient or Pearson
product moment correlation coefficient is a
measure of the strength of linear relation between
two quantitative variables. We use the Greek letter
(rho) to represent the population correlation
coefficient and r to represent the sample correlation
coefficient. We shall only present the formula for
the sample correlation coefficient.
1. The linear correlation coefficient is always
between -1 and 1, inclusive. That is, -1 < r < 1.
2. If r = +1, there is a perfect positive linear relation
between the two variables.
3. If r = -1, there is a perfect negative linear relation
between the two variables.
4. The closer r is to +1, the stronger the evidence of
positive association between the two variables.
5. The closer r is to -1, the stronger the evidence of
negative association between the two variables.
Properties of the Linear Correlation CoefficientProperties of the Linear Correlation Coefficient
6. If r is close to 0, there is evidence of no linear
relation between the two variables. Because the
linear correlation coefficient is a measure of
strength of linear relation, r close to 0 does not
imply no relation, just no linear relation.
7. It is a unitless measure of association. So, the
unit of measure for x and y plays no role in the
interpretation of r.
Properties of the Linear Correlation CoefficientProperties of the Linear Correlation Coefficient
EXAMPLE Drawing a Scatter Diagram and
Computing the Correlation Coefficient
For the following data
(a)Draw a scatter diagram and comment on the
type of relation that appears to exist between x
and y.
(b) By hand, compute the linear correlation
coefficient.
EXAMPLE Determining the Linear
Correlation Coefficient
Determine the linear correlation coefficient
of the drilling data.
i
x
x x
s
− i
y
y y
s
−
i i
x y
x x y y
s s
  − −
   
  
x =
y =
A linear correlation coefficient that implies
a strong positive or negative association
that is computed using observational data
does not imply causation among the
variables.
Chapter 4
Describing the Relation
Between Two Variables
4.2
Least-squares Regression
EXAMPLE Finding an Equation that Describes
a Linear Relation
(a) Find a linear equation that relates x (the
predictor variable) and y (the response variable)
by selecting two points and finding the equation
of the line containing the points.
(b) Graph the equation on the scatter diagram.
(c) Use the equation to predict y if x = 5.
Using the following sample data:
The difference between the observed value
of y and the predicted value of y is the error
or residual. That is
residual = observed - predicted
Compute the residual for the prediction
corresponding to x = 5.
EXAMPLE Finding the Least-squares
Regression Line
Using the sample data:
(a) Find the least-squares regression line.
(b) Interpret the slope and intercept.
(c) Predict y if x = 5.
(d) Compute the residual for x = 5.
(e) Draw the least-squares regression line on the
scatter diagram of the data.
EXAMPLE Computing the Sum of Squared
Residuals
Compute the sum of squared residuals for
the line describing the relation between x
and y that was obtained using two points.
Compute the sum of squared residuals for
the least-squares regression line. Which is
smaller?
EXAMPLE Finding the Least-squares
Regression Line
(a) Find the least-squares regression line
for the drilling data.
(b) Use the line to predict the drilling time
at x = 130 feet.
(c) Should the line be used to predict the
drilling time at x = 400 feet? Why?
(d) Interpret the slope and y-intercept.
Math n Statistic
Math n Statistic

Contenu connexe

Tendances

Regression analysis
Regression analysisRegression analysis
Regression analysissaba khan
 
Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Lauren Crosby
 
Sumit presentation
Sumit presentationSumit presentation
Sumit presentationSumit Bharti
 
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Neeraj Bhandari
 
Regression & correlation
Regression & correlationRegression & correlation
Regression & correlationAtiq Rehman
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regressionalok tiwari
 
Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression pptSantosh Bhaskar
 
9.2 lin reg coeff of det
9.2 lin reg coeff of det9.2 lin reg coeff of det
9.2 lin reg coeff of detleblance
 
Regression analysis
Regression analysisRegression analysis
Regression analysisSrikant001p
 
Least Squares Regression Method | Edureka
Least Squares Regression Method | EdurekaLeast Squares Regression Method | Edureka
Least Squares Regression Method | EdurekaEdureka!
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inferenceKemal İnciroğlu
 
Kendall's ,partial correlation and scatter plot
Kendall's ,partial correlation and scatter plotKendall's ,partial correlation and scatter plot
Kendall's ,partial correlation and scatter plotBharath kumar Karanam
 
4. regression analysis1
4. regression analysis14. regression analysis1
4. regression analysis1Karan Kukreja
 

Tendances (19)

Linear regression
Linear regressionLinear regression
Linear regression
 
Coefficient of correlation
Coefficient of correlationCoefficient of correlation
Coefficient of correlation
 
Spearman Rank
Spearman RankSpearman Rank
Spearman Rank
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011
 
Sumit presentation
Sumit presentationSumit presentation
Sumit presentation
 
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
 
Scatter diagram
Scatter diagramScatter diagram
Scatter diagram
 
Regression & correlation
Regression & correlationRegression & correlation
Regression & correlation
 
Regression
RegressionRegression
Regression
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
 
correlation
correlation correlation
correlation
 
Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression ppt
 
9.2 lin reg coeff of det
9.2 lin reg coeff of det9.2 lin reg coeff of det
9.2 lin reg coeff of det
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Least Squares Regression Method | Edureka
Least Squares Regression Method | EdurekaLeast Squares Regression Method | Edureka
Least Squares Regression Method | Edureka
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
 
Kendall's ,partial correlation and scatter plot
Kendall's ,partial correlation and scatter plotKendall's ,partial correlation and scatter plot
Kendall's ,partial correlation and scatter plot
 
4. regression analysis1
4. regression analysis14. regression analysis1
4. regression analysis1
 

En vedette

Scatter diagrams and correlation
Scatter diagrams and correlationScatter diagrams and correlation
Scatter diagrams and correlationkeithpeter
 
scatter diagram
 scatter diagram scatter diagram
scatter diagramshrey8916
 
Coefficient of correlation...ppt
Coefficient of correlation...pptCoefficient of correlation...ppt
Coefficient of correlation...pptRahul Dhaker
 
Lesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionLesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionSumit Prajapati
 
Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Namrata Saxena
 
7 c's of marketing.
7 c's of marketing.7 c's of marketing.
7 c's of marketing.ssagar88
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis pptAnil Mishra
 
Mpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationMpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationVasant Kothari
 

En vedette (16)

Scatter Diagrams
Scatter DiagramsScatter Diagrams
Scatter Diagrams
 
9.1
9.19.1
9.1
 
7 qc tools
7 qc tools7 qc tools
7 qc tools
 
Pearson Correlation
Pearson CorrelationPearson Correlation
Pearson Correlation
 
Scatter diagrams and correlation
Scatter diagrams and correlationScatter diagrams and correlation
Scatter diagrams and correlation
 
scatter diagram
 scatter diagram scatter diagram
scatter diagram
 
Scatter diagram in tqm
Scatter diagram in tqmScatter diagram in tqm
Scatter diagram in tqm
 
Coefficient of correlation...ppt
Coefficient of correlation...pptCoefficient of correlation...ppt
Coefficient of correlation...ppt
 
Lesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionLesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And Regression
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)
 
7 c's of marketing.
7 c's of marketing.7 c's of marketing.
7 c's of marketing.
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis ppt
 
Correlation ppt...
Correlation ppt...Correlation ppt...
Correlation ppt...
 
Mpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationMpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlation
 

Similaire à Math n Statistic (20)

9. parametric regression
9. parametric regression9. parametric regression
9. parametric regression
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Exploring bivariate data
Exploring bivariate dataExploring bivariate data
Exploring bivariate data
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation and regression impt
Correlation and regression imptCorrelation and regression impt
Correlation and regression impt
 
Correlation
CorrelationCorrelation
Correlation
 
REGRESSION ANALYSIS THEORY EXPLAINED HERE
REGRESSION ANALYSIS THEORY EXPLAINED HEREREGRESSION ANALYSIS THEORY EXPLAINED HERE
REGRESSION ANALYSIS THEORY EXPLAINED HERE
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
2-20-04.ppt
2-20-04.ppt2-20-04.ppt
2-20-04.ppt
 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
 
Stat
StatStat
Stat
 
Regression -Linear.pptx
Regression -Linear.pptxRegression -Linear.pptx
Regression -Linear.pptx
 
Scatterplots, Correlation, and Regression
Scatterplots, Correlation, and RegressionScatterplots, Correlation, and Regression
Scatterplots, Correlation, and Regression
 
Study of Correlation
Study of Correlation Study of Correlation
Study of Correlation
 
Measure of Association
Measure of AssociationMeasure of Association
Measure of Association
 
RMBS - CORRELATION.pptx
RMBS - CORRELATION.pptxRMBS - CORRELATION.pptx
RMBS - CORRELATION.pptx
 

Dernier

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxAna-Maria Mihalceanu
 
Français Patch Tuesday - Avril
Français Patch Tuesday - AvrilFrançais Patch Tuesday - Avril
Français Patch Tuesday - AvrilIvanti
 
QMMS Lesson 2 - Using MS Excel Formula.pdf
QMMS Lesson 2 - Using MS Excel Formula.pdfQMMS Lesson 2 - Using MS Excel Formula.pdf
QMMS Lesson 2 - Using MS Excel Formula.pdfROWELL MARQUINA
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...amber724300
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Karmanjay Verma
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Nikki Chapple
 

Dernier (20)

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance Toolbox
 
Français Patch Tuesday - Avril
Français Patch Tuesday - AvrilFrançais Patch Tuesday - Avril
Français Patch Tuesday - Avril
 
QMMS Lesson 2 - Using MS Excel Formula.pdf
QMMS Lesson 2 - Using MS Excel Formula.pdfQMMS Lesson 2 - Using MS Excel Formula.pdf
QMMS Lesson 2 - Using MS Excel Formula.pdf
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
 

Math n Statistic

  • 1. Chapter 4 Describing the Relation Between Two Variables 4.1 Scatter Diagrams; Correlation
  • 2. Bivariate data is data in which two variables are measured on an individual. The response variable is the variable whose value can be explained or determined based upon the value of the predictor variable. A lurking variable is one that is related to the response and/or predictor variable, but is excluded from the analysis
  • 3. A scatter diagram shows the relationship between two quantitative variables measured on the same individual. Each individual in the data set is represented by a point in the scatter diagram. The predictor variable is plotted on the horizontal axis and the response variable is plotted on the vertical axis. Do not connect the points when drawing a scatter diagram.
  • 4. EXAMPLE Drawing a Scatter Diagram The following data are based on a study for drilling rock. The researchers wanted to determine whether the time it takes to dry drill a distance of 5 feet in rock increases with the depth at which the drilling begins. So, depth at which drilling begins is the predictor variable, x, and time (in minutes) to drill five feet is the response variable, y. Draw a scatter diagram of the data. Source: Penner, R., and Watts, D.G. “Mining Information.” The American Statistician, Vol. 45, No. 1, Feb. 1991, p. 6.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. Two variables that are linearly related are said to be positively associated when above average values of one variable are associated with above average values of the corresponding variable. That is, two variables are positively associated when the values of the predictor variable increase, the values of the response variable also increase.
  • 11. Two variables that are linearly related are said to be negatively associated when above average values of one variable are associated with below average values of the corresponding variable. That is, two variables are negatively associated when the values of the predictor variable increase, the values of the response variable decrease
  • 12. The linear correlation coefficient or Pearson product moment correlation coefficient is a measure of the strength of linear relation between two quantitative variables. We use the Greek letter (rho) to represent the population correlation coefficient and r to represent the sample correlation coefficient. We shall only present the formula for the sample correlation coefficient.
  • 13. 1. The linear correlation coefficient is always between -1 and 1, inclusive. That is, -1 < r < 1. 2. If r = +1, there is a perfect positive linear relation between the two variables. 3. If r = -1, there is a perfect negative linear relation between the two variables. 4. The closer r is to +1, the stronger the evidence of positive association between the two variables. 5. The closer r is to -1, the stronger the evidence of negative association between the two variables. Properties of the Linear Correlation CoefficientProperties of the Linear Correlation Coefficient
  • 14. 6. If r is close to 0, there is evidence of no linear relation between the two variables. Because the linear correlation coefficient is a measure of strength of linear relation, r close to 0 does not imply no relation, just no linear relation. 7. It is a unitless measure of association. So, the unit of measure for x and y plays no role in the interpretation of r. Properties of the Linear Correlation CoefficientProperties of the Linear Correlation Coefficient
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. EXAMPLE Drawing a Scatter Diagram and Computing the Correlation Coefficient For the following data (a)Draw a scatter diagram and comment on the type of relation that appears to exist between x and y. (b) By hand, compute the linear correlation coefficient.
  • 25. EXAMPLE Determining the Linear Correlation Coefficient Determine the linear correlation coefficient of the drilling data.
  • 26.
  • 27. i x x x s − i y y y s − i i x y x x y y s s   − −        x = y =
  • 28. A linear correlation coefficient that implies a strong positive or negative association that is computed using observational data does not imply causation among the variables.
  • 29. Chapter 4 Describing the Relation Between Two Variables 4.2 Least-squares Regression
  • 30. EXAMPLE Finding an Equation that Describes a Linear Relation (a) Find a linear equation that relates x (the predictor variable) and y (the response variable) by selecting two points and finding the equation of the line containing the points. (b) Graph the equation on the scatter diagram. (c) Use the equation to predict y if x = 5. Using the following sample data:
  • 31. The difference between the observed value of y and the predicted value of y is the error or residual. That is residual = observed - predicted Compute the residual for the prediction corresponding to x = 5.
  • 32.
  • 33.
  • 34. EXAMPLE Finding the Least-squares Regression Line Using the sample data: (a) Find the least-squares regression line. (b) Interpret the slope and intercept. (c) Predict y if x = 5. (d) Compute the residual for x = 5. (e) Draw the least-squares regression line on the scatter diagram of the data.
  • 35. EXAMPLE Computing the Sum of Squared Residuals Compute the sum of squared residuals for the line describing the relation between x and y that was obtained using two points. Compute the sum of squared residuals for the least-squares regression line. Which is smaller?
  • 36. EXAMPLE Finding the Least-squares Regression Line (a) Find the least-squares regression line for the drilling data. (b) Use the line to predict the drilling time at x = 130 feet. (c) Should the line be used to predict the drilling time at x = 400 feet? Why? (d) Interpret the slope and y-intercept.