SlideShare une entreprise Scribd logo
1  sur  25
Télécharger pour lire hors ligne
1

INTRODUCTION TO STATISTICS &
PROBABILITY
Chapter 1:

Looking at Data—Distributions (Part 2)
1.2 Describing Distributions with Numbers

Dr. Nahid Sultana
1.2 Describing Distributions with
Numbers
2

Objectives

 Measures of center: mean, median
 Measures of spread: quartiles, standard deviation
 Five-number summary and boxplot

 IQR and outliers
 Choosing among summary statistics

 Changing the unit of measurement
Measures of center: The Mean
3

 The most common measure of center is the arithmetic
average, or mean, or sample mean.
 To calculate the average, or mean, add all values, then
divide by the number of individuals.
 It is the “center of mass.”
 If the n observations are x1, x2, x3, …, xn, their mean is:
sum of observations x1  x2  ...  xn
x

n
n
1
or in more compact notation, x  n  xi
Measures of center: The Mean
(cont…)
4

Find the mean:
Here are the scores on the first exam in an introductory
statistics course for 10 students:

80

73

92

85

75

98

93

55

Find the mean first-exam score for these students.
Solution:

80

90
Measuring Center: The Median
5

 Another common measure of center is the median.

 The median M is the midpoint of a distribution, the
number such that half of the observations are smaller
and the other half are larger.
To find the median of a distribution:
1. Arrange all observations from smallest to largest.
2. If the number of observations n is odd, the median M is the
center observation in the ordered list.
3. If the number of observations n is even, the median M is the
average of the two center observations in the ordered list.
Measuring Center: The Median (cont...)
6

Find the median:
Here are the scores on the first exam in an introductory
statistics course for 10 students:
80 73
92
85
75
98
93
55
80
Find the median first-exam score for these students.
Solution:

90

Note: The location of the median is (n + 1)/2 in the sorted list.
Comparing Mean and Median
7
Comparing Mean and Median (Cont...)
8

 The mean and the median are the same only if the distribution is
symmetrical.

 In a skewed
distribution, the mean is
usually farther out in
the long tail than is the
median.

 The median is a measure of center that is resistant to skew and
outliers. The mean is not.
Measuring Spread: The Quartiles
9

A measure of center alone can be misleading. A useful numerical
description of a distribution requires both a measure of center and a
measure of spread.
 We describe the spread or variability of a distribution by giving
several percentiles.
 The median divides the data in two parts; half of the observations
are above the median and half are below the median. We could
call the median the 50th percentile.
 The lower quartile (first quartile, Q1)is the median of the lower
half of the data; the upper quartile (third quartile, Q3) is the
median of the upper half of the data.
 With the median, the quartiles divide the data into four equal
parts; 25% of the data are in each part
Measuring Spread: The Quartiles (Cont.)
Calculate the quartiles and inter-quartile:

10

1. Arrange the observations in
increasing order and locate
the median M.
2. The first quartile Q1 is the
median of the lower half of
the data, excluding M.

3. The third quartile Q3 is it is
the median of the upper half
of the data, excluding M.
Measuring Spread: The Quartiles
(Cont.)
11

Example: Here are the scores on the first-exam in an introductory
statistics course for 10 students:
80 73
92
85
75
98
93
55
80
90
Find the quartiles for these first-exam scores.
Solution: In order, the scores are:
55 73
75
80
80
85
90
92
93
98
The median is,
Q1 = 75, the median of the first five numbers: 55, 73, 75, 80, 80.
Q3 = 92, the median of the last five numbers: 85, 90, 92, 93, 98.
The Five-Number Summary
12

The five-number summary of a distribution consists of
 The smallest observation (Min)
 The first quartile (Q1)
 The median (M)
 The third quartile (Q3)
 The largest observation (Max)
written in order from smallest to largest.

Minimum

Q1

M

Q3

Maximum
Boxplots
13

A boxplot is a graph of the five-number summary.
 Draw a central box from Q1 to Q3.
 Draw a line inside the box to mark the median M.
 Extend lines from the box out to the minimum and maximum
values that are not outliers.
Boxplots (Cont…)
14

Example: Here are the scores on the first-exam in an introductory
statistics course for 10 students:
80 73
92
85
75

98
93
Make a boxplot for these first-exam scores.
Solution: In order, the scores are:
55, 73, 75, 80, 80, 85, 90, 92, 93, 98
Min = 55
Q1 = 75
M = 82.5
Q3 = 92
Max = 98

55

80

90
Comparing Boxplots to Histograms
15
15
Boxplots and skewed data
16

Years until death

Boxplots for a symmetric and a right-skewed distribution
15
14
13
12
11
10
9
8
7
6
5
4
3
2
1
0

Boxplots show

symmetry or skew.

Disease X

Multiple Myeloma
Suspected Outliers: 1.5  IQR Rule
17

 Outliers are troublesome data points, and it is important to be
able to identify them.
The interquartile range IQR is the distance between the first and
third quartiles,
IQR = Q3 − Q1

 IQR is used as part of a rule of thumb for identifying outliers.
The 1.5  IQR Rule for Outliers
Call an observation an outlier if it falls more than 1.5  IQR above
the third quartile or below the first quartile.

 Suspected low outlier: any value < Q1 – 1.5  IQR
 Suspected high outlier: any value > Q3 + 1.5  IQR
Suspected Outliers: 1.5  IQR Rule (Cont..)
18

Individual #25 has a value of 7.9 years, which is 3.55 years
above the third quartile. This is more than 1.5 * IQR =3.225
years. Thus, individual #25 is a suspected outlier.
Suspected Outliers: 1.5  IQR Rule (Cont..)
19

 Modified boxplots plot suspected outliers individually.

 The 8 largest call lengths are
438, 465, 479, 700, 700, 951, 1148, 2631
 They are plotted as individual points, though 2 of them are
identical and so do not appear separately.
Measuring Spread:
The Standard Deviation

20
The most common measure of spread looks at how far each
observation is from the mean. This measure is called the standard
deviation.

 The standard deviation s measures the average distance of the
observations from their mean.
 It is calculated by

 This average squared distance is called the variance.
Calculating The Standard Deviation
21
1. Calculate mean
2. Calculate each deviation,
deviation = observation – mean
3. Square each deviation
4. Calculate the sum of the squared
deviations
5. Divided by degrees freedom,
(df) = (n-1), this is called the variance.
6. Calculate the square root of the
variance…this is the standard
deviation.

The variance = 52/(9 – 1) = 6.5
Standard deviation = 6.5 = 2.55

xi

(xi-mean) (xi-mean)2

1

1 - 5 = -4

(-4)2 = 16

3

3 - 5 = -2

(-2)2 = 4

4

4 - 5 = -1

(-1)2 = 1

4

4 - 5 = -1

(-1)2 = 1

4

4 - 5 = -1

(-1)2 = 1

5

5-5=0

(0)2 = 0

7

7-5=2

(2)2 = 4

8

8-5=3

(3)2 = 9

9

9-5=4

(4)2 = 16

Mean=5

Sum=0

Sum=52
Properties of The Standard Deviation
22
 s measures spread about the mean and should be used only
when the mean is the measure of center.

 s = 0 only when all observations have the same value and there
is no spread. Otherwise, s > 0.
 s is not resistant to outliers.
 s has the same units of measurement as the original
observations.
Choosing Measures of Center and
Spread

23
We now have a choice between two descriptions for center and spread
 Mean and Standard Deviation
 Median and Interquartile Range
 The median and IQR are usually better than the mean and
standard deviation for describing a skewed distribution or a
distribution with outliers.

 Use mean and standard deviation only for reasonably symmetric
distributions that don’t have outliers.
NOTE: Numerical summaries do not fully describe the shape of a
distribution. ALWAYS PLOT YOUR DATA FIRST!
Changing the Unit of Measurement
24
 Variables can be recorded in different units of measurement.
 Most often, one measurement unit is a linear transformation of
another measurement unit: xnew = a + bx.
Example 1: If a distance x is measured in kilometers, the same distance
in miles is xnew = 0.62 x
This transformation changes the units without changing the origin
—a distance of 0 kilometers is the same as a distance of 0 miles.
Example 2: Temperatures can be expressed in degrees Fahrenheit or
degrees Celsius.
This transformation changes both the unit; size and the origin of
the measurements —The origin in the Celsius scale (0◦C, the
temperature at which water freezes) is 32◦ in the Fahrenheit scale.
Changing the Unit of Measurement
(Cont…)

25

 Linear transformations do not change the basic shape of a
distribution (skew, symmetry).
 But they do change the measures of center and spread:
 Multiplying each observation by a positive number b multiplies

both measures of center (mean, median) and spread (IQR, s) by b.
 Adding the same number a (positive or negative) to each

observation adds a to measures of center and to quartiles but it
does not change measures of spread (IQR, s).

Contenu connexe

Tendances

Measure of Central Tendency
Measure of Central Tendency Measure of Central Tendency
Measure of Central Tendency Umme Habiba
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central TendencyRejvi Ahmed
 
Inter quartile range
Inter quartile rangeInter quartile range
Inter quartile rangeKen Plummer
 
Measure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptxMeasure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptxSandeAlotaBoco
 
Measures of Variation or Dispersion
Measures of Variation or Dispersion Measures of Variation or Dispersion
Measures of Variation or Dispersion Dr Athar Khan
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variationRaj Teotia
 
Measures of Central Tendency: Ungrouped and Grouped
Measures of Central Tendency: Ungrouped and GroupedMeasures of Central Tendency: Ungrouped and Grouped
Measures of Central Tendency: Ungrouped and GroupedMaryGraceRecaaAgusti
 
Center-Radius Form of the Equation of a Circle.pptx
Center-Radius Form of the Equation of a Circle.pptxCenter-Radius Form of the Equation of a Circle.pptx
Center-Radius Form of the Equation of a Circle.pptxEmeritaTrases
 
INDEPENDENT AND DEPENDENT EVENTS
INDEPENDENT AND DEPENDENT EVENTSINDEPENDENT AND DEPENDENT EVENTS
INDEPENDENT AND DEPENDENT EVENTSnirabmedhi91
 
Normal distribution
Normal distributionNormal distribution
Normal distributionCamilleJoy3
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendencyRichard Paulino
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersionsCapricorn
 
Mean, Median, Mode, And Range
Mean, Median, Mode, And RangeMean, Median, Mode, And Range
Mean, Median, Mode, And Rangetmathteacher86
 
Basic Concept Of Probability
Basic Concept Of ProbabilityBasic Concept Of Probability
Basic Concept Of Probabilityguest45a926
 
Skewness & Kurtosis
Skewness & KurtosisSkewness & Kurtosis
Skewness & KurtosisNavin Bafna
 

Tendances (20)

Counting
Counting  Counting
Counting
 
Measure of Central Tendency
Measure of Central Tendency Measure of Central Tendency
Measure of Central Tendency
 
Z-Scores
Z-ScoresZ-Scores
Z-Scores
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
 
Inter quartile range
Inter quartile rangeInter quartile range
Inter quartile range
 
Measure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptxMeasure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptx
 
Summation Notation
Summation NotationSummation Notation
Summation Notation
 
Measures of Variation or Dispersion
Measures of Variation or Dispersion Measures of Variation or Dispersion
Measures of Variation or Dispersion
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variation
 
Union and intersection
Union and intersectionUnion and intersection
Union and intersection
 
Measures of Central Tendency: Ungrouped and Grouped
Measures of Central Tendency: Ungrouped and GroupedMeasures of Central Tendency: Ungrouped and Grouped
Measures of Central Tendency: Ungrouped and Grouped
 
Center-Radius Form of the Equation of a Circle.pptx
Center-Radius Form of the Equation of a Circle.pptxCenter-Radius Form of the Equation of a Circle.pptx
Center-Radius Form of the Equation of a Circle.pptx
 
INDEPENDENT AND DEPENDENT EVENTS
INDEPENDENT AND DEPENDENT EVENTSINDEPENDENT AND DEPENDENT EVENTS
INDEPENDENT AND DEPENDENT EVENTS
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
 
Mean, Median, Mode, And Range
Mean, Median, Mode, And RangeMean, Median, Mode, And Range
Mean, Median, Mode, And Range
 
Basic Concept Of Probability
Basic Concept Of ProbabilityBasic Concept Of Probability
Basic Concept Of Probability
 
Skewness & Kurtosis
Skewness & KurtosisSkewness & Kurtosis
Skewness & Kurtosis
 

En vedette

F test Analysis of Variance (ANOVA)
F test Analysis of Variance (ANOVA)F test Analysis of Variance (ANOVA)
F test Analysis of Variance (ANOVA)Marianne Maluyo
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...nszakir
 
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...nszakir
 
Anova (f test) and mean differentiation
Anova (f test) and mean differentiationAnova (f test) and mean differentiation
Anova (f test) and mean differentiationSubramani Parasuraman
 
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...nszakir
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimationTech_MX
 
Hypothesis testing; z test, t-test. f-test
Hypothesis testing; z test, t-test. f-testHypothesis testing; z test, t-test. f-test
Hypothesis testing; z test, t-test. f-testShakehand with Life
 

En vedette (12)

FEC 512.04
FEC 512.04FEC 512.04
FEC 512.04
 
F test Analysis of Variance (ANOVA)
F test Analysis of Variance (ANOVA)F test Analysis of Variance (ANOVA)
F test Analysis of Variance (ANOVA)
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
 
Estimation
EstimationEstimation
Estimation
 
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
Chapter 6 part1- Introduction to Inference-Estimating with Confidence (Introd...
 
Anova (f test) and mean differentiation
Anova (f test) and mean differentiationAnova (f test) and mean differentiation
Anova (f test) and mean differentiation
 
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
Chapter 7 : Inference for Distributions(The t Distributions, One-Sample t Con...
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimation
 
Chi – square test
Chi – square testChi – square test
Chi – square test
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
Hypothesis testing; z test, t-test. f-test
Hypothesis testing; z test, t-test. f-testHypothesis testing; z test, t-test. f-test
Hypothesis testing; z test, t-test. f-test
 
Chi square test
Chi square testChi square test
Chi square test
 

Similaire à Describing Distributions with Numbers

3. Descriptive statistics.pdf
3. Descriptive statistics.pdf3. Descriptive statistics.pdf
3. Descriptive statistics.pdfYomifDeksisaHerpa
 
local_media4419196206087945469 (1).pptx
local_media4419196206087945469 (1).pptxlocal_media4419196206087945469 (1).pptx
local_media4419196206087945469 (1).pptxJayArRodriguez2
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptxVanmala Buchke
 
Penggambaran Data Secara Numerik
Penggambaran Data Secara NumerikPenggambaran Data Secara Numerik
Penggambaran Data Secara Numerikanom1392
 
Empirics of standard deviation
Empirics of standard deviationEmpirics of standard deviation
Empirics of standard deviationAdebanji Ayeni
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbersUlster BOCES
 
ap_stat_1.3.ppt
ap_stat_1.3.pptap_stat_1.3.ppt
ap_stat_1.3.pptfghgjd
 
Measures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and ShapesMeasures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and ShapesScholarsPoint1
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersionSachin Shekde
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersionDrZahid Khan
 
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Neeraj Bhandari
 
Module 3 statistics
Module 3   statisticsModule 3   statistics
Module 3 statisticsdionesioable
 
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdfUnit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdfAravindS199
 
Biostatistics cource for clinical pharmacy
Biostatistics cource for clinical pharmacyBiostatistics cource for clinical pharmacy
Biostatistics cource for clinical pharmacyBatizemaryam
 
Kwoledge of calculation of mean,median and mode
Kwoledge of calculation of mean,median and modeKwoledge of calculation of mean,median and mode
Kwoledge of calculation of mean,median and modeAarti Vijaykumar
 
measure of dispersion
measure of dispersion measure of dispersion
measure of dispersion som allul
 
Mba i qt unit-2.1_measures of variations
Mba i qt unit-2.1_measures of variationsMba i qt unit-2.1_measures of variations
Mba i qt unit-2.1_measures of variationsRai University
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersionKirti Gupta
 

Similaire à Describing Distributions with Numbers (20)

3. Descriptive statistics.pdf
3. Descriptive statistics.pdf3. Descriptive statistics.pdf
3. Descriptive statistics.pdf
 
local_media4419196206087945469 (1).pptx
local_media4419196206087945469 (1).pptxlocal_media4419196206087945469 (1).pptx
local_media4419196206087945469 (1).pptx
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
Penggambaran Data Secara Numerik
Penggambaran Data Secara NumerikPenggambaran Data Secara Numerik
Penggambaran Data Secara Numerik
 
Empirics of standard deviation
Empirics of standard deviationEmpirics of standard deviation
Empirics of standard deviation
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbers
 
Statistics
StatisticsStatistics
Statistics
 
ap_stat_1.3.ppt
ap_stat_1.3.pptap_stat_1.3.ppt
ap_stat_1.3.ppt
 
Measures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and ShapesMeasures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and Shapes
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
 
Module 3 statistics
Module 3   statisticsModule 3   statistics
Module 3 statistics
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
 
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdfUnit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
 
Biostatistics cource for clinical pharmacy
Biostatistics cource for clinical pharmacyBiostatistics cource for clinical pharmacy
Biostatistics cource for clinical pharmacy
 
Kwoledge of calculation of mean,median and mode
Kwoledge of calculation of mean,median and modeKwoledge of calculation of mean,median and mode
Kwoledge of calculation of mean,median and mode
 
measure of dispersion
measure of dispersion measure of dispersion
measure of dispersion
 
Mba i qt unit-2.1_measures of variations
Mba i qt unit-2.1_measures of variationsMba i qt unit-2.1_measures of variations
Mba i qt unit-2.1_measures of variations
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
 

Plus de nszakir

Chapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by ContrapositiveChapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by Contrapositivenszakir
 
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVEChapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVEnszakir
 
Chapter 2: Relations
Chapter 2: RelationsChapter 2: Relations
Chapter 2: Relationsnszakir
 
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...nszakir
 
Chapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample MeanChapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample Meannszakir
 
Chapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability RulesChapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability Rulesnszakir
 
Chapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random VariablesChapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random Variablesnszakir
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variablesnszakir
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Modelnszakir
 
Chapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical InferenceChapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical Inferencenszakir
 
Chapter 3 part2- Sampling Design
Chapter 3 part2- Sampling DesignChapter 3 part2- Sampling Design
Chapter 3 part2- Sampling Designnszakir
 
Chapter 3 part1-Design of Experiments
Chapter 3 part1-Design of ExperimentsChapter 3 part1-Design of Experiments
Chapter 3 part1-Design of Experimentsnszakir
 
Chapter 2 part2-Correlation
Chapter 2 part2-CorrelationChapter 2 part2-Correlation
Chapter 2 part2-Correlationnszakir
 
Chapter 2 part1-Scatterplots
Chapter 2 part1-ScatterplotsChapter 2 part1-Scatterplots
Chapter 2 part1-Scatterplotsnszakir
 
Chapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares RegressionChapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares Regressionnszakir
 
Density Curves and Normal Distributions
Density Curves and Normal DistributionsDensity Curves and Normal Distributions
Density Curves and Normal Distributionsnszakir
 
Displaying Distributions with Graphs
Displaying Distributions with GraphsDisplaying Distributions with Graphs
Displaying Distributions with Graphsnszakir
 

Plus de nszakir (17)

Chapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by ContrapositiveChapter-4: More on Direct Proof and Proof by Contrapositive
Chapter-4: More on Direct Proof and Proof by Contrapositive
 
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVEChapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
Chapter-3: DIRECT PROOF AND PROOF BY CONTRAPOSITIVE
 
Chapter 2: Relations
Chapter 2: RelationsChapter 2: Relations
Chapter 2: Relations
 
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
Chapter 5 part2- Sampling Distributions for Counts and Proportions (Binomial ...
 
Chapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample MeanChapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample Mean
 
Chapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability RulesChapter 4 part4- General Probability Rules
Chapter 4 part4- General Probability Rules
 
Chapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random VariablesChapter 4 part3- Means and Variances of Random Variables
Chapter 4 part3- Means and Variances of Random Variables
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variables
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Model
 
Chapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical InferenceChapter 3 part3-Toward Statistical Inference
Chapter 3 part3-Toward Statistical Inference
 
Chapter 3 part2- Sampling Design
Chapter 3 part2- Sampling DesignChapter 3 part2- Sampling Design
Chapter 3 part2- Sampling Design
 
Chapter 3 part1-Design of Experiments
Chapter 3 part1-Design of ExperimentsChapter 3 part1-Design of Experiments
Chapter 3 part1-Design of Experiments
 
Chapter 2 part2-Correlation
Chapter 2 part2-CorrelationChapter 2 part2-Correlation
Chapter 2 part2-Correlation
 
Chapter 2 part1-Scatterplots
Chapter 2 part1-ScatterplotsChapter 2 part1-Scatterplots
Chapter 2 part1-Scatterplots
 
Chapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares RegressionChapter 2 part3-Least-Squares Regression
Chapter 2 part3-Least-Squares Regression
 
Density Curves and Normal Distributions
Density Curves and Normal DistributionsDensity Curves and Normal Distributions
Density Curves and Normal Distributions
 
Displaying Distributions with Graphs
Displaying Distributions with GraphsDisplaying Distributions with Graphs
Displaying Distributions with Graphs
 

Dernier

Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 

Dernier (20)

Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 

Describing Distributions with Numbers

  • 1. 1 INTRODUCTION TO STATISTICS & PROBABILITY Chapter 1: Looking at Data—Distributions (Part 2) 1.2 Describing Distributions with Numbers Dr. Nahid Sultana
  • 2. 1.2 Describing Distributions with Numbers 2 Objectives  Measures of center: mean, median  Measures of spread: quartiles, standard deviation  Five-number summary and boxplot  IQR and outliers  Choosing among summary statistics  Changing the unit of measurement
  • 3. Measures of center: The Mean 3  The most common measure of center is the arithmetic average, or mean, or sample mean.  To calculate the average, or mean, add all values, then divide by the number of individuals.  It is the “center of mass.”  If the n observations are x1, x2, x3, …, xn, their mean is: sum of observations x1  x2  ...  xn x  n n 1 or in more compact notation, x  n  xi
  • 4. Measures of center: The Mean (cont…) 4 Find the mean: Here are the scores on the first exam in an introductory statistics course for 10 students: 80 73 92 85 75 98 93 55 Find the mean first-exam score for these students. Solution: 80 90
  • 5. Measuring Center: The Median 5  Another common measure of center is the median.  The median M is the midpoint of a distribution, the number such that half of the observations are smaller and the other half are larger. To find the median of a distribution: 1. Arrange all observations from smallest to largest. 2. If the number of observations n is odd, the median M is the center observation in the ordered list. 3. If the number of observations n is even, the median M is the average of the two center observations in the ordered list.
  • 6. Measuring Center: The Median (cont...) 6 Find the median: Here are the scores on the first exam in an introductory statistics course for 10 students: 80 73 92 85 75 98 93 55 80 Find the median first-exam score for these students. Solution: 90 Note: The location of the median is (n + 1)/2 in the sorted list.
  • 8. Comparing Mean and Median (Cont...) 8  The mean and the median are the same only if the distribution is symmetrical.  In a skewed distribution, the mean is usually farther out in the long tail than is the median.  The median is a measure of center that is resistant to skew and outliers. The mean is not.
  • 9. Measuring Spread: The Quartiles 9 A measure of center alone can be misleading. A useful numerical description of a distribution requires both a measure of center and a measure of spread.  We describe the spread or variability of a distribution by giving several percentiles.  The median divides the data in two parts; half of the observations are above the median and half are below the median. We could call the median the 50th percentile.  The lower quartile (first quartile, Q1)is the median of the lower half of the data; the upper quartile (third quartile, Q3) is the median of the upper half of the data.  With the median, the quartiles divide the data into four equal parts; 25% of the data are in each part
  • 10. Measuring Spread: The Quartiles (Cont.) Calculate the quartiles and inter-quartile: 10 1. Arrange the observations in increasing order and locate the median M. 2. The first quartile Q1 is the median of the lower half of the data, excluding M. 3. The third quartile Q3 is it is the median of the upper half of the data, excluding M.
  • 11. Measuring Spread: The Quartiles (Cont.) 11 Example: Here are the scores on the first-exam in an introductory statistics course for 10 students: 80 73 92 85 75 98 93 55 80 90 Find the quartiles for these first-exam scores. Solution: In order, the scores are: 55 73 75 80 80 85 90 92 93 98 The median is, Q1 = 75, the median of the first five numbers: 55, 73, 75, 80, 80. Q3 = 92, the median of the last five numbers: 85, 90, 92, 93, 98.
  • 12. The Five-Number Summary 12 The five-number summary of a distribution consists of  The smallest observation (Min)  The first quartile (Q1)  The median (M)  The third quartile (Q3)  The largest observation (Max) written in order from smallest to largest. Minimum Q1 M Q3 Maximum
  • 13. Boxplots 13 A boxplot is a graph of the five-number summary.  Draw a central box from Q1 to Q3.  Draw a line inside the box to mark the median M.  Extend lines from the box out to the minimum and maximum values that are not outliers.
  • 14. Boxplots (Cont…) 14 Example: Here are the scores on the first-exam in an introductory statistics course for 10 students: 80 73 92 85 75 98 93 Make a boxplot for these first-exam scores. Solution: In order, the scores are: 55, 73, 75, 80, 80, 85, 90, 92, 93, 98 Min = 55 Q1 = 75 M = 82.5 Q3 = 92 Max = 98 55 80 90
  • 15. Comparing Boxplots to Histograms 15 15
  • 16. Boxplots and skewed data 16 Years until death Boxplots for a symmetric and a right-skewed distribution 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Boxplots show symmetry or skew. Disease X Multiple Myeloma
  • 17. Suspected Outliers: 1.5  IQR Rule 17  Outliers are troublesome data points, and it is important to be able to identify them. The interquartile range IQR is the distance between the first and third quartiles, IQR = Q3 − Q1  IQR is used as part of a rule of thumb for identifying outliers. The 1.5  IQR Rule for Outliers Call an observation an outlier if it falls more than 1.5  IQR above the third quartile or below the first quartile.  Suspected low outlier: any value < Q1 – 1.5  IQR  Suspected high outlier: any value > Q3 + 1.5  IQR
  • 18. Suspected Outliers: 1.5  IQR Rule (Cont..) 18 Individual #25 has a value of 7.9 years, which is 3.55 years above the third quartile. This is more than 1.5 * IQR =3.225 years. Thus, individual #25 is a suspected outlier.
  • 19. Suspected Outliers: 1.5  IQR Rule (Cont..) 19  Modified boxplots plot suspected outliers individually.  The 8 largest call lengths are 438, 465, 479, 700, 700, 951, 1148, 2631  They are plotted as individual points, though 2 of them are identical and so do not appear separately.
  • 20. Measuring Spread: The Standard Deviation 20 The most common measure of spread looks at how far each observation is from the mean. This measure is called the standard deviation.  The standard deviation s measures the average distance of the observations from their mean.  It is calculated by  This average squared distance is called the variance.
  • 21. Calculating The Standard Deviation 21 1. Calculate mean 2. Calculate each deviation, deviation = observation – mean 3. Square each deviation 4. Calculate the sum of the squared deviations 5. Divided by degrees freedom, (df) = (n-1), this is called the variance. 6. Calculate the square root of the variance…this is the standard deviation. The variance = 52/(9 – 1) = 6.5 Standard deviation = 6.5 = 2.55 xi (xi-mean) (xi-mean)2 1 1 - 5 = -4 (-4)2 = 16 3 3 - 5 = -2 (-2)2 = 4 4 4 - 5 = -1 (-1)2 = 1 4 4 - 5 = -1 (-1)2 = 1 4 4 - 5 = -1 (-1)2 = 1 5 5-5=0 (0)2 = 0 7 7-5=2 (2)2 = 4 8 8-5=3 (3)2 = 9 9 9-5=4 (4)2 = 16 Mean=5 Sum=0 Sum=52
  • 22. Properties of The Standard Deviation 22  s measures spread about the mean and should be used only when the mean is the measure of center.  s = 0 only when all observations have the same value and there is no spread. Otherwise, s > 0.  s is not resistant to outliers.  s has the same units of measurement as the original observations.
  • 23. Choosing Measures of Center and Spread 23 We now have a choice between two descriptions for center and spread  Mean and Standard Deviation  Median and Interquartile Range  The median and IQR are usually better than the mean and standard deviation for describing a skewed distribution or a distribution with outliers.  Use mean and standard deviation only for reasonably symmetric distributions that don’t have outliers. NOTE: Numerical summaries do not fully describe the shape of a distribution. ALWAYS PLOT YOUR DATA FIRST!
  • 24. Changing the Unit of Measurement 24  Variables can be recorded in different units of measurement.  Most often, one measurement unit is a linear transformation of another measurement unit: xnew = a + bx. Example 1: If a distance x is measured in kilometers, the same distance in miles is xnew = 0.62 x This transformation changes the units without changing the origin —a distance of 0 kilometers is the same as a distance of 0 miles. Example 2: Temperatures can be expressed in degrees Fahrenheit or degrees Celsius. This transformation changes both the unit; size and the origin of the measurements —The origin in the Celsius scale (0◦C, the temperature at which water freezes) is 32◦ in the Fahrenheit scale.
  • 25. Changing the Unit of Measurement (Cont…) 25  Linear transformations do not change the basic shape of a distribution (skew, symmetry).  But they do change the measures of center and spread:  Multiplying each observation by a positive number b multiplies both measures of center (mean, median) and spread (IQR, s) by b.  Adding the same number a (positive or negative) to each observation adds a to measures of center and to quartiles but it does not change measures of spread (IQR, s).