SlideShare une entreprise Scribd logo
1  sur  37
Introduction to Statistics for Built
Environment
Course Code: AED 1222
Compiled by
DEPARTMENT OF ARCHITECTURE AND ENVIRONMENTAL DESIGN (AED)
CENTRE FOR FOUNDATION STUDIES (CFS)
INTERNATIONAL ISLAMIC UNIVERSITY MALAYSIA
Lecture 7
Measures of central tendency
Today’s Lecture:
Measures of central tendency for grouped and
ungrouped data:
 The arithmetic mean/trimmed mean
 The median
 The mode
 Summary of comparative characteristics
What is/are Measures of Central Tendency?
●Usually called the average with the purpose to
summarize in a single value: the typical size, middle
property, or central location of a set of values.
Measures of Central Tendency
●Measures of Central Tendency is a single value
situated at the centre of a data and can be taken
as a summary value for that data set.
●The three most common measures of central
tendency are the mean, median and mode.
Center and Location
Mean Mode
An overview
An overview of common measures of central tendency and location:
Median
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
N
x
n
x
x
N
i
i
n
i
i
∑
∑
=
=
=µ
=
1
1
The arithmetic mean
●When people use the word average, they are usually
referring to the arithmetic mean.
●The arithmetic mean is the most commonly used measure
of central tendency.
●The mean is the sum of all scores/data divided by the
number of scores/data.
●Which is the best single number to describe a group of
scores.
●Called meu for population and x bar for sample mean.
What is/are Mean?
µ x
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
• The Mean is the arithmetic average of data values
• Mean = sum of values divided by the number of values
– Population mean
– Sample mean
n = Sample Size
N = Population Size
n
x
x
n
i
i∑=
= 1
N
x
N
i
i∑=
= 1
µ
Formula of :
Formula of :
(meu)
(x bar)
The arithmetic mean cont.
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
• The Mean affected by extreme values (outliers)
0 1 2 3 4 5 6 7 8 9 10
Mean = 3
0 1 2 3 4 5 6 7 8 9 10
Mean = 4
3
5
15
5
54321
==
++++
= 4
5
20
5
104321
==
++++
=
Example 1: (no outliers) Example 2: (with outliers)
The arithmetic mean cont.
OUTLIERS
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
Example (Cont.):
DATA
ARRAY
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Sorted raw data from low to high:
Insulation manufacturer 20 days high temperature record.
• Computing the Mean for ungrouped data
The arithmetic mean cont.
20
5853464443413837353230272726242421171312 +++++++++++++++++++
=
20
648
=
=x
32.4
n
x
x
n
i
i∑=
= 1Formula of :
●The same process in principle.
●However, since the compression of data in a
frequency table results in the loss of actual values of
the observations in each class, it becomes necessary
to make an assumption about these values.
The assumption is that every observation in a class
has a value equal to the class mid-point.
• Computing the Mean for grouped data
The arithmetic mean cont.
No. of Liters sold No. of sales staff (f) Class mid-points (m) fm
80 and less than 90 2 85 170
90 and less than 100 6 95 570
100 and less than 110 10 105 1050
110 and less than 120 14 115 1610
120 and less than 130 9 125 1125
130 and less than 140 7 135 945
140 and less than 150 2 145 290
f 50 fm 5760
=x
= 5760/50
The arithmetic mean cont.
• Computing the Mean for grouped data
Example :
Formula of :
= 115.2 Liters sold
●The mean is a good measure for roughly symmetric
distributions.
●Can be misleading in skewed distributions since it can
be greatly influenced by extreme values (outliers), and
thus it is not the most appropriate measure of central
tendency for very skewed distributions.
●This problem associated with the calculation of the
arithmetic mean can be overcome by relying on a slightly
modified measure of central tendency: the trimmed
mean.
The arithmetic mean cont.
The trimmed mean
●The trimmed mean is calculated by “trimming” or dropping the
smallest and largest numbers from the data set and calculating
the mean of the remaining numbers.
There is no rule determining the number of values to be
trimmed. This rather depends on the data available.
For example, a 5% trimmed mean would be calculated by
dropping the smallest 5% and the largest 5% of the data set and
computing the mean for the remaining 90% of the original data.
●The trimmed mean is a compromise between the arithmetic
(ordinary) mean and the median. Why?
The median
●The median is a measure of central tendency that
occupies/lies the middle position in an array of values.
Half (50%) the data items fall below the median,
and another half (50%) are above that value.
●The median position (not the median value) can be
found using the formula: i=(n+1)/2 or i=(1/2)n
where ‘n’ is the number of observations or values in a
data set.
What is/are Median?
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
• In an ordered array, the median is the
“middle” number, i.e., the number that splits the
distribution in half
• The median is not affected by extreme values
(outliers)
The median cont.
Example 1: (no outliers) Example 2: (with outliers)
0 1 2 3 4 5 6 7 8 9 10
Median = 3
0 1 2 3 4 5 6 7 8 9 10
Median = 3
OUTLIERS
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
• To find the median, sort the n data values
from low to high (sorted data is called a data
array)
• Find the value in the i = (1/2)n position
• The ith
position is called the Median Index
Point.
– If i is not an integer, round up to next highest
integer
The median cont.
For Example:
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
• Note that n = 13
• Find the i = (1/2)n position:
i = (1/2)(13) = 6.5
• Since 6.5 is not an integer, round up to 7
• The median is the value in the 7th position:
Md = 12
Data array:
4, 4, 5, 5, 9, 11, 12, 14, 16, 19, 22, 23, 24
The median cont.
• Computing the Median for ungrouped data
●If using the formula results in a non-integer
value, we take the average of the two nearest
numbers.
For example:
n=18,
based on the formula, the median position is:
i=(18+1)/2=9.5,
in this case we take the average of the 9th
and 10th
values as the median of the data set.
The median cont.
More Example:
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
Example (Cont.):
DATA
ARRAY
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Sorted raw data from low to high:
Insulation manufacturer 20 days high temperature record.
• Computing the Median for ungrouped data
The median cont.
2
120+
=
2
62
=
i =10.5
Formula of : i=(n+1)/2
Md = 31
Median = 31
2
3230+
=Find Average of :
• Computing the Median for grouped data
The median cont.
●Since the actual values of a data set are lost
when a distribution is constructed, it is only
possible to approximate the median value for
grouped data.
●The median for grouped data can be estimated
using the following formula:
Where:
Bl = lower boundary of class containing median
n = sample size
cfp= cumulative frequency of classes preceding
class containing the median
fm = number of observations in class containing
the median
i = width of the interval containing the median
Computing the median for grouped data cont.
i)
f m
cf p-
2
n
(+Bl=Med
Formula of :
No. of Liters sold No. of sales staff (f) Cumulative frequency
(cf)
80 and less than 90 2 2
90 and less than 100 6 8
100 and less than 110 10 18
110 and less than 120 14 32
120 and less than 130 9 41
130 and less than 140 7 48
140 and less than 150 2 50
Computing the median for grouped data cont.
Compute the median for the above data set.
i)
f m
cf p-
2
n
(+Bl=Med
10
14
18
50
110 )
-
2(+=
10
14
7
110 )(+=
105.0110 )(+=
= 115 Liters sold
5110+=
Answer:
The mode
●The mode is the most commonly occurring value
in a data set.
A distribution may have one mode, two modes (bi-
modal) or more modes (multi-modal). It is also
possible for a distribution to have no mode.
●The mode may be an important measure to a
clothing manufacturer who must decide how many
dresses of each size to make. What is most
commonly purchased size?
What is/are Mode?
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
• A measure of location.
• The value that occurs most often.
• Not affected by extreme values (outliers)
• Used for either numerical or categorical data.
• There may be no mode
• There may be several modes
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Mode = 5
0 1 2 3 4 5 6
No Mode
The mode cont.
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
Example (Cont.):
DATA
ARRAY
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Insulation manufacturer 20 days high temperature record.
• Estimating the Mode for ungrouped data
The mode cont.
Mode = 24 Mode = 27
Estimating the mode for grouped data
●When actual data values are unknown, the class
in a distribution with the largest frequency is often
referred to as the modal class.
●The mode may then be defined to be the mid-
point of that class.
●If two or more classes share the distinction of
having the largest frequency, then there are two or
more mid-point values representing two or more
modes.
Where:
L = lower boundary of class containing the mode
f0 = frequency of class containing the mode
f1 = frequency of class preceding the class containing
the mode
f2= frequency of class after the class containing the
mode
c = size of the class containing the mode
Computing the mode for grouped data cont.
Formula of :
c
f-ff-f
f-f
+L=Mode








+ )20()10(
10
No. of Liters sold No. of sales staff (f) Cumulative frequency
(cf)
80 and less than 90 2 2
90 and less than 100 6 8
100 and less than 110 10 18
110 and less than 120 14 32
120 and less than 130 9 41
130 and less than 140 7 48
140 and less than 150 2 50
Estimating the mode for grouped data cont.
Estimate the mode for the above data
= 114.4 Liters sold
44.4110+=
Answer:
c
f-ff-f
f-f
+L=Mode








+ )20()10(
10
10
)914()1014(
1014
110 





+ --
-
+=
10
)5()4(
4
110 





+
+=
10
9
4
110 



+=
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
• Five houses on a hill by the beach
$2,000 K
$500 K
$300 K
$100 K
$100 K
House Prices:
RM 2,000,000
RM 500,000
RM 300,000
RM 100,000
RM 100,000
Review Example
RM 2m
RM 500k
RM 300k
RM 100k
RM 100k
Business Statistics: A Decision-
Making Approach, 7e © 2008
Prentice-Hall, Inc.
Summary Statistics
• Mean: (RM 3,000,000 / 5)
= RM 600,000
• Median: middle value of ranked data
= RM 300,000
• Mode: most frequent value
= RM 100,000
House Prices:
RM2,000,000
500,000
300,000
100,000
100,000
Sum 3,000,000
Review Example cont.
Which measure to use?
●Not all measures are appropriate for all kinds of variables.
●Nominal data (e.g. gender, race)>> mode is the only valid
measure.
●Ordinal data (e.g. salary categories)>> mode & median can be
used.
• When to use the arithmetic mean?
– The best measure for continuous data.
• When to use the median?
– When you know that a distribution is skewed.
– When you have a small number of subjects.
• When to use the mode?
– Only when describing discrete categorical data.
Which measure to use? cont.
Summary of comparative characteristics
The arithmetic mean:
1. It is the most familiar and most widely used measure.
2. It is a measure that is affected by the value of every
observation in the data set.
3. Its value may be distorted too much by a relatively few
extreme values (outliers). And thus can lose its
representative quality in badly skewed data. The
trimmed mean can help overcome such a problem.
4. It can not be computed from a frequency distribution
with an open ended class.
The median:
1. It is easy to define and easy to understand.
2. It is affected by the number of observations but not by
the values of these observations. Thus extremely high
or low values (outliers)do not distort the median.
3. It is frequently used in badly skewed distributions.
4. It may be computed in an open-ended distribution,
since the median value is located in the median class
interval which is highly unlikely to be an open-ended
interval.
Summary of comparative characteristics
The mode:
1. It is generally a less widely used measure than the
mean and median.
2. It may not exist in some sets of data, or there may be
more than one mode in other data sets.
3. It is not affected by extreme values (outliers) in a
distribution.
Summary of comparative characteristics
Next class…
The following topics will be discussed:
 Measures of variability / dispersion (Part I):
 The range
 Quartiles & the Interquartile range
 Percentiles
 The five number summary

Contenu connexe

Tendances

Measure OF Central Tendency
Measure OF Central TendencyMeasure OF Central Tendency
Measure OF Central Tendency
Iqrabutt038
 
Hypothesis testing an introduction
Hypothesis testing an introductionHypothesis testing an introduction
Hypothesis testing an introduction
Geetika Gulyani
 
Class lecture notes # 2 (statistics for research)
Class lecture notes # 2 (statistics for research)Class lecture notes # 2 (statistics for research)
Class lecture notes # 2 (statistics for research)
Harve Abella
 

Tendances (20)

Type of data
Type of dataType of data
Type of data
 
Variance & standard deviation
Variance & standard deviationVariance & standard deviation
Variance & standard deviation
 
Confidence interval
Confidence intervalConfidence interval
Confidence interval
 
Central tendency
Central tendencyCentral tendency
Central tendency
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
Introduction to Statistics and Probability
Introduction to Statistics and ProbabilityIntroduction to Statistics and Probability
Introduction to Statistics and Probability
 
Math 102- Statistics
Math 102- StatisticsMath 102- Statistics
Math 102- Statistics
 
quartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.pptquartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.ppt
 
Probability concept and Probability distribution
Probability concept and Probability distributionProbability concept and Probability distribution
Probability concept and Probability distribution
 
Quartile
QuartileQuartile
Quartile
 
Measures of central tendency and dispersion
Measures of central tendency and dispersionMeasures of central tendency and dispersion
Measures of central tendency and dispersion
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Measure OF Central Tendency
Measure OF Central TendencyMeasure OF Central Tendency
Measure OF Central Tendency
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Hypothesis testing an introduction
Hypothesis testing an introductionHypothesis testing an introduction
Hypothesis testing an introduction
 
Class lecture notes # 2 (statistics for research)
Class lecture notes # 2 (statistics for research)Class lecture notes # 2 (statistics for research)
Class lecture notes # 2 (statistics for research)
 
Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 

En vedette

Measures Of Central Tendency
Measures Of Central TendencyMeasures Of Central Tendency
Measures Of Central Tendency
jchancew
 
Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency
Jan Nah
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
Chie Pegollo
 
2 4 measures of center
2 4 measures of center2 4 measures of center
2 4 measures of center
Ken Kretsch
 
Chapter 03
Chapter 03Chapter 03
Chapter 03
bmcfad01
 
Mean Median Mode
Mean Median ModeMean Median Mode
Mean Median Mode
hiratufail
 
Descriptions of data statistics for research
Descriptions of data   statistics for researchDescriptions of data   statistics for research
Descriptions of data statistics for research
Harve Abella
 
Stat3 central tendency & dispersion
Stat3 central tendency & dispersionStat3 central tendency & dispersion
Stat3 central tendency & dispersion
Forensic Pathology
 

En vedette (20)

Measures Of Central Tendency
Measures Of Central TendencyMeasures Of Central Tendency
Measures Of Central Tendency
 
Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
P5 ungrouped data
P5 ungrouped dataP5 ungrouped data
P5 ungrouped data
 
Grouped Mean Median Mode
Grouped Mean Median ModeGrouped Mean Median Mode
Grouped Mean Median Mode
 
2 4 measures of center
2 4 measures of center2 4 measures of center
2 4 measures of center
 
Chapter 03
Chapter 03Chapter 03
Chapter 03
 
Mode
ModeMode
Mode
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
 
Lesson Plan- Measures of Central tendency of Data
Lesson Plan- Measures of Central tendency of DataLesson Plan- Measures of Central tendency of Data
Lesson Plan- Measures of Central tendency of Data
 
3.3 Mean, Median, Mode, Formulas
3.3 Mean, Median, Mode, Formulas3.3 Mean, Median, Mode, Formulas
3.3 Mean, Median, Mode, Formulas
 
Mean Median Mode
Mean Median ModeMean Median Mode
Mean Median Mode
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Descriptions of data statistics for research
Descriptions of data   statistics for researchDescriptions of data   statistics for research
Descriptions of data statistics for research
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Stat3 central tendency & dispersion
Stat3 central tendency & dispersionStat3 central tendency & dispersion
Stat3 central tendency & dispersion
 
STATISTICS | Measures Of Central Tendency
STATISTICS | Measures Of Central TendencySTATISTICS | Measures Of Central Tendency
STATISTICS | Measures Of Central Tendency
 

Similaire à Lesson 6 measures of central tendency

Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...
Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...
Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...
JuliusRomano3
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
Gilbert Joseph Abueg
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
kreshajay
 
Topic 8a Basic Statistics
Topic 8a Basic StatisticsTopic 8a Basic Statistics
Topic 8a Basic Statistics
Yee Bee Choo
 

Similaire à Lesson 6 measures of central tendency (20)

Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...
Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...
Mean_Median_Mode.ppthhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh...
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
 
Krupa rm
Krupa rmKrupa rm
Krupa rm
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Lecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptxLecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptx
 
Measures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and ShapesMeasures of Central Tendency, Variability and Shapes
Measures of Central Tendency, Variability and Shapes
 
Statistical Methods: Measures of Central Tendency.pptx
Statistical Methods: Measures of Central Tendency.pptxStatistical Methods: Measures of Central Tendency.pptx
Statistical Methods: Measures of Central Tendency.pptx
 
Mat 255 chapter 3 notes
Mat 255 chapter 3 notesMat 255 chapter 3 notes
Mat 255 chapter 3 notes
 
Topic 8a Basic Statistics
Topic 8a Basic StatisticsTopic 8a Basic Statistics
Topic 8a Basic Statistics
 
Measures in Statistics. kjc.pptx
Measures in Statistics. kjc.pptxMeasures in Statistics. kjc.pptx
Measures in Statistics. kjc.pptx
 
Chap03 numerical descriptive measures
Chap03 numerical descriptive measuresChap03 numerical descriptive measures
Chap03 numerical descriptive measures
 
Numerical Descriptive Measures
Numerical Descriptive MeasuresNumerical Descriptive Measures
Numerical Descriptive Measures
 
1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf
 
Business Statistics Chapter 3
Business Statistics Chapter 3Business Statistics Chapter 3
Business Statistics Chapter 3
 
Empirics of standard deviation
Empirics of standard deviationEmpirics of standard deviation
Empirics of standard deviation
 
Measures of Central Tendency.ppt
Measures of Central Tendency.pptMeasures of Central Tendency.ppt
Measures of Central Tendency.ppt
 
Engineering Data Analysis-ProfCharlton
Engineering Data  Analysis-ProfCharltonEngineering Data  Analysis-ProfCharlton
Engineering Data Analysis-ProfCharlton
 
More about data science post.pdf
More about data science post.pdfMore about data science post.pdf
More about data science post.pdf
 
5.DATA SUMMERISATION.ppt
5.DATA SUMMERISATION.ppt5.DATA SUMMERISATION.ppt
5.DATA SUMMERISATION.ppt
 

Plus de nurun2010

Ibe quantity surveyor 2013
Ibe quantity surveyor 2013Ibe quantity surveyor 2013
Ibe quantity surveyor 2013
nurun2010
 
Conservation updated (1)
Conservation updated (1)Conservation updated (1)
Conservation updated (1)
nurun2010
 
Week 8 landscape architecture
Week 8 landscape architectureWeek 8 landscape architecture
Week 8 landscape architecture
nurun2010
 
Lesson 8 zscore
Lesson 8 zscoreLesson 8 zscore
Lesson 8 zscore
nurun2010
 
Lesson 7 measures of dispersion part 2
Lesson 7 measures of dispersion part 2Lesson 7 measures of dispersion part 2
Lesson 7 measures of dispersion part 2
nurun2010
 
Lesson 7 measures of dispersion part 1
Lesson 7 measures of dispersion part 1Lesson 7 measures of dispersion part 1
Lesson 7 measures of dispersion part 1
nurun2010
 
Ibe interiordesign
Ibe interiordesignIbe interiordesign
Ibe interiordesign
nurun2010
 
Week architecture updated
Week  architecture updatedWeek  architecture updated
Week architecture updated
nurun2010
 
Aed1222 lesson 6 2nd part
Aed1222 lesson 6 2nd partAed1222 lesson 6 2nd part
Aed1222 lesson 6 2nd part
nurun2010
 
Aed1222 lesson 6
Aed1222 lesson 6Aed1222 lesson 6
Aed1222 lesson 6
nurun2010
 
Aed1313 ibe week 5 urp
Aed1313 ibe week 5 urpAed1313 ibe week 5 urp
Aed1313 ibe week 5 urp
nurun2010
 
Env. management and issues
Env. management and issuesEnv. management and issues
Env. management and issues
nurun2010
 
Aed1222 lesson 5
Aed1222 lesson 5Aed1222 lesson 5
Aed1222 lesson 5
nurun2010
 
Aed 1313 ibe wk 2
Aed 1313 ibe wk 2Aed 1313 ibe wk 2
Aed 1313 ibe wk 2
nurun2010
 
Aed 1313 ibe-wk 1
Aed 1313 ibe-wk 1Aed 1313 ibe-wk 1
Aed 1313 ibe-wk 1
nurun2010
 
Aed1222 lesson 4
Aed1222 lesson 4Aed1222 lesson 4
Aed1222 lesson 4
nurun2010
 
Aed1222 lesson 3 addition
Aed1222 lesson 3 additionAed1222 lesson 3 addition
Aed1222 lesson 3 addition
nurun2010
 
Aed1222 lesson 2
Aed1222 lesson 2Aed1222 lesson 2
Aed1222 lesson 2
nurun2010
 
Aed1222 lesson 1 and 3
Aed1222 lesson 1 and 3Aed1222 lesson 1 and 3
Aed1222 lesson 1 and 3
nurun2010
 

Plus de nurun2010 (19)

Ibe quantity surveyor 2013
Ibe quantity surveyor 2013Ibe quantity surveyor 2013
Ibe quantity surveyor 2013
 
Conservation updated (1)
Conservation updated (1)Conservation updated (1)
Conservation updated (1)
 
Week 8 landscape architecture
Week 8 landscape architectureWeek 8 landscape architecture
Week 8 landscape architecture
 
Lesson 8 zscore
Lesson 8 zscoreLesson 8 zscore
Lesson 8 zscore
 
Lesson 7 measures of dispersion part 2
Lesson 7 measures of dispersion part 2Lesson 7 measures of dispersion part 2
Lesson 7 measures of dispersion part 2
 
Lesson 7 measures of dispersion part 1
Lesson 7 measures of dispersion part 1Lesson 7 measures of dispersion part 1
Lesson 7 measures of dispersion part 1
 
Ibe interiordesign
Ibe interiordesignIbe interiordesign
Ibe interiordesign
 
Week architecture updated
Week  architecture updatedWeek  architecture updated
Week architecture updated
 
Aed1222 lesson 6 2nd part
Aed1222 lesson 6 2nd partAed1222 lesson 6 2nd part
Aed1222 lesson 6 2nd part
 
Aed1222 lesson 6
Aed1222 lesson 6Aed1222 lesson 6
Aed1222 lesson 6
 
Aed1313 ibe week 5 urp
Aed1313 ibe week 5 urpAed1313 ibe week 5 urp
Aed1313 ibe week 5 urp
 
Env. management and issues
Env. management and issuesEnv. management and issues
Env. management and issues
 
Aed1222 lesson 5
Aed1222 lesson 5Aed1222 lesson 5
Aed1222 lesson 5
 
Aed 1313 ibe wk 2
Aed 1313 ibe wk 2Aed 1313 ibe wk 2
Aed 1313 ibe wk 2
 
Aed 1313 ibe-wk 1
Aed 1313 ibe-wk 1Aed 1313 ibe-wk 1
Aed 1313 ibe-wk 1
 
Aed1222 lesson 4
Aed1222 lesson 4Aed1222 lesson 4
Aed1222 lesson 4
 
Aed1222 lesson 3 addition
Aed1222 lesson 3 additionAed1222 lesson 3 addition
Aed1222 lesson 3 addition
 
Aed1222 lesson 2
Aed1222 lesson 2Aed1222 lesson 2
Aed1222 lesson 2
 
Aed1222 lesson 1 and 3
Aed1222 lesson 1 and 3Aed1222 lesson 1 and 3
Aed1222 lesson 1 and 3
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

Lesson 6 measures of central tendency

  • 1. Introduction to Statistics for Built Environment Course Code: AED 1222 Compiled by DEPARTMENT OF ARCHITECTURE AND ENVIRONMENTAL DESIGN (AED) CENTRE FOR FOUNDATION STUDIES (CFS) INTERNATIONAL ISLAMIC UNIVERSITY MALAYSIA
  • 2. Lecture 7 Measures of central tendency Today’s Lecture: Measures of central tendency for grouped and ungrouped data:  The arithmetic mean/trimmed mean  The median  The mode  Summary of comparative characteristics
  • 3. What is/are Measures of Central Tendency? ●Usually called the average with the purpose to summarize in a single value: the typical size, middle property, or central location of a set of values. Measures of Central Tendency ●Measures of Central Tendency is a single value situated at the centre of a data and can be taken as a summary value for that data set. ●The three most common measures of central tendency are the mean, median and mode.
  • 4. Center and Location Mean Mode An overview An overview of common measures of central tendency and location: Median Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. N x n x x N i i n i i ∑ ∑ = = =µ = 1 1
  • 5. The arithmetic mean ●When people use the word average, they are usually referring to the arithmetic mean. ●The arithmetic mean is the most commonly used measure of central tendency. ●The mean is the sum of all scores/data divided by the number of scores/data. ●Which is the best single number to describe a group of scores. ●Called meu for population and x bar for sample mean. What is/are Mean? µ x
  • 6. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. • The Mean is the arithmetic average of data values • Mean = sum of values divided by the number of values – Population mean – Sample mean n = Sample Size N = Population Size n x x n i i∑= = 1 N x N i i∑= = 1 µ Formula of : Formula of : (meu) (x bar) The arithmetic mean cont.
  • 7. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. • The Mean affected by extreme values (outliers) 0 1 2 3 4 5 6 7 8 9 10 Mean = 3 0 1 2 3 4 5 6 7 8 9 10 Mean = 4 3 5 15 5 54321 == ++++ = 4 5 20 5 104321 == ++++ = Example 1: (no outliers) Example 2: (with outliers) The arithmetic mean cont. OUTLIERS
  • 8. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. Example (Cont.): DATA ARRAY 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 Sorted raw data from low to high: Insulation manufacturer 20 days high temperature record. • Computing the Mean for ungrouped data The arithmetic mean cont. 20 5853464443413837353230272726242421171312 +++++++++++++++++++ = 20 648 = =x 32.4 n x x n i i∑= = 1Formula of :
  • 9. ●The same process in principle. ●However, since the compression of data in a frequency table results in the loss of actual values of the observations in each class, it becomes necessary to make an assumption about these values. The assumption is that every observation in a class has a value equal to the class mid-point. • Computing the Mean for grouped data The arithmetic mean cont.
  • 10. No. of Liters sold No. of sales staff (f) Class mid-points (m) fm 80 and less than 90 2 85 170 90 and less than 100 6 95 570 100 and less than 110 10 105 1050 110 and less than 120 14 115 1610 120 and less than 130 9 125 1125 130 and less than 140 7 135 945 140 and less than 150 2 145 290 f 50 fm 5760 =x = 5760/50 The arithmetic mean cont. • Computing the Mean for grouped data Example : Formula of : = 115.2 Liters sold
  • 11. ●The mean is a good measure for roughly symmetric distributions. ●Can be misleading in skewed distributions since it can be greatly influenced by extreme values (outliers), and thus it is not the most appropriate measure of central tendency for very skewed distributions. ●This problem associated with the calculation of the arithmetic mean can be overcome by relying on a slightly modified measure of central tendency: the trimmed mean. The arithmetic mean cont.
  • 12. The trimmed mean ●The trimmed mean is calculated by “trimming” or dropping the smallest and largest numbers from the data set and calculating the mean of the remaining numbers. There is no rule determining the number of values to be trimmed. This rather depends on the data available. For example, a 5% trimmed mean would be calculated by dropping the smallest 5% and the largest 5% of the data set and computing the mean for the remaining 90% of the original data. ●The trimmed mean is a compromise between the arithmetic (ordinary) mean and the median. Why?
  • 13. The median ●The median is a measure of central tendency that occupies/lies the middle position in an array of values. Half (50%) the data items fall below the median, and another half (50%) are above that value. ●The median position (not the median value) can be found using the formula: i=(n+1)/2 or i=(1/2)n where ‘n’ is the number of observations or values in a data set. What is/are Median?
  • 14. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. • In an ordered array, the median is the “middle” number, i.e., the number that splits the distribution in half • The median is not affected by extreme values (outliers) The median cont. Example 1: (no outliers) Example 2: (with outliers) 0 1 2 3 4 5 6 7 8 9 10 Median = 3 0 1 2 3 4 5 6 7 8 9 10 Median = 3 OUTLIERS
  • 15. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. • To find the median, sort the n data values from low to high (sorted data is called a data array) • Find the value in the i = (1/2)n position • The ith position is called the Median Index Point. – If i is not an integer, round up to next highest integer The median cont. For Example:
  • 16. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. • Note that n = 13 • Find the i = (1/2)n position: i = (1/2)(13) = 6.5 • Since 6.5 is not an integer, round up to 7 • The median is the value in the 7th position: Md = 12 Data array: 4, 4, 5, 5, 9, 11, 12, 14, 16, 19, 22, 23, 24 The median cont. • Computing the Median for ungrouped data
  • 17. ●If using the formula results in a non-integer value, we take the average of the two nearest numbers. For example: n=18, based on the formula, the median position is: i=(18+1)/2=9.5, in this case we take the average of the 9th and 10th values as the median of the data set. The median cont. More Example:
  • 18. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. Example (Cont.): DATA ARRAY 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 Sorted raw data from low to high: Insulation manufacturer 20 days high temperature record. • Computing the Median for ungrouped data The median cont. 2 120+ = 2 62 = i =10.5 Formula of : i=(n+1)/2 Md = 31 Median = 31 2 3230+ =Find Average of :
  • 19. • Computing the Median for grouped data The median cont. ●Since the actual values of a data set are lost when a distribution is constructed, it is only possible to approximate the median value for grouped data. ●The median for grouped data can be estimated using the following formula:
  • 20. Where: Bl = lower boundary of class containing median n = sample size cfp= cumulative frequency of classes preceding class containing the median fm = number of observations in class containing the median i = width of the interval containing the median Computing the median for grouped data cont. i) f m cf p- 2 n (+Bl=Med Formula of :
  • 21. No. of Liters sold No. of sales staff (f) Cumulative frequency (cf) 80 and less than 90 2 2 90 and less than 100 6 8 100 and less than 110 10 18 110 and less than 120 14 32 120 and less than 130 9 41 130 and less than 140 7 48 140 and less than 150 2 50 Computing the median for grouped data cont. Compute the median for the above data set.
  • 22. i) f m cf p- 2 n (+Bl=Med 10 14 18 50 110 ) - 2(+= 10 14 7 110 )(+= 105.0110 )(+= = 115 Liters sold 5110+= Answer:
  • 23. The mode ●The mode is the most commonly occurring value in a data set. A distribution may have one mode, two modes (bi- modal) or more modes (multi-modal). It is also possible for a distribution to have no mode. ●The mode may be an important measure to a clothing manufacturer who must decide how many dresses of each size to make. What is most commonly purchased size? What is/are Mode?
  • 24. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. • A measure of location. • The value that occurs most often. • Not affected by extreme values (outliers) • Used for either numerical or categorical data. • There may be no mode • There may be several modes 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Mode = 5 0 1 2 3 4 5 6 No Mode The mode cont.
  • 25. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. Example (Cont.): DATA ARRAY 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 Insulation manufacturer 20 days high temperature record. • Estimating the Mode for ungrouped data The mode cont. Mode = 24 Mode = 27
  • 26. Estimating the mode for grouped data ●When actual data values are unknown, the class in a distribution with the largest frequency is often referred to as the modal class. ●The mode may then be defined to be the mid- point of that class. ●If two or more classes share the distinction of having the largest frequency, then there are two or more mid-point values representing two or more modes.
  • 27. Where: L = lower boundary of class containing the mode f0 = frequency of class containing the mode f1 = frequency of class preceding the class containing the mode f2= frequency of class after the class containing the mode c = size of the class containing the mode Computing the mode for grouped data cont. Formula of : c f-ff-f f-f +L=Mode         + )20()10( 10
  • 28. No. of Liters sold No. of sales staff (f) Cumulative frequency (cf) 80 and less than 90 2 2 90 and less than 100 6 8 100 and less than 110 10 18 110 and less than 120 14 32 120 and less than 130 9 41 130 and less than 140 7 48 140 and less than 150 2 50 Estimating the mode for grouped data cont. Estimate the mode for the above data
  • 29. = 114.4 Liters sold 44.4110+= Answer: c f-ff-f f-f +L=Mode         + )20()10( 10 10 )914()1014( 1014 110       + -- - += 10 )5()4( 4 110       + += 10 9 4 110     +=
  • 30. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. • Five houses on a hill by the beach $2,000 K $500 K $300 K $100 K $100 K House Prices: RM 2,000,000 RM 500,000 RM 300,000 RM 100,000 RM 100,000 Review Example RM 2m RM 500k RM 300k RM 100k RM 100k
  • 31. Business Statistics: A Decision- Making Approach, 7e © 2008 Prentice-Hall, Inc. Summary Statistics • Mean: (RM 3,000,000 / 5) = RM 600,000 • Median: middle value of ranked data = RM 300,000 • Mode: most frequent value = RM 100,000 House Prices: RM2,000,000 500,000 300,000 100,000 100,000 Sum 3,000,000 Review Example cont.
  • 32. Which measure to use? ●Not all measures are appropriate for all kinds of variables. ●Nominal data (e.g. gender, race)>> mode is the only valid measure. ●Ordinal data (e.g. salary categories)>> mode & median can be used. • When to use the arithmetic mean? – The best measure for continuous data. • When to use the median? – When you know that a distribution is skewed. – When you have a small number of subjects. • When to use the mode? – Only when describing discrete categorical data.
  • 33. Which measure to use? cont.
  • 34. Summary of comparative characteristics The arithmetic mean: 1. It is the most familiar and most widely used measure. 2. It is a measure that is affected by the value of every observation in the data set. 3. Its value may be distorted too much by a relatively few extreme values (outliers). And thus can lose its representative quality in badly skewed data. The trimmed mean can help overcome such a problem. 4. It can not be computed from a frequency distribution with an open ended class.
  • 35. The median: 1. It is easy to define and easy to understand. 2. It is affected by the number of observations but not by the values of these observations. Thus extremely high or low values (outliers)do not distort the median. 3. It is frequently used in badly skewed distributions. 4. It may be computed in an open-ended distribution, since the median value is located in the median class interval which is highly unlikely to be an open-ended interval. Summary of comparative characteristics
  • 36. The mode: 1. It is generally a less widely used measure than the mean and median. 2. It may not exist in some sets of data, or there may be more than one mode in other data sets. 3. It is not affected by extreme values (outliers) in a distribution. Summary of comparative characteristics
  • 37. Next class… The following topics will be discussed:  Measures of variability / dispersion (Part I):  The range  Quartiles & the Interquartile range  Percentiles  The five number summary

Notes de l'éditeur

  1. Updated Version 11/12/2012
  2. It is a compromise in that if you drop none of the data you get an ordinary mean, while if you drop all the data but one value, you get the median.