Measures of Central tendency

Measures
of
Central tendency
Sir Parashurambhau College
Pune, India
anildusane@gmail.com
1

Measures of central tendency
• In statistics, collection of data is the first step, but this is not
sufficient to arrive at any conclusion.
• It is essential to represent data by a single value that shows
conclusion of data at that particular value. This is possible by the
measures of central tendency.
• Generally, it is seen that the values of variables have a tendency to
cluster around a particular value. This tendency is called as
Central tendency.
2

Measures of central tendency
• Agricultural scientist, genetics, plant breeder, manager, teacher,
production engineer, etc. often talk about averages in the context of
average weight gain, average milk production per day, average
income of worker, average marks of students etc.
• These averages simply summarize a set of data in a single value.
• These are representative of data and around which large portion of
observations gathered.
• The averages can be presented as one of the three common
measures of central tendency viz. mean, mode and median.
3

Properties of a good measure of central tendency
1. It should be rigidly defined.
2. It should be unique i.e. only one value.
3. It should be based on all observations (values).
4. It should have sampling stability i.e. it should not be
affected by sampling fluctuations.
5. It should be capable for the further mathematical
calculations
6. It should not be affected by extreme values.
4

Arithmetic Mean(AM)
Mean:
• It is a widely used measure of central tendency.
• There are three types of means, i) Arithmetic Mean (AM), Geometric Mean
(GM) and Harmonic Mean (HM).
• The selection of mean depends on a particular type of data.
Arithmetic Mean (A.M.):
• It is popularly known as average.
• It is the most commonly used measure.
• It is calculated by dividing sum of all the individual observations by the total
number of observations.
• A.M. = Sum of observations/total number of observations
5

Arithmetic Mean for individual observations (i.e. ungrouped data)
• It is computed by summing up of the observations and dividing the sum by total
number of observations.
• A.M. = X =X/n
• A.M. for a discrete series (grouped data):
• For computation of arithmetic mean in a discrete series the values of variables are
multiplied by their respective frequencies.
• However total number of observations is equal to the total number of frequencies.
• X =(fx)/n
• f= frequency of individuals in each class.
• x= class value of measurements
• n = number of observations.
6

Merits of Arithmetic mean
1. It is simple to understand and easy to calculate.
2. It is calculated by the perfect formulae so every one gets same results.
3. It is not necessary to arrange the data.
4. It provides good measure for comparison between two or more groups.
5. It takes into account all the observations in the series.
6. Subsequent calculations and algebraic treatments can be done in better way
than median and mode.
7. It is unique i.e. for a given set of data, there can be only one mean.
8. It does not change much even when repeated samples are taken for the same
population.
7

Demerits of Arithmetic Mean
1.It is largely affected by the extreme values of data.
2.It does not consider increasing and decreasing trend.
3.It is a good measure of central tendency only when the distribution
is normal (Bell shaped) and it is not useful in case of ‘U’ shaped
distribution.
4.In some cases it may lead to meaning less results.
5.It cannot be calculated in open-end classes without making
assumptions about the size of the class interval.
8

Important Applications of AM
1. It is mostly used in practical statistics.
2.It is used to calculate many other estimates in statistics.
3.It is the most used method by any measurement used by
common people to get the average of any data.
4.It is essential to calculate Mean deviation, variance, standard
deviation, etc.
9

Geometric mean (G.M)
• It usually a more suitable as a measure of central tendency when the
values changes exponentially.
• It is extensively used in microbiological and serological research. If
there are two observations G.M. is the square root of product of two
observations.
• If there are three observations it is a cube root of the product of
three observations, if there are ‘n’ observations it is the nth root of
the product of ‘n’ observations.
10

Merits and Demerits of GM
Merits of GM:
1. It is based on all observations
2. It is rigidly defined.
3. It is not much affected by fluctuations of sampling.
4. It is suitable for averaging rates, ratios and percentages.
5. It gives larger weightage to the small items.
Demerits of GM:
1. It is difficult to understand and calculate.
2. It cannot be calculated when there are both negative and positive values.
3. If one or more values are “ 0”, then it would also be “0”.
4. The geometric mean cannot be used if any of the original observations are
negative, since negative number has no logarithm.
11

Calculations of GM in a series of individual observations
• Steps involved:
1.Find logs of the given values (x1,x2,x3 etc)
2.Take the sum of the given values.
3.Sum of logs is divided by number of observations.
4.Find out antilog of the values
12

Harmonic Mean (H.M.)
• It is suitable measure of central tendency when the data pertained
to speed, rates and time.
• H.M. (for ungrouped data) = n / (1/n)
• H.M.(for grouped data) = f /  (f/x)
• Relation ship between A.M., GM and H.M.
• It has been observed that A.M is greater than G.M. while
G.M. is greater than H.M.
• Therefore A.M.  G.M.  H.M.
13

Merits and Demerits of GM
Merits:
1. It is rigidly defined.
2. It is based on all observations of a series,
3. It gives greater weightage to the smaller items.
4. It is useful to study the average price, speed, time, distance etc.
5. It is not much affected by sampling fluctuations.
Demerits:
1. It is not easy to calculate and understand.
2. It cannot be calculated if one of the value is ‘0’
14

Median
• It is a positional average.
• It is the "middle" value in the list of numbers when observations are arranged
in ascending or descending order.
• It is a value separating the higher half from the lower half of a data sample.
• If a series of observations is 23,25,27,29,31,33, then median will be 27.
23
25
27 Median
29
31
33
15

Merits of median
1.It is easy to define and understand.
2.It is unique i.e. it has single value.
3.It is simple to calculate.
4.It is not affected by extreme values of data.
5.It indicates exactly the middle item and hence it can be easily
explained.
6.It can be determined by mere inspection in some cases.
7.It is useful when the classes are open and when the distribution is
unequal (skewed distributions).
16

Demerits of median
1.For calculating the median, it is necessary to arrange the data in
ascending or descending order, however it is not for calculation of
mean and mode.
2.If the number of observations is large, then arranging the data is
tedious and time consuming.
3.In this relative importance cannot be judged properly.
4.It is a proportional average and its value is not determined by each and
every observation.
5.It largely affected by sampling fluctuations.
6.It can not be used for further mathematical processing.
17

Median for individual observations
This involves the following steps:
• 1. Arrange the data in the ascending order.
• 2. Median is located by finding the size of n+1 / 2 th item.
• Median = Size of n+1 / 2 th item
• Where n = number of observations.
18

Median for continuous series
It involves following steps.
1. The first step is to determine median class by using n/2 as the rank [not (n+1)/ 2 ].
2. To determine cumulative frequencies. This is done by adding the number of
frequencies that are in each interval to the frequencies in the next class.
3. The cumulative frequency tells us where n/2 falls.
Median = L + [ ( n/2 – cf ) / f ] x i
Where L = lower limit of median class
Cf = cumulative frequency of the class preceding the median clas
f = frequency of the median class,
i = class interval.
19

Application of median
• Median is used when distribution is markedly
skewed and when small number of measurements is
given.
• Usually, a median is computed when we are
interested, whether cases fall with in the upper or
lower half of distribution and not particularly in how
far they are from the central point.
20

Mode
• It is the value or values that occur most often in distribution.
• It is nothing but the most frequently occurring value.
• It is symbolized by ‘Mo’.
• E.g. Following observations has been recorded
10,15, 12, 13, 14, 15, 12,11,13,15
Value Frequency
10 1
11 1
12 2
13 2
14 1
15 3 15 occurs is having highest frequency so 15 is mode
21

Merits of Mode
1.It is simple and precise.
2.It is easy to calculate and can be determined by a mere
observation of data.
3.It is not affected by extreme values of variables.
4.It is applicable to quantitative as well as qualitative data.
5.It can be calculated from a grouped frequency distribution
having open-end classes and with class intervals of unequal
lengths.
6.The value of mode can be ascertained graphically.
22

Demerits of Mode
1.It is largely affected by sampling fluctuations.
2.It is not always possible to find out well-defined mode.
3.It can not be used for further mathematical calculations.
4.It is not based on all observations.
5.The value of mode can not be determined in bimodal
distribution.
23

Mode of individual observations
• The value which is occurring maximum number of times is the
modal value.
• Mode in discrete series:
• It can be determined by looking to that value of variable around
which the items are most heavily concentrated.
• In frequency distribution, modal class is that class interval which
contains the highest frequency of observations and the mode is the
mid- point of this class interval.
24

Mode of continuous series
• Mode = Lm + d1/(d1+d2) x c
• Lm = lower limit of modal class,
• d1 = frequency of modal class – frequency of the preceding class,
(ignore signs)
• d2 = frequency of modal class – frequency of the succeeding class
( ignore signs).
• c = class interval of modal class.
25

Significance of central tendency
• A measure of central tendency tells only about the general level of
magnitudes of the distribution, but it fails to give any idea of the variability
of the observations, as to the individual values scatter around to what extent
or spread about the average.
• To know the extent of the spread about these variations the measure called
Dispersion is used.
• In general, greater the spread from the mean, greater is the variability.
• The different measures of variability (dispersion) are range, mean deviation,
variance, standard deviation, etc.
26

Measures of Central tendency

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Measures of Central tendency

Similar to Measures of Central tendency (20)

More from Sir Parashurambhau College, Pune

More from Sir Parashurambhau College, Pune (20)

Recently uploaded

Recently uploaded (20)

Measures of Central tendency