A. measure of central tendency

Measure of Central
Tendency

A measure indicating the value to be
expected of a typical or middle data point.

The Arithmetic Mean

 A central tendency measure representing the
arithmetic average of a set of observations.
 The Population Arithmetic Mean, µ, = Σx
N
 The Sample Arithmetic Mean, x = Σx
n

The Arithmetic Mean
 Calculating the Mean from Grouped Data:
x = Σfx
n
 Calculating the Mean of Grouped Data Using
Codes:
x = x0 + w * Σ(u*f)
n

The Arithmetic Mean
 Advantages:
 Its concept is familiar to most of people
 Every data set has one & only one mean
 It is useful for comparison

 Disadvantages:
 It is affected by the extreme values in the data set
 It is tedious to calculate for large data
 It cannot be calculated for grouped data with open-
ended classes

The Weighted Mean
 Weighted Mean xw = Σ(w *x)
Σw
Where,
xw = symbol for the weighted mean
w = weight assigned to each observation
Σ(w *x) = sum of the weight of each element
times that element
Σw = sum of all the weights

Geometric Mean
 For quantities that change over a period of
time, if we need to know an average rate of
change, the arithmetic mean is inappropriate.
Geometric mean offers useful measure in
such a case.

GM = (product of all x values)1/n
Where n is number of x values

The Median
 The median is a single value that measures
the central item in the data set. Half the items
lie above the median, half below it. If the data
set contains an odd number of items, the
middle item of the array is the median. For an
even number of items, the median is the
average
 Median = (n + 1)th item in a data array
2
Where n = number of items in the array

The Median
 Calculating Median of grouped data =
m = [ (n + 1)/2 – (F + 1)]*w + Lm.
fm
Where,
m = median, n = total number of items,
F = the sum of all the class frequencies up to, but
not including, the median class,
fm = frequency of observations of the median class
w = the class-interval width
Lm = lower limit of the median class interval

The Median
 Advantages:
 Extreme values do not affect the median.
 It is easy to understand and can be calculated
from any kind of data, even for grouped data
with open-ended classes, unless the median
falls in an open-ended class.
 Can be calculated for qualitative data.

The Median

 Disadvantages:
 Certain statistical procedures that use the
median are more complex than those that use
the mean.

 To find median value, data first need to be
arranged in ascending order. For large set of
data this could be time consuming.

The Mode
The mode is that value most often repeated in the
data set. The mode of grouped data,
M o = LM + [ d1 ]*w
(d1 + d2)
LM= lower limit of mode class
d1 = frequency of the modal class minus the
frequency of the class directly below it
d2 = frequency of the modal class minus the
frequency of the class directly above it
w = width of the modal class interval

Dispersion
 The spread or variability in a set of data.
 Measures of Dispersion:
• Range
• Interfractile Range: Quartiles, Deciles,
percentiles
• Variance
• Standard Deviation

Range
 The range is the difference between the
highest and the lowest values in a frequency
distribution
 The interquartile range measures
approximately how far from the median we
must go on either side before we can include
one-half the values of the data set.
 Interquartile Range = Q3 – Q1.

Variance & Standard Deviation of
Population
 Variance is a measure of the average
squared distance between the mean and
each item in the data set.

 σ2 = Σ(x - µ)2 = Σ x2 - µ2
N N
 The standard deviation is the positive square
root of the variance. It is expressed in the
same units as the data.
 σ = √σ2

Population

 For calculating variance of grouped data
 σ2 = Σf*(x - µ)2 = Σf x2 - µ2
N N
Where, f represents the frequency of the
class and x represents the midpoint.

Sample
 S2 = Σ(x - x)2 = Σ x2 – n*x2
n–1 n–1 n–1
Notice the change in formula, instead of N,
n – I has come as divisor. If we divide by n,
the result will have some bias as an
estimator. Using a divisor of n – 1 gives us an
unbiased estimation.

Uses of Standard Deviation

 It helps determining, where the values of a
frequency distribution are located in relation
to mean. (Standard Normal Curve)
 It is also useful in describing how far
individual items in a distribution depart from
the mean of the distribution.

A. measure of central tendency

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à A. measure of central tendency

Similaire à A. measure of central tendency (20)

A. measure of central tendency