SlideShare a Scribd company logo
1 of 41
MONDAY, DECEMBER 23
2013
INTRODUCTION TO THEORY OF
STATISTICS
BY
SRIRAM C
I sem M.Tech Geoinformatics
KSRSAC
Today:
Central Tendency , Dispersion & Probability
 From frequency tables to distributions
 Types of Distributions: Normal, Skewed
 Level of Measurement:
Nominal, Ordinal, Interval
 Central Tendency: Mode, Median, Mean
 Dispersion: Variance, Standard Deviation
Descriptive statistics are concerned with
describing the characteristics of frequency
distributions
 Where is the center?
 What is the range?
 What is the shape [of the
distribution]?
Frequency Distributions OR HISTOGRAMS
 Simple depiction of all the data
 Graphic — easy to understand
 Problems
 Not always precisely measured
 Not summarized in one number or datum
 Simple depiction of all the data
 Graphic — easy to understand
 Problems
 Not always precisely measured
 Not summarized in one number or datum
Frequency Table
Test Scores
Observation Frequency
65 1
70 2
75 3
80 4
85 3
90 2
95 1
Frequency Distributions
Test Score
Frequency
4
3
2
1
65 70 75 80 85 90 95
Normally Distributed Curve
Skewed Distributions
Summarizing Distributions
Two key characteristics of a frequency distribution
are especially important when summarizing
data or when making a prediction from one set
of results to another:
 Central Tendency
 What is in the “Middle”?
 What is most common?
 What would we use to predict?
 Dispersion
 How Spread out is the distribution?
 What Shape is it?
Three measures of central tendency are commonly
used in statistical analysis - the mode, the median,
and the mean
Each measure is designed to represent a typical score
The choice of which measure to use depends on:
• the shape of the distribution (whether normal or
skewed), and
• the variable’s “level of measurement” (data are
nominal, ordinal or interval).
Appropriate Measures of
Central Tendency
• Nominal variables Mode
• Ordinal variables Median
• Interval level variables Mean
- If the distribution is normal (median is better
with skewed distribution)
• Nominal variables Mode
• Ordinal variables Median
• Interval level variables Mean
- If the distribution is normal (median is better
with skewed distribution)
Mode
Most Common Outcome
Find the Mode
4 5 6 6 7 8 9 10 12
Ans:6
Median
Middle-most Value
50% of observations are above the Median, 50% are
below it
The difference in magnitude between the
observations does not matter
Therefore, it is not sensitive to outliers
Formula Median = n + 1 / 2
To compute the median
• first you rank order the values of X from low to
high:  85, 94, 94, 96, 96, 96, 96, 97, 97, 98
• then count number of observations = 10.
• add 1 = 11.
• divide by 2 to get the middle score  the 5 ½
score
here 96 is the middle score score
Mean - Average
 Most common measure of central tendency
 Best for making predictions
 Applicable under two conditions:
1. scores are measured at the interval level, and
2. distribution is more or less normal [symmetrical].
 Symbolized as:
 for the mean of a sample
 μ for the mean of a population
X
Finding the MeanFinding the Mean
• X = (Σ X / N)
• If X = {3, 5, 10, 4, 3}
X = (3 + 5 + 10 + 4 + 3) / 5
= 25 / 5
= 5
• X = (Σ X / N)
• If X = {3, 5, 10, 4, 3}
X = (3 + 5 + 10 + 4 + 3) / 5
= 25 / 5
= 5
Find the Mean
Q: 4, 5, 8, 7
A: 6
Median: 6.5
Q: 4, 5, 8, 1000
A: 254.25
Median: 6.5
Why can’t the mean tell us everything?
Mean describes Central Tendency, what the
average outcome is.
We also want to know something about how
accurate the mean is when making predictions.
The question becomes how good a representation
of the distribution is the mean? How good is the
mean as a description of central tendency -- or
how good is the mean as a predictor?
Answer -- it depends on the shape of the
distribution. Is the distribution normal or
skewed?
Measures of Variability
Central Tendency doesn’t tell us everything
Dispersion/Deviation/Spread tells us a lot about how a
variable is distributed.
We are most interested in Standard Deviations (σ) and
Variance (σ2
)
Dispersion
Once you determine that the variable of interest is
normally distributed, ideally by producing a
histogram of the scores, the next question to be
asked about the Normally Distributed Curve is its
dispersion: how spread out are the scores
around the mean.
Dispersion is a key concept in statistical thinking.
The basic question being asked is how much do the
scores deviate around the Mean? The more
“bunched up” around the mean the better your
ability to make accurate predictions.
How well does the mean represent the scores in a
distribution? The logic here is to determine
how much spread is in the scores. How much
do the scores "deviate" from the mean? Think
of the mean as the true score or as your best
guess. If every X were very close to the Mean,
the mean would be a very good predictor.
If the distribution is very sharply peaked then the
mean is a good measure of central tendency
and if you were to use the mean to make
predictions you would be right or close much of
the time.
Mean Deviation
The key concept for describing normal distributions
and making predictions from them is called
deviation from the mean.
We could just calculate the average distance between
each observation and the mean.
• We must take the absolute value of the distance,
otherwise they would just cancel out to zero!
Formula:
| |iX X
n
−
∑
Mean Deviation: An ExampleMean Deviation: An Example
X – Xi Abs. Dev.
7 – 6 1
7 – 10 3
7 – 5 2
7 – 4 3
7 – 9 2
7 – 8 1
1. Compute X (Average)
2. Compute X – X and take
the Absolute Value to get
Absolute Deviations
3. Sum the Absolute
Deviations
4. Divide the sum of the
absolute deviations by N
Data: X = {6, 10, 5, 4, 9, 8} X = 42 / 6 = 7
Total: 12 12 / 6 = 2
What Does it Mean?
On Average, each observation is two units
away from the mean.
Is it Really that Easy?
• No!
• Absolute values are difficult to manipulate algebraically
• Absolute values cause enormous problems for calculus
(Discontinuity)
• We need something else…
Variance and Standard Deviation
Instead of taking the absolute value, we square
the deviations from the mean. This yields a
positive value.
This will result in measures we call the Variance
and the Standard Deviation
Sample- Population-
s: Standard Deviation σ: Standard Deviation
s2
: Variance σ2
: Variance
Example:
-1 1
3 9
-2 4
-3 9
2 4
1 1
Data: X = {6, 10, 5, 4, 9, 8}; N = 6
Total: 42 Total: 28
Standard Deviation:
7
6
42
===
∑
N
X
X
Mean:
Variance:
2
2
( ) 28
4.67
6
X X
s
N
−
= = =
∑
16.267.42
=== ss
XX − 2
)( XX −X
6
10
5
4
9
8
Introduction to Probability
Experiments, Counting Rules,Experiments, Counting Rules,
and Assigning Probabilitiesand Assigning Probabilities
Events and Their ProbabilityEvents and Their Probability
Some Basic RelationshipsSome Basic Relationships
of Probabilityof Probability
Conditional ProbabilityConditional Probability
Probability as a Numerical MeasureProbability as a Numerical Measure
of the Likelihood of Occurrenceof the Likelihood of Occurrence
00 11.5.5
Increasing Likelihood of OccurrenceIncreasing Likelihood of Occurrence
Probability:Probability:
The eventThe event
is veryis very
unlikelyunlikely
to occur.to occur.
The occurrenceThe occurrence
of the event isof the event is
just as likely asjust as likely as
it is unlikely.it is unlikely.
The eventThe event
is almostis almost
certaincertain
to occur.to occur.
An Experiment and Its Sample SpaceAn Experiment and Its Sample Space
AnAn experimentexperiment is any process that generatesis any process that generates
well-defined outcomes.well-defined outcomes.
AnAn experimentexperiment is any process that generatesis any process that generates
well-defined outcomes.well-defined outcomes.
TheThe sample spacesample space for an experiment is the set offor an experiment is the set of
all experimental outcomes.all experimental outcomes.
TheThe sample spacesample space for an experiment is the set offor an experiment is the set of
all experimental outcomes.all experimental outcomes.
An experimental outcome is also called aAn experimental outcome is also called a samplesample
pointpoint..
An experimental outcome is also called aAn experimental outcome is also called a samplesample
pointpoint..
Events & Probabilities…
An individual outcome of a sample space is called a simple
event [cannot break it down into several other events],
An event is a collection or set of one or more simple events
in a sample space.
Roll of a die: S = {1, 2, 3, 4, 5, 6}
Simple event: the number “3” will be rolled
Event: an even number (one of 2, 4, or 6) will be rolled
Events & Probabilities…
The probability of an event is the sum of the probabilities of
the simple events that constitute the event.
E.g. (assuming a fair die) S = {1, 2, 3, 4, 5, 6} and
P(1) = P(2) = P(3) = P(4) = P(5) = P(6) = 1/6
Then:
P(EVEN) = P(2) + P(4) + P(6) = 1/6 + 1/6 + 1/6 = 3/6 = 1/2
Probability Rules:
Mathematical Notation
Random Variables
 A random variable is a variable whose value
is a numerical outcome of a random
phenomenon
 often denoted with capital alphabetic symbols
(X, Y, etc.)
 a normal random variable may be denoted as
X ~ N(µ, σ)
 The probability distribution of a random
variable X tells us what values X can take and
how to assign probabilities to those values
Random Variables
 Random variables that have a finite
(countable) list of possible outcomes, with
probabilities assigned to each of these
outcomes, are called discrete
 Random variables that can take on any
value in an interval, with probabilities
given as areas under a density curve, are
called continuous
Random Variables
 Discrete random variables
 number of pets owned (0, 1, 2, … )
 numerical day of the month (1, 2, …, 31)
 how many days of class missed
 Continuous random variables
 weight
 temperature
 time it takes to travel to work
Conditional Probability…
Conditional probability is used to determine how two events
are related; that is, we can determine the probability of one
event given the occurrence of another related event.
Experiment: random select one student in class.
P(randomly selected student is male) =
P(randomly selected student is male/student is on 3rd
row) =
Conditional probabilities are written as P(A | B) and read as
“the probability of A given B” and is calculated as:
Conditional Probability…
Again, the probability of an event given that another event
has occurred is called a conditional probability…
P( A and B) = P(A)*P(B/A) = P(B)*P(A/B) both are true
Keep this in mind!
Data ExplorationSUMMARY
Descriptive statistics help describe your data’s distribution
A measure of central tendency and dispersion are needed to
describe your data’s distribution statistically
Ideally your data fits the descriptions of a normal distribution
with data distributed evenly on either side of the measure of
central tendency.
The following are measures of central tendency: mean, median
and mode
The following are measure of dispersion: range, variance, and
standard deviation
Histograms and box plots can help you illustrate your data’s
distribution
Your descriptive statistics, histograms and/or box plots together
help you describe the nature of your data
After exploring your data using descriptive statistics it’s good to
reflect on your question and modify or refine it as needed.
Thanks
MONDAY, DECEMBER 23
2013

More Related Content

What's hot

Normal distribution
Normal distributionNormal distribution
Normal distributionGlobal Polis
 
M.Ed Tcs 2 seminar ppt npc to submit
M.Ed Tcs 2 seminar ppt npc   to submitM.Ed Tcs 2 seminar ppt npc   to submit
M.Ed Tcs 2 seminar ppt npc to submitBINCYKMATHEW
 
Thiyagu normal probability curve
Thiyagu   normal probability curveThiyagu   normal probability curve
Thiyagu normal probability curveThiyagu K
 
Normal distribution slide share
Normal distribution slide shareNormal distribution slide share
Normal distribution slide shareKate FLR
 
CABT SHS Statistics & Probability - The z-scores and Problems involving Norma...
CABT SHS Statistics & Probability - The z-scores and Problems involving Norma...CABT SHS Statistics & Probability - The z-scores and Problems involving Norma...
CABT SHS Statistics & Probability - The z-scores and Problems involving Norma...Gilbert Joseph Abueg
 
L10 confidence intervals
L10 confidence intervalsL10 confidence intervals
L10 confidence intervalsLayal Fahad
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersionsCapricorn
 
Central tendency
Central tendencyCentral tendency
Central tendencysaadinoor
 
Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)jillmitchell8778
 
Lesson 8 zscore
Lesson 8 zscoreLesson 8 zscore
Lesson 8 zscorenurun2010
 
Mba i qt unit-2.1_measures of variations
Mba i qt unit-2.1_measures of variationsMba i qt unit-2.1_measures of variations
Mba i qt unit-2.1_measures of variationsRai University
 
MEASURE OF DISPERSION
MEASURE OF DISPERSIONMEASURE OF DISPERSION
MEASURE OF DISPERSIONHasnan Naveed
 
Normal distribuation curve[1]
Normal distribuation curve[1]Normal distribuation curve[1]
Normal distribuation curve[1]Sherien Youssry
 
Standard deviationnormal distributionshow
Standard deviationnormal distributionshowStandard deviationnormal distributionshow
Standard deviationnormal distributionshowBiologyIB
 

What's hot (20)

Central tendency
Central tendencyCentral tendency
Central tendency
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
M.Ed Tcs 2 seminar ppt npc to submit
M.Ed Tcs 2 seminar ppt npc   to submitM.Ed Tcs 2 seminar ppt npc   to submit
M.Ed Tcs 2 seminar ppt npc to submit
 
Thiyagu normal probability curve
Thiyagu   normal probability curveThiyagu   normal probability curve
Thiyagu normal probability curve
 
Dispersion stati
Dispersion statiDispersion stati
Dispersion stati
 
Measure of Dispersion in statistics
Measure of Dispersion in statisticsMeasure of Dispersion in statistics
Measure of Dispersion in statistics
 
Normal distribution slide share
Normal distribution slide shareNormal distribution slide share
Normal distribution slide share
 
CABT SHS Statistics & Probability - The z-scores and Problems involving Norma...
CABT SHS Statistics & Probability - The z-scores and Problems involving Norma...CABT SHS Statistics & Probability - The z-scores and Problems involving Norma...
CABT SHS Statistics & Probability - The z-scores and Problems involving Norma...
 
L10 confidence intervals
L10 confidence intervalsL10 confidence intervals
L10 confidence intervals
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
 
Central tendency
Central tendencyCentral tendency
Central tendency
 
Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)
 
Introduction to the t-test
Introduction to the t-testIntroduction to the t-test
Introduction to the t-test
 
Lesson 8 zscore
Lesson 8 zscoreLesson 8 zscore
Lesson 8 zscore
 
Variability
VariabilityVariability
Variability
 
Confidence interval
Confidence intervalConfidence interval
Confidence interval
 
Mba i qt unit-2.1_measures of variations
Mba i qt unit-2.1_measures of variationsMba i qt unit-2.1_measures of variations
Mba i qt unit-2.1_measures of variations
 
MEASURE OF DISPERSION
MEASURE OF DISPERSIONMEASURE OF DISPERSION
MEASURE OF DISPERSION
 
Normal distribuation curve[1]
Normal distribuation curve[1]Normal distribuation curve[1]
Normal distribuation curve[1]
 
Standard deviationnormal distributionshow
Standard deviationnormal distributionshowStandard deviationnormal distributionshow
Standard deviationnormal distributionshow
 

Viewers also liked

Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to StatisticsRobert Tinaro
 
Chapter 1 introduction to statistics
Chapter 1 introduction to statisticsChapter 1 introduction to statistics
Chapter 1 introduction to statisticsJohn Carlo Catacutan
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statisticsKapil Dev Ghante
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to StatisticsSaurav Shrestha
 
MEASURES OF CENTRAL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY AND VARIABILITYMEASURES OF CENTRAL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY AND VARIABILITYMariele Brutas
 
Introduction to statistics 2013
Introduction to statistics 2013Introduction to statistics 2013
Introduction to statistics 2013Mohammad Ihmeidan
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to StatisticsSr Edith Bogue
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to StatisticsAnjan Mahanta
 
Probability and statistics(exercise answers)
Probability and statistics(exercise answers)Probability and statistics(exercise answers)
Probability and statistics(exercise answers)Fatima Bianca Gueco
 
Chapter 1 introduction to statistics for engineers 1 (1)
Chapter 1 introduction to statistics for engineers 1 (1)Chapter 1 introduction to statistics for engineers 1 (1)
Chapter 1 introduction to statistics for engineers 1 (1)abfisho
 
Statistics lesson 1
Statistics   lesson 1Statistics   lesson 1
Statistics lesson 1Katrina Mae
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statisticsmadan kumar
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statisticsakbhanj
 
Basic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsBasic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsAhmed-Refat Refat
 
Introduction to statistics...ppt rahul
Introduction to statistics...ppt rahulIntroduction to statistics...ppt rahul
Introduction to statistics...ppt rahulRahul Dhaker
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...nszakir
 

Viewers also liked (20)

Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
Statistics Introduction
Statistics IntroductionStatistics Introduction
Statistics Introduction
 
Chapter 1 introduction to statistics
Chapter 1 introduction to statisticsChapter 1 introduction to statistics
Chapter 1 introduction to statistics
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
MEASURES OF CENTRAL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY AND VARIABILITYMEASURES OF CENTRAL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY AND VARIABILITY
 
Introduction to statistics 2013
Introduction to statistics 2013Introduction to statistics 2013
Introduction to statistics 2013
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
Probability and statistics(exercise answers)
Probability and statistics(exercise answers)Probability and statistics(exercise answers)
Probability and statistics(exercise answers)
 
Chapter 1 introduction to statistics for engineers 1 (1)
Chapter 1 introduction to statistics for engineers 1 (1)Chapter 1 introduction to statistics for engineers 1 (1)
Chapter 1 introduction to statistics for engineers 1 (1)
 
What Is Statistics
What Is StatisticsWhat Is Statistics
What Is Statistics
 
Statistics lesson 1
Statistics   lesson 1Statistics   lesson 1
Statistics lesson 1
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Basic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsBasic Statistical Concepts and Methods
Basic Statistical Concepts and Methods
 
Introduction to statistics...ppt rahul
Introduction to statistics...ppt rahulIntroduction to statistics...ppt rahul
Introduction to statistics...ppt rahul
 
Statistical ppt
Statistical pptStatistical ppt
Statistical ppt
 
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...Chapter 6 part2-Introduction to Inference-Tests of Significance,  Stating Hyp...
Chapter 6 part2-Introduction to Inference-Tests of Significance, Stating Hyp...
 

Similar to Sriram seminar on introduction to statistics

best for normal distribution.ppt
best for normal distribution.pptbest for normal distribution.ppt
best for normal distribution.pptDejeneDay
 
statical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.pptstatical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.pptNazarudinManik1
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersionKirti Gupta
 
Measures of Dispersion .pptx
Measures of Dispersion .pptxMeasures of Dispersion .pptx
Measures of Dispersion .pptxVishal543707
 
Standard deviation
Standard deviationStandard deviation
Standard deviationMai Ngoc Duc
 
Bio statistics
Bio statisticsBio statistics
Bio statisticsNc Das
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbersUlster BOCES
 
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhhpolar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhhNathanAndreiBoongali
 
ch-4-measures-of-variability-11 2.ppt for nursing
ch-4-measures-of-variability-11 2.ppt for nursingch-4-measures-of-variability-11 2.ppt for nursing
ch-4-measures-of-variability-11 2.ppt for nursingwindri3
 
measures-of-variability-11.ppt
measures-of-variability-11.pptmeasures-of-variability-11.ppt
measures-of-variability-11.pptNievesGuardian1
 
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxLecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxNabeelAli89
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptxVanmala Buchke
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersionSachin Shekde
 

Similar to Sriram seminar on introduction to statistics (20)

best for normal distribution.ppt
best for normal distribution.pptbest for normal distribution.ppt
best for normal distribution.ppt
 
statical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.pptstatical-data-1 to know how to measure.ppt
statical-data-1 to know how to measure.ppt
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
 
21.StatsLecture.07.ppt
21.StatsLecture.07.ppt21.StatsLecture.07.ppt
21.StatsLecture.07.ppt
 
statistics
statisticsstatistics
statistics
 
Statistics
StatisticsStatistics
Statistics
 
Measures of Dispersion .pptx
Measures of Dispersion .pptxMeasures of Dispersion .pptx
Measures of Dispersion .pptx
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
Statistics excellent
Statistics excellentStatistics excellent
Statistics excellent
 
Bio statistics
Bio statisticsBio statistics
Bio statistics
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbers
 
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhhpolar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
polar pojhjgfnbhggnbh hnhghgnhbhnhbjnhhhhhh
 
Inorganic CHEMISTRY
Inorganic CHEMISTRYInorganic CHEMISTRY
Inorganic CHEMISTRY
 
Chapter 11 Psrm
Chapter 11 PsrmChapter 11 Psrm
Chapter 11 Psrm
 
ch-4-measures-of-variability-11 2.ppt for nursing
ch-4-measures-of-variability-11 2.ppt for nursingch-4-measures-of-variability-11 2.ppt for nursing
ch-4-measures-of-variability-11 2.ppt for nursing
 
measures-of-variability-11.ppt
measures-of-variability-11.pptmeasures-of-variability-11.ppt
measures-of-variability-11.ppt
 
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxLecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
 
Basic stat review
Basic stat reviewBasic stat review
Basic stat review
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 

Recently uploaded

Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 

Recently uploaded (20)

Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 

Sriram seminar on introduction to statistics

  • 2. INTRODUCTION TO THEORY OF STATISTICS BY SRIRAM C I sem M.Tech Geoinformatics KSRSAC
  • 3. Today: Central Tendency , Dispersion & Probability  From frequency tables to distributions  Types of Distributions: Normal, Skewed  Level of Measurement: Nominal, Ordinal, Interval  Central Tendency: Mode, Median, Mean  Dispersion: Variance, Standard Deviation
  • 4. Descriptive statistics are concerned with describing the characteristics of frequency distributions  Where is the center?  What is the range?  What is the shape [of the distribution]?
  • 5. Frequency Distributions OR HISTOGRAMS  Simple depiction of all the data  Graphic — easy to understand  Problems  Not always precisely measured  Not summarized in one number or datum  Simple depiction of all the data  Graphic — easy to understand  Problems  Not always precisely measured  Not summarized in one number or datum
  • 6. Frequency Table Test Scores Observation Frequency 65 1 70 2 75 3 80 4 85 3 90 2 95 1
  • 10. Summarizing Distributions Two key characteristics of a frequency distribution are especially important when summarizing data or when making a prediction from one set of results to another:  Central Tendency  What is in the “Middle”?  What is most common?  What would we use to predict?  Dispersion  How Spread out is the distribution?  What Shape is it?
  • 11. Three measures of central tendency are commonly used in statistical analysis - the mode, the median, and the mean Each measure is designed to represent a typical score The choice of which measure to use depends on: • the shape of the distribution (whether normal or skewed), and • the variable’s “level of measurement” (data are nominal, ordinal or interval).
  • 12. Appropriate Measures of Central Tendency • Nominal variables Mode • Ordinal variables Median • Interval level variables Mean - If the distribution is normal (median is better with skewed distribution) • Nominal variables Mode • Ordinal variables Median • Interval level variables Mean - If the distribution is normal (median is better with skewed distribution)
  • 13. Mode Most Common Outcome Find the Mode 4 5 6 6 7 8 9 10 12 Ans:6
  • 14. Median Middle-most Value 50% of observations are above the Median, 50% are below it The difference in magnitude between the observations does not matter Therefore, it is not sensitive to outliers Formula Median = n + 1 / 2
  • 15. To compute the median • first you rank order the values of X from low to high:  85, 94, 94, 96, 96, 96, 96, 97, 97, 98 • then count number of observations = 10. • add 1 = 11. • divide by 2 to get the middle score  the 5 ½ score here 96 is the middle score score
  • 16. Mean - Average  Most common measure of central tendency  Best for making predictions  Applicable under two conditions: 1. scores are measured at the interval level, and 2. distribution is more or less normal [symmetrical].  Symbolized as:  for the mean of a sample  μ for the mean of a population X
  • 17. Finding the MeanFinding the Mean • X = (Σ X / N) • If X = {3, 5, 10, 4, 3} X = (3 + 5 + 10 + 4 + 3) / 5 = 25 / 5 = 5 • X = (Σ X / N) • If X = {3, 5, 10, 4, 3} X = (3 + 5 + 10 + 4 + 3) / 5 = 25 / 5 = 5
  • 18. Find the Mean Q: 4, 5, 8, 7 A: 6 Median: 6.5 Q: 4, 5, 8, 1000 A: 254.25 Median: 6.5
  • 19. Why can’t the mean tell us everything? Mean describes Central Tendency, what the average outcome is. We also want to know something about how accurate the mean is when making predictions. The question becomes how good a representation of the distribution is the mean? How good is the mean as a description of central tendency -- or how good is the mean as a predictor? Answer -- it depends on the shape of the distribution. Is the distribution normal or skewed?
  • 20. Measures of Variability Central Tendency doesn’t tell us everything Dispersion/Deviation/Spread tells us a lot about how a variable is distributed. We are most interested in Standard Deviations (σ) and Variance (σ2 )
  • 21. Dispersion Once you determine that the variable of interest is normally distributed, ideally by producing a histogram of the scores, the next question to be asked about the Normally Distributed Curve is its dispersion: how spread out are the scores around the mean. Dispersion is a key concept in statistical thinking. The basic question being asked is how much do the scores deviate around the Mean? The more “bunched up” around the mean the better your ability to make accurate predictions.
  • 22. How well does the mean represent the scores in a distribution? The logic here is to determine how much spread is in the scores. How much do the scores "deviate" from the mean? Think of the mean as the true score or as your best guess. If every X were very close to the Mean, the mean would be a very good predictor. If the distribution is very sharply peaked then the mean is a good measure of central tendency and if you were to use the mean to make predictions you would be right or close much of the time.
  • 23. Mean Deviation The key concept for describing normal distributions and making predictions from them is called deviation from the mean. We could just calculate the average distance between each observation and the mean. • We must take the absolute value of the distance, otherwise they would just cancel out to zero! Formula: | |iX X n − ∑
  • 24. Mean Deviation: An ExampleMean Deviation: An Example X – Xi Abs. Dev. 7 – 6 1 7 – 10 3 7 – 5 2 7 – 4 3 7 – 9 2 7 – 8 1 1. Compute X (Average) 2. Compute X – X and take the Absolute Value to get Absolute Deviations 3. Sum the Absolute Deviations 4. Divide the sum of the absolute deviations by N Data: X = {6, 10, 5, 4, 9, 8} X = 42 / 6 = 7 Total: 12 12 / 6 = 2
  • 25. What Does it Mean? On Average, each observation is two units away from the mean. Is it Really that Easy? • No! • Absolute values are difficult to manipulate algebraically • Absolute values cause enormous problems for calculus (Discontinuity) • We need something else…
  • 26. Variance and Standard Deviation Instead of taking the absolute value, we square the deviations from the mean. This yields a positive value. This will result in measures we call the Variance and the Standard Deviation Sample- Population- s: Standard Deviation σ: Standard Deviation s2 : Variance σ2 : Variance
  • 27. Example: -1 1 3 9 -2 4 -3 9 2 4 1 1 Data: X = {6, 10, 5, 4, 9, 8}; N = 6 Total: 42 Total: 28 Standard Deviation: 7 6 42 === ∑ N X X Mean: Variance: 2 2 ( ) 28 4.67 6 X X s N − = = = ∑ 16.267.42 === ss XX − 2 )( XX −X 6 10 5 4 9 8
  • 28. Introduction to Probability Experiments, Counting Rules,Experiments, Counting Rules, and Assigning Probabilitiesand Assigning Probabilities Events and Their ProbabilityEvents and Their Probability Some Basic RelationshipsSome Basic Relationships of Probabilityof Probability Conditional ProbabilityConditional Probability
  • 29. Probability as a Numerical MeasureProbability as a Numerical Measure of the Likelihood of Occurrenceof the Likelihood of Occurrence 00 11.5.5 Increasing Likelihood of OccurrenceIncreasing Likelihood of Occurrence Probability:Probability: The eventThe event is veryis very unlikelyunlikely to occur.to occur. The occurrenceThe occurrence of the event isof the event is just as likely asjust as likely as it is unlikely.it is unlikely. The eventThe event is almostis almost certaincertain to occur.to occur.
  • 30. An Experiment and Its Sample SpaceAn Experiment and Its Sample Space AnAn experimentexperiment is any process that generatesis any process that generates well-defined outcomes.well-defined outcomes. AnAn experimentexperiment is any process that generatesis any process that generates well-defined outcomes.well-defined outcomes. TheThe sample spacesample space for an experiment is the set offor an experiment is the set of all experimental outcomes.all experimental outcomes. TheThe sample spacesample space for an experiment is the set offor an experiment is the set of all experimental outcomes.all experimental outcomes. An experimental outcome is also called aAn experimental outcome is also called a samplesample pointpoint.. An experimental outcome is also called aAn experimental outcome is also called a samplesample pointpoint..
  • 31. Events & Probabilities… An individual outcome of a sample space is called a simple event [cannot break it down into several other events], An event is a collection or set of one or more simple events in a sample space. Roll of a die: S = {1, 2, 3, 4, 5, 6} Simple event: the number “3” will be rolled Event: an even number (one of 2, 4, or 6) will be rolled
  • 32. Events & Probabilities… The probability of an event is the sum of the probabilities of the simple events that constitute the event. E.g. (assuming a fair die) S = {1, 2, 3, 4, 5, 6} and P(1) = P(2) = P(3) = P(4) = P(5) = P(6) = 1/6 Then: P(EVEN) = P(2) + P(4) + P(6) = 1/6 + 1/6 + 1/6 = 3/6 = 1/2
  • 34. Random Variables  A random variable is a variable whose value is a numerical outcome of a random phenomenon  often denoted with capital alphabetic symbols (X, Y, etc.)  a normal random variable may be denoted as X ~ N(µ, σ)  The probability distribution of a random variable X tells us what values X can take and how to assign probabilities to those values
  • 35. Random Variables  Random variables that have a finite (countable) list of possible outcomes, with probabilities assigned to each of these outcomes, are called discrete  Random variables that can take on any value in an interval, with probabilities given as areas under a density curve, are called continuous
  • 36. Random Variables  Discrete random variables  number of pets owned (0, 1, 2, … )  numerical day of the month (1, 2, …, 31)  how many days of class missed  Continuous random variables  weight  temperature  time it takes to travel to work
  • 37. Conditional Probability… Conditional probability is used to determine how two events are related; that is, we can determine the probability of one event given the occurrence of another related event. Experiment: random select one student in class. P(randomly selected student is male) = P(randomly selected student is male/student is on 3rd row) = Conditional probabilities are written as P(A | B) and read as “the probability of A given B” and is calculated as:
  • 38. Conditional Probability… Again, the probability of an event given that another event has occurred is called a conditional probability… P( A and B) = P(A)*P(B/A) = P(B)*P(A/B) both are true Keep this in mind!
  • 39. Data ExplorationSUMMARY Descriptive statistics help describe your data’s distribution A measure of central tendency and dispersion are needed to describe your data’s distribution statistically Ideally your data fits the descriptions of a normal distribution with data distributed evenly on either side of the measure of central tendency. The following are measures of central tendency: mean, median and mode The following are measure of dispersion: range, variance, and standard deviation Histograms and box plots can help you illustrate your data’s distribution Your descriptive statistics, histograms and/or box plots together help you describe the nature of your data After exploring your data using descriptive statistics it’s good to reflect on your question and modify or refine it as needed.
  • 40.