SlideShare une entreprise Scribd logo
1  sur  24
1
Basics of Statistics
A Quick reference to the basics of
statistics
by
Shiva Dasharathi
2
1. Population & Sample
2. Descriptive Statistics
1. Central tendency
2. Spread (or ) Dispersion
3. Normal Distribution
4. Central Limit Theorem
5. Correlation, covariance
6. Probability
7. Statistical tests & Significance level
8. Random Variables
9. Other Distributions
Contents
3
Population & Sample
Population: is the complete set of items of a characteristic of given
subject of analysis
Sample: A sample is a part of the population under study selected
so that inferences can be drawn from it about the population
Primary data: Data collected for actual analysis of the subject
Secondary Data: Already collected data used for analyzing other
factors of the subject
Measurement scales :
Nominal, ordinal, interval, ratio
4
Descriptive statistics
Central Tendency: Mean
Mean: is the average value
Arithmetic Mean
Geometric Mean
Harmonic Mean
N
xi∑=
N ix∏=
∑=
= N
i ix
N
1
1
5
Descriptive statistics
Central Tendency: Median
Median: is the middle term in an ordered set
: if ‘N’ is odd number
=
: if ‘N’ is even number
Ex: S = {1,4,7,7,9,11,23}
median of S = 4th
term of set
= 7
xN
2
1+
{ 2
2
1
2
xx NN ++
6
Descriptive statistics
Central Tendency: Mode
Mode: = High frequency term
Ex: S = {1,4,11,4,3,9,3,5,7,5,3}
mode of S = 3
7
Descriptive statistics
Spread:
Variance of a sample
Standard Deviation =
( )
1
2
2
−
−
=
∑
N
X µ
σ
Varianceσ
8
Descriptive statistics
for the given sample S = {1, 2,5,7,9,11,12,15,}
Min:1
Max:15
Range = Max – Min
= 15 -1 = 14
IQR: Inter Quartile Range
= Q3 - Q1
= 11.5 – 3.5 Q3 = (11+12)/2 = 11.5
= 8 Q1 = (2+5)/2 = 3.5
Q1: middle term in the 1st
half of the ordered set
Q2: Median
Q3: middle term in the 2nd
half of the ordered set
{
9
Normal Distribution
Normal (or) Gaussian Distribution:
- Is a bell curve with Mean at the center and
- 68% data distributed in 1 Standard Deviation range
- 95.4% data distributed in 2 Standard Deviations range
- 99.7% data distributed in 3 Standard Deviations range
),( σµN
10
Normal Distribution
In a perfect Normal Distribution,
1. parameters Mean, Median, Mode are aligned
2. data is symmetrically distributed around these parameters
),( σµN
11
Standardized Normal Distribution
)1,0(N
To standardize the distribution,
Step 1. center the mean (minus the mean from all the data points)
Step 2. divide each value by the standard deviation
Standardized Normal Distribution:
Isa normal distribution with mean=0 and standard
deviation=1
12
Skewness
Skewness: indicates the symmetry of the distribution
13
Central Limit Theorem
),(
n
N
σ
µ
pµµ =
)
n
pσ
σ = 




:
:
:
n
p
p
σ
µ
14
Correlation
Correlation: Cor(X,Y) = [-1,1]
-Correlation indicates the strength of association
between two variable
- it ranges from -1 to +1
cor(x,y ) = 1 // being perfectly positively correlated
cor(x,y) = -1 // being perfectly negatively correlated
cor(x,y) = 0 // being independent
Cor(X,Y) =
15
Correlation cont..
Perfect correlation
Weak correlation
16
Covariance
Covariance: is a measure of how two variables are changing
together.
- So correlation is the covariance normalized by the standard
deviation of the 2 variables
17
Probability
Terminology:
Sample space: set of all possible outcomes of an experiment
Event: is a subset of sample space
Probability: is the measure of the likeliness that an event will
occur among all the possible outcomes.
Probability of ‘event x’ =
Example:
Event: tossing a fair coin
Sample Space = {HEAD,TAIL}
Probability of ‘Head’ = P(HEAD) = ½ = 0.5
Probability of ‘Tail’ = P(TAIL) = ½ = 0.5
salloutcome
likelyness
xp =)(
18
Probability cont..
Axioms of probability
1.
2. ,
3. Sum of all the probabilities of events in a sample space
is equal to 1
4.
]1,0[)( =xp
)(xpc
)(1)( xpxpc −=
)(xp are mutually exclusive if,
1..)()( 21 =++ xpxp
)()()()( BAPBPAPBAP ∩−+=∪
19
Conditional Probability
Conditional Probability: is the probability of
event A given event B occurred
Where,
: Probability of A conditioned on B
: Joint probability of A and B
: Probability of B
for independent events A, B
)(
)(
)/(
BP
BAP
BAP
∩
=





∩
)(
)(
)/(
BP
BAP
BAP
)(*)()( BPAPBAP =∩
)(
)(
)(*)(
)/( AP
BP
BPAP
BAP ==
∴
20
Bayes Rule
Bayes Rule: Posterior * evidence = Prior * likelihood
from conditional probability,
)(*)/()(*)/(
)(*)/()(
)(
)(
)/(
)(*)/()(
)(
)(
)/(
APABPBPBAP
APABPBAP
AP
BAP
ABP
BPBAPBAP
BP
BAP
BAP
=
=∩⇒
∩
=
=∩⇒
∩
=

)(
)/(*)(
)/(
BP
ABPBP
BAP =
21
Naïve Bayesian
Naïve Bayesian: Bayes Rule + Conditional independence
If ‘E’ is the event conditioned on x1,x2, .. xn, then
by applying the chain rule & independence condition we
get the final equation
∑=
= n
i
ii
n
n
xPx
ExPExPExPEP
xxxEP
1
*
21
2,1
)(
)/(..*)/(*)/(*)(
),../(
)/(...*)/(*)/(*)( 21 ExPExPExPEP n),../( 2,1 nxxxEP α
∏=
n
i
i ExPEP
1
)/(*)(α
22
Statistical tests & Significance level
t-test:
z-test:
Chi-square test:
ANOVA:
Factor ANOVA:
H0 – Null Hypothesis
H1 – Alternative Hypothesis
p-value (significance level, for Null hypothesis)
Type -1 error
Type-2 error
23
Random Variable
Random Variables & Probability Distribution:
Discrete:
Probability mass Function
Cumulative mass function
Continuous
Probability Density Function
Cumulative density function
24
Few other Distributions
Bernoulli distribution:
Binomial Distribution:
Poisson Distribution:

Contenu connexe

Tendances

Stat 4 the normal distribution & steps of testing hypothesis
Stat 4 the normal distribution & steps of testing hypothesisStat 4 the normal distribution & steps of testing hypothesis
Stat 4 the normal distribution & steps of testing hypothesis
Forensic Pathology
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
Gilbert Joseph Abueg
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimation
Tech_MX
 
Point Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsPoint Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis tests
University of Salerno
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbers
Ulster BOCES
 

Tendances (20)

Presentation on "Measure of central tendency"
Presentation on "Measure of central tendency"Presentation on "Measure of central tendency"
Presentation on "Measure of central tendency"
 
Stat 4 the normal distribution & steps of testing hypothesis
Stat 4 the normal distribution & steps of testing hypothesisStat 4 the normal distribution & steps of testing hypothesis
Stat 4 the normal distribution & steps of testing hypothesis
 
Central tendency and Variation or Dispersion
Central tendency and Variation or DispersionCentral tendency and Variation or Dispersion
Central tendency and Variation or Dispersion
 
Basic statistics 1
Basic statistics  1Basic statistics  1
Basic statistics 1
 
Measures of Central Tendency and Dispersion
Measures of Central Tendency and DispersionMeasures of Central Tendency and Dispersion
Measures of Central Tendency and Dispersion
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
 
Point Estimation
Point EstimationPoint Estimation
Point Estimation
 
The Normal Distribution
The Normal DistributionThe Normal Distribution
The Normal Distribution
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Parameter estimation
Parameter estimationParameter estimation
Parameter estimation
 
3 descritive statistics measure of central tendency variatio
3 descritive statistics measure of   central   tendency variatio3 descritive statistics measure of   central   tendency variatio
3 descritive statistics measure of central tendency variatio
 
Statistics - Basics
Statistics - BasicsStatistics - Basics
Statistics - Basics
 
Statistical parameters
Statistical parametersStatistical parameters
Statistical parameters
 
Data Distribution &The Probability Distributions
Data Distribution &The Probability DistributionsData Distribution &The Probability Distributions
Data Distribution &The Probability Distributions
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
 
Chapter 5 and Chapter 6
Chapter 5 and Chapter 6 Chapter 5 and Chapter 6
Chapter 5 and Chapter 6
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimation
 
Point Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsPoint Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis tests
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbers
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 

En vedette

presentation of data
presentation of datapresentation of data
presentation of data
Chie Pegollo
 
Sampling techniquesmod5
Sampling techniquesmod5Sampling techniquesmod5
Sampling techniquesmod5
Chie Pegollo
 
Nonparametric tests
Nonparametric testsNonparametric tests
Nonparametric tests
Arun Kumar
 
Definitions And Scope Of Economics 5
Definitions And Scope Of Economics  5Definitions And Scope Of Economics  5
Definitions And Scope Of Economics 5
siraj2762268
 
Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency
Jan Nah
 

En vedette (20)

Lecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.airLecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.air
 
Presentation non parametric
Presentation non parametricPresentation non parametric
Presentation non parametric
 
presentation of data
presentation of datapresentation of data
presentation of data
 
Sampling techniquesmod5
Sampling techniquesmod5Sampling techniquesmod5
Sampling techniquesmod5
 
Berd 5-6
Berd 5-6Berd 5-6
Berd 5-6
 
Commonly used Statistics in Medical Research Handout
Commonly used Statistics in Medical Research HandoutCommonly used Statistics in Medical Research Handout
Commonly used Statistics in Medical Research Handout
 
What is a Kruskal Wallis-Test?
What is a Kruskal Wallis-Test?What is a Kruskal Wallis-Test?
What is a Kruskal Wallis-Test?
 
Samplels & Sampling Techniques
Samplels & Sampling TechniquesSamplels & Sampling Techniques
Samplels & Sampling Techniques
 
Nonparametric tests
Nonparametric testsNonparametric tests
Nonparametric tests
 
Null hypothesis for Kruskal Wallis Test
Null hypothesis for Kruskal Wallis TestNull hypothesis for Kruskal Wallis Test
Null hypothesis for Kruskal Wallis Test
 
Definitions And Scope Of Economics 5
Definitions And Scope Of Economics  5Definitions And Scope Of Economics  5
Definitions And Scope Of Economics 5
 
Chapter 3
Chapter 3Chapter 3
Chapter 3
 
Non parametric tests
Non parametric testsNon parametric tests
Non parametric tests
 
Parametric and non parametric test
Parametric and non parametric testParametric and non parametric test
Parametric and non parametric test
 
Non-Parametric Tests
Non-Parametric TestsNon-Parametric Tests
Non-Parametric Tests
 
Presentation of data
Presentation of dataPresentation of data
Presentation of data
 
Basics of statistics
Basics of statisticsBasics of statistics
Basics of statistics
 
Statistics for Class XI (CBSE)
Statistics for Class XI (CBSE)Statistics for Class XI (CBSE)
Statistics for Class XI (CBSE)
 
Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency
 
Introduction to economics
Introduction to economicsIntroduction to economics
Introduction to economics
 

Similaire à Basics of Statistics

Descriptive Statistics Formula Sheet Sample Populatio.docx
Descriptive Statistics Formula Sheet    Sample Populatio.docxDescriptive Statistics Formula Sheet    Sample Populatio.docx
Descriptive Statistics Formula Sheet Sample Populatio.docx
simonithomas47935
 
AP Statistic and Probability 6.1 (1).ppt
AP Statistic and Probability 6.1 (1).pptAP Statistic and Probability 6.1 (1).ppt
AP Statistic and Probability 6.1 (1).ppt
AlfredNavea1
 
C2 st lecture 13 revision for test b handout
C2 st lecture 13   revision for test b handoutC2 st lecture 13   revision for test b handout
C2 st lecture 13 revision for test b handout
fatima d
 
2 Review of Statistics. 2 Review of Statistics.
2 Review of Statistics. 2 Review of Statistics.2 Review of Statistics. 2 Review of Statistics.
2 Review of Statistics. 2 Review of Statistics.
WeihanKhor2
 

Similaire à Basics of Statistics (20)

Sriram seminar on introduction to statistics
Sriram seminar on introduction to statisticsSriram seminar on introduction to statistics
Sriram seminar on introduction to statistics
 
Descriptive
DescriptiveDescriptive
Descriptive
 
Talk 3
Talk 3Talk 3
Talk 3
 
S t a t i s t i c s
S t a t i s t i c sS t a t i s t i c s
S t a t i s t i c s
 
S t a t i s t i c s
S t a t i s t i c sS t a t i s t i c s
S t a t i s t i c s
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec doms
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variables
 
Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methods
 
Lect w2 measures_of_location_and_spread
Lect w2 measures_of_location_and_spreadLect w2 measures_of_location_and_spread
Lect w2 measures_of_location_and_spread
 
Descriptive Statistics Formula Sheet Sample Populatio.docx
Descriptive Statistics Formula Sheet    Sample Populatio.docxDescriptive Statistics Formula Sheet    Sample Populatio.docx
Descriptive Statistics Formula Sheet Sample Populatio.docx
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
 
AP Statistic and Probability 6.1 (1).ppt
AP Statistic and Probability 6.1 (1).pptAP Statistic and Probability 6.1 (1).ppt
AP Statistic and Probability 6.1 (1).ppt
 
Normal as Approximation to Binomial
Normal as Approximation to Binomial  Normal as Approximation to Binomial
Normal as Approximation to Binomial
 
Central Tendency.pptx
Central Tendency.pptxCentral Tendency.pptx
Central Tendency.pptx
 
Introduction to Statistics and Probability
Introduction to Statistics and ProbabilityIntroduction to Statistics and Probability
Introduction to Statistics and Probability
 
statistics
statisticsstatistics
statistics
 
C2 st lecture 13 revision for test b handout
C2 st lecture 13   revision for test b handoutC2 st lecture 13   revision for test b handout
C2 st lecture 13 revision for test b handout
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
2 Review of Statistics. 2 Review of Statistics.
2 Review of Statistics. 2 Review of Statistics.2 Review of Statistics. 2 Review of Statistics.
2 Review of Statistics. 2 Review of Statistics.
 

Dernier

Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 

Dernier (20)

7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 

Basics of Statistics

  • 1. 1 Basics of Statistics A Quick reference to the basics of statistics by Shiva Dasharathi
  • 2. 2 1. Population & Sample 2. Descriptive Statistics 1. Central tendency 2. Spread (or ) Dispersion 3. Normal Distribution 4. Central Limit Theorem 5. Correlation, covariance 6. Probability 7. Statistical tests & Significance level 8. Random Variables 9. Other Distributions Contents
  • 3. 3 Population & Sample Population: is the complete set of items of a characteristic of given subject of analysis Sample: A sample is a part of the population under study selected so that inferences can be drawn from it about the population Primary data: Data collected for actual analysis of the subject Secondary Data: Already collected data used for analyzing other factors of the subject Measurement scales : Nominal, ordinal, interval, ratio
  • 4. 4 Descriptive statistics Central Tendency: Mean Mean: is the average value Arithmetic Mean Geometric Mean Harmonic Mean N xi∑= N ix∏= ∑= = N i ix N 1 1
  • 5. 5 Descriptive statistics Central Tendency: Median Median: is the middle term in an ordered set : if ‘N’ is odd number = : if ‘N’ is even number Ex: S = {1,4,7,7,9,11,23} median of S = 4th term of set = 7 xN 2 1+ { 2 2 1 2 xx NN ++
  • 6. 6 Descriptive statistics Central Tendency: Mode Mode: = High frequency term Ex: S = {1,4,11,4,3,9,3,5,7,5,3} mode of S = 3
  • 7. 7 Descriptive statistics Spread: Variance of a sample Standard Deviation = ( ) 1 2 2 − − = ∑ N X µ σ Varianceσ
  • 8. 8 Descriptive statistics for the given sample S = {1, 2,5,7,9,11,12,15,} Min:1 Max:15 Range = Max – Min = 15 -1 = 14 IQR: Inter Quartile Range = Q3 - Q1 = 11.5 – 3.5 Q3 = (11+12)/2 = 11.5 = 8 Q1 = (2+5)/2 = 3.5 Q1: middle term in the 1st half of the ordered set Q2: Median Q3: middle term in the 2nd half of the ordered set {
  • 9. 9 Normal Distribution Normal (or) Gaussian Distribution: - Is a bell curve with Mean at the center and - 68% data distributed in 1 Standard Deviation range - 95.4% data distributed in 2 Standard Deviations range - 99.7% data distributed in 3 Standard Deviations range ),( σµN
  • 10. 10 Normal Distribution In a perfect Normal Distribution, 1. parameters Mean, Median, Mode are aligned 2. data is symmetrically distributed around these parameters ),( σµN
  • 11. 11 Standardized Normal Distribution )1,0(N To standardize the distribution, Step 1. center the mean (minus the mean from all the data points) Step 2. divide each value by the standard deviation Standardized Normal Distribution: Isa normal distribution with mean=0 and standard deviation=1
  • 12. 12 Skewness Skewness: indicates the symmetry of the distribution
  • 13. 13 Central Limit Theorem ),( n N σ µ pµµ = ) n pσ σ =      : : : n p p σ µ
  • 14. 14 Correlation Correlation: Cor(X,Y) = [-1,1] -Correlation indicates the strength of association between two variable - it ranges from -1 to +1 cor(x,y ) = 1 // being perfectly positively correlated cor(x,y) = -1 // being perfectly negatively correlated cor(x,y) = 0 // being independent Cor(X,Y) =
  • 16. 16 Covariance Covariance: is a measure of how two variables are changing together. - So correlation is the covariance normalized by the standard deviation of the 2 variables
  • 17. 17 Probability Terminology: Sample space: set of all possible outcomes of an experiment Event: is a subset of sample space Probability: is the measure of the likeliness that an event will occur among all the possible outcomes. Probability of ‘event x’ = Example: Event: tossing a fair coin Sample Space = {HEAD,TAIL} Probability of ‘Head’ = P(HEAD) = ½ = 0.5 Probability of ‘Tail’ = P(TAIL) = ½ = 0.5 salloutcome likelyness xp =)(
  • 18. 18 Probability cont.. Axioms of probability 1. 2. , 3. Sum of all the probabilities of events in a sample space is equal to 1 4. ]1,0[)( =xp )(xpc )(1)( xpxpc −= )(xp are mutually exclusive if, 1..)()( 21 =++ xpxp )()()()( BAPBPAPBAP ∩−+=∪
  • 19. 19 Conditional Probability Conditional Probability: is the probability of event A given event B occurred Where, : Probability of A conditioned on B : Joint probability of A and B : Probability of B for independent events A, B )( )( )/( BP BAP BAP ∩ =      ∩ )( )( )/( BP BAP BAP )(*)()( BPAPBAP =∩ )( )( )(*)( )/( AP BP BPAP BAP == ∴
  • 20. 20 Bayes Rule Bayes Rule: Posterior * evidence = Prior * likelihood from conditional probability, )(*)/()(*)/( )(*)/()( )( )( )/( )(*)/()( )( )( )/( APABPBPBAP APABPBAP AP BAP ABP BPBAPBAP BP BAP BAP = =∩⇒ ∩ = =∩⇒ ∩ =  )( )/(*)( )/( BP ABPBP BAP =
  • 21. 21 Naïve Bayesian Naïve Bayesian: Bayes Rule + Conditional independence If ‘E’ is the event conditioned on x1,x2, .. xn, then by applying the chain rule & independence condition we get the final equation ∑= = n i ii n n xPx ExPExPExPEP xxxEP 1 * 21 2,1 )( )/(..*)/(*)/(*)( ),../( )/(...*)/(*)/(*)( 21 ExPExPExPEP n),../( 2,1 nxxxEP α ∏= n i i ExPEP 1 )/(*)(α
  • 22. 22 Statistical tests & Significance level t-test: z-test: Chi-square test: ANOVA: Factor ANOVA: H0 – Null Hypothesis H1 – Alternative Hypothesis p-value (significance level, for Null hypothesis) Type -1 error Type-2 error
  • 23. 23 Random Variable Random Variables & Probability Distribution: Discrete: Probability mass Function Cumulative mass function Continuous Probability Density Function Cumulative density function
  • 24. 24 Few other Distributions Bernoulli distribution: Binomial Distribution: Poisson Distribution: