outliers

ARPAN PAUL.
ARPAN PAUL.student à Brainware University
BBADMC
102
BWU-BBD-21-014
outliers
Introduction to Boxplot
What is Box plot
Representing the data
Via Boxplot
Affect on Mean, Median, Mode
Example & theory
What Are The Outlier
Definition & Example
Types of outliers
Describing 3 types
Reason for Outliers
4 reason for outliers
How to Find Outliers
Steps & Example
SLIDE 3
SLIDE 16
SLIDE 15
SLIDE 11
SLIDE 10
SLIDE 7
SLIDE 14
What Are The Outliers ?
Outliers are extreme values.
Extremely high or low values in a data set.
In a data set outliers may include sample maximum or minimum or
both.
It indicate that distribution is heavy tailed or highly skewed.
Q1. For Example.
1.5, 2.1, 2.5, 3, 3.1, 3.7, 4.1, 5, 9.5
Q3. For Example.
15, 1, 3, 3.5, 2.1, 4.2, 3.1, 5
Q2. For Example.
1, 3, 5, 6, 7, 9, 10, 12, 22
Outliers
Global
Outlier
Contextual
Outlier
Collective
Outlier
Types of outliers ?
a. Global outliers .
when a data object differ from the rest of the given data
kkkset, it is considered to be global outliers.
b. Contextual outliers.
A contextual is a data object anomalous within its context
kkkor its neighborhood.
c. Collective outliers.
if a collection of related data instance is anomalous with
kkkrespect to the entire data set, it is termed as a collective
kkkoutliers.
a. Global outlier c. Collective outlier
b.
Contextual
outlier
Reason For Outliers ?
 Data entry error.
 Instrumental error.
 System faults.
 Measurement error.
How To Identify Outliers ?
A data value less than Q1 – 1.5(IQR) or greater
than Q3 + 1.5(IQR) can be considered an
outlier.
Steps To Find Outliers
Arrange the data in order from lowest to highest and find Q1 and Q3.
Find the interquartile range (IQR) Q3 – Q1.
Multiply IQR by 1.5.
Subtract step 3 from Q1 and add in Q3.
Check the data set for any data value that is smaller than Q1 – 1.5(IQR) or
larger than Q3 + 1.5(IQR)
1. Arrange the data & find Q1, Q3 .
7, 10, 11, 15, 25, 30, 35, 68
Q1. = 10.5 Q3 = 32.5
2. Find the IQR (Q3 – Q1)
= 32.5 – 10.5
= 22
3. Multiply IQR by 1.5
= 33
4. Subtract IQR from Q1 & add in Q3 10.5 – 33 = -22.5
32.5 + 33 = 65.5
5. Check the data set for any data value that is smaller than Q1 – 1.5(IQR) or larger than Q3 + 1.5(IQR).
68 is the outliers
Example
10, 11, 15, 25, 35, 30, 7, 68
Box Plots At A Glance
Outliers
Outliers
Min.
value
Max.
value
Q3
Q2
&
Median
Q1
Q3 – Q1
IQR
Represent In Box Plot
Q1. = 10.5
Q2 = 20
Q3 = 32.5
Outlier = 68
Min value = 7
Max value = 35
7(Min)
10.5(Q1 ) 20(Q2 ) 25(Q3)
35(Max)
68(Outliers)
Median
Affect On Mean, Median, Mode.
Without
Outlier
With Outlier
Affects
Mode is not affected by outlier.
Median is also not so affected.
Mean is more affected as mean depends on average of all data.
• Low outlier tend to shift mean more negatively than median.
• High outlier tend to shift mean more positively than median.
QUOTE
 
A single death is a
tragedy; a million deaths
is a statistics.
- Joseph Stalin
Thank you…
1 sur 19

Recommandé

Outliers par
OutliersOutliers
OutliersAlexandru Dorobantu
6.4K vues29 diapositives
Outlier par
OutlierOutlier
OutlierKelly Jans
1.8K vues6 diapositives
Descriptive Statistics and Data Visualization par
Descriptive Statistics and Data VisualizationDescriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDouglas Joubert
6.6K vues44 diapositives
Summary statistics par
Summary statisticsSummary statistics
Summary statisticsRupak Roy
167 vues16 diapositives
Exploratory data analysis par
Exploratory data analysisExploratory data analysis
Exploratory data analysisgokulprasath06
2.3K vues39 diapositives
Exploratory data analysis par
Exploratory data analysisExploratory data analysis
Exploratory data analysisVishwas N
444 vues27 diapositives

Contenu connexe

Tendances

Box Plot par
Box PlotBox Plot
Box PlotCIToolkit
4.4K vues23 diapositives
Statistics-Measures of dispersions par
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersionsCapricorn
10.8K vues46 diapositives
4 Descriptive Statistics with R par
4 Descriptive Statistics with R4 Descriptive Statistics with R
4 Descriptive Statistics with RDr Nisha Arora
121 vues21 diapositives
Mean-median-mode par
Mean-median-modeMean-median-mode
Mean-median-modePawan Mishra
7.2K vues21 diapositives
T-Test par
T-TestT-Test
T-TestE-Media Arts
918 vues43 diapositives
Histogram par
HistogramHistogram
HistogramMahrukhShehzadi1
781 vues14 diapositives

Tendances(20)

Statistics-Measures of dispersions par Capricorn
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
Capricorn 10.8K vues
Measures of dispersion par Self-employed
Measures of dispersion Measures of dispersion
Measures of dispersion
Self-employed13.7K vues
Types of Statistics par loranel
Types of StatisticsTypes of Statistics
Types of Statistics
loranel37.6K vues
Learn Set Theory par yochevedl
Learn Set TheoryLearn Set Theory
Learn Set Theory
yochevedl1.4K vues
Median & mode par Raj Teotia
Median & modeMedian & mode
Median & mode
Raj Teotia43.6K vues
Inter quartile range par Ken Plummer
Inter quartile rangeInter quartile range
Inter quartile range
Ken Plummer5.1K vues
Chapter 5 part1- The Sampling Distribution of a Sample Mean par nszakir
Chapter 5 part1- The Sampling Distribution of a Sample MeanChapter 5 part1- The Sampling Distribution of a Sample Mean
Chapter 5 part1- The Sampling Distribution of a Sample Mean
nszakir9.9K vues
quartile deviation: An introduction par Dr Rajesh Verma
quartile deviation: An introductionquartile deviation: An introduction
quartile deviation: An introduction
Dr Rajesh Verma5.4K vues
Standard Deviation par pwheeles
Standard DeviationStandard Deviation
Standard Deviation
pwheeles14.5K vues

Similaire à outliers

Most prominent methods of how to find outliers in statistics par
Most prominent methods of how to find outliers in statisticsMost prominent methods of how to find outliers in statistics
Most prominent methods of how to find outliers in statisticsStat Analytica
137 vues13 diapositives
Outliers introductory stat par
Outliers introductory statOutliers introductory stat
Outliers introductory statFakhira Abd Kadir
431 vues15 diapositives
Revisionf2 par
Revisionf2Revisionf2
Revisionf2wind12
21 vues38 diapositives
Measures of Relative Standing and Boxplots par
Measures of Relative Standing and BoxplotsMeasures of Relative Standing and Boxplots
Measures of Relative Standing and BoxplotsLong Beach City College
661 vues15 diapositives
lecture 3 Slides.pptx par
lecture 3 Slides.pptxlecture 3 Slides.pptx
lecture 3 Slides.pptxSADAF53170
19 vues13 diapositives
Measures of-variation par
Measures of-variationMeasures of-variation
Measures of-variationJhonna Barrosa
209 vues35 diapositives

Similaire à outliers(20)

Most prominent methods of how to find outliers in statistics par Stat Analytica
Most prominent methods of how to find outliers in statisticsMost prominent methods of how to find outliers in statistics
Most prominent methods of how to find outliers in statistics
Stat Analytica137 vues
Revisionf2 par wind12
Revisionf2Revisionf2
Revisionf2
wind1221 vues
lecture 3 Slides.pptx par SADAF53170
lecture 3 Slides.pptxlecture 3 Slides.pptx
lecture 3 Slides.pptx
SADAF5317019 vues
ap_stat_1.3.ppt par fghgjd
ap_stat_1.3.pptap_stat_1.3.ppt
ap_stat_1.3.ppt
fghgjd15 vues
Classification decision tree par yazad dumasia
Classification  decision treeClassification  decision tree
Classification decision tree
yazad dumasia1.8K vues
Measures of Variation (Ungrouped Data) par Zaira Mae
Measures of Variation (Ungrouped Data)Measures of Variation (Ungrouped Data)
Measures of Variation (Ungrouped Data)
Zaira Mae302 vues
3Measurements of health and disease_MCTD.pdf par AmanuelDina
3Measurements of health and disease_MCTD.pdf3Measurements of health and disease_MCTD.pdf
3Measurements of health and disease_MCTD.pdf
AmanuelDina9 vues
Describing Distributions with Numbers par nszakir
Describing Distributions with NumbersDescribing Distributions with Numbers
Describing Distributions with Numbers
nszakir3.5K vues

Dernier

Macro Economics- Group Presentation for Germany par
Macro Economics- Group Presentation for Germany Macro Economics- Group Presentation for Germany
Macro Economics- Group Presentation for Germany BethanyAline
39 vues24 diapositives
GroupPresentation_MicroEconomics par
GroupPresentation_MicroEconomicsGroupPresentation_MicroEconomics
GroupPresentation_MicroEconomicsBethanyAline
34 vues27 diapositives
Stock Market Brief Deck 1129.pdf par
Stock Market Brief Deck 1129.pdfStock Market Brief Deck 1129.pdf
Stock Market Brief Deck 1129.pdfMichael Silva
56 vues46 diapositives
Indias Sparkling Future : Lab-Grown Diamonds in Focus par
Indias Sparkling Future : Lab-Grown Diamonds in FocusIndias Sparkling Future : Lab-Grown Diamonds in Focus
Indias Sparkling Future : Lab-Grown Diamonds in Focusanujadeodhar4
9 vues45 diapositives
2023-11-01-IPT-PPT.pdf par
2023-11-01-IPT-PPT.pdf2023-11-01-IPT-PPT.pdf
2023-11-01-IPT-PPT.pdfAdnet Communications
253 vues35 diapositives
QNBFS Daily Market Report November 29, 2023 par
QNBFS Daily Market Report November 29, 2023QNBFS Daily Market Report November 29, 2023
QNBFS Daily Market Report November 29, 2023QNB Group
10 vues9 diapositives

Dernier(20)

Macro Economics- Group Presentation for Germany par BethanyAline
Macro Economics- Group Presentation for Germany Macro Economics- Group Presentation for Germany
Macro Economics- Group Presentation for Germany
BethanyAline39 vues
GroupPresentation_MicroEconomics par BethanyAline
GroupPresentation_MicroEconomicsGroupPresentation_MicroEconomics
GroupPresentation_MicroEconomics
BethanyAline34 vues
Stock Market Brief Deck 1129.pdf par Michael Silva
Stock Market Brief Deck 1129.pdfStock Market Brief Deck 1129.pdf
Stock Market Brief Deck 1129.pdf
Michael Silva56 vues
Indias Sparkling Future : Lab-Grown Diamonds in Focus par anujadeodhar4
Indias Sparkling Future : Lab-Grown Diamonds in FocusIndias Sparkling Future : Lab-Grown Diamonds in Focus
Indias Sparkling Future : Lab-Grown Diamonds in Focus
anujadeodhar49 vues
QNBFS Daily Market Report November 29, 2023 par QNB Group
QNBFS Daily Market Report November 29, 2023QNBFS Daily Market Report November 29, 2023
QNBFS Daily Market Report November 29, 2023
QNB Group10 vues
The breath of the investment grade and the unpredictability of inflation - Eu... par Antonis Zairis
The breath of the investment grade and the unpredictability of inflation - Eu...The breath of the investment grade and the unpredictability of inflation - Eu...
The breath of the investment grade and the unpredictability of inflation - Eu...
Antonis Zairis11 vues
Teaching Third Generation Islamic Economics par Asad Zaman
Teaching Third Generation Islamic EconomicsTeaching Third Generation Islamic Economics
Teaching Third Generation Islamic Economics
Asad Zaman247 vues
Embracing the eFarming Challenge.pdf par ramadhan04116
Embracing the eFarming Challenge.pdfEmbracing the eFarming Challenge.pdf
Embracing the eFarming Challenge.pdf
ramadhan041169 vues
List of Qataris Sanctioned by the U.S. Treasury Department for Links to Al-Qa... par aljazeeramasoom
List of Qataris Sanctioned by the U.S. Treasury Department for Links to Al-Qa...List of Qataris Sanctioned by the U.S. Treasury Department for Links to Al-Qa...
List of Qataris Sanctioned by the U.S. Treasury Department for Links to Al-Qa...
Digital4Climate-Leveraging Digital innovations & data for climate action par Soren Gigler
Digital4Climate-Leveraging Digital innovations & data for climate action Digital4Climate-Leveraging Digital innovations & data for climate action
Digital4Climate-Leveraging Digital innovations & data for climate action
Soren Gigler64 vues
Debt Watch | ICICI Prudential Mutual Fund par iciciprumf
Debt Watch | ICICI Prudential Mutual FundDebt Watch | ICICI Prudential Mutual Fund
Debt Watch | ICICI Prudential Mutual Fund
iciciprumf20 vues
Pandit No2 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam... par Amil baba
Pandit No2 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam...Pandit No2 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam...
Pandit No2 Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam...
Amil baba7 vues
Supplier Sourcing presentation.pdf par AllenSingson
Supplier Sourcing presentation.pdfSupplier Sourcing presentation.pdf
Supplier Sourcing presentation.pdf
AllenSingson19 vues

outliers

  • 3. Introduction to Boxplot What is Box plot Representing the data Via Boxplot Affect on Mean, Median, Mode Example & theory What Are The Outlier Definition & Example Types of outliers Describing 3 types Reason for Outliers 4 reason for outliers How to Find Outliers Steps & Example SLIDE 3 SLIDE 16 SLIDE 15 SLIDE 11 SLIDE 10 SLIDE 7 SLIDE 14
  • 4. What Are The Outliers ? Outliers are extreme values. Extremely high or low values in a data set. In a data set outliers may include sample maximum or minimum or both. It indicate that distribution is heavy tailed or highly skewed.
  • 5. Q1. For Example. 1.5, 2.1, 2.5, 3, 3.1, 3.7, 4.1, 5, 9.5
  • 6. Q3. For Example. 15, 1, 3, 3.5, 2.1, 4.2, 3.1, 5 Q2. For Example. 1, 3, 5, 6, 7, 9, 10, 12, 22
  • 8. a. Global outliers . when a data object differ from the rest of the given data kkkset, it is considered to be global outliers. b. Contextual outliers. A contextual is a data object anomalous within its context kkkor its neighborhood. c. Collective outliers. if a collection of related data instance is anomalous with kkkrespect to the entire data set, it is termed as a collective kkkoutliers.
  • 9. a. Global outlier c. Collective outlier b. Contextual outlier
  • 10. Reason For Outliers ?  Data entry error.  Instrumental error.  System faults.  Measurement error.
  • 11. How To Identify Outliers ? A data value less than Q1 – 1.5(IQR) or greater than Q3 + 1.5(IQR) can be considered an outlier.
  • 12. Steps To Find Outliers Arrange the data in order from lowest to highest and find Q1 and Q3. Find the interquartile range (IQR) Q3 – Q1. Multiply IQR by 1.5. Subtract step 3 from Q1 and add in Q3. Check the data set for any data value that is smaller than Q1 – 1.5(IQR) or larger than Q3 + 1.5(IQR)
  • 13. 1. Arrange the data & find Q1, Q3 . 7, 10, 11, 15, 25, 30, 35, 68 Q1. = 10.5 Q3 = 32.5 2. Find the IQR (Q3 – Q1) = 32.5 – 10.5 = 22 3. Multiply IQR by 1.5 = 33 4. Subtract IQR from Q1 & add in Q3 10.5 – 33 = -22.5 32.5 + 33 = 65.5 5. Check the data set for any data value that is smaller than Q1 – 1.5(IQR) or larger than Q3 + 1.5(IQR). 68 is the outliers Example 10, 11, 15, 25, 35, 30, 7, 68
  • 14. Box Plots At A Glance Outliers Outliers Min. value Max. value Q3 Q2 & Median Q1 Q3 – Q1 IQR
  • 15. Represent In Box Plot Q1. = 10.5 Q2 = 20 Q3 = 32.5 Outlier = 68 Min value = 7 Max value = 35 7(Min) 10.5(Q1 ) 20(Q2 ) 25(Q3) 35(Max) 68(Outliers) Median
  • 16. Affect On Mean, Median, Mode.
  • 18. Affects Mode is not affected by outlier. Median is also not so affected. Mean is more affected as mean depends on average of all data. • Low outlier tend to shift mean more negatively than median. • High outlier tend to shift mean more positively than median.
  • 19. QUOTE   A single death is a tragedy; a million deaths is a statistics. - Joseph Stalin Thank you…