SlideShare une entreprise Scribd logo
1  sur  11
Télécharger pour lire hors ligne
Naïve Bayes Classifiers
Dae-Ki Kang
Probability
• What is the probability that you get hit in the
head with bird droppings when you go outside?
▫ Initially shown in one episode of Korea TV show
▫ Asked in one BBS of an elite group in Korea
 no one gave the right answer
• It is difficult to define “probability”
▫ As difficult as defining “science”, “history”, etc.
• 16 different definitions
• Frequentist vs. Bayesian
Bayesian Probability
• Can assign probability on any statement
• Can be used to show the degree of one’s belief, or
degree of one’s knowledge.
• Example of subjectiveness: Alice, Bob, and Carol
▫ Alice tosses a new coin 10 times  3/10 heads
▫ Bob tosses the coin 10 times  4/10 heads
▫ Carol has watched them all  7/20 heads
• Probabilities in Monty Hall
▫ MC knows where the car is  1
▫ Guest asked again after MC opened the door  2/3
▫ Viewer started watching the show after MC opened the
door  1/2
Bayes Theorem
• Probability – P(A)
• Conditional Probability – P(A|C)
• P(A|B) = P(A,B)/P(B)
• P(B|A) = P(A,B)/P(A)
• P(A|B) = {P(B|A)P(A)}/P(B)
• P(C|D) = {P(D|C)P(C)}/P(D)
▫ P(C|D) : The probability of class given data (Posterior
Probability)
▫ P(C) : Prior Probability
▫ P(D|C) : Likelihood
▫ P(D) : constant, so ignored
Does patient have cancer or not?
• A patient takes a lab test and the result comes back positive. It
is known that the test returns a correct positive result in only
99% of the cases and a correct negative result in only 95% of t
he cases. Furthermore, only 0.03 of the entire population has
this disease.
1. What is the probability that this patient has cancer?
2. What is the probability that he does not have cancer?
3. What is the diagnosis?
Bayes Classifier
• Recall P(C|X1,X2,…Xn) =
P(C)P(X1,X2,…Xn|C)/P(X1,X2,…Xn)
•  P(C|X1,X2,…Xn) α P(C)P(X1,X2,…Xn|C)
• Maximum A Posteriori hypothesis
▫ hMAP = argmax P(D|h)P(h)
• Maximum Likelihood hypothesis
▫ hML = argmax P(D|h)
Naïve Bayes Classifier
• Recall P(C|X1,X2,…Xn) =
P(C)P(X1,X2,…Xn|C)/P(X1,X2,…Xn)
•  P(C|X1,X2,…Xn) α P(C)P(X1,X2,…Xn|C)
• It is hard to calculate joint probabilities
▫ Too many cases
▫ Too little data
• If we ignore the dependences among X1,X2,…Xn?
▫ Why? Because we are naïve.
▫ More precisely, suppose X1,X2,…Xn are conditionally
independent of each other given C
• P(C|X1,X2,…Xn) α P(C) * Π P(Xi|C)
Question: For the day <sunny, cool, high, strong>, what’s
the play prediction?
Wind
Humidity
Temp
Outlook
PlayP(Play=Y) 9/14
P(Play=N) 4/14
P(Sunny|P) P(Overcast|P) P(Rain|P)
Play=
Y
2/9 4/9 3/9
Play=
N
3/5 0/5 2/5
P(Hot|P) P(Mild|P) P(Cool|P)
Play=
Y
2/9 4/9 3/9
Play=
N
2/5 2/5 1/5
Zero Occurrence
• When a feature is never occurred in the training
set  zero frequency  PANIC: makes all terms
zero
• Smoothing the distribution
▫ Laplacian Smoothing
▫ Dirichlet Priors Smoothing
▫ and many more (Absolute Discouting, Jelinek-
Mercer smoothing, Katz smoothing, Good-Turing
smoothing, etc.)
Extension of NBC
• NBC is actually very effective
• Selective Bayesian Classifiers (SBC)
• Tree Augmented Naïve Bayes (TAN)
▫ ChowLiu algorithm (CL-TAN)
▫ SuperParent algorithm (SP-TAN)
• Attribute Value Taxonomy Guided Naïve Bayes
Learner (AVT-NBL)
• n-gram Augmented Naïve Bayes

Contenu connexe

Tendances

Probability&Bayes theorem
Probability&Bayes theoremProbability&Bayes theorem
Probability&Bayes theoremimran iqbal
 
Probability & Bayesian Theorem
Probability & Bayesian TheoremProbability & Bayesian Theorem
Probability & Bayesian TheoremAzmi Mohd Tamil
 
An introduction to Bayesian Statistics using Python
An introduction to Bayesian Statistics using PythonAn introduction to Bayesian Statistics using Python
An introduction to Bayesian Statistics using Pythonfreshdatabos
 
Introduction of Probability
Introduction of ProbabilityIntroduction of Probability
Introduction of Probabilityrey castro
 
Pre-Cal 40S Slides May 17, 2007
Pre-Cal 40S Slides May 17, 2007Pre-Cal 40S Slides May 17, 2007
Pre-Cal 40S Slides May 17, 2007Darren Kuropatwa
 
Complements conditional probability bayes theorem
Complements  conditional probability bayes theorem  Complements  conditional probability bayes theorem
Complements conditional probability bayes theorem Long Beach City College
 
Formulas of Probability :Class 12 maths
Formulas of Probability :Class 12 mathsFormulas of Probability :Class 12 maths
Formulas of Probability :Class 12 mathssumanmathews
 
Probability By Ms Aarti
Probability By Ms AartiProbability By Ms Aarti
Probability By Ms Aartikulachihansraj
 
1616 probability-the foundation of probability theory
1616 probability-the foundation of probability theory1616 probability-the foundation of probability theory
1616 probability-the foundation of probability theoryDr Fereidoun Dejahang
 
Probability basics and bayes' theorem
Probability basics and bayes' theoremProbability basics and bayes' theorem
Probability basics and bayes' theoremBalaji P
 
Conditional probability, and probability trees
Conditional probability, and probability treesConditional probability, and probability trees
Conditional probability, and probability treesGlobal Polis
 

Tendances (20)

Probability
ProbabilityProbability
Probability
 
Probability[1]
Probability[1]Probability[1]
Probability[1]
 
Bayes theorem
Bayes theoremBayes theorem
Bayes theorem
 
Nossi ch 10
Nossi ch 10Nossi ch 10
Nossi ch 10
 
Probability&Bayes theorem
Probability&Bayes theoremProbability&Bayes theorem
Probability&Bayes theorem
 
Probability & Bayesian Theorem
Probability & Bayesian TheoremProbability & Bayesian Theorem
Probability & Bayesian Theorem
 
Probability Concepts
Probability ConceptsProbability Concepts
Probability Concepts
 
Pre-Cal 40S June 1, 2009
Pre-Cal 40S June 1, 2009Pre-Cal 40S June 1, 2009
Pre-Cal 40S June 1, 2009
 
An introduction to Bayesian Statistics using Python
An introduction to Bayesian Statistics using PythonAn introduction to Bayesian Statistics using Python
An introduction to Bayesian Statistics using Python
 
Lecture on Statistics 1
Lecture on Statistics 1Lecture on Statistics 1
Lecture on Statistics 1
 
Counting
Counting  Counting
Counting
 
Introduction of Probability
Introduction of ProbabilityIntroduction of Probability
Introduction of Probability
 
Pre-Cal 40S Slides May 17, 2007
Pre-Cal 40S Slides May 17, 2007Pre-Cal 40S Slides May 17, 2007
Pre-Cal 40S Slides May 17, 2007
 
Complements conditional probability bayes theorem
Complements  conditional probability bayes theorem  Complements  conditional probability bayes theorem
Complements conditional probability bayes theorem
 
Formulas of Probability :Class 12 maths
Formulas of Probability :Class 12 mathsFormulas of Probability :Class 12 maths
Formulas of Probability :Class 12 maths
 
Probability By Ms Aarti
Probability By Ms AartiProbability By Ms Aarti
Probability By Ms Aarti
 
1616 probability-the foundation of probability theory
1616 probability-the foundation of probability theory1616 probability-the foundation of probability theory
1616 probability-the foundation of probability theory
 
Probability basics and bayes' theorem
Probability basics and bayes' theoremProbability basics and bayes' theorem
Probability basics and bayes' theorem
 
Probability
ProbabilityProbability
Probability
 
Conditional probability, and probability trees
Conditional probability, and probability treesConditional probability, and probability trees
Conditional probability, and probability trees
 

En vedette

An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier ananth
 
Naive bayes
Naive bayesNaive bayes
Naive bayesabaldove
 
Dwdm naive bayes_ankit_gadgil_027
Dwdm naive bayes_ankit_gadgil_027Dwdm naive bayes_ankit_gadgil_027
Dwdm naive bayes_ankit_gadgil_027ankitgadgil
 

En vedette (6)

Lecture10 - Naïve Bayes
Lecture10 - Naïve BayesLecture10 - Naïve Bayes
Lecture10 - Naïve Bayes
 
An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier An Overview of Naïve Bayes Classifier
An Overview of Naïve Bayes Classifier
 
Naive bayes
Naive bayesNaive bayes
Naive bayes
 
Dwdm naive bayes_ankit_gadgil_027
Dwdm naive bayes_ankit_gadgil_027Dwdm naive bayes_ankit_gadgil_027
Dwdm naive bayes_ankit_gadgil_027
 
Naive bayes
Naive bayesNaive bayes
Naive bayes
 
Naive Bayes Presentation
Naive Bayes PresentationNaive Bayes Presentation
Naive Bayes Presentation
 

Similaire à 2013-1 Machine Learning Lecture 03 - Naïve Bayes Classifiers

Similaire à 2013-1 Machine Learning Lecture 03 - Naïve Bayes Classifiers (20)

Discrete PDs(1).pdf
Discrete PDs(1).pdfDiscrete PDs(1).pdf
Discrete PDs(1).pdf
 
2.statistical DEcision makig.pptx
2.statistical DEcision makig.pptx2.statistical DEcision makig.pptx
2.statistical DEcision makig.pptx
 
Probability 23321
Probability 23321Probability 23321
Probability 23321
 
Probability 23321
Probability 23321Probability 23321
Probability 23321
 
Probability
ProbabilityProbability
Probability
 
Probablity
ProbablityProbablity
Probablity
 
Bayes NIPT
Bayes NIPTBayes NIPT
Bayes NIPT
 
Probability Theory for Data Scientists
Probability Theory for Data ScientistsProbability Theory for Data Scientists
Probability Theory for Data Scientists
 
Probabilty1.pptx
Probabilty1.pptxProbabilty1.pptx
Probabilty1.pptx
 
Statistics 1 (FPN) QP
Statistics 1 (FPN) QPStatistics 1 (FPN) QP
Statistics 1 (FPN) QP
 
Probability 4.2
Probability 4.2Probability 4.2
Probability 4.2
 
5. RV and Distributions.pptx
5. RV and Distributions.pptx5. RV and Distributions.pptx
5. RV and Distributions.pptx
 
PROBABILITY THEORIES.pptx
PROBABILITY THEORIES.pptxPROBABILITY THEORIES.pptx
PROBABILITY THEORIES.pptx
 
Lecture_5Conditional_Probability_Bayes_T.pptx
Lecture_5Conditional_Probability_Bayes_T.pptxLecture_5Conditional_Probability_Bayes_T.pptx
Lecture_5Conditional_Probability_Bayes_T.pptx
 
Basic Probability Distribution
Basic Probability Distribution Basic Probability Distribution
Basic Probability Distribution
 
Ch7 gene disorders and pedigree analysis
Ch7 gene disorders and pedigree analysisCh7 gene disorders and pedigree analysis
Ch7 gene disorders and pedigree analysis
 
Probability and Distribution
Probability and DistributionProbability and Distribution
Probability and Distribution
 
Lecture 3
Lecture 3Lecture 3
Lecture 3
 
Sqqs1013 ch5-a122
Sqqs1013 ch5-a122Sqqs1013 ch5-a122
Sqqs1013 ch5-a122
 
Chapter7ppt.pdf
Chapter7ppt.pdfChapter7ppt.pdf
Chapter7ppt.pdf
 

Plus de Dongseo University

Lecture_NaturalPolicyGradientsTRPOPPO.pdf
Lecture_NaturalPolicyGradientsTRPOPPO.pdfLecture_NaturalPolicyGradientsTRPOPPO.pdf
Lecture_NaturalPolicyGradientsTRPOPPO.pdfDongseo University
 
Evolutionary Computation Lecture notes03
Evolutionary Computation Lecture notes03Evolutionary Computation Lecture notes03
Evolutionary Computation Lecture notes03Dongseo University
 
Evolutionary Computation Lecture notes02
Evolutionary Computation Lecture notes02Evolutionary Computation Lecture notes02
Evolutionary Computation Lecture notes02Dongseo University
 
Evolutionary Computation Lecture notes01
Evolutionary Computation Lecture notes01Evolutionary Computation Lecture notes01
Evolutionary Computation Lecture notes01Dongseo University
 
Average Linear Selection Algorithm
Average Linear Selection AlgorithmAverage Linear Selection Algorithm
Average Linear Selection AlgorithmDongseo University
 
Lower Bound of Comparison Sort
Lower Bound of Comparison SortLower Bound of Comparison Sort
Lower Bound of Comparison SortDongseo University
 
Running Time of Building Binary Heap using Array
Running Time of Building Binary Heap using ArrayRunning Time of Building Binary Heap using Array
Running Time of Building Binary Heap using ArrayDongseo University
 
Proof By Math Induction Example
Proof By Math Induction ExampleProof By Math Induction Example
Proof By Math Induction ExampleDongseo University
 
Estimating probability distributions
Estimating probability distributionsEstimating probability distributions
Estimating probability distributionsDongseo University
 
2018-2 Machine Learning (Wasserstein GAN and BEGAN)
2018-2 Machine Learning (Wasserstein GAN and BEGAN)2018-2 Machine Learning (Wasserstein GAN and BEGAN)
2018-2 Machine Learning (Wasserstein GAN and BEGAN)Dongseo University
 
2018-2 Machine Learning (Linear regression, Logistic regression)
2018-2 Machine Learning (Linear regression, Logistic regression)2018-2 Machine Learning (Linear regression, Logistic regression)
2018-2 Machine Learning (Linear regression, Logistic regression)Dongseo University
 
2017-2 ML W9 Reinforcement Learning #5
2017-2 ML W9 Reinforcement Learning #52017-2 ML W9 Reinforcement Learning #5
2017-2 ML W9 Reinforcement Learning #5Dongseo University
 

Plus de Dongseo University (20)

Lecture_NaturalPolicyGradientsTRPOPPO.pdf
Lecture_NaturalPolicyGradientsTRPOPPO.pdfLecture_NaturalPolicyGradientsTRPOPPO.pdf
Lecture_NaturalPolicyGradientsTRPOPPO.pdf
 
Evolutionary Computation Lecture notes03
Evolutionary Computation Lecture notes03Evolutionary Computation Lecture notes03
Evolutionary Computation Lecture notes03
 
Evolutionary Computation Lecture notes02
Evolutionary Computation Lecture notes02Evolutionary Computation Lecture notes02
Evolutionary Computation Lecture notes02
 
Evolutionary Computation Lecture notes01
Evolutionary Computation Lecture notes01Evolutionary Computation Lecture notes01
Evolutionary Computation Lecture notes01
 
Markov Chain Monte Carlo
Markov Chain Monte CarloMarkov Chain Monte Carlo
Markov Chain Monte Carlo
 
Simplex Lecture Notes
Simplex Lecture NotesSimplex Lecture Notes
Simplex Lecture Notes
 
Reinforcement Learning
Reinforcement LearningReinforcement Learning
Reinforcement Learning
 
Median of Medians
Median of MediansMedian of Medians
Median of Medians
 
Average Linear Selection Algorithm
Average Linear Selection AlgorithmAverage Linear Selection Algorithm
Average Linear Selection Algorithm
 
Lower Bound of Comparison Sort
Lower Bound of Comparison SortLower Bound of Comparison Sort
Lower Bound of Comparison Sort
 
Running Time of Building Binary Heap using Array
Running Time of Building Binary Heap using ArrayRunning Time of Building Binary Heap using Array
Running Time of Building Binary Heap using Array
 
Running Time of MergeSort
Running Time of MergeSortRunning Time of MergeSort
Running Time of MergeSort
 
Binary Trees
Binary TreesBinary Trees
Binary Trees
 
Proof By Math Induction Example
Proof By Math Induction ExampleProof By Math Induction Example
Proof By Math Induction Example
 
TRPO and PPO notes
TRPO and PPO notesTRPO and PPO notes
TRPO and PPO notes
 
Estimating probability distributions
Estimating probability distributionsEstimating probability distributions
Estimating probability distributions
 
2018-2 Machine Learning (Wasserstein GAN and BEGAN)
2018-2 Machine Learning (Wasserstein GAN and BEGAN)2018-2 Machine Learning (Wasserstein GAN and BEGAN)
2018-2 Machine Learning (Wasserstein GAN and BEGAN)
 
2018-2 Machine Learning (Linear regression, Logistic regression)
2018-2 Machine Learning (Linear regression, Logistic regression)2018-2 Machine Learning (Linear regression, Logistic regression)
2018-2 Machine Learning (Linear regression, Logistic regression)
 
2017-2 ML W11 GAN #1
2017-2 ML W11 GAN #12017-2 ML W11 GAN #1
2017-2 ML W11 GAN #1
 
2017-2 ML W9 Reinforcement Learning #5
2017-2 ML W9 Reinforcement Learning #52017-2 ML W9 Reinforcement Learning #5
2017-2 ML W9 Reinforcement Learning #5
 

Dernier

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 

Dernier (20)

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

2013-1 Machine Learning Lecture 03 - Naïve Bayes Classifiers

  • 2. Probability • What is the probability that you get hit in the head with bird droppings when you go outside? ▫ Initially shown in one episode of Korea TV show ▫ Asked in one BBS of an elite group in Korea  no one gave the right answer • It is difficult to define “probability” ▫ As difficult as defining “science”, “history”, etc. • 16 different definitions • Frequentist vs. Bayesian
  • 3. Bayesian Probability • Can assign probability on any statement • Can be used to show the degree of one’s belief, or degree of one’s knowledge. • Example of subjectiveness: Alice, Bob, and Carol ▫ Alice tosses a new coin 10 times  3/10 heads ▫ Bob tosses the coin 10 times  4/10 heads ▫ Carol has watched them all  7/20 heads • Probabilities in Monty Hall ▫ MC knows where the car is  1 ▫ Guest asked again after MC opened the door  2/3 ▫ Viewer started watching the show after MC opened the door  1/2
  • 4. Bayes Theorem • Probability – P(A) • Conditional Probability – P(A|C) • P(A|B) = P(A,B)/P(B) • P(B|A) = P(A,B)/P(A) • P(A|B) = {P(B|A)P(A)}/P(B) • P(C|D) = {P(D|C)P(C)}/P(D) ▫ P(C|D) : The probability of class given data (Posterior Probability) ▫ P(C) : Prior Probability ▫ P(D|C) : Likelihood ▫ P(D) : constant, so ignored
  • 5. Does patient have cancer or not? • A patient takes a lab test and the result comes back positive. It is known that the test returns a correct positive result in only 99% of the cases and a correct negative result in only 95% of t he cases. Furthermore, only 0.03 of the entire population has this disease. 1. What is the probability that this patient has cancer? 2. What is the probability that he does not have cancer? 3. What is the diagnosis?
  • 6. Bayes Classifier • Recall P(C|X1,X2,…Xn) = P(C)P(X1,X2,…Xn|C)/P(X1,X2,…Xn) •  P(C|X1,X2,…Xn) α P(C)P(X1,X2,…Xn|C) • Maximum A Posteriori hypothesis ▫ hMAP = argmax P(D|h)P(h) • Maximum Likelihood hypothesis ▫ hML = argmax P(D|h)
  • 7. Naïve Bayes Classifier • Recall P(C|X1,X2,…Xn) = P(C)P(X1,X2,…Xn|C)/P(X1,X2,…Xn) •  P(C|X1,X2,…Xn) α P(C)P(X1,X2,…Xn|C) • It is hard to calculate joint probabilities ▫ Too many cases ▫ Too little data • If we ignore the dependences among X1,X2,…Xn? ▫ Why? Because we are naïve. ▫ More precisely, suppose X1,X2,…Xn are conditionally independent of each other given C • P(C|X1,X2,…Xn) α P(C) * Π P(Xi|C)
  • 8. Question: For the day <sunny, cool, high, strong>, what’s the play prediction?
  • 9. Wind Humidity Temp Outlook PlayP(Play=Y) 9/14 P(Play=N) 4/14 P(Sunny|P) P(Overcast|P) P(Rain|P) Play= Y 2/9 4/9 3/9 Play= N 3/5 0/5 2/5 P(Hot|P) P(Mild|P) P(Cool|P) Play= Y 2/9 4/9 3/9 Play= N 2/5 2/5 1/5
  • 10. Zero Occurrence • When a feature is never occurred in the training set  zero frequency  PANIC: makes all terms zero • Smoothing the distribution ▫ Laplacian Smoothing ▫ Dirichlet Priors Smoothing ▫ and many more (Absolute Discouting, Jelinek- Mercer smoothing, Katz smoothing, Good-Turing smoothing, etc.)
  • 11. Extension of NBC • NBC is actually very effective • Selective Bayesian Classifiers (SBC) • Tree Augmented Naïve Bayes (TAN) ▫ ChowLiu algorithm (CL-TAN) ▫ SuperParent algorithm (SP-TAN) • Attribute Value Taxonomy Guided Naïve Bayes Learner (AVT-NBL) • n-gram Augmented Naïve Bayes