SlideShare une entreprise Scribd logo
1  sur  1
Briefly describe a sign of overfitting in Naive Bayes learning, and how it can be avoided.
Solution
Briefly, with the Naive Bayes (NB) algorithm the 'naive' conditional independence assumption
means that interactions between variables can be ignored. What follows is:
i) it has a simpler hypothesis function (compared with other algorithms e.g. logistic regression)
ii) since the interactions are not modeled, some of the information in the data is ignored. This
makes it an inherently high bias model; it has a high approximation error but as a result it also
does not overfit. (A model with high variance attempts to model all of the data including the
noise in the data).
iii) Since the interactions are not modeled, less training data is needed. This is why the NB
classifier is known to perform well both with small data sets and with missing data. Hereis a
small experiment I did to see effect missing data and training data size have on the NB classifier.

Contenu connexe

Similaire à Briefly describe a sign of overfitting in Naive Bayes learning- and ho.docx

Top 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdfTop 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdf
AnanthReddy38
 
notes as .ppt
notes as .pptnotes as .ppt
notes as .ppt
butest
 
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
ijtsrd
 
renewed-poster-presentation (12)
renewed-poster-presentation (12)renewed-poster-presentation (12)
renewed-poster-presentation (12)
Kofi Forson
 
SELECTED DATA PREPARATION METHODS
SELECTED DATA PREPARATION METHODSSELECTED DATA PREPARATION METHODS
SELECTED DATA PREPARATION METHODS
KAMIL MAJEED
 
Dealing with imbalanced data sets.pdf
Dealing with imbalanced data sets.pdfDealing with imbalanced data sets.pdf
Dealing with imbalanced data sets.pdf
NagaVarthini
 
Twala2007.doc
Twala2007.docTwala2007.doc
Twala2007.doc
butest
 
An advance extended binomial GLMBoost ensemble method with synthetic minorit...
An advance extended binomial GLMBoost ensemble method  with synthetic minorit...An advance extended binomial GLMBoost ensemble method  with synthetic minorit...
An advance extended binomial GLMBoost ensemble method with synthetic minorit...
IJECEIAES
 

Similaire à Briefly describe a sign of overfitting in Naive Bayes learning- and ho.docx (20)

Top 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdfTop 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdf
 
notes as .ppt
notes as .pptnotes as .ppt
notes as .ppt
 
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
 
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCEA SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
A SURVEY OF METHODS FOR HANDLING DISK DATA IMBALANCE
 
Issues in DTL.pptx
Issues in DTL.pptxIssues in DTL.pptx
Issues in DTL.pptx
 
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Scores...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating  Scores...SPSS GuideAssessing Normality, Handling Missing Data, and Calculating  Scores...
SPSS GuideAssessing Normality, Handling Missing Data, and Calculating Scores...
 
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
 
renewed-poster-presentation (12)
renewed-poster-presentation (12)renewed-poster-presentation (12)
renewed-poster-presentation (12)
 
SELECTED DATA PREPARATION METHODS
SELECTED DATA PREPARATION METHODSSELECTED DATA PREPARATION METHODS
SELECTED DATA PREPARATION METHODS
 
Dealing with imbalanced data sets.pdf
Dealing with imbalanced data sets.pdfDealing with imbalanced data sets.pdf
Dealing with imbalanced data sets.pdf
 
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
COMPARISION OF PERCENTAGE ERROR BY USING IMPUTATION METHOD ON MID TERM EXAMIN...
 
Twala2007.doc
Twala2007.docTwala2007.doc
Twala2007.doc
 
An advance extended binomial GLMBoost ensemble method with synthetic minorit...
An advance extended binomial GLMBoost ensemble method  with synthetic minorit...An advance extended binomial GLMBoost ensemble method  with synthetic minorit...
An advance extended binomial GLMBoost ensemble method with synthetic minorit...
 
IRJET- Predicting Customers Churn in Telecom Industry using Centroid Oversamp...
IRJET- Predicting Customers Churn in Telecom Industry using Centroid Oversamp...IRJET- Predicting Customers Churn in Telecom Industry using Centroid Oversamp...
IRJET- Predicting Customers Churn in Telecom Industry using Centroid Oversamp...
 
Neural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learningNeural networks, naïve bayes and decision tree machine learning
Neural networks, naïve bayes and decision tree machine learning
 
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
 
Regularization_BY_MOHAMED_ESSAM.pptx
Regularization_BY_MOHAMED_ESSAM.pptxRegularization_BY_MOHAMED_ESSAM.pptx
Regularization_BY_MOHAMED_ESSAM.pptx
 
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...
Prognosticating Autism Spectrum Disorder Using Artificial Neural Network: Lev...
 
Machine Learning Approaches and its Challenges
Machine Learning Approaches and its ChallengesMachine Learning Approaches and its Challenges
Machine Learning Approaches and its Challenges
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
 

Plus de marions12

C++ help!! Im not familiar with how to save- read file so that the pro.docx
C++ help!! Im not familiar with how to save- read file so that the pro.docxC++ help!! Im not familiar with how to save- read file so that the pro.docx
C++ help!! Im not familiar with how to save- read file so that the pro.docx
marions12
 
Briefly describe some of the similarities and differences between U-S-.docx
Briefly describe some of the similarities and differences between U-S-.docxBriefly describe some of the similarities and differences between U-S-.docx
Briefly describe some of the similarities and differences between U-S-.docx
marions12
 
Briefly explain the purposes of adjustments Briefly explain the purp.docx
Briefly explain the purposes of adjustments   Briefly explain the purp.docxBriefly explain the purposes of adjustments   Briefly explain the purp.docx
Briefly explain the purposes of adjustments Briefly explain the purp.docx
marions12
 

Plus de marions12 (20)

C++ program Revising the Array-Based List ADT Given the data structure.docx
C++ program Revising the Array-Based List ADT Given the data structure.docxC++ program Revising the Array-Based List ADT Given the data structure.docx
C++ program Revising the Array-Based List ADT Given the data structure.docx
 
C++ help!! Im not familiar with how to save- read file so that the pro.docx
C++ help!! Im not familiar with how to save- read file so that the pro.docxC++ help!! Im not familiar with how to save- read file so that the pro.docx
C++ help!! Im not familiar with how to save- read file so that the pro.docx
 
c++ computer architecture and memory What is fragmentation- Briefly ex.docx
c++ computer architecture and memory What is fragmentation- Briefly ex.docxc++ computer architecture and memory What is fragmentation- Briefly ex.docx
c++ computer architecture and memory What is fragmentation- Briefly ex.docx
 
C# There are two schools of thought on the use of implicitly typed var.docx
C# There are two schools of thought on the use of implicitly typed var.docxC# There are two schools of thought on the use of implicitly typed var.docx
C# There are two schools of thought on the use of implicitly typed var.docx
 
Brief introduction of the WorldComSolutionWorldCom was started by Bill.docx
Brief introduction of the WorldComSolutionWorldCom was started by Bill.docxBrief introduction of the WorldComSolutionWorldCom was started by Bill.docx
Brief introduction of the WorldComSolutionWorldCom was started by Bill.docx
 
Brief Exercise 5-5 Crane Corporation has the following accounts includ.docx
Brief Exercise 5-5 Crane Corporation has the following accounts includ.docxBrief Exercise 5-5 Crane Corporation has the following accounts includ.docx
Brief Exercise 5-5 Crane Corporation has the following accounts includ.docx
 
Briefly describe some of the similarities and differences between U-S-.docx
Briefly describe some of the similarities and differences between U-S-.docxBriefly describe some of the similarities and differences between U-S-.docx
Briefly describe some of the similarities and differences between U-S-.docx
 
Briefly describe what the query evaluation engine performs inside a DB.docx
Briefly describe what the query evaluation engine performs inside a DB.docxBriefly describe what the query evaluation engine performs inside a DB.docx
Briefly describe what the query evaluation engine performs inside a DB.docx
 
Briefly explain the purposes of adjustments Briefly explain the purp.docx
Briefly explain the purposes of adjustments   Briefly explain the purp.docxBriefly explain the purposes of adjustments   Briefly explain the purp.docx
Briefly explain the purposes of adjustments Briefly explain the purp.docx
 
Budget Martin Corporation granted a nonqualified stock option to e.docx
Budget    Martin Corporation granted a nonqualified stock option to  e.docxBudget    Martin Corporation granted a nonqualified stock option to  e.docx
Budget Martin Corporation granted a nonqualified stock option to e.docx
 
Calculate fost of goods sold and ending inventory and analyze effect o (1).docx
Calculate fost of goods sold and ending inventory and analyze effect o (1).docxCalculate fost of goods sold and ending inventory and analyze effect o (1).docx
Calculate fost of goods sold and ending inventory and analyze effect o (1).docx
 
Calculate fost of goods sold and ending inventory and analyze effect o.docx
Calculate fost of goods sold and ending inventory and analyze effect o.docxCalculate fost of goods sold and ending inventory and analyze effect o.docx
Calculate fost of goods sold and ending inventory and analyze effect o.docx
 
Ca(OH)2 is insoluble- What is the total concentration of ions if 2M of.docx
Ca(OH)2 is insoluble- What is the total concentration of ions if 2M of.docxCa(OH)2 is insoluble- What is the total concentration of ions if 2M of.docx
Ca(OH)2 is insoluble- What is the total concentration of ions if 2M of.docx
 
c- Since BEEF hooks to Metasploit- is there any safe way to use the we.docx
c- Since BEEF hooks to Metasploit- is there any safe way to use the we.docxc- Since BEEF hooks to Metasploit- is there any safe way to use the we.docx
c- Since BEEF hooks to Metasploit- is there any safe way to use the we.docx
 
9- Fill in the following table with names or formulas- as appropriate-.docx
9- Fill in the following table with names or formulas- as appropriate-.docx9- Fill in the following table with names or formulas- as appropriate-.docx
9- Fill in the following table with names or formulas- as appropriate-.docx
 
9- A combination of sand- salt- and water is an example of a A) homoge.docx
9- A combination of sand- salt- and water is an example of a A) homoge.docx9- A combination of sand- salt- and water is an example of a A) homoge.docx
9- A combination of sand- salt- and water is an example of a A) homoge.docx
 
8-6- Which of the highlighted elements in Figure P8-6 has the greatest.docx
8-6- Which of the highlighted elements in Figure P8-6 has the greatest.docx8-6- Which of the highlighted elements in Figure P8-6 has the greatest.docx
8-6- Which of the highlighted elements in Figure P8-6 has the greatest.docx
 
870 The plates of a capacitor are not quite parallel- the distance bet.docx
870 The plates of a capacitor are not quite parallel- the distance bet.docx870 The plates of a capacitor are not quite parallel- the distance bet.docx
870 The plates of a capacitor are not quite parallel- the distance bet.docx
 
8- What are concurrency design patterns- Explain-Solution8) we have fi.docx
8- What are concurrency design patterns- Explain-Solution8) we have fi.docx8- What are concurrency design patterns- Explain-Solution8) we have fi.docx
8- What are concurrency design patterns- Explain-Solution8) we have fi.docx
 
8- Calculate the distance from the donor (D) to the acceptor (A) in th.docx
8- Calculate the distance from the donor (D) to the acceptor (A) in th.docx8- Calculate the distance from the donor (D) to the acceptor (A) in th.docx
8- Calculate the distance from the donor (D) to the acceptor (A) in th.docx
 

Dernier

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Dernier (20)

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 

Briefly describe a sign of overfitting in Naive Bayes learning- and ho.docx

  • 1. Briefly describe a sign of overfitting in Naive Bayes learning, and how it can be avoided. Solution Briefly, with the Naive Bayes (NB) algorithm the 'naive' conditional independence assumption means that interactions between variables can be ignored. What follows is: i) it has a simpler hypothesis function (compared with other algorithms e.g. logistic regression) ii) since the interactions are not modeled, some of the information in the data is ignored. This makes it an inherently high bias model; it has a high approximation error but as a result it also does not overfit. (A model with high variance attempts to model all of the data including the noise in the data). iii) Since the interactions are not modeled, less training data is needed. This is why the NB classifier is known to perform well both with small data sets and with missing data. Hereis a small experiment I did to see effect missing data and training data size have on the NB classifier.