SlideShare une entreprise Scribd logo
1  sur  21
PRESENTATION
On
K-Fold Cross Validation Method
• Under the Guidance –Mrs.Divya Gupta
(M.tech)
Assistant Professor Computer Science
Department IERT ALD
• Made By Shubham Gupta
Aktu-Roll No-1511010041
B.Tech CSE 3rd Year 39
TABLE OF CONTENT
• Data Sets
1-Training Data Sets
2-Testing Data Sets
3-Data Set Figure Representation
• Cross validation
1-Definition
2-Methods of Cross Validation
• Hold –Out Method For Cross Validation
1-Definition
2-Need
3-Advantages
4-Disadvantages
• K-Fold Cross Validation methods
1-Definition
2-Need
3-Advantages
4-Disadvantages
* References
2
DATA SETS
• In machine Learning, the study and construction of algorithm that can learn from and
make prediction on the data is a common task. Such algorithms works by making
data- driven predictions or decision through building a mathematical model from
input data.
• The data used is called Data Sets.
• Date Sets are classified into two types.
1-Training Data set
2-Testing Data set.
3
TRAINING DATA SET
• Type of data set in which we know the solution or in other words we can say we
know the input and output data both such type of data is called Training Data set.
Eg-History (We know the outcome of that).
• It is used for leaning of result and making algorithm or pattern. Hence it should be in
large amount say 70% of the initial data.
• Also know as Development data set.
4
TESTING DATA SET
• Type of Data Set in which we don't know the solution or in other words we don’t
now the output of that input set. Such type of Data Set is called Training Data sets.
Eg. Future (We don’t know the outcomes of events that will occur in future.
• It is used for Data validation. Hence it should be maximum say 30% of initial data.
• Also know as Validation data sets.
5
DATA SET FIGURE REPRESENTATION
6
CROSS VALIDATION
• Cross Validation is a model validation technique for accessing how the result of
statistical analysis will generalize to an independent data set.
• So we can say cross validation is used for-
1-Finding or estimating expected error.
2-Helps in selecting the best fit Model (Model which fit the data set best).
3-Avoiding Over-Fit Model.(e.g. time fit Model like Earthquake.)
7
METHOD USED FOR CROSS
VALIDATION
• There are four methods used for Cross Validation. These are-
1-Hold out sample Validation.
2-K-Fold Cross Validation
3-Leave one out Cross Validation
4-Bootstraps Methods
Here we will discuss only 2 methods Hold out sample Validation and K-Fold Cross
Validation only.
8
HOLD OUT CROSS VALIDATION
• Step by Step-
• Step 1:- Took all data
• Step 2-Randomly divided into two parts
(say 70% 30)
• Step 3: Use Part1 as development
(training data set) and Part2 as
testing data set.
9
WHY WE DID SO IN HOLD OUT
METHOD
• To ensure that we learn the generalized
pattern without much error.
• Pattern obtained from the training set data
must show similar results in test/validation
data.
10
ADVANTAGES /DISADVANTAGES
OF HOLD OUT METHOD
• Advantages
1-Simplest method
2-Easily can work on large Data.
3-Fast method as compared to other method.
Disadvantage
1-Not working for small data set.(here it comes the Role of K-Fold Cross validation.
11
WHY WE NEED K-FOLD CROSS
VALIDATION METHOD
• Suppose a situation in which we have a short data
set say 500 data sets.
• Now we split the data into 70 :30 % as hold out
method says.
• Hence we only get 150 records which is too low.
• To increase it we make it 50:50 %Ratio.
• Now if we make 50:50 ratio than the training data
will become too low.
• If we don’t have much training data the model
develop will have more error and will not be accurate.
12
DILEMMA STATE IN TRAINING
AND TESTING DATA
• #More Training data more
accurate model will develop.
• #Less error in the model.
• Here it comes the role of K-Fold
CV.
• #more Testing data more Value to
check data.
13
K-FOLD CROSS VALIDATION
• Let assume k=5.So it will be 5-Fold validation.
• First take the data and divide it into 5
equal parts.
• Each part will have 20% of the data set values.
14
K-FOLD CROSS VALIDATION
CONTD
• Now used 4 parts as
development and 1 parts
for validation.
See the given figure
15
K-FOLD CROSS VALIDATION
CONTD
• Similar we can
done the same
thing for next
four.
See the
Figure
16
K-FOLD CROSS VALIDATION
CONTD
• Points to be noted
• Each part become available for 1 time in validation set.
• Similar Each part will become 4 times in the training Set.
• Hence we have increased both validation set and training.
17
ADVANTAGES OF K-FOLD CROSS VALIDATION
METHOD
• Given We have big data for model Development as in the Hold out method we have
only 500 data set now we have 500x5=2500 data sets in the K-Fold Cross validation
method .
• Given We have now a big data for validation. In case of Hold out method we have
only 150 data sets now in case of K-Fold cross validation method we have
100x5=500 data sets for validation.
• Hence we Have big data so it will more accurate as compared to other methods.
18
DISADVANTAGES OF K-FOLD CROSS
VALIDATION METHOD
• Only the Disadvantage that the K-Fold Cross Validation method has is it calculation.
• As we Repeat the model-K-times Hence it required More heavy calculation. Infact
it required K-times more calculation as compared to Hold –Out Cross Validation
method.
• Hence it is K-times slower.
19
REFERENCES
• Wikipedia-
https://en.wikipedia.org/wiki/Training,_test,_and_validation_sets
• Geeks for Geeks
https://www.geeksforgeeks.org/cross-validation-machine-learning/
• Udacity
https://www.youtube.com/watch?v=TIgfjmp-4BA
20
21

Contenu connexe

Tendances

Machine Learning with Decision trees
Machine Learning with Decision treesMachine Learning with Decision trees
Machine Learning with Decision treesKnoldus Inc.
 
Bias and variance trade off
Bias and variance trade offBias and variance trade off
Bias and variance trade offVARUN KUMAR
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning Gopal Sakarkar
 
Linear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaLinear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaEdureka!
 
K-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierK-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierNeha Kulkarni
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)Abhimanyu Dwivedi
 
Feature selection
Feature selectionFeature selection
Feature selectionDong Guo
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)Unsupervised learning (clustering)
Unsupervised learning (clustering)Pravinkumar Landge
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsMd. Main Uddin Rony
 
Deep Dive into Hyperparameter Tuning
Deep Dive into Hyperparameter TuningDeep Dive into Hyperparameter Tuning
Deep Dive into Hyperparameter TuningShubhmay Potdar
 
Overfitting & Underfitting
Overfitting & UnderfittingOverfitting & Underfitting
Overfitting & UnderfittingSOUMIT KAR
 
Over fitting underfitting
Over fitting underfittingOver fitting underfitting
Over fitting underfittingSivapriyaS12
 
A Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-LearnA Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-LearnSarah Guido
 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluationeShikshak
 

Tendances (20)

Machine Learning with Decision trees
Machine Learning with Decision treesMachine Learning with Decision trees
Machine Learning with Decision trees
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
 
Machine Learning: Bias and Variance Trade-off
Machine Learning: Bias and Variance Trade-offMachine Learning: Bias and Variance Trade-off
Machine Learning: Bias and Variance Trade-off
 
Bias and variance trade off
Bias and variance trade offBias and variance trade off
Bias and variance trade off
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
 
Linear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | EdurekaLinear Regression vs Logistic Regression | Edureka
Linear Regression vs Logistic Regression | Edureka
 
K-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierK-Nearest Neighbor Classifier
K-Nearest Neighbor Classifier
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)
 
Feature selection
Feature selectionFeature selection
Feature selection
 
Unsupervised learning (clustering)
Unsupervised learning (clustering)Unsupervised learning (clustering)
Unsupervised learning (clustering)
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
 
Hierarchical Clustering
Hierarchical ClusteringHierarchical Clustering
Hierarchical Clustering
 
Deep Dive into Hyperparameter Tuning
Deep Dive into Hyperparameter TuningDeep Dive into Hyperparameter Tuning
Deep Dive into Hyperparameter Tuning
 
Overfitting & Underfitting
Overfitting & UnderfittingOverfitting & Underfitting
Overfitting & Underfitting
 
Over fitting underfitting
Over fitting underfittingOver fitting underfitting
Over fitting underfitting
 
A Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-LearnA Beginner's Guide to Machine Learning with Scikit-Learn
A Beginner's Guide to Machine Learning with Scikit-Learn
 
Machine learning
Machine learningMachine learning
Machine learning
 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluation
 
K - Nearest neighbor ( KNN )
K - Nearest neighbor  ( KNN )K - Nearest neighbor  ( KNN )
K - Nearest neighbor ( KNN )
 
Decision tree
Decision treeDecision tree
Decision tree
 

Similaire à K-Folds Cross Validation Method

shubhampresentation-180430060134.pptx
shubhampresentation-180430060134.pptxshubhampresentation-180430060134.pptx
shubhampresentation-180430060134.pptxABINASHPADHY6
 
Statistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxStatistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxrajalakshmi5921
 
6 Evaluating Predictive Performance and ensemble.pptx
6 Evaluating Predictive Performance and ensemble.pptx6 Evaluating Predictive Performance and ensemble.pptx
6 Evaluating Predictive Performance and ensemble.pptxmohammedalherwi1
 
LETS PUBLISH WITH MORE RELIABLE & PRESENTABLE MODELLING.pptx
LETS PUBLISH WITH MORE RELIABLE & PRESENTABLE MODELLING.pptxLETS PUBLISH WITH MORE RELIABLE & PRESENTABLE MODELLING.pptx
LETS PUBLISH WITH MORE RELIABLE & PRESENTABLE MODELLING.pptxshamsul2010
 
Cross validation.pptx
Cross validation.pptxCross validation.pptx
Cross validation.pptxYouKnowwho28
 
Week 11 Model Evalaution Model Evaluation
Week 11 Model Evalaution Model EvaluationWeek 11 Model Evalaution Model Evaluation
Week 11 Model Evalaution Model Evaluationkhairulhuda242
 
Cmpe 255 cross validation
Cmpe 255 cross validationCmpe 255 cross validation
Cmpe 255 cross validationAbraham Kong
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10Roger Barga
 
MACHINE LEARNING YEAR DL SECOND PART.pptx
MACHINE LEARNING YEAR DL SECOND PART.pptxMACHINE LEARNING YEAR DL SECOND PART.pptx
MACHINE LEARNING YEAR DL SECOND PART.pptxNAGARAJANS68
 
Model Selection Techniques
Model Selection TechniquesModel Selection Techniques
Model Selection TechniquesSwati .
 
Information Retrieval 08
Information Retrieval 08 Information Retrieval 08
Information Retrieval 08 Jeet Das
 
Improving machine learning models unit 5.pptx
Improving machine learning models unit 5.pptxImproving machine learning models unit 5.pptx
Improving machine learning models unit 5.pptxSomnathMule5
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining ProcessMarc Berman
 
Machine learning project_promotion
Machine learning project_promotionMachine learning project_promotion
Machine learning project_promotionkahhuey
 

Similaire à K-Folds Cross Validation Method (20)

shubhampresentation-180430060134.pptx
shubhampresentation-180430060134.pptxshubhampresentation-180430060134.pptx
shubhampresentation-180430060134.pptx
 
crossvalidation.pptx
crossvalidation.pptxcrossvalidation.pptx
crossvalidation.pptx
 
Statistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptxStatistical Learning and Model Selection (1).pptx
Statistical Learning and Model Selection (1).pptx
 
6 Evaluating Predictive Performance and ensemble.pptx
6 Evaluating Predictive Performance and ensemble.pptx6 Evaluating Predictive Performance and ensemble.pptx
6 Evaluating Predictive Performance and ensemble.pptx
 
LETS PUBLISH WITH MORE RELIABLE & PRESENTABLE MODELLING.pptx
LETS PUBLISH WITH MORE RELIABLE & PRESENTABLE MODELLING.pptxLETS PUBLISH WITH MORE RELIABLE & PRESENTABLE MODELLING.pptx
LETS PUBLISH WITH MORE RELIABLE & PRESENTABLE MODELLING.pptx
 
Cross validation.pptx
Cross validation.pptxCross validation.pptx
Cross validation.pptx
 
evaluation and credibility-Part 1
evaluation and credibility-Part 1evaluation and credibility-Part 1
evaluation and credibility-Part 1
 
Week 11 Model Evalaution Model Evaluation
Week 11 Model Evalaution Model EvaluationWeek 11 Model Evalaution Model Evaluation
Week 11 Model Evalaution Model Evaluation
 
Cmpe 255 cross validation
Cmpe 255 cross validationCmpe 255 cross validation
Cmpe 255 cross validation
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10
 
MACHINE LEARNING YEAR DL SECOND PART.pptx
MACHINE LEARNING YEAR DL SECOND PART.pptxMACHINE LEARNING YEAR DL SECOND PART.pptx
MACHINE LEARNING YEAR DL SECOND PART.pptx
 
Model Selection Techniques
Model Selection TechniquesModel Selection Techniques
Model Selection Techniques
 
4.1.pptx
4.1.pptx4.1.pptx
4.1.pptx
 
Information Retrieval 08
Information Retrieval 08 Information Retrieval 08
Information Retrieval 08
 
Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)
 
Improving machine learning models unit 5.pptx
Improving machine learning models unit 5.pptxImproving machine learning models unit 5.pptx
Improving machine learning models unit 5.pptx
 
The 8 Step Data Mining Process
The 8 Step Data Mining ProcessThe 8 Step Data Mining Process
The 8 Step Data Mining Process
 
Models ABC
Models ABCModels ABC
Models ABC
 
Machine learning project_promotion
Machine learning project_promotionMachine learning project_promotion
Machine learning project_promotion
 
ai4.ppt
ai4.pptai4.ppt
ai4.ppt
 

Dernier

the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 

Dernier (20)

the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 

K-Folds Cross Validation Method

  • 1. PRESENTATION On K-Fold Cross Validation Method • Under the Guidance –Mrs.Divya Gupta (M.tech) Assistant Professor Computer Science Department IERT ALD • Made By Shubham Gupta Aktu-Roll No-1511010041 B.Tech CSE 3rd Year 39
  • 2. TABLE OF CONTENT • Data Sets 1-Training Data Sets 2-Testing Data Sets 3-Data Set Figure Representation • Cross validation 1-Definition 2-Methods of Cross Validation • Hold –Out Method For Cross Validation 1-Definition 2-Need 3-Advantages 4-Disadvantages • K-Fold Cross Validation methods 1-Definition 2-Need 3-Advantages 4-Disadvantages * References 2
  • 3. DATA SETS • In machine Learning, the study and construction of algorithm that can learn from and make prediction on the data is a common task. Such algorithms works by making data- driven predictions or decision through building a mathematical model from input data. • The data used is called Data Sets. • Date Sets are classified into two types. 1-Training Data set 2-Testing Data set. 3
  • 4. TRAINING DATA SET • Type of data set in which we know the solution or in other words we can say we know the input and output data both such type of data is called Training Data set. Eg-History (We know the outcome of that). • It is used for leaning of result and making algorithm or pattern. Hence it should be in large amount say 70% of the initial data. • Also know as Development data set. 4
  • 5. TESTING DATA SET • Type of Data Set in which we don't know the solution or in other words we don’t now the output of that input set. Such type of Data Set is called Training Data sets. Eg. Future (We don’t know the outcomes of events that will occur in future. • It is used for Data validation. Hence it should be maximum say 30% of initial data. • Also know as Validation data sets. 5
  • 6. DATA SET FIGURE REPRESENTATION 6
  • 7. CROSS VALIDATION • Cross Validation is a model validation technique for accessing how the result of statistical analysis will generalize to an independent data set. • So we can say cross validation is used for- 1-Finding or estimating expected error. 2-Helps in selecting the best fit Model (Model which fit the data set best). 3-Avoiding Over-Fit Model.(e.g. time fit Model like Earthquake.) 7
  • 8. METHOD USED FOR CROSS VALIDATION • There are four methods used for Cross Validation. These are- 1-Hold out sample Validation. 2-K-Fold Cross Validation 3-Leave one out Cross Validation 4-Bootstraps Methods Here we will discuss only 2 methods Hold out sample Validation and K-Fold Cross Validation only. 8
  • 9. HOLD OUT CROSS VALIDATION • Step by Step- • Step 1:- Took all data • Step 2-Randomly divided into two parts (say 70% 30) • Step 3: Use Part1 as development (training data set) and Part2 as testing data set. 9
  • 10. WHY WE DID SO IN HOLD OUT METHOD • To ensure that we learn the generalized pattern without much error. • Pattern obtained from the training set data must show similar results in test/validation data. 10
  • 11. ADVANTAGES /DISADVANTAGES OF HOLD OUT METHOD • Advantages 1-Simplest method 2-Easily can work on large Data. 3-Fast method as compared to other method. Disadvantage 1-Not working for small data set.(here it comes the Role of K-Fold Cross validation. 11
  • 12. WHY WE NEED K-FOLD CROSS VALIDATION METHOD • Suppose a situation in which we have a short data set say 500 data sets. • Now we split the data into 70 :30 % as hold out method says. • Hence we only get 150 records which is too low. • To increase it we make it 50:50 %Ratio. • Now if we make 50:50 ratio than the training data will become too low. • If we don’t have much training data the model develop will have more error and will not be accurate. 12
  • 13. DILEMMA STATE IN TRAINING AND TESTING DATA • #More Training data more accurate model will develop. • #Less error in the model. • Here it comes the role of K-Fold CV. • #more Testing data more Value to check data. 13
  • 14. K-FOLD CROSS VALIDATION • Let assume k=5.So it will be 5-Fold validation. • First take the data and divide it into 5 equal parts. • Each part will have 20% of the data set values. 14
  • 15. K-FOLD CROSS VALIDATION CONTD • Now used 4 parts as development and 1 parts for validation. See the given figure 15
  • 16. K-FOLD CROSS VALIDATION CONTD • Similar we can done the same thing for next four. See the Figure 16
  • 17. K-FOLD CROSS VALIDATION CONTD • Points to be noted • Each part become available for 1 time in validation set. • Similar Each part will become 4 times in the training Set. • Hence we have increased both validation set and training. 17
  • 18. ADVANTAGES OF K-FOLD CROSS VALIDATION METHOD • Given We have big data for model Development as in the Hold out method we have only 500 data set now we have 500x5=2500 data sets in the K-Fold Cross validation method . • Given We have now a big data for validation. In case of Hold out method we have only 150 data sets now in case of K-Fold cross validation method we have 100x5=500 data sets for validation. • Hence we Have big data so it will more accurate as compared to other methods. 18
  • 19. DISADVANTAGES OF K-FOLD CROSS VALIDATION METHOD • Only the Disadvantage that the K-Fold Cross Validation method has is it calculation. • As we Repeat the model-K-times Hence it required More heavy calculation. Infact it required K-times more calculation as compared to Hold –Out Cross Validation method. • Hence it is K-times slower. 19
  • 20. REFERENCES • Wikipedia- https://en.wikipedia.org/wiki/Training,_test,_and_validation_sets • Geeks for Geeks https://www.geeksforgeeks.org/cross-validation-machine-learning/ • Udacity https://www.youtube.com/watch?v=TIgfjmp-4BA 20
  • 21. 21