SlideShare a Scribd company logo
1 of 43
Download to read offline
Machine Learning with
Classification & Regression Trees
© 2020 Minitab, LLC.
• 25+ years of
experience
• Minitab Trainer
• Statistical Consultant
• Minitab Software
Designer
• Master's in Statistics
Meet the Presenter:
Cheryl Pammer
Senior Advisory Statistician
© 2020 Minitab, LLC.
Learning Objectives
4
►Overview of Classification and Regression Trees
►Model Selection
►Visualizing Information from Large Trees
►Prediction
© 2020 Minitab, LLC.
Basic Supervised Machine Learning Algorithms
►Continuous Y: Regression, CART Regression Trees
►Categorical Y: Logistic Regression, CART Classification Trees
5
© 2020 Minitab, LLC.
Example: Pulp Bleaching
►A paper manufacturer needs to
determine which predictors
contribute to defects in a pulp
bleaching process.
►Several months of observational
process data are mined to
determine the root cause of
defects.
6
© 2020 Minitab, LLC.
Logistic Regression
7
?
© 2020 Minitab, LLC.
Challenges with Regression
Regression and logistic regression often don’t work well with larger
observational data sets.
►Everything is significant.
►Assumptions are not met.
►Relationships are nonlinear.
►Local interactions exist.
►Extreme outliers.
►Many missing values.
8
© 2020 Minitab, LLC.
Classification and Regression Trees (CART)
Find X’s that best partition the data into similar groupings of Y.
►CART Classification Tree
 Partitions data into high percentage of Group A and high percentage
of Group B
 Example: Separate data into defective and non-defective groupings
►CART Regression Tree
 Partitions data into high values of Y and low values of Y
 Example: Find conditions that lead to optimal values of Y
9
© 2020 Minitab, LLC.
CART Classification Tree
10
© 2020 Minitab, LLC.
CART Classification Tree
11
© 2020 Minitab, LLC.
CART Classification Tree
►Will continue to split
data into groupings to
separate defects from
nondefects.
►Continues splitting until:
 Runs out of data
 Hits a stopping rule
such as terminal
nodes need at least
3 observations.
12
© 2020 Minitab, LLC.
Model Selection
►Simpler models (fewest nodes) may be adequate.
►Look for elbow in plot.
13
© 2020 Minitab, LLC.
10-Node Tree
►Simpler tree is easier to
interpret.
►Less tendency toward
over-fitting.
►Will be more
misclassifications.
►Here, most defect
hotspots involve just 2
variables.
14
© 2020 Minitab, LLC.
Receiver Operating Characteristic (ROC) Curve
►Plot of True Positive Rate vs False
Positive Rate
►For a random classifier True Positive
Rate = False Positive Rate
15
Sensitivity = TP/(TP + FN)
Specificity = TN/(FP + TN)
Model = Fail Model = Pass
True Fail #TP #FN
True Pass #FP #TN
© 2020 Minitab, LLC.
Confusion Matrix
16
Model = Fail Model = Pass
True Fail #TP #FN
True Pass #FP #TN
Sensitivity = TP/(TP + FN)
Specificity = TN/(FP + TN)
© 2020 Minitab, LLC.
Variable Importance
17
© 2020 Minitab, LLC.
Hot Spots
►Overall defect rate = 2.9%
►Four nodes had defect
rates > 15%
18
© 2020 Minitab, LLC.
Example: Glass Manufacturing Process
A glass bottle manufacturer wants
to use past data to:
►Determine what increases per
unit cost of manufacture.
►Predict the per unit cost of
manufacture of custom orders to
determine pricing.
19
© 2020 Minitab, LLC.
Relationships Between Continuous Variables
20
© 2020 Minitab, LLC.
Counts by Plant, Segment and Color
21
© 2020 Minitab, LLC.
CART Regression Tree
►Find predictor and split point
that best separates the
cases into two groups with
similar response values.
►Splitting criterion is usually
least squared error. For this
method, predictions (fits) are
mean of data in each
terminal node.
22
© 2020 Minitab, LLC.
CART Regression Tree (Least Squares Loss)
►For each predictor:
1. Sort smallest to largest
2. Calculate split improvement for each possible split point
3. Select best split point for variable
4. Compare best split improvement across all variables
►For continuous predictor, split point is midpoint between distinct
values
►For categorical predictor, split point is a set of values
23
© 2020 Minitab, LLC.
CART Regression Tree Example
1. Sort by X1
24
© 2020 Minitab, LLC.
CART Regression Tree Example
2. Split at each possible split point and calculate split improvement.
25
© 2020 Minitab, LLC.
CART Regression Tree Example
3. Select the best split point for variable.
4. Compare best split improvement across all variables.
26
© 2020 Minitab, LLC.
CART Regression Tree Example
27
© 2020 Minitab, LLC.
CART Regression Tree
28
© 2020 Minitab, LLC.
CART Regression Tree
►Even with small number of predictors, optimal tree can be large.
►Interpret the tree using visualizations based on stored fits and
terminal node information.
29
© 2020 Minitab, LLC.
Model Selection
“Optimal” model within 1 SE of highest Test R2.
30
© 2020 Minitab, LLC.
Using CART for Prediction (Fit)
►Classification tree: Can use
either predicted class or
predicted probability of
terminal node.
►Regression tree: Fit is mean
or median of terminal node,
depending on estimation
method.
31
© 2020 Minitab, LLC.
Diagnostics
32
© 2020 Minitab, LLC.
Exploring the CART Results
►Store the Fits and
Terminal Node Ids when
creating the model.
►Use these stored values
and Minitab’s
visualization tools to
explore results.
33
© 2020 Minitab, LLC.
Terminal Node Sample Sizes
34
© 2020 Minitab, LLC.
Variable Importance
35
© 2020 Minitab, LLC.
Predicted Cost by Segment
36
© 2020 Minitab, LLC.
Highest Predicted Cost
37
Brushing Mode
Add Variables and
Make Indicator Variable
© 2020 Minitab, LLC.
Terminal Node 71 vs Other Nodes
38
© 2020 Minitab, LLC.
Terminal Node 71 Predictor Demographics
39
© 2020 Minitab, LLC.
What is the Predicted Manufacturing Cost?
40
© 2020 Minitab, LLC.
Take-aways
►Overview of Classification and Regression Trees
►Model Selection
►Visualizing Information from Large Trees
►Prediction
Questions!
Cheryl Pammer
cpammer@minitab.com
41
© 2020 Minitab, LLC.
Upcoming Webinars and Virtual Events
• Maximize Efficiency: Using Minitab Workspace and Minitab
Statistical Software to Amplify Output and Meet Critical
Demand
Time: Wednesday 22 July, 12PM AEST (10AM HKT / 2PM NZST)
• Turning Words in to Numbers and Numbers into Data-Driven
Insights with Minitab, Python and Text Mining
Time: Wednesday 29 July, 12PM AEST (10AM HKT / 2PM NZST)
See all the details and sign up at:
https://info.minitab.com/resources/webinars/webinar-wednesdays-apac
© 2020 Minitab, LLC.
Upcoming Webinars and Virtual Events
• Online/Virtual Training
Minitab is now offering virtual training taught by
Minitab experts – perfect for remote/home workers.
Visit www.minitab.com/training/training for more info.
• Talk to Minitab
Complimentary resources to help you deal quickly with today's challenges and changing environment.
Visit www.minitab.com and click on the Talk to Minitab button and a Minitab representative will be in touch!
© 2020 Minitab, LLC.
Our Approach: More Than Business Analytics… Solutions Analytics
Software
Services
Training
Learn first-hand by attending public or
customized trainings in your facilities
according to your requirements.
Statistical
Consulting
Personalized help with statistical
challenges from collecting the right data
to interpreting analysis more.
Support
Assistance with installation,
implementation, version updates
and license management.
Master statistics and
Minitab anywhere
with online training
Machine learning and
predictive analytics
software
Start, track, manage
and execute
improvement projects
with real-time
dashboards
Powerful statistical
software everyone
can use.
Data Analysis Predictive Modeling Visual Business Tools Project Oversight
Visual tools to
process and product
excellence.
Online Training
Solutions analytics is our integrated approach to providing software and services that enable organizations to make better decisions that drive business excellence.

More Related Content

What's hot

Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)Parth Khare
 
Decision tree presentation
Decision tree presentationDecision tree presentation
Decision tree presentationVijay Yadav
 
Decision Tree In R | Decision Tree Algorithm | Data Science Tutorial | Machin...
Decision Tree In R | Decision Tree Algorithm | Data Science Tutorial | Machin...Decision Tree In R | Decision Tree Algorithm | Data Science Tutorial | Machin...
Decision Tree In R | Decision Tree Algorithm | Data Science Tutorial | Machin...Simplilearn
 
Slide3.ppt
Slide3.pptSlide3.ppt
Slide3.pptbutest
 
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...Edureka!
 
Feature selection
Feature selectionFeature selection
Feature selectionDong Guo
 
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...Simplilearn
 
Introduction to Random Forests by Dr. Adele Cutler
Introduction to Random Forests by Dr. Adele CutlerIntroduction to Random Forests by Dr. Adele Cutler
Introduction to Random Forests by Dr. Adele CutlerSalford Systems
 
Introduction to Some Tree based Learning Method
Introduction to Some Tree based Learning MethodIntroduction to Some Tree based Learning Method
Introduction to Some Tree based Learning MethodHonglin Yu
 
Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...
Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...
Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...Simplilearn
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Simplilearn
 
Introduction to XGboost
Introduction to XGboostIntroduction to XGboost
Introduction to XGboostShuai Zhang
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsMd. Main Uddin Rony
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression TreesHemant Chetwani
 
Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018HJ van Veen
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysisGramener
 

What's hot (20)

Decision tree and random forest
Decision tree and random forestDecision tree and random forest
Decision tree and random forest
 
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
Machine learning basics using trees algorithm (Random forest, Gradient Boosting)
 
Decision tree presentation
Decision tree presentationDecision tree presentation
Decision tree presentation
 
Decision tree
Decision treeDecision tree
Decision tree
 
Decision Tree In R | Decision Tree Algorithm | Data Science Tutorial | Machin...
Decision Tree In R | Decision Tree Algorithm | Data Science Tutorial | Machin...Decision Tree In R | Decision Tree Algorithm | Data Science Tutorial | Machin...
Decision Tree In R | Decision Tree Algorithm | Data Science Tutorial | Machin...
 
Slide3.ppt
Slide3.pptSlide3.ppt
Slide3.ppt
 
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
Decision Tree Algorithm | Decision Tree in Python | Machine Learning Algorith...
 
Feature selection
Feature selectionFeature selection
Feature selection
 
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
Decision Tree Algorithm With Example | Decision Tree In Machine Learning | Da...
 
Decision tree
Decision treeDecision tree
Decision tree
 
Introduction to Random Forests by Dr. Adele Cutler
Introduction to Random Forests by Dr. Adele CutlerIntroduction to Random Forests by Dr. Adele Cutler
Introduction to Random Forests by Dr. Adele Cutler
 
Introduction to Some Tree based Learning Method
Introduction to Some Tree based Learning MethodIntroduction to Some Tree based Learning Method
Introduction to Some Tree based Learning Method
 
Decision tree
Decision treeDecision tree
Decision tree
 
Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...
Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...
Random Forest In R | Random Forest Algorithm | Random Forest Tutorial |Machin...
 
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
Random Forest Algorithm - Random Forest Explained | Random Forest In Machine ...
 
Introduction to XGboost
Introduction to XGboostIntroduction to XGboost
Introduction to XGboost
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression Trees
 
Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018Hacking Predictive Modeling - RoadSec 2018
Hacking Predictive Modeling - RoadSec 2018
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 

Similar to Machine Learning Classification & Regression Trees

Machine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APACMachine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APACMinitab, LLC
 
Machine Learning with Multiple Regression - APAC
Machine Learning with Multiple Regression - APACMachine Learning with Multiple Regression - APAC
Machine Learning with Multiple Regression - APACMinitab, LLC
 
Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...
Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...
Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...Minitab, LLC
 
Boost Your Data Expertise - What's New in Minitab 19.2020.1
Boost Your Data Expertise -  What's New in Minitab 19.2020.1Boost Your Data Expertise -  What's New in Minitab 19.2020.1
Boost Your Data Expertise - What's New in Minitab 19.2020.1Minitab, LLC
 
Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...
Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...
Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...Abbie Wong
 
Using Minitab for Superior Quality in Medical Device Manufacturing
Using Minitab for Superior Quality in Medical Device ManufacturingUsing Minitab for Superior Quality in Medical Device Manufacturing
Using Minitab for Superior Quality in Medical Device ManufacturingMinitab, LLC
 
Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...
Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...
Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...Minitab, LLC
 
Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...
Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...
Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...Minitab, LLC
 
Datascience101presentation4
Datascience101presentation4Datascience101presentation4
Datascience101presentation4Salford Systems
 
Machinelearning: The next step in manufacturing performance
Machinelearning: The next step in manufacturing performance Machinelearning: The next step in manufacturing performance
Machinelearning: The next step in manufacturing performance Blackberry&Cross
 
Production model lifecycle management 2016 09
Production model lifecycle management 2016 09Production model lifecycle management 2016 09
Production model lifecycle management 2016 09Greg Makowski
 
Performing at your best turning words into numbers and numbers into data driv...
Performing at your best turning words into numbers and numbers into data driv...Performing at your best turning words into numbers and numbers into data driv...
Performing at your best turning words into numbers and numbers into data driv...Minitab, LLC
 
The New Toolkit for Continuous Improvement webinar presentation slides
The New Toolkit for Continuous Improvement webinar presentation slidesThe New Toolkit for Continuous Improvement webinar presentation slides
The New Toolkit for Continuous Improvement webinar presentation slidesMinitab, LLC
 
Visualizations that make an impact - see what s new in minitab statistical s...
Visualizations that make an impact  - see what s new in minitab statistical s...Visualizations that make an impact  - see what s new in minitab statistical s...
Visualizations that make an impact - see what s new in minitab statistical s...Minitab, LLC
 
Minitab webinar presentation See the unknown with monte carlo simulation
Minitab webinar presentation See the unknown with monte carlo simulationMinitab webinar presentation See the unknown with monte carlo simulation
Minitab webinar presentation See the unknown with monte carlo simulationMinitab, LLC
 
Modern Toolkit for Process Excellence
Modern Toolkit for Process ExcellenceModern Toolkit for Process Excellence
Modern Toolkit for Process ExcellenceMinitab, LLC
 
Modern tool kit for process excellence, gracias a Minitab Inc.
Modern tool kit for process excellence, gracias a Minitab Inc.Modern tool kit for process excellence, gracias a Minitab Inc.
Modern tool kit for process excellence, gracias a Minitab Inc.Blackberry&Cross
 
Democratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDemocratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDatabricks
 
Predictive analytics retention
Predictive analytics retentionPredictive analytics retention
Predictive analytics retentionQubit
 
Driving customer retention using predictive analytics
Driving customer retention using predictive analyticsDriving customer retention using predictive analytics
Driving customer retention using predictive analyticsVanessa Beeswanger
 

Similar to Machine Learning Classification & Regression Trees (20)

Machine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APACMachine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APAC
 
Machine Learning with Multiple Regression - APAC
Machine Learning with Multiple Regression - APACMachine Learning with Multiple Regression - APAC
Machine Learning with Multiple Regression - APAC
 
Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...
Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...
Boost Your Data Expertise with the Latest Release of Minitab Statistical Soft...
 
Boost Your Data Expertise - What's New in Minitab 19.2020.1
Boost Your Data Expertise -  What's New in Minitab 19.2020.1Boost Your Data Expertise -  What's New in Minitab 19.2020.1
Boost Your Data Expertise - What's New in Minitab 19.2020.1
 
Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...
Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...
Gaining a Competitive Advantage using Analytics to Optimize your Digital Mark...
 
Using Minitab for Superior Quality in Medical Device Manufacturing
Using Minitab for Superior Quality in Medical Device ManufacturingUsing Minitab for Superior Quality in Medical Device Manufacturing
Using Minitab for Superior Quality in Medical Device Manufacturing
 
Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...
Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...
Companion by Minitab - Seeing the unknown identifying risk and quantifying pr...
 
Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...
Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...
Maximize Efficiency with Minitab Workspace and Minitab Statistical Software -...
 
Datascience101presentation4
Datascience101presentation4Datascience101presentation4
Datascience101presentation4
 
Machinelearning: The next step in manufacturing performance
Machinelearning: The next step in manufacturing performance Machinelearning: The next step in manufacturing performance
Machinelearning: The next step in manufacturing performance
 
Production model lifecycle management 2016 09
Production model lifecycle management 2016 09Production model lifecycle management 2016 09
Production model lifecycle management 2016 09
 
Performing at your best turning words into numbers and numbers into data driv...
Performing at your best turning words into numbers and numbers into data driv...Performing at your best turning words into numbers and numbers into data driv...
Performing at your best turning words into numbers and numbers into data driv...
 
The New Toolkit for Continuous Improvement webinar presentation slides
The New Toolkit for Continuous Improvement webinar presentation slidesThe New Toolkit for Continuous Improvement webinar presentation slides
The New Toolkit for Continuous Improvement webinar presentation slides
 
Visualizations that make an impact - see what s new in minitab statistical s...
Visualizations that make an impact  - see what s new in minitab statistical s...Visualizations that make an impact  - see what s new in minitab statistical s...
Visualizations that make an impact - see what s new in minitab statistical s...
 
Minitab webinar presentation See the unknown with monte carlo simulation
Minitab webinar presentation See the unknown with monte carlo simulationMinitab webinar presentation See the unknown with monte carlo simulation
Minitab webinar presentation See the unknown with monte carlo simulation
 
Modern Toolkit for Process Excellence
Modern Toolkit for Process ExcellenceModern Toolkit for Process Excellence
Modern Toolkit for Process Excellence
 
Modern tool kit for process excellence, gracias a Minitab Inc.
Modern tool kit for process excellence, gracias a Minitab Inc.Modern tool kit for process excellence, gracias a Minitab Inc.
Modern tool kit for process excellence, gracias a Minitab Inc.
 
Democratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn CreatorDemocratizing Machine Learning: Perspective from a scikit-learn Creator
Democratizing Machine Learning: Perspective from a scikit-learn Creator
 
Predictive analytics retention
Predictive analytics retentionPredictive analytics retention
Predictive analytics retention
 
Driving customer retention using predictive analytics
Driving customer retention using predictive analyticsDriving customer retention using predictive analytics
Driving customer retention using predictive analytics
 

More from Minitab, LLC

L'art de la visualisation pour une meilleure compréhension des données
L'art de la visualisation pour une meilleure compréhension des donnéesL'art de la visualisation pour une meilleure compréhension des données
L'art de la visualisation pour une meilleure compréhension des donnéesMinitab, LLC
 
Pilotez le développement de vos produits et de vos procédés avec Minitab et M...
Pilotez le développement de vos produits et de vos procédés avec Minitab et M...Pilotez le développement de vos produits et de vos procédés avec Minitab et M...
Pilotez le développement de vos produits et de vos procédés avec Minitab et M...Minitab, LLC
 
Introducing Graph Builder: Visualizations Built to Move You Forward
Introducing Graph Builder: Visualizations Built to Move You ForwardIntroducing Graph Builder: Visualizations Built to Move You Forward
Introducing Graph Builder: Visualizations Built to Move You ForwardMinitab, LLC
 
Les solutions Minitab pour développer vos produits selon les réglementations ...
Les solutions Minitab pour développer vos produits selon les réglementations ...Les solutions Minitab pour développer vos produits selon les réglementations ...
Les solutions Minitab pour développer vos produits selon les réglementations ...Minitab, LLC
 
Concrétisez votre transformation digitale avec Minitab et Minitab Connect
Concrétisez votre transformation digitale avec Minitab et Minitab ConnectConcrétisez votre transformation digitale avec Minitab et Minitab Connect
Concrétisez votre transformation digitale avec Minitab et Minitab ConnectMinitab, LLC
 
En route vers l'excellence avec les solutions Minitab
En route vers l'excellence avec les solutions MinitabEn route vers l'excellence avec les solutions Minitab
En route vers l'excellence avec les solutions MinitabMinitab, LLC
 
Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...
Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...
Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...Minitab, LLC
 
La puissance du machine learning et des algorithmes cart au service des métiers
La puissance du machine learning et des algorithmes cart au service des métiersLa puissance du machine learning et des algorithmes cart au service des métiers
La puissance du machine learning et des algorithmes cart au service des métiersMinitab, LLC
 
Strukturierte problemloesung mit datenunterstuetzung
Strukturierte problemloesung mit datenunterstuetzungStrukturierte problemloesung mit datenunterstuetzung
Strukturierte problemloesung mit datenunterstuetzungMinitab, LLC
 
Visualizaciones que crean impacto: Vea las novedades de Minitab Statistical ...
Visualizaciones que crean impacto:  Vea las novedades de Minitab Statistical ...Visualizaciones que crean impacto:  Vea las novedades de Minitab Statistical ...
Visualizaciones que crean impacto: Vea las novedades de Minitab Statistical ...Minitab, LLC
 
Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...
Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...
Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...Minitab, LLC
 
Statistical solutions to help you with 5 FDA medical devices stages
Statistical solutions to help you with 5 FDA medical devices stagesStatistical solutions to help you with 5 FDA medical devices stages
Statistical solutions to help you with 5 FDA medical devices stagesMinitab, LLC
 
Unleashing the Power of Python Using the New Minitab/Python Integration Modul...
Unleashing the Power of Python Using the New Minitab/Python Integration Modul...Unleashing the Power of Python Using the New Minitab/Python Integration Modul...
Unleashing the Power of Python Using the New Minitab/Python Integration Modul...Minitab, LLC
 
Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...
Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...
Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...Minitab, LLC
 
Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...
Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...
Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...Minitab, LLC
 
Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...
Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...
Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...Minitab, LLC
 
Pilotez, structurez et cartographiez vos processus avec minitab workspace
Pilotez, structurez et cartographiez vos processus avec minitab workspacePilotez, structurez et cartographiez vos processus avec minitab workspace
Pilotez, structurez et cartographiez vos processus avec minitab workspaceMinitab, LLC
 
Minitab Preview Training: Introduction to t-Tests for Manufacturing
Minitab Preview Training: Introduction to t-Tests for ManufacturingMinitab Preview Training: Introduction to t-Tests for Manufacturing
Minitab Preview Training: Introduction to t-Tests for ManufacturingMinitab, LLC
 
Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...
Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...
Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...Minitab, LLC
 
Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...
Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...
Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...Minitab, LLC
 

More from Minitab, LLC (20)

L'art de la visualisation pour une meilleure compréhension des données
L'art de la visualisation pour une meilleure compréhension des donnéesL'art de la visualisation pour une meilleure compréhension des données
L'art de la visualisation pour une meilleure compréhension des données
 
Pilotez le développement de vos produits et de vos procédés avec Minitab et M...
Pilotez le développement de vos produits et de vos procédés avec Minitab et M...Pilotez le développement de vos produits et de vos procédés avec Minitab et M...
Pilotez le développement de vos produits et de vos procédés avec Minitab et M...
 
Introducing Graph Builder: Visualizations Built to Move You Forward
Introducing Graph Builder: Visualizations Built to Move You ForwardIntroducing Graph Builder: Visualizations Built to Move You Forward
Introducing Graph Builder: Visualizations Built to Move You Forward
 
Les solutions Minitab pour développer vos produits selon les réglementations ...
Les solutions Minitab pour développer vos produits selon les réglementations ...Les solutions Minitab pour développer vos produits selon les réglementations ...
Les solutions Minitab pour développer vos produits selon les réglementations ...
 
Concrétisez votre transformation digitale avec Minitab et Minitab Connect
Concrétisez votre transformation digitale avec Minitab et Minitab ConnectConcrétisez votre transformation digitale avec Minitab et Minitab Connect
Concrétisez votre transformation digitale avec Minitab et Minitab Connect
 
En route vers l'excellence avec les solutions Minitab
En route vers l'excellence avec les solutions MinitabEn route vers l'excellence avec les solutions Minitab
En route vers l'excellence avec les solutions Minitab
 
Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...
Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...
Meet Minitab Engage Your End-to-End Improvement Solution From Idea Generation...
 
La puissance du machine learning et des algorithmes cart au service des métiers
La puissance du machine learning et des algorithmes cart au service des métiersLa puissance du machine learning et des algorithmes cart au service des métiers
La puissance du machine learning et des algorithmes cart au service des métiers
 
Strukturierte problemloesung mit datenunterstuetzung
Strukturierte problemloesung mit datenunterstuetzungStrukturierte problemloesung mit datenunterstuetzung
Strukturierte problemloesung mit datenunterstuetzung
 
Visualizaciones que crean impacto: Vea las novedades de Minitab Statistical ...
Visualizaciones que crean impacto:  Vea las novedades de Minitab Statistical ...Visualizaciones que crean impacto:  Vea las novedades de Minitab Statistical ...
Visualizaciones que crean impacto: Vea las novedades de Minitab Statistical ...
 
Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...
Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...
Powerful Statistical Tools in the Pharmaceutical and Medical Devices Industry...
 
Statistical solutions to help you with 5 FDA medical devices stages
Statistical solutions to help you with 5 FDA medical devices stagesStatistical solutions to help you with 5 FDA medical devices stages
Statistical solutions to help you with 5 FDA medical devices stages
 
Unleashing the Power of Python Using the New Minitab/Python Integration Modul...
Unleashing the Power of Python Using the New Minitab/Python Integration Modul...Unleashing the Power of Python Using the New Minitab/Python Integration Modul...
Unleashing the Power of Python Using the New Minitab/Python Integration Modul...
 
Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...
Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...
Einführung in den Minitab Workspace_Visuelle Toolkit zur Verbesserung Ihrer A...
 
Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...
Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...
Discover Minitab Workspace - The Ultimate Visual Toolkit to Elevate Your Work...
 
Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...
Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...
Melhore seu conhecimento sobre analise de dados com a versao mais recente do ...
 
Pilotez, structurez et cartographiez vos processus avec minitab workspace
Pilotez, structurez et cartographiez vos processus avec minitab workspacePilotez, structurez et cartographiez vos processus avec minitab workspace
Pilotez, structurez et cartographiez vos processus avec minitab workspace
 
Minitab Preview Training: Introduction to t-Tests for Manufacturing
Minitab Preview Training: Introduction to t-Tests for ManufacturingMinitab Preview Training: Introduction to t-Tests for Manufacturing
Minitab Preview Training: Introduction to t-Tests for Manufacturing
 
Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...
Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...
Praesentation - Identifizieren und eliminieren sie ihre analytischen schwachp...
 
Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...
Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...
Nouveautes Minitab-Integration Python, Arbres de decision, Validation de mode...
 

Recently uploaded

Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 

Machine Learning Classification & Regression Trees

  • 2. © 2020 Minitab, LLC. • 25+ years of experience • Minitab Trainer • Statistical Consultant • Minitab Software Designer • Master's in Statistics Meet the Presenter: Cheryl Pammer Senior Advisory Statistician
  • 3. © 2020 Minitab, LLC. Learning Objectives 4 ►Overview of Classification and Regression Trees ►Model Selection ►Visualizing Information from Large Trees ►Prediction
  • 4. © 2020 Minitab, LLC. Basic Supervised Machine Learning Algorithms ►Continuous Y: Regression, CART Regression Trees ►Categorical Y: Logistic Regression, CART Classification Trees 5
  • 5. © 2020 Minitab, LLC. Example: Pulp Bleaching ►A paper manufacturer needs to determine which predictors contribute to defects in a pulp bleaching process. ►Several months of observational process data are mined to determine the root cause of defects. 6
  • 6. © 2020 Minitab, LLC. Logistic Regression 7 ?
  • 7. © 2020 Minitab, LLC. Challenges with Regression Regression and logistic regression often don’t work well with larger observational data sets. ►Everything is significant. ►Assumptions are not met. ►Relationships are nonlinear. ►Local interactions exist. ►Extreme outliers. ►Many missing values. 8
  • 8. © 2020 Minitab, LLC. Classification and Regression Trees (CART) Find X’s that best partition the data into similar groupings of Y. ►CART Classification Tree  Partitions data into high percentage of Group A and high percentage of Group B  Example: Separate data into defective and non-defective groupings ►CART Regression Tree  Partitions data into high values of Y and low values of Y  Example: Find conditions that lead to optimal values of Y 9
  • 9. © 2020 Minitab, LLC. CART Classification Tree 10
  • 10. © 2020 Minitab, LLC. CART Classification Tree 11
  • 11. © 2020 Minitab, LLC. CART Classification Tree ►Will continue to split data into groupings to separate defects from nondefects. ►Continues splitting until:  Runs out of data  Hits a stopping rule such as terminal nodes need at least 3 observations. 12
  • 12. © 2020 Minitab, LLC. Model Selection ►Simpler models (fewest nodes) may be adequate. ►Look for elbow in plot. 13
  • 13. © 2020 Minitab, LLC. 10-Node Tree ►Simpler tree is easier to interpret. ►Less tendency toward over-fitting. ►Will be more misclassifications. ►Here, most defect hotspots involve just 2 variables. 14
  • 14. © 2020 Minitab, LLC. Receiver Operating Characteristic (ROC) Curve ►Plot of True Positive Rate vs False Positive Rate ►For a random classifier True Positive Rate = False Positive Rate 15 Sensitivity = TP/(TP + FN) Specificity = TN/(FP + TN) Model = Fail Model = Pass True Fail #TP #FN True Pass #FP #TN
  • 15. © 2020 Minitab, LLC. Confusion Matrix 16 Model = Fail Model = Pass True Fail #TP #FN True Pass #FP #TN Sensitivity = TP/(TP + FN) Specificity = TN/(FP + TN)
  • 16. © 2020 Minitab, LLC. Variable Importance 17
  • 17. © 2020 Minitab, LLC. Hot Spots ►Overall defect rate = 2.9% ►Four nodes had defect rates > 15% 18
  • 18. © 2020 Minitab, LLC. Example: Glass Manufacturing Process A glass bottle manufacturer wants to use past data to: ►Determine what increases per unit cost of manufacture. ►Predict the per unit cost of manufacture of custom orders to determine pricing. 19
  • 19. © 2020 Minitab, LLC. Relationships Between Continuous Variables 20
  • 20. © 2020 Minitab, LLC. Counts by Plant, Segment and Color 21
  • 21. © 2020 Minitab, LLC. CART Regression Tree ►Find predictor and split point that best separates the cases into two groups with similar response values. ►Splitting criterion is usually least squared error. For this method, predictions (fits) are mean of data in each terminal node. 22
  • 22. © 2020 Minitab, LLC. CART Regression Tree (Least Squares Loss) ►For each predictor: 1. Sort smallest to largest 2. Calculate split improvement for each possible split point 3. Select best split point for variable 4. Compare best split improvement across all variables ►For continuous predictor, split point is midpoint between distinct values ►For categorical predictor, split point is a set of values 23
  • 23. © 2020 Minitab, LLC. CART Regression Tree Example 1. Sort by X1 24
  • 24. © 2020 Minitab, LLC. CART Regression Tree Example 2. Split at each possible split point and calculate split improvement. 25
  • 25. © 2020 Minitab, LLC. CART Regression Tree Example 3. Select the best split point for variable. 4. Compare best split improvement across all variables. 26
  • 26. © 2020 Minitab, LLC. CART Regression Tree Example 27
  • 27. © 2020 Minitab, LLC. CART Regression Tree 28
  • 28. © 2020 Minitab, LLC. CART Regression Tree ►Even with small number of predictors, optimal tree can be large. ►Interpret the tree using visualizations based on stored fits and terminal node information. 29
  • 29. © 2020 Minitab, LLC. Model Selection “Optimal” model within 1 SE of highest Test R2. 30
  • 30. © 2020 Minitab, LLC. Using CART for Prediction (Fit) ►Classification tree: Can use either predicted class or predicted probability of terminal node. ►Regression tree: Fit is mean or median of terminal node, depending on estimation method. 31
  • 31. © 2020 Minitab, LLC. Diagnostics 32
  • 32. © 2020 Minitab, LLC. Exploring the CART Results ►Store the Fits and Terminal Node Ids when creating the model. ►Use these stored values and Minitab’s visualization tools to explore results. 33
  • 33. © 2020 Minitab, LLC. Terminal Node Sample Sizes 34
  • 34. © 2020 Minitab, LLC. Variable Importance 35
  • 35. © 2020 Minitab, LLC. Predicted Cost by Segment 36
  • 36. © 2020 Minitab, LLC. Highest Predicted Cost 37 Brushing Mode Add Variables and Make Indicator Variable
  • 37. © 2020 Minitab, LLC. Terminal Node 71 vs Other Nodes 38
  • 38. © 2020 Minitab, LLC. Terminal Node 71 Predictor Demographics 39
  • 39. © 2020 Minitab, LLC. What is the Predicted Manufacturing Cost? 40
  • 40. © 2020 Minitab, LLC. Take-aways ►Overview of Classification and Regression Trees ►Model Selection ►Visualizing Information from Large Trees ►Prediction Questions! Cheryl Pammer cpammer@minitab.com 41
  • 41. © 2020 Minitab, LLC. Upcoming Webinars and Virtual Events • Maximize Efficiency: Using Minitab Workspace and Minitab Statistical Software to Amplify Output and Meet Critical Demand Time: Wednesday 22 July, 12PM AEST (10AM HKT / 2PM NZST) • Turning Words in to Numbers and Numbers into Data-Driven Insights with Minitab, Python and Text Mining Time: Wednesday 29 July, 12PM AEST (10AM HKT / 2PM NZST) See all the details and sign up at: https://info.minitab.com/resources/webinars/webinar-wednesdays-apac
  • 42. © 2020 Minitab, LLC. Upcoming Webinars and Virtual Events • Online/Virtual Training Minitab is now offering virtual training taught by Minitab experts – perfect for remote/home workers. Visit www.minitab.com/training/training for more info. • Talk to Minitab Complimentary resources to help you deal quickly with today's challenges and changing environment. Visit www.minitab.com and click on the Talk to Minitab button and a Minitab representative will be in touch!
  • 43. © 2020 Minitab, LLC. Our Approach: More Than Business Analytics… Solutions Analytics Software Services Training Learn first-hand by attending public or customized trainings in your facilities according to your requirements. Statistical Consulting Personalized help with statistical challenges from collecting the right data to interpreting analysis more. Support Assistance with installation, implementation, version updates and license management. Master statistics and Minitab anywhere with online training Machine learning and predictive analytics software Start, track, manage and execute improvement projects with real-time dashboards Powerful statistical software everyone can use. Data Analysis Predictive Modeling Visual Business Tools Project Oversight Visual tools to process and product excellence. Online Training Solutions analytics is our integrated approach to providing software and services that enable organizations to make better decisions that drive business excellence.