SlideShare a Scribd company logo
1 of 30
Introduction to CART Dan Steinberg Mykhaylo Golovnya [email_address] August, 2009
In The Beginning… ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Years of Struggle  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Final Triumph ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
CART ®  Overview ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Heart Disease Classification Problem
Typical CART Solution ,[object Object],[object Object],[object Object],[object Object],PATIENTS = 215 SURVIVE 178 82.8% DEAD 37 17.2% Is BP<=91? PATIENTS = 195 SURVIVE 172 88.2% DEAD 23 11.8% Is AGE<=62.5? PATIENTS = 91 SURVIVE 70 76.9% DEAD 21 23.1% Is SINUS<=.5? <= 91 > 91 <= 62.5 > 62.5 >.5 <=.5 ,[object Object],[object Object],[object Object],[object Object],Terminal Node A SURVIVE 6 30.0% DEAD 14 70.0% NODE = DEAD Terminal Node B SURVIVE 102 98.1% DEAD 2   1.9% NODE = SURVIVE Terminal Node C SURVIVE 14 50.0% DEAD 14 50.0% NODE = DEAD Terminal Node D SURVIVE 56 88.9% DEAD 7 11.1% NODE = SURVIVE
General Workflow Stage 1 Stage 2  Stage 3 Historical Data Learn Test Validate Build a Sequence of Nested Trees Monitor Performance Best Confirm Findings
Decision Questions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Tree is a Classifier ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Importance of Binary Splits ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Competitors and Surrogates ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Utility of Surrogates ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Competitors and Surrogates Are Different   ,[object Object],[object Object],[object Object],[object Object],A B C A B C Split X A B C A C B Split Y ,[object Object],[object Object],[object Object]
Tree Interpretation and Use ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],CART ® - Pros and Cons
Example: Marketing Study ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Cell Phone Study: Root Node ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Optimal Model
Variable Importance and Predictive Accuracy ,[object Object],[object Object],[object Object],[object Object]
Introduction to Hot Spot Detection ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Internal Class Assignment Rule ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
General Rules ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Impact of Priors ,[object Object],[object Object],[object Object], r =0.1 ,   b =0.9  r =0.9 ,   b =0.1  r =0.5 ,   b =0.5
Varying Priors – the Key to Hot Spot Detection ,[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],Hot Spot Detection ,[object Object],[object Object]
Improving Feature Selection Process ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Constrained Trees ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Further Development of CART
References ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

Unit 4 - BRM
Unit 4 - BRMUnit 4 - BRM
Unit 4 - BRM
Ritu_3
 
Discrete And Continuous Simulation
Discrete And Continuous SimulationDiscrete And Continuous Simulation
Discrete And Continuous Simulation
Nguyen Chien
 

What's hot (20)

Decision tree
Decision treeDecision tree
Decision tree
 
Decision tree
Decision treeDecision tree
Decision tree
 
Decision tree induction
Decision tree inductionDecision tree induction
Decision tree induction
 
Textmining Predictive Models
Textmining Predictive ModelsTextmining Predictive Models
Textmining Predictive Models
 
2.1 Data Mining-classification Basic concepts
2.1 Data Mining-classification Basic concepts2.1 Data Mining-classification Basic concepts
2.1 Data Mining-classification Basic concepts
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and Processing
 
Unit 4 - BRM
Unit 4 - BRMUnit 4 - BRM
Unit 4 - BRM
 
Research methods module 5 msf
Research methods module 5 msfResearch methods module 5 msf
Research methods module 5 msf
 
83 learningdecisiontree
83 learningdecisiontree83 learningdecisiontree
83 learningdecisiontree
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 
Exploratory data analysis project
Exploratory data analysis project Exploratory data analysis project
Exploratory data analysis project
 
Decision tree presentation
Decision tree presentationDecision tree presentation
Decision tree presentation
 
Dsa unit 1
Dsa unit 1Dsa unit 1
Dsa unit 1
 
Discrete And Continuous Simulation
Discrete And Continuous SimulationDiscrete And Continuous Simulation
Discrete And Continuous Simulation
 
Classification
ClassificationClassification
Classification
 
Bbs11 ppt ch02
Bbs11 ppt ch02Bbs11 ppt ch02
Bbs11 ppt ch02
 
13 random forest
13 random forest13 random forest
13 random forest
 
Eda sri
Eda sriEda sri
Eda sri
 
Design principle of pattern recognition system and STATISTICAL PATTERN RECOGN...
Design principle of pattern recognition system and STATISTICAL PATTERN RECOGN...Design principle of pattern recognition system and STATISTICAL PATTERN RECOGN...
Design principle of pattern recognition system and STATISTICAL PATTERN RECOGN...
 
Bbs11 ppt ch01
Bbs11 ppt ch01Bbs11 ppt ch01
Bbs11 ppt ch01
 

Viewers also liked

Forgiveness
ForgivenessForgiveness
Forgiveness
Educator
 
Citizens copenhagen dk_2013_june-2
Citizens copenhagen dk_2013_june-2Citizens copenhagen dk_2013_june-2
Citizens copenhagen dk_2013_june-2
consumerenergy
 
How Chinese Teens Use Digital: Getting to Know Your Customers of Tomorrow
How Chinese Teens Use Digital: Getting to Know Your Customers of TomorrowHow Chinese Teens Use Digital: Getting to Know Your Customers of Tomorrow
How Chinese Teens Use Digital: Getting to Know Your Customers of Tomorrow
Labbrand
 
Information about planets and houses
Information about planets and housesInformation about planets and houses
Information about planets and houses
BASKARAN P
 
Gasteizko irteera 2B BOLIPORT
Gasteizko irteera 2B BOLIPORTGasteizko irteera 2B BOLIPORT
Gasteizko irteera 2B BOLIPORT
arbelar
 
Offshore Software Development, Software Testing by CAMO Solutions
Offshore Software Development, Software Testing by CAMO SolutionsOffshore Software Development, Software Testing by CAMO Solutions
Offshore Software Development, Software Testing by CAMO Solutions
CAMO Solutions LLC
 

Viewers also liked (20)

Neural network & its applications
Neural network & its applications Neural network & its applications
Neural network & its applications
 
Forgiveness
ForgivenessForgiveness
Forgiveness
 
12 days of_christmas
12 days of_christmas12 days of_christmas
12 days of_christmas
 
Easy lift application
Easy lift applicationEasy lift application
Easy lift application
 
Incubix
IncubixIncubix
Incubix
 
Citizens copenhagen dk_2013_june-2
Citizens copenhagen dk_2013_june-2Citizens copenhagen dk_2013_june-2
Citizens copenhagen dk_2013_june-2
 
How Chinese Teens Use Digital: Getting to Know Your Customers of Tomorrow
How Chinese Teens Use Digital: Getting to Know Your Customers of TomorrowHow Chinese Teens Use Digital: Getting to Know Your Customers of Tomorrow
How Chinese Teens Use Digital: Getting to Know Your Customers of Tomorrow
 
Daftarhadir&nilai kapita selekta
Daftarhadir&nilai kapita selektaDaftarhadir&nilai kapita selekta
Daftarhadir&nilai kapita selekta
 
The dark Flood
The dark FloodThe dark Flood
The dark Flood
 
Lex analysis
Lex analysisLex analysis
Lex analysis
 
презентация Microsoft office power point
презентация Microsoft office power pointпрезентация Microsoft office power point
презентация Microsoft office power point
 
2003
20032003
2003
 
Information about planets and houses
Information about planets and housesInformation about planets and houses
Information about planets and houses
 
Viterbi2
Viterbi2Viterbi2
Viterbi2
 
Prospectus 2013 final
Prospectus 2013 finalProspectus 2013 final
Prospectus 2013 final
 
Gasteizko irteera 2B BOLIPORT
Gasteizko irteera 2B BOLIPORTGasteizko irteera 2B BOLIPORT
Gasteizko irteera 2B BOLIPORT
 
Elemzés
ElemzésElemzés
Elemzés
 
Esco and schools dec. 13
Esco and schools dec. 13Esco and schools dec. 13
Esco and schools dec. 13
 
Mini Neons
Mini Neons Mini Neons
Mini Neons
 
Offshore Software Development, Software Testing by CAMO Solutions
Offshore Software Development, Software Testing by CAMO SolutionsOffshore Software Development, Software Testing by CAMO Solutions
Offshore Software Development, Software Testing by CAMO Solutions
 

Similar to Introduction to cart_2009

The Use Of Decision Trees For Adaptive Item
The Use Of Decision Trees For Adaptive ItemThe Use Of Decision Trees For Adaptive Item
The Use Of Decision Trees For Adaptive Item
barthriley
 
On cascading small decision trees
On cascading small decision treesOn cascading small decision trees
On cascading small decision trees
Julià Minguillón
 
Module III - Classification Decision tree (1).pptx
Module III - Classification Decision tree (1).pptxModule III - Classification Decision tree (1).pptx
Module III - Classification Decision tree (1).pptx
Shivakrishnan18
 
Data Mining in Market Research
Data Mining in Market ResearchData Mining in Market Research
Data Mining in Market Research
butest
 
Data Mining In Market Research
Data Mining In Market ResearchData Mining In Market Research
Data Mining In Market Research
kevinlan
 

Similar to Introduction to cart_2009 (20)

Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests
 
Classification
ClassificationClassification
Classification
 
decisiontrees (3).ppt
decisiontrees (3).pptdecisiontrees (3).ppt
decisiontrees (3).ppt
 
decisiontrees.ppt
decisiontrees.pptdecisiontrees.ppt
decisiontrees.ppt
 
decisiontrees.ppt
decisiontrees.pptdecisiontrees.ppt
decisiontrees.ppt
 
The Use Of Decision Trees For Adaptive Item
The Use Of Decision Trees For Adaptive ItemThe Use Of Decision Trees For Adaptive Item
The Use Of Decision Trees For Adaptive Item
 
DIY market segmentation 20170125
DIY market segmentation 20170125DIY market segmentation 20170125
DIY market segmentation 20170125
 
Decision Trees for Classification: A Machine Learning Algorithm
Decision Trees for Classification: A Machine Learning AlgorithmDecision Trees for Classification: A Machine Learning Algorithm
Decision Trees for Classification: A Machine Learning Algorithm
 
ADAN Symposium
ADAN SymposiumADAN Symposium
ADAN Symposium
 
On cascading small decision trees
On cascading small decision treesOn cascading small decision trees
On cascading small decision trees
 
Data collection,tabulation,processing and analysis
Data collection,tabulation,processing and analysisData collection,tabulation,processing and analysis
Data collection,tabulation,processing and analysis
 
Machine learning session6(decision trees random forrest)
Machine learning   session6(decision trees random forrest)Machine learning   session6(decision trees random forrest)
Machine learning session6(decision trees random forrest)
 
Store segmentation progresso
Store segmentation progressoStore segmentation progresso
Store segmentation progresso
 
6238578.ppt
6238578.ppt6238578.ppt
6238578.ppt
 
Novel Frequency Domain Classification Algorithm Based On Parameter Weight Fac...
Novel Frequency Domain Classification Algorithm Based On Parameter Weight Fac...Novel Frequency Domain Classification Algorithm Based On Parameter Weight Fac...
Novel Frequency Domain Classification Algorithm Based On Parameter Weight Fac...
 
Module III - Classification Decision tree (1).pptx
Module III - Classification Decision tree (1).pptxModule III - Classification Decision tree (1).pptx
Module III - Classification Decision tree (1).pptx
 
Data Mining In Market Research
Data Mining In Market ResearchData Mining In Market Research
Data Mining In Market Research
 
Data Mining in Market Research
Data Mining in Market ResearchData Mining in Market Research
Data Mining in Market Research
 
Data Mining In Market Research
Data Mining In Market ResearchData Mining In Market Research
Data Mining In Market Research
 
Classification and decision tree classifier machine learning
Classification and decision tree classifier machine learningClassification and decision tree classifier machine learning
Classification and decision tree classifier machine learning
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

Introduction to cart_2009

  • 1. Introduction to CART Dan Steinberg Mykhaylo Golovnya [email_address] August, 2009
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8. General Workflow Stage 1 Stage 2 Stage 3 Historical Data Learn Test Validate Build a Sequence of Nested Trees Monitor Performance Best Confirm Findings
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.