SlideShare une entreprise Scribd logo
1  sur  39
www.edureka.co/decision-tree-Modeling-using-r
Decision Tree for predictive modeling
Slide 2 www.edureka.co/decision-tree-Modeling-using-r
Agenda
 Business need of a model
 Anatomy of a decision tree
 Advantage of using decision tree in the business scenario
 Usage of decision tree techniques in business
 Key decision tree features
 Course framework
At the end of the session we would learn about :
Slide 3 www.edureka.co/decision-tree-Modeling-using-r
Business Scenario – Need of a Model
Slide 4 www.edureka.co/decision-tree-Modeling-using-rSlide 4
Business Scenario – Need of a Model?
Business is unhappy
with such a poor
response rate
Say 100,000 prospect
Say 1,000 takes up the product
Slide 5 www.edureka.co/decision-tree-Modeling-using-rSlide 5
Business Scenario – Need of a Model?
Think of – if $2 is the cost of mailer then one has spend
$200 per new customer acquisition, right?
Can we find a base where by working on less number of
prospect, we can still get almost all the responder
Business is unhappy
with such a poor
response rate
Say 100,000 prospect
Say 1,000 takes up the product
Slide 6 www.edureka.co/decision-tree-Modeling-using-rSlide 6
Business Scenario – Need of a Model?
Say by working on 20000 prospect
Can we get 900 responder
Think of – if $2 is the cost of mailer then one has spend
$200 per new customer acquisition, right?
Can we find a base where by working on less number of
prospect, we can still get almost all the responder
Business is unhappy
with such a poor
response rate
Say 100,000 prospect
Say 1,000 takes up the product
Slide 7 www.edureka.co/decision-tree-Modeling-using-rSlide 7
Business Scenario – Need of a Model?
Say by working on 20000 prospect
Can we get 900 responder
Note – no possibility of exact match in real life scenarios
Also very rare possibility of getting all the responder by
working on part of population
Target is to get almost all the responder by working on
only small portion of the population
Think of – if $2 is the cost of mailer then one has spend
$200 per new customer acquisition, right?
Can we find a base where by working on less number of
prospect, we can still get almost all the responder
Business is unhappy
with such a poor
response rate
Say 100,000 prospect
Say 1,000 takes up the product
Slide 8 www.edureka.co/decision-tree-Modeling-using-rSlide 8
So the Target is …..
Target is to get almost all the responder by working on only part of the population
Population – N
Responder – K
X % of Population N
Y % – of Responder K
Y > X
Slide 9 www.edureka.co/decision-tree-Modeling-using-rSlide 9
So the Target is …..
Target is to get almost all the responder by working on only part of the population
Population – N
Responder – K
X % of Population N
Y %– of Responder K
Y > X
1 – X% of Population – N
1 – Y% of Responder – K
Slide 10 www.edureka.co/decision-tree-Modeling-using-rSlide 10
So the Target is …..
Target is to get almost all the responder by working on only part of the population
Note RGB concept
» Green the bench mark response rate
» more response rate – red
» Less response rate – blue
Work on red / blue– higher response/lower response rate section
Population – N
Responder – K
X % of Population N
Y %– of Responder K
Y > X
1 – X% of Population – N
1 – Y% of Responder – K
Slide 11 www.edureka.co/decision-tree-Modeling-using-r
Decision Tree Example –
Understand the Anatomy
Slide 12 www.edureka.co/decision-tree-Modeling-using-rSlide 12
Decision Tree Example
Send files to bureau for credit worthiness of existing customers
70% gets good rating, 30% bad rating
30%
70%
N
Y
Credit Rating Y: Good, N: Bad
Slide 13 www.edureka.co/decision-tree-Modeling-using-rSlide 13
Send files to bureau for credit worthiness of existing customers
70% gets good rating, 30% bad rating
Say $5 is the cost of sending each record for check to bureau
Can we send records selectively to only those base where we have doubts
Because ultimately, we want to stop loss and want to know, who will get bad rating hence risky
Decision Tree Example (Contd.)
30%
70%
N
Y
Credit Rating Y: Good, N: Bad
Slide 14 www.edureka.co/decision-tree-Modeling-using-rSlide 14
Decision Tree Example (Contd.)
Can we forecast, among current population, who will Have good credit rating
Decision tree improves the accuracy of decisioning
A
30%
70%
N
Y
Credit Rating Y: Good, N: Bad
Slide 15 www.edureka.co/decision-tree-Modeling-using-rSlide 15
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT Root Note
2
3
Decision Tree Example (Contd.)
Slide 16 www.edureka.co/decision-tree-Modeling-using-rSlide 16
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT Root Note
Leaf Node
2
3
Decision Tree Example (Contd.)
Slide 17 www.edureka.co/decision-tree-Modeling-using-rSlide 17
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT Root Note
Leaf Node
CHK_ACCT < 1.5 and
Duration >= 22.5 and
SAV_ACCT < 2.5
2
3
Decision Tree Example (Contd.)
Slide 18 www.edureka.co/decision-tree-Modeling-using-rSlide 18
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
37%
71% 65% 87%
Root Note
Leaf Node
CHK_ACCT < 1.5 and
Duration >= 22.5 and
SAV_ACCT < 2.5
2
3
Decision Tree Example (Contd.)
Slide 19 www.edureka.co/decision-tree-Modeling-using-rSlide 19
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
37%
71% 65% 87%
Root Note
Leaf Node
CHK_ACCT < 1.5 and
Duration >= 22.5 and
SAV_ACCT < 2.5
Node Size
Depth
2
3
Decision Tree Example (Contd.)
Slide 20 www.edureka.co/decision-tree-Modeling-using-r
Decision Tree Example –
Understand the Gain from Decision Tree
Slide 21 www.edureka.co/decision-tree-Modeling-using-rSlide 21
Decision Tree Example
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
2
3
>=1.5<1.5
<22.5>=22.5
>=2.5
Node 4
(37%)
Node 5
(71%)
Node 6
(65%)
SAV_ACCT
Duration NODE 7
(87%)
CHK_ACCT
(70%)
<2.5
Slide 22 www.edureka.co/decision-tree-Modeling-using-rSlide 22
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
37%
71% 65% 87%
2
3
70%
Decision Tree Example (Contd.)
Slide 23 www.edureka.co/decision-tree-Modeling-using-rSlide 23
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
37%
71% 65% 87%
2
3
70%
Decision Tree Example (Contd.)
Understand gain by working on different nodes
Slide 24 www.edureka.co/decision-tree-Modeling-using-rSlide 24
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
37%
71% 65% 87%
2
3
70%
Decision Tree Example (Contd.)
Understand gain by working on different nodes
Now we can keep a documentation cell to demand more document from a subset of population and then send
them to bureau after receipt of documents
Slide 25 www.edureka.co/decision-tree-Modeling-using-rSlide 25
RGB Concepts
Decision Tree Example (Contd.)
Population – N
Responder – K
X % of Population N
Y % – of Responder K
Y > X
1 – X% of Population – N
1 – Y% of Responder – K
Slide 26 www.edureka.co/decision-tree-Modeling-using-rSlide 26
C1 = 3, C2=3
RGB Concepts
C1 = 1, C2=2C1 = 2, C2=1
Decision Tree Example (Contd.)
Population – N
Responder – K
X % of Population N
Y % – of Responder K
Y > X
1 – X% of Population – N
1 – Y% of Responder – K
Slide 27 www.edureka.co/decision-tree-Modeling-using-rSlide 27
RGB Concepts
Decision Tree Example (Contd.)
Population – N
Responder – K
X % of Population N
Y % – of Responder K
Y > X
1 – X% of Population – N
1 – Y% of Responder – K
Slide 28 www.edureka.co/decision-tree-Modeling-using-rSlide 28
RGB Concepts
Decision Tree Example (Contd.)
Population – N
Responder – K
X % of Population N
Y % – of Responder K
Y > X
1 – X% of Population – N
1 – Y% of Responder – K
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
37%
71% 65% 87%
2
3
70%
Slide 29 www.edureka.co/decision-tree-Modeling-using-rSlide 29
RGB Concepts
Decision Tree Example (Contd.)
Population – N
Responder – K
X % of Population N
Y % – of Responder K
Y > X
1 – X% of Population – N
1 – Y% of Responder – K
70%
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
37%
71% 65% 87%
2
3
70%
Slide 30 www.edureka.co/decision-tree-Modeling-using-rSlide 30
RGB Concepts
Decision Tree Example (Contd.)
70%Population – N
Responder – K
X % of Population N
Y % – of Responder K
Y > X
1 – X% of Population – N
1 – Y% of Responder – K
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0.2
0
1
0.8
0.6
0.4
0
0.2
1
0.8
0.6
0.4
0
0.2
Z
Y
Z
Y
Z
Y
Z
Y
Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457)
<2.5 ≥2.5
≥22.5 <22.5
<1.5 ≥1.5
1
SAV_ACCT
DURATION
CHK_ACCT
37%
71% 65% 87%
2
3
70%
Slide 31 www.edureka.co/decision-tree-Modeling-using-r
Business Applications of a Decision Tree
– Use of a Model
Slide 32 www.edureka.co/decision-tree-Modeling-using-rSlide 32
Business Scenario and Advantage
Among prospect, Find who will default vs. non defaulter
» So by not giving loan to set of prospect, you avoid lots of bad loan
Slide 33 www.edureka.co/decision-tree-Modeling-using-rSlide 33
Business Scenario and Advantage
Among prospect, Find who will default vs. non defaulter
Slide 34 www.edureka.co/decision-tree-Modeling-using-rSlide 34
Business Scenario and Advantage (Contd.)
Among patients profile, who will respond better with such treatment
» So by putting rest of them into another kind of treatment
Among customers, Find profile of those who will attrite vs. those will stay with the business
» So by targeting such customer you can reduce attrition?
Among applicants, Find which are the applicants, who can be fraud (such as cases of account take over)
» So by working on few selected applications you can avoid lots of account take over fraud cases
Among prospect of home loan pool, Find who are the prospects customer, who will switch over their home loan
» So by not working on few prospect, bank can quickly grow their portfolio by taking over existing home
loans
Find who among current base will move into delinquency
» So that their credit limit can be reduced to reduce exposure and losses
Slide 35 www.edureka.co/decision-tree-Modeling-using-r
Key decision tree features
Slide 36 www.edureka.co/decision-tree-Modeling-using-rSlide 36
Key Decision Tree features
Automated field selection
» handles any number of fields
» automatically selects relevant fields
Little data preprocessing needed
» Does not require any kind of variable transforms
» Impervious to outliers
Missing value tolerant
» Moderate loss of accuracy due to missing values
Quick development and validation
Slide 37 www.edureka.co/decision-tree-Modeling-using-r
Questions
Slide 38 www.edureka.co/decision-tree-Modeling-using-r
Your feedback is important to us, be it a compliment, a suggestion or a complaint. It helps us to make
the course better!
Please spare few seconds to take the survey after the webinar.
www.edureka.co/
Survey
Slide 39 www.edureka.co/decision-tree-Modeling-using-r

Contenu connexe

Similaire à Decision Tree for Predictive Modeling

Customer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclustCustomer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclustJim Porzak
 
Tangible Assets Expenditure PowerPoint Presentation Slides
Tangible Assets Expenditure PowerPoint Presentation Slides Tangible Assets Expenditure PowerPoint Presentation Slides
Tangible Assets Expenditure PowerPoint Presentation Slides SlideTeam
 
RST Official Presentation - RST Ecoenergy Private Limited
RST Official Presentation - RST Ecoenergy Private LimitedRST Official Presentation - RST Ecoenergy Private Limited
RST Official Presentation - RST Ecoenergy Private LimitedRST ECOENERGY PVT. LTD.
 
Example Budget Presentation Powerpoint Presentation Slides
Example Budget Presentation Powerpoint Presentation SlidesExample Budget Presentation Powerpoint Presentation Slides
Example Budget Presentation Powerpoint Presentation SlidesSlideTeam
 
Fixed Asset Expenditure PowerPoint Presentation Slides
Fixed Asset Expenditure PowerPoint Presentation Slides Fixed Asset Expenditure PowerPoint Presentation Slides
Fixed Asset Expenditure PowerPoint Presentation Slides SlideTeam
 
Example Budget Presentation PowerPoint Presentation Slides
Example Budget Presentation PowerPoint Presentation SlidesExample Budget Presentation PowerPoint Presentation Slides
Example Budget Presentation PowerPoint Presentation SlidesSlideTeam
 
3 scenarios where predictive analytics is a must!
3 scenarios where predictive analytics is a must!3 scenarios where predictive analytics is a must!
3 scenarios where predictive analytics is a must!Edureka!
 
Business Proposal For Recruitment Agency Powerpoint Presentation Slides
Business Proposal For Recruitment Agency Powerpoint Presentation SlidesBusiness Proposal For Recruitment Agency Powerpoint Presentation Slides
Business Proposal For Recruitment Agency Powerpoint Presentation SlidesSlideTeam
 
Advanced Strategies and Analytics for Campus Green Revolving Funds
Advanced Strategies and Analytics for Campus Green Revolving FundsAdvanced Strategies and Analytics for Campus Green Revolving Funds
Advanced Strategies and Analytics for Campus Green Revolving FundsJoe Indvik
 
Risk And Relevance 20080414ppt
Risk And Relevance 20080414pptRisk And Relevance 20080414ppt
Risk And Relevance 20080414pptgregoryg
 
Risk And Relevance 20080414ppt
Risk And Relevance 20080414pptRisk And Relevance 20080414ppt
Risk And Relevance 20080414pptgregoryg
 
New Product Cost Assessment PowerPoint Presentation Slides
New Product Cost Assessment PowerPoint Presentation SlidesNew Product Cost Assessment PowerPoint Presentation Slides
New Product Cost Assessment PowerPoint Presentation SlidesSlideTeam
 
New Product Performance Cost Analysis PowerPoint Presentation Slides
New Product Performance Cost Analysis PowerPoint Presentation SlidesNew Product Performance Cost Analysis PowerPoint Presentation Slides
New Product Performance Cost Analysis PowerPoint Presentation SlidesSlideTeam
 
The ROI of UX Research - Featuring Susan Weinshenk of The Team W
The ROI of UX Research - Featuring Susan Weinshenk of The Team WThe ROI of UX Research - Featuring Susan Weinshenk of The Team W
The ROI of UX Research - Featuring Susan Weinshenk of The Team WUserZoom
 
Fall, 2013STA 2122, Social Science StatisticsD. SahaName ___.docx
Fall, 2013STA 2122, Social Science StatisticsD. SahaName  ___.docxFall, 2013STA 2122, Social Science StatisticsD. SahaName  ___.docx
Fall, 2013STA 2122, Social Science StatisticsD. SahaName ___.docxmydrynan
 
Cost and schedule risk modelling - Range analysis
Cost and schedule risk modelling - Range analysisCost and schedule risk modelling - Range analysis
Cost and schedule risk modelling - Range analysisStephen Grey
 
Implementing Design Thinking Powerpoint Presentation Slides
Implementing Design Thinking Powerpoint Presentation SlidesImplementing Design Thinking Powerpoint Presentation Slides
Implementing Design Thinking Powerpoint Presentation SlidesSlideTeam
 
Implementing Design Thinking PowerPoint Presentation Slides
Implementing Design Thinking PowerPoint Presentation Slides Implementing Design Thinking PowerPoint Presentation Slides
Implementing Design Thinking PowerPoint Presentation Slides SlideTeam
 

Similaire à Decision Tree for Predictive Modeling (20)

Customer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclustCustomer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclust
 
Tangible Assets Expenditure PowerPoint Presentation Slides
Tangible Assets Expenditure PowerPoint Presentation Slides Tangible Assets Expenditure PowerPoint Presentation Slides
Tangible Assets Expenditure PowerPoint Presentation Slides
 
RST Official Presentation - RST Ecoenergy Private Limited
RST Official Presentation - RST Ecoenergy Private LimitedRST Official Presentation - RST Ecoenergy Private Limited
RST Official Presentation - RST Ecoenergy Private Limited
 
Example Budget Presentation Powerpoint Presentation Slides
Example Budget Presentation Powerpoint Presentation SlidesExample Budget Presentation Powerpoint Presentation Slides
Example Budget Presentation Powerpoint Presentation Slides
 
Fixed Asset Expenditure PowerPoint Presentation Slides
Fixed Asset Expenditure PowerPoint Presentation Slides Fixed Asset Expenditure PowerPoint Presentation Slides
Fixed Asset Expenditure PowerPoint Presentation Slides
 
Example Budget Presentation PowerPoint Presentation Slides
Example Budget Presentation PowerPoint Presentation SlidesExample Budget Presentation PowerPoint Presentation Slides
Example Budget Presentation PowerPoint Presentation Slides
 
3 scenarios where predictive analytics is a must!
3 scenarios where predictive analytics is a must!3 scenarios where predictive analytics is a must!
3 scenarios where predictive analytics is a must!
 
Linear Programming-1.ppt
Linear Programming-1.pptLinear Programming-1.ppt
Linear Programming-1.ppt
 
Business Proposal For Recruitment Agency Powerpoint Presentation Slides
Business Proposal For Recruitment Agency Powerpoint Presentation SlidesBusiness Proposal For Recruitment Agency Powerpoint Presentation Slides
Business Proposal For Recruitment Agency Powerpoint Presentation Slides
 
Advanced Strategies and Analytics for Campus Green Revolving Funds
Advanced Strategies and Analytics for Campus Green Revolving FundsAdvanced Strategies and Analytics for Campus Green Revolving Funds
Advanced Strategies and Analytics for Campus Green Revolving Funds
 
Risk And Relevance 20080414ppt
Risk And Relevance 20080414pptRisk And Relevance 20080414ppt
Risk And Relevance 20080414ppt
 
Risk And Relevance 20080414ppt
Risk And Relevance 20080414pptRisk And Relevance 20080414ppt
Risk And Relevance 20080414ppt
 
New Product Cost Assessment PowerPoint Presentation Slides
New Product Cost Assessment PowerPoint Presentation SlidesNew Product Cost Assessment PowerPoint Presentation Slides
New Product Cost Assessment PowerPoint Presentation Slides
 
New Product Performance Cost Analysis PowerPoint Presentation Slides
New Product Performance Cost Analysis PowerPoint Presentation SlidesNew Product Performance Cost Analysis PowerPoint Presentation Slides
New Product Performance Cost Analysis PowerPoint Presentation Slides
 
The ROI of UX Research - Featuring Susan Weinshenk of The Team W
The ROI of UX Research - Featuring Susan Weinshenk of The Team WThe ROI of UX Research - Featuring Susan Weinshenk of The Team W
The ROI of UX Research - Featuring Susan Weinshenk of The Team W
 
Fall, 2013STA 2122, Social Science StatisticsD. SahaName ___.docx
Fall, 2013STA 2122, Social Science StatisticsD. SahaName  ___.docxFall, 2013STA 2122, Social Science StatisticsD. SahaName  ___.docx
Fall, 2013STA 2122, Social Science StatisticsD. SahaName ___.docx
 
Cost and schedule risk modelling - Range analysis
Cost and schedule risk modelling - Range analysisCost and schedule risk modelling - Range analysis
Cost and schedule risk modelling - Range analysis
 
Decision Trees
Decision TreesDecision Trees
Decision Trees
 
Implementing Design Thinking Powerpoint Presentation Slides
Implementing Design Thinking Powerpoint Presentation SlidesImplementing Design Thinking Powerpoint Presentation Slides
Implementing Design Thinking Powerpoint Presentation Slides
 
Implementing Design Thinking PowerPoint Presentation Slides
Implementing Design Thinking PowerPoint Presentation Slides Implementing Design Thinking PowerPoint Presentation Slides
Implementing Design Thinking PowerPoint Presentation Slides
 

Plus de Edureka!

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaEdureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaEdureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaEdureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaEdureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaEdureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaEdureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaEdureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaEdureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaEdureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaEdureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | EdurekaEdureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEdureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEdureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaEdureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaEdureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaEdureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaEdureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaEdureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | EdurekaEdureka!
 

Plus de Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Dernier

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Dernier (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Decision Tree for Predictive Modeling

  • 2. Slide 2 www.edureka.co/decision-tree-Modeling-using-r Agenda  Business need of a model  Anatomy of a decision tree  Advantage of using decision tree in the business scenario  Usage of decision tree techniques in business  Key decision tree features  Course framework At the end of the session we would learn about :
  • 4. Slide 4 www.edureka.co/decision-tree-Modeling-using-rSlide 4 Business Scenario – Need of a Model? Business is unhappy with such a poor response rate Say 100,000 prospect Say 1,000 takes up the product
  • 5. Slide 5 www.edureka.co/decision-tree-Modeling-using-rSlide 5 Business Scenario – Need of a Model? Think of – if $2 is the cost of mailer then one has spend $200 per new customer acquisition, right? Can we find a base where by working on less number of prospect, we can still get almost all the responder Business is unhappy with such a poor response rate Say 100,000 prospect Say 1,000 takes up the product
  • 6. Slide 6 www.edureka.co/decision-tree-Modeling-using-rSlide 6 Business Scenario – Need of a Model? Say by working on 20000 prospect Can we get 900 responder Think of – if $2 is the cost of mailer then one has spend $200 per new customer acquisition, right? Can we find a base where by working on less number of prospect, we can still get almost all the responder Business is unhappy with such a poor response rate Say 100,000 prospect Say 1,000 takes up the product
  • 7. Slide 7 www.edureka.co/decision-tree-Modeling-using-rSlide 7 Business Scenario – Need of a Model? Say by working on 20000 prospect Can we get 900 responder Note – no possibility of exact match in real life scenarios Also very rare possibility of getting all the responder by working on part of population Target is to get almost all the responder by working on only small portion of the population Think of – if $2 is the cost of mailer then one has spend $200 per new customer acquisition, right? Can we find a base where by working on less number of prospect, we can still get almost all the responder Business is unhappy with such a poor response rate Say 100,000 prospect Say 1,000 takes up the product
  • 8. Slide 8 www.edureka.co/decision-tree-Modeling-using-rSlide 8 So the Target is ….. Target is to get almost all the responder by working on only part of the population Population – N Responder – K X % of Population N Y % – of Responder K Y > X
  • 9. Slide 9 www.edureka.co/decision-tree-Modeling-using-rSlide 9 So the Target is ….. Target is to get almost all the responder by working on only part of the population Population – N Responder – K X % of Population N Y %– of Responder K Y > X 1 – X% of Population – N 1 – Y% of Responder – K
  • 10. Slide 10 www.edureka.co/decision-tree-Modeling-using-rSlide 10 So the Target is ….. Target is to get almost all the responder by working on only part of the population Note RGB concept » Green the bench mark response rate » more response rate – red » Less response rate – blue Work on red / blue– higher response/lower response rate section Population – N Responder – K X % of Population N Y %– of Responder K Y > X 1 – X% of Population – N 1 – Y% of Responder – K
  • 11. Slide 11 www.edureka.co/decision-tree-Modeling-using-r Decision Tree Example – Understand the Anatomy
  • 12. Slide 12 www.edureka.co/decision-tree-Modeling-using-rSlide 12 Decision Tree Example Send files to bureau for credit worthiness of existing customers 70% gets good rating, 30% bad rating 30% 70% N Y Credit Rating Y: Good, N: Bad
  • 13. Slide 13 www.edureka.co/decision-tree-Modeling-using-rSlide 13 Send files to bureau for credit worthiness of existing customers 70% gets good rating, 30% bad rating Say $5 is the cost of sending each record for check to bureau Can we send records selectively to only those base where we have doubts Because ultimately, we want to stop loss and want to know, who will get bad rating hence risky Decision Tree Example (Contd.) 30% 70% N Y Credit Rating Y: Good, N: Bad
  • 14. Slide 14 www.edureka.co/decision-tree-Modeling-using-rSlide 14 Decision Tree Example (Contd.) Can we forecast, among current population, who will Have good credit rating Decision tree improves the accuracy of decisioning A 30% 70% N Y Credit Rating Y: Good, N: Bad
  • 15. Slide 15 www.edureka.co/decision-tree-Modeling-using-rSlide 15 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT Root Note 2 3 Decision Tree Example (Contd.)
  • 16. Slide 16 www.edureka.co/decision-tree-Modeling-using-rSlide 16 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT Root Note Leaf Node 2 3 Decision Tree Example (Contd.)
  • 17. Slide 17 www.edureka.co/decision-tree-Modeling-using-rSlide 17 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT Root Note Leaf Node CHK_ACCT < 1.5 and Duration >= 22.5 and SAV_ACCT < 2.5 2 3 Decision Tree Example (Contd.)
  • 18. Slide 18 www.edureka.co/decision-tree-Modeling-using-rSlide 18 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 37% 71% 65% 87% Root Note Leaf Node CHK_ACCT < 1.5 and Duration >= 22.5 and SAV_ACCT < 2.5 2 3 Decision Tree Example (Contd.)
  • 19. Slide 19 www.edureka.co/decision-tree-Modeling-using-rSlide 19 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 37% 71% 65% 87% Root Note Leaf Node CHK_ACCT < 1.5 and Duration >= 22.5 and SAV_ACCT < 2.5 Node Size Depth 2 3 Decision Tree Example (Contd.)
  • 20. Slide 20 www.edureka.co/decision-tree-Modeling-using-r Decision Tree Example – Understand the Gain from Decision Tree
  • 21. Slide 21 www.edureka.co/decision-tree-Modeling-using-rSlide 21 Decision Tree Example 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 2 3 >=1.5<1.5 <22.5>=22.5 >=2.5 Node 4 (37%) Node 5 (71%) Node 6 (65%) SAV_ACCT Duration NODE 7 (87%) CHK_ACCT (70%) <2.5
  • 22. Slide 22 www.edureka.co/decision-tree-Modeling-using-rSlide 22 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 37% 71% 65% 87% 2 3 70% Decision Tree Example (Contd.)
  • 23. Slide 23 www.edureka.co/decision-tree-Modeling-using-rSlide 23 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 37% 71% 65% 87% 2 3 70% Decision Tree Example (Contd.) Understand gain by working on different nodes
  • 24. Slide 24 www.edureka.co/decision-tree-Modeling-using-rSlide 24 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 37% 71% 65% 87% 2 3 70% Decision Tree Example (Contd.) Understand gain by working on different nodes Now we can keep a documentation cell to demand more document from a subset of population and then send them to bureau after receipt of documents
  • 25. Slide 25 www.edureka.co/decision-tree-Modeling-using-rSlide 25 RGB Concepts Decision Tree Example (Contd.) Population – N Responder – K X % of Population N Y % – of Responder K Y > X 1 – X% of Population – N 1 – Y% of Responder – K
  • 26. Slide 26 www.edureka.co/decision-tree-Modeling-using-rSlide 26 C1 = 3, C2=3 RGB Concepts C1 = 1, C2=2C1 = 2, C2=1 Decision Tree Example (Contd.) Population – N Responder – K X % of Population N Y % – of Responder K Y > X 1 – X% of Population – N 1 – Y% of Responder – K
  • 27. Slide 27 www.edureka.co/decision-tree-Modeling-using-rSlide 27 RGB Concepts Decision Tree Example (Contd.) Population – N Responder – K X % of Population N Y % – of Responder K Y > X 1 – X% of Population – N 1 – Y% of Responder – K
  • 28. Slide 28 www.edureka.co/decision-tree-Modeling-using-rSlide 28 RGB Concepts Decision Tree Example (Contd.) Population – N Responder – K X % of Population N Y % – of Responder K Y > X 1 – X% of Population – N 1 – Y% of Responder – K 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 37% 71% 65% 87% 2 3 70%
  • 29. Slide 29 www.edureka.co/decision-tree-Modeling-using-rSlide 29 RGB Concepts Decision Tree Example (Contd.) Population – N Responder – K X % of Population N Y % – of Responder K Y > X 1 – X% of Population – N 1 – Y% of Responder – K 70% 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 37% 71% 65% 87% 2 3 70%
  • 30. Slide 30 www.edureka.co/decision-tree-Modeling-using-rSlide 30 RGB Concepts Decision Tree Example (Contd.) 70%Population – N Responder – K X % of Population N Y % – of Responder K Y > X 1 – X% of Population – N 1 – Y% of Responder – K 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0.2 0 1 0.8 0.6 0.4 0 0.2 1 0.8 0.6 0.4 0 0.2 Z Y Z Y Z Y Z Y Node 4 (n = 196) Node 5 (n = 41) Node 6 (n = 306) Node 7 (n = 457) <2.5 ≥2.5 ≥22.5 <22.5 <1.5 ≥1.5 1 SAV_ACCT DURATION CHK_ACCT 37% 71% 65% 87% 2 3 70%
  • 31. Slide 31 www.edureka.co/decision-tree-Modeling-using-r Business Applications of a Decision Tree – Use of a Model
  • 32. Slide 32 www.edureka.co/decision-tree-Modeling-using-rSlide 32 Business Scenario and Advantage Among prospect, Find who will default vs. non defaulter » So by not giving loan to set of prospect, you avoid lots of bad loan
  • 33. Slide 33 www.edureka.co/decision-tree-Modeling-using-rSlide 33 Business Scenario and Advantage Among prospect, Find who will default vs. non defaulter
  • 34. Slide 34 www.edureka.co/decision-tree-Modeling-using-rSlide 34 Business Scenario and Advantage (Contd.) Among patients profile, who will respond better with such treatment » So by putting rest of them into another kind of treatment Among customers, Find profile of those who will attrite vs. those will stay with the business » So by targeting such customer you can reduce attrition? Among applicants, Find which are the applicants, who can be fraud (such as cases of account take over) » So by working on few selected applications you can avoid lots of account take over fraud cases Among prospect of home loan pool, Find who are the prospects customer, who will switch over their home loan » So by not working on few prospect, bank can quickly grow their portfolio by taking over existing home loans Find who among current base will move into delinquency » So that their credit limit can be reduced to reduce exposure and losses
  • 36. Slide 36 www.edureka.co/decision-tree-Modeling-using-rSlide 36 Key Decision Tree features Automated field selection » handles any number of fields » automatically selects relevant fields Little data preprocessing needed » Does not require any kind of variable transforms » Impervious to outliers Missing value tolerant » Moderate loss of accuracy due to missing values Quick development and validation
  • 38. Slide 38 www.edureka.co/decision-tree-Modeling-using-r Your feedback is important to us, be it a compliment, a suggestion or a complaint. It helps us to make the course better! Please spare few seconds to take the survey after the webinar. www.edureka.co/ Survey