SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Credit Risk Model Building
                     Venkat Reddy
Note
•   This presentation is just the lecture notes for the talk on Credit Risk Model building
    conducted for MBA students
•   The best way to treat this is as a high-level summary; the actual session went more
    in depth and contained other information.
•   Most of this material was written as informal notes, not intended for publication
•   Please send your questions/comments/corrections to
    venkat@trenwiseanalytics.com or 21.venkat@gmail.com
•   Please check my website for latest version of this document
                                                       -Venkat Reddy
Contents


• Applications of Statistics in Business
• The Model Building Problem
• Credit Risk Model Building
• Other Applications of Model Building
Applications of Statistics in Business
Increasingly, business rely on intelligent tools and techniques to analyze data systematically to
improve decision-making.

    Retail sales analytics
    Financial services analytics
    Telecommunications
    Supply Chain analytics
    Transportation analytics
    Risk & Credit analytics
    Talent analytics
    Marketing analytics
    Behavioral analytics
    Collections analytics
    Fraud analytics
    Pricing analytics
The Problem
Who will run away with my money?
•   Citi Bank : Present in more than 90 countries.
•   More than 100,000 customers apply for credit
    cards/loans every month
•   All of them have different characteristics
•   Out of 10o,000 customers, who all have the higher
    probability of default/ Charge off?
•   Basically, who will run away with my money?
•   We need to predict the probability of “Running away”
•   Who all have ‘Gupta Bank’ credit card applications in
    this room?
Bank builds a model that gives a score to each
customer
         Applicants




“Developing set of equations or mathematical formulation to forecast future
behaviors based on current or historical data.”
Predictive Modeling
Lets try to understand predictive modeling

Predictive Modeling – Fitting a model to the data to predict the future.

                                       •   Predicting the future –and it is so easy some times
                                       •   Who is going to score more runs in IPL-2013?
                                       •   That’s it you predicted the future..
                                       •   BTW how did you predict?
                                       •   Predicting the future based on historical data is
                                           nothing but Predictive modeling
Predictive Modeling
Lets try to understand predictive modeling

Predictive Modeling – Fitting a model to the data to predict the future.

                                       • Who is going to score more runs in IPL 2013?
                                       • Predicting the future …well it is not that easy …
Predictive Modeling
Horse Race Analogy

                      How to bet on best horse in a horse race
The Historical Data
Win vs. Loss record in past 2 years

• Long legs: 75% (Horses with long legs won 75% of the times)
• Breed A: 55%, Breed B: 15 % Others : 30%
• T/L (Tummy to length) ratio <1/2 :75 %
• Gender: Male -68%
• Head size: Small 10%, Medium 15% Large 75%
• Country: Africa -65%
Given the historical data
Which one of these two horses would you bet on?
                        Kalyan   Chethak
       Length of legs   150 cm    110 cm
           Breed          A         F
         T/L ratio       0.3       0.6
          Gender         Male    Female
         Head size      Large     Small
          Country        India    India
Given the historical data
Which one would you bet on….now?
                        Kalyan   Chethak
       Length of legs   110 cm    150 cm
           Breed          C         A
         T/L ratio       0.45      0.60
          Gender         Male    Female
         Head size      Small     Large
          Country       Africa    India
Given the historical data
 What about best one in this lot?

            Horse-1   Horse-2   Horse-3 Horse-4 Horse-5 Horse-6 Horse-7 Horse-8 Horse-9 Horse-10
Length of
              109       114      134      130     149      120     104      117     115     135
   legs
  Breed        C        A         B       A        F        K       L        B       C       A
T/L ratio     0.1       0.8       0.5     1.0     0.3      0.3     0.3      0.6     0.7     0.9
 Gender      Male     Female     Male   Female   Male     Female   Male    Female   Male   Female
Head size      L        S         M       M        L        L       S        M       L       M
Country      Africa    India     Aus      NZ     Africa   Africa   India    India   Aus    Africa
Citi has a similar problem?
Who is going to run away with my money?
• Given Historical of the customers we want to predict the probability of bad
• We have the data of each customer on
   •   Customer previous loans, customer previous payments, length of account credit
       history, other credit cards and loans, job type, income band etc.,
• We want to predict the probability of default
Credit Risk Model Building
   Four main steps


1. Study historical
data
                                                3. Find the exact
• What are the          2. Identify the most                              4. Use these
  causes(Customer                                impact of each
                         impacting factors                           coefficients for future
  Characteristics)                             factor(Quantify it)
• What are the
  effects(Charge off)
The Historical Data of Customers
                                                         Attribute                               Value

•   Contains all the information about customers
                                                         SSN                                     111259005
                                                         Age                                     27
•   Contains information across more than 500
    variables                                            Number of dependents                    2

•   Portion of data is present in the application form   Number of current loans                 1

•   Portion of the data is available with bank
                                                         Number of credit cards                  1
                                                         Number Installments 30days late in      4
•   Lot of data is maintained in bureau                  last 2 years
     •   Social Security number –in US                   Average utilization % in last 2 years   30%
     •   PAN Number in India                             Time since accounts opened              60 months
                                                         Number of previous applications for     2
                                                         credit card
                                                         Bankrupt                                NO
Identifying most impacting factors
Out of 500 what are those 20 important attributes
Model Building
logistic regression model to predict the probability of default

• Probability of bad = w1(Var1)+w2(Var2) +w3(Var1)……+w20(var20)
• Logistic regression gives us those weights
• Predicting the probability
   •   Probability of bad=0.13(number of cards)+ 0.21 (utilization)+…….+0.06(number of loan
       applications)
• That’s it ….we are done
Credit Risk Model Building
Real-time Example
Attributes used on the model
1. MonthlyIncome
2. Number of loans
3. Number of times 30days late in last 2 years
4. Utilization in last 2 years
5. Age
6. DebtRatio – Monthly Debt / Monthly Income
‘Gupta Bank’ Credit Cards Approval

• Lets use above model for gupta bank
• Who are the applicants here?
• Lets get the bureau information for the applicants
Actual model Building Steps
 Objective &     Customer score                   Observation
  Portfolio        vs. account     Exclusions     point and/or
identification        score                         window



Performance                                      Validation plan
                 Bad definition   Segmentation
  window                                          and samples




  Variable                         Scoring and        Score
                 Model Building
  Selection                         Validation   Implementation
Marketing Example
Predicting the response probability to a marketing Campaign

•   Selling Mobile phones – Marketing campaign
•   Who should we target?
     •   Consider historical data of mobile phone buyers
     •   See their characteristics
     •   Find top impacting characteristics
     •   Find weight of each characteristic
     •   Score new population
     •   Decide on the cut off
     •   Try to sell people who score more than cut off
Other Applications of Model Building


• Fraud transactions scorecard – Fraud identification based on attributes like
  transaction amount, place, time, frequency of transactions etc.,

• Attrition modeling – Predicting employee attrition based on their characteristics
Thank You
Venkat Reddy
21.venkat@gmail.com

Contenu connexe

Tendances

Case Study: Loan default prediction
Case Study: Loan default predictionCase Study: Loan default prediction
Case Study: Loan default predictionALTEN Calsoft Labs
 
Predicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsPredicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsSagar Tupkar
 
Loan Default Prediction with Machine Learning
Loan Default Prediction with Machine LearningLoan Default Prediction with Machine Learning
Loan Default Prediction with Machine LearningAlibaba Cloud
 
Default Probability Prediction using Artificial Neural Networks in R Programming
Default Probability Prediction using Artificial Neural Networks in R ProgrammingDefault Probability Prediction using Artificial Neural Networks in R Programming
Default Probability Prediction using Artificial Neural Networks in R ProgrammingVineet Ojha
 
Kaggle "Give me some credit" challenge overview
Kaggle "Give me some credit" challenge overviewKaggle "Give me some credit" challenge overview
Kaggle "Give me some credit" challenge overviewAdam Pah
 
Machine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskMachine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskQuantUniversity
 
Exploratory Data Analysis Bank Fraud Case Study
Exploratory  Data Analysis Bank Fraud Case StudyExploratory  Data Analysis Bank Fraud Case Study
Exploratory Data Analysis Bank Fraud Case StudyLumbiniSardare
 
Analytics in banking services
Analytics in banking servicesAnalytics in banking services
Analytics in banking servicesMariyageorge
 
Exploratory Data Analysis For Credit Risk Assesment
Exploratory Data Analysis For Credit Risk AssesmentExploratory Data Analysis For Credit Risk Assesment
Exploratory Data Analysis For Credit Risk AssesmentVishalPatil527
 
Loan default prediction with machine language
Loan  default  prediction with  machine  language Loan  default  prediction with  machine  language
Loan default prediction with machine language Aayush Kumar
 
Credit Scoring
Credit ScoringCredit Scoring
Credit ScoringMABSIV
 
Taiwanese Credit Card Client Fraud detection
Taiwanese Credit Card Client Fraud detectionTaiwanese Credit Card Client Fraud detection
Taiwanese Credit Card Client Fraud detectionRavi Gupta
 
Detecting fraud with Python and machine learning
Detecting fraud with Python and machine learningDetecting fraud with Python and machine learning
Detecting fraud with Python and machine learningwgyn
 
Overview of Data Analytics in Lending Business
Overview of Data Analytics in Lending BusinessOverview of Data Analytics in Lending Business
Overview of Data Analytics in Lending BusinessSanjay Kar
 
Final capstone powerpoint
Final capstone powerpointFinal capstone powerpoint
Final capstone powerpointCaroline Nguma
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banksPankaj Baid
 
Credit default risk
Credit default riskCredit default risk
Credit default riskchs71
 

Tendances (20)

Case Study: Loan default prediction
Case Study: Loan default predictionCase Study: Loan default prediction
Case Study: Loan default prediction
 
Predicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning AlgorithmsPredicting Credit Card Defaults using Machine Learning Algorithms
Predicting Credit Card Defaults using Machine Learning Algorithms
 
Loan Default Prediction with Machine Learning
Loan Default Prediction with Machine LearningLoan Default Prediction with Machine Learning
Loan Default Prediction with Machine Learning
 
Default Probability Prediction using Artificial Neural Networks in R Programming
Default Probability Prediction using Artificial Neural Networks in R ProgrammingDefault Probability Prediction using Artificial Neural Networks in R Programming
Default Probability Prediction using Artificial Neural Networks in R Programming
 
Kaggle "Give me some credit" challenge overview
Kaggle "Give me some credit" challenge overviewKaggle "Give me some credit" challenge overview
Kaggle "Give me some credit" challenge overview
 
Machine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskMachine Learning Applications in Credit Risk
Machine Learning Applications in Credit Risk
 
Exploratory Data Analysis Bank Fraud Case Study
Exploratory  Data Analysis Bank Fraud Case StudyExploratory  Data Analysis Bank Fraud Case Study
Exploratory Data Analysis Bank Fraud Case Study
 
Credit scoring
Credit scoringCredit scoring
Credit scoring
 
Analytics in banking services
Analytics in banking servicesAnalytics in banking services
Analytics in banking services
 
Exploratory Data Analysis For Credit Risk Assesment
Exploratory Data Analysis For Credit Risk AssesmentExploratory Data Analysis For Credit Risk Assesment
Exploratory Data Analysis For Credit Risk Assesment
 
Loan default prediction with machine language
Loan  default  prediction with  machine  language Loan  default  prediction with  machine  language
Loan default prediction with machine language
 
Credit Scoring
Credit ScoringCredit Scoring
Credit Scoring
 
Taiwanese Credit Card Client Fraud detection
Taiwanese Credit Card Client Fraud detectionTaiwanese Credit Card Client Fraud detection
Taiwanese Credit Card Client Fraud detection
 
Detecting fraud with Python and machine learning
Detecting fraud with Python and machine learningDetecting fraud with Python and machine learning
Detecting fraud with Python and machine learning
 
Telecom Churn Prediction
Telecom Churn PredictionTelecom Churn Prediction
Telecom Churn Prediction
 
Overview of Data Analytics in Lending Business
Overview of Data Analytics in Lending BusinessOverview of Data Analytics in Lending Business
Overview of Data Analytics in Lending Business
 
Final capstone powerpoint
Final capstone powerpointFinal capstone powerpoint
Final capstone powerpoint
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
 
Credit eda case study
Credit eda case studyCredit eda case study
Credit eda case study
 
Credit default risk
Credit default riskCredit default risk
Credit default risk
 

En vedette

Report (istanbul stock exchange and resistance)
Report (istanbul stock exchange and resistance)Report (istanbul stock exchange and resistance)
Report (istanbul stock exchange and resistance)Taishi Torii
 
Borsa Istanbul Derivatives Market (VIOP) Q1 2015 Presentation
Borsa Istanbul Derivatives Market (VIOP) Q1 2015 PresentationBorsa Istanbul Derivatives Market (VIOP) Q1 2015 Presentation
Borsa Istanbul Derivatives Market (VIOP) Q1 2015 PresentationBorsa İstanbul
 
Exploratory Data Analysis for Energy Efficiency
Exploratory Data Analysis for Energy EfficiencyExploratory Data Analysis for Energy Efficiency
Exploratory Data Analysis for Energy EfficiencyNitin Agarwal
 
Sas credit scorecards
Sas credit scorecardsSas credit scorecards
Sas credit scorecardsTEMPLA73
 
Delopment and testing of a credit scoring model
Delopment and testing of a credit scoring modelDelopment and testing of a credit scoring model
Delopment and testing of a credit scoring modelMattia Ciprian
 

En vedette (6)

Report (istanbul stock exchange and resistance)
Report (istanbul stock exchange and resistance)Report (istanbul stock exchange and resistance)
Report (istanbul stock exchange and resistance)
 
Borsa Istanbul Derivatives Market (VIOP) Q1 2015 Presentation
Borsa Istanbul Derivatives Market (VIOP) Q1 2015 PresentationBorsa Istanbul Derivatives Market (VIOP) Q1 2015 Presentation
Borsa Istanbul Derivatives Market (VIOP) Q1 2015 Presentation
 
Exploratory Data Analysis for Energy Efficiency
Exploratory Data Analysis for Energy EfficiencyExploratory Data Analysis for Energy Efficiency
Exploratory Data Analysis for Energy Efficiency
 
Energy Efficiency
Energy EfficiencyEnergy Efficiency
Energy Efficiency
 
Sas credit scorecards
Sas credit scorecardsSas credit scorecards
Sas credit scorecards
 
Delopment and testing of a credit scoring model
Delopment and testing of a credit scoring modelDelopment and testing of a credit scoring model
Delopment and testing of a credit scoring model
 

Similaire à Model building in credit card and loan approval

Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1Venkata Reddy Konasani
 
Credit Score Basics-04-17
Credit Score Basics-04-17Credit Score Basics-04-17
Credit Score Basics-04-17Barbara O'Neill
 
Credit Slideshow
Credit SlideshowCredit Slideshow
Credit SlideshowElie Gammal
 
Data Science Applications in Finance and Investing
Data Science Applications in Finance and InvestingData Science Applications in Finance and Investing
Data Science Applications in Finance and InvestingChristopher Conlan
 
Credit Scores-The Basics
Credit Scores-The BasicsCredit Scores-The Basics
Credit Scores-The BasicsBarbara O'Neill
 
Regulatory Hot Topics 2012
Regulatory Hot Topics 2012Regulatory Hot Topics 2012
Regulatory Hot Topics 2012Scott Sharp
 
Trends in Alternative Financial Services
Trends in Alternative Financial Services Trends in Alternative Financial Services
Trends in Alternative Financial Services Experian
 
How Eastern Bank Uses Big Data to Better Serve and Protect its Customers
How Eastern Bank Uses Big Data to Better Serve and Protect its CustomersHow Eastern Bank Uses Big Data to Better Serve and Protect its Customers
How Eastern Bank Uses Big Data to Better Serve and Protect its CustomersBrian Griffith
 
A New Approach to Consumer Credit
A New Approach to Consumer CreditA New Approach to Consumer Credit
A New Approach to Consumer CreditRabindran Abraham
 
Personal Loan Risk Assessment
Personal Loan Risk Assessment Personal Loan Risk Assessment
Personal Loan Risk Assessment Kunal Kashyap
 
Compressed Credit workshop SF final
Compressed Credit workshop SF  finalCompressed Credit workshop SF  final
Compressed Credit workshop SF finalUpendra Avadhani
 
ROLE OF credit score WHILE SanctionING LOAN .pptx
ROLE OF credit score  WHILE SanctionING LOAN .pptxROLE OF credit score  WHILE SanctionING LOAN .pptx
ROLE OF credit score WHILE SanctionING LOAN .pptxrekhabawa2
 
Data Exploration, Validation and Sanitization
Data Exploration, Validation and SanitizationData Exploration, Validation and Sanitization
Data Exploration, Validation and SanitizationVenkata Reddy Konasani
 
IRJET- Prediction of Credit Risks in Lending Bank Loans
IRJET- Prediction of Credit Risks in Lending Bank LoansIRJET- Prediction of Credit Risks in Lending Bank Loans
IRJET- Prediction of Credit Risks in Lending Bank LoansIRJET Journal
 
Estimating Supply and Demand for Microcredit
Estimating Supply and Demand for MicrocreditEstimating Supply and Demand for Microcredit
Estimating Supply and Demand for MicrocreditFriedman Associates
 
Business analytics in banking sector
Business analytics in banking sectorBusiness analytics in banking sector
Business analytics in banking sectorVikhilSonna
 
FactorTrust NABD Orlando2016
FactorTrust NABD Orlando2016FactorTrust NABD Orlando2016
FactorTrust NABD Orlando2016Scott Brackin
 

Similaire à Model building in credit card and loan approval (20)

Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1
 
Credit Score Basics-04-17
Credit Score Basics-04-17Credit Score Basics-04-17
Credit Score Basics-04-17
 
Credit Slideshow
Credit SlideshowCredit Slideshow
Credit Slideshow
 
Data Science Applications in Finance and Investing
Data Science Applications in Finance and InvestingData Science Applications in Finance and Investing
Data Science Applications in Finance and Investing
 
Credit Scores-The Basics
Credit Scores-The BasicsCredit Scores-The Basics
Credit Scores-The Basics
 
Group 1 p53
Group 1 p53Group 1 p53
Group 1 p53
 
Regulatory Hot Topics 2012
Regulatory Hot Topics 2012Regulatory Hot Topics 2012
Regulatory Hot Topics 2012
 
Trends in Alternative Financial Services
Trends in Alternative Financial Services Trends in Alternative Financial Services
Trends in Alternative Financial Services
 
How Eastern Bank Uses Big Data to Better Serve and Protect its Customers
How Eastern Bank Uses Big Data to Better Serve and Protect its CustomersHow Eastern Bank Uses Big Data to Better Serve and Protect its Customers
How Eastern Bank Uses Big Data to Better Serve and Protect its Customers
 
Credit appraisal SYSTEM
Credit appraisal SYSTEMCredit appraisal SYSTEM
Credit appraisal SYSTEM
 
A New Approach to Consumer Credit
A New Approach to Consumer CreditA New Approach to Consumer Credit
A New Approach to Consumer Credit
 
Personal Loan Risk Assessment
Personal Loan Risk Assessment Personal Loan Risk Assessment
Personal Loan Risk Assessment
 
Compressed Credit workshop SF final
Compressed Credit workshop SF  finalCompressed Credit workshop SF  final
Compressed Credit workshop SF final
 
ROLE OF credit score WHILE SanctionING LOAN .pptx
ROLE OF credit score  WHILE SanctionING LOAN .pptxROLE OF credit score  WHILE SanctionING LOAN .pptx
ROLE OF credit score WHILE SanctionING LOAN .pptx
 
Data Exploration, Validation and Sanitization
Data Exploration, Validation and SanitizationData Exploration, Validation and Sanitization
Data Exploration, Validation and Sanitization
 
IRJET- Prediction of Credit Risks in Lending Bank Loans
IRJET- Prediction of Credit Risks in Lending Bank LoansIRJET- Prediction of Credit Risks in Lending Bank Loans
IRJET- Prediction of Credit Risks in Lending Bank Loans
 
Estimating Supply and Demand for Microcredit
Estimating Supply and Demand for MicrocreditEstimating Supply and Demand for Microcredit
Estimating Supply and Demand for Microcredit
 
Business analytics in banking sector
Business analytics in banking sectorBusiness analytics in banking sector
Business analytics in banking sector
 
AI for HRM
AI for HRMAI for HRM
AI for HRM
 
FactorTrust NABD Orlando2016
FactorTrust NABD Orlando2016FactorTrust NABD Orlando2016
FactorTrust NABD Orlando2016
 

Plus de Venkata Reddy Konasani

Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Venkata Reddy Konasani
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniquesVenkata Reddy Konasani
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Venkata Reddy Konasani
 
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau -  Data, Graphs, Filters, Dashboards and Advanced featuresLearning Tableau -  Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced featuresVenkata Reddy Konasani
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitizationVenkata Reddy Konasani
 

Plus de Venkata Reddy Konasani (20)

Transformers 101
Transformers 101 Transformers 101
Transformers 101
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniques
 
Neural Network Part-2
Neural Network Part-2Neural Network Part-2
Neural Network Part-2
 
GBM theory code and parameters
GBM theory code and parametersGBM theory code and parameters
GBM theory code and parameters
 
Neural Networks made easy
Neural Networks made easyNeural Networks made easy
Neural Networks made easy
 
Decision tree
Decision treeDecision tree
Decision tree
 
Step By Step Guide to Learn R
Step By Step Guide to Learn RStep By Step Guide to Learn R
Step By Step Guide to Learn R
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS
 
SAS basics Step by step learning
SAS basics Step by step learningSAS basics Step by step learning
SAS basics Step by step learning
 
Testing of hypothesis case study
Testing of hypothesis case study Testing of hypothesis case study
Testing of hypothesis case study
 
L101 predictive modeling case_study
L101 predictive modeling case_studyL101 predictive modeling case_study
L101 predictive modeling case_study
 
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau -  Data, Graphs, Filters, Dashboards and Advanced featuresLearning Tableau -  Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
 
Online data sources for analaysis
Online data sources for analaysis Online data sources for analaysis
Online data sources for analaysis
 
A data analyst view of Bigdata
A data analyst view of Bigdata A data analyst view of Bigdata
A data analyst view of Bigdata
 
R- Introduction
R- IntroductionR- Introduction
R- Introduction
 
Cluster Analysis for Dummies
Cluster Analysis for DummiesCluster Analysis for Dummies
Cluster Analysis for Dummies
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitization
 
ARIMA
ARIMA ARIMA
ARIMA
 

Dernier

Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 

Dernier (20)

Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 

Model building in credit card and loan approval

  • 1. Credit Risk Model Building Venkat Reddy
  • 2. Note • This presentation is just the lecture notes for the talk on Credit Risk Model building conducted for MBA students • The best way to treat this is as a high-level summary; the actual session went more in depth and contained other information. • Most of this material was written as informal notes, not intended for publication • Please send your questions/comments/corrections to venkat@trenwiseanalytics.com or 21.venkat@gmail.com • Please check my website for latest version of this document -Venkat Reddy
  • 3. Contents • Applications of Statistics in Business • The Model Building Problem • Credit Risk Model Building • Other Applications of Model Building
  • 4. Applications of Statistics in Business Increasingly, business rely on intelligent tools and techniques to analyze data systematically to improve decision-making.  Retail sales analytics  Financial services analytics  Telecommunications  Supply Chain analytics  Transportation analytics  Risk & Credit analytics  Talent analytics  Marketing analytics  Behavioral analytics  Collections analytics  Fraud analytics  Pricing analytics
  • 5. The Problem Who will run away with my money? • Citi Bank : Present in more than 90 countries. • More than 100,000 customers apply for credit cards/loans every month • All of them have different characteristics • Out of 10o,000 customers, who all have the higher probability of default/ Charge off? • Basically, who will run away with my money? • We need to predict the probability of “Running away” • Who all have ‘Gupta Bank’ credit card applications in this room?
  • 6. Bank builds a model that gives a score to each customer Applicants “Developing set of equations or mathematical formulation to forecast future behaviors based on current or historical data.”
  • 7. Predictive Modeling Lets try to understand predictive modeling Predictive Modeling – Fitting a model to the data to predict the future. • Predicting the future –and it is so easy some times • Who is going to score more runs in IPL-2013? • That’s it you predicted the future.. • BTW how did you predict? • Predicting the future based on historical data is nothing but Predictive modeling
  • 8. Predictive Modeling Lets try to understand predictive modeling Predictive Modeling – Fitting a model to the data to predict the future. • Who is going to score more runs in IPL 2013? • Predicting the future …well it is not that easy …
  • 9. Predictive Modeling Horse Race Analogy How to bet on best horse in a horse race
  • 10. The Historical Data Win vs. Loss record in past 2 years • Long legs: 75% (Horses with long legs won 75% of the times) • Breed A: 55%, Breed B: 15 % Others : 30% • T/L (Tummy to length) ratio <1/2 :75 % • Gender: Male -68% • Head size: Small 10%, Medium 15% Large 75% • Country: Africa -65%
  • 11. Given the historical data Which one of these two horses would you bet on? Kalyan Chethak Length of legs 150 cm 110 cm Breed A F T/L ratio 0.3 0.6 Gender Male Female Head size Large Small Country India India
  • 12. Given the historical data Which one would you bet on….now? Kalyan Chethak Length of legs 110 cm 150 cm Breed C A T/L ratio 0.45 0.60 Gender Male Female Head size Small Large Country Africa India
  • 13. Given the historical data What about best one in this lot? Horse-1 Horse-2 Horse-3 Horse-4 Horse-5 Horse-6 Horse-7 Horse-8 Horse-9 Horse-10 Length of 109 114 134 130 149 120 104 117 115 135 legs Breed C A B A F K L B C A T/L ratio 0.1 0.8 0.5 1.0 0.3 0.3 0.3 0.6 0.7 0.9 Gender Male Female Male Female Male Female Male Female Male Female Head size L S M M L L S M L M Country Africa India Aus NZ Africa Africa India India Aus Africa
  • 14. Citi has a similar problem? Who is going to run away with my money? • Given Historical of the customers we want to predict the probability of bad • We have the data of each customer on • Customer previous loans, customer previous payments, length of account credit history, other credit cards and loans, job type, income band etc., • We want to predict the probability of default
  • 15. Credit Risk Model Building Four main steps 1. Study historical data 3. Find the exact • What are the 2. Identify the most 4. Use these causes(Customer impact of each impacting factors coefficients for future Characteristics) factor(Quantify it) • What are the effects(Charge off)
  • 16. The Historical Data of Customers Attribute Value • Contains all the information about customers SSN 111259005 Age 27 • Contains information across more than 500 variables Number of dependents 2 • Portion of data is present in the application form Number of current loans 1 • Portion of the data is available with bank Number of credit cards 1 Number Installments 30days late in 4 • Lot of data is maintained in bureau last 2 years • Social Security number –in US Average utilization % in last 2 years 30% • PAN Number in India Time since accounts opened 60 months Number of previous applications for 2 credit card Bankrupt NO
  • 17. Identifying most impacting factors Out of 500 what are those 20 important attributes
  • 18. Model Building logistic regression model to predict the probability of default • Probability of bad = w1(Var1)+w2(Var2) +w3(Var1)……+w20(var20) • Logistic regression gives us those weights • Predicting the probability • Probability of bad=0.13(number of cards)+ 0.21 (utilization)+…….+0.06(number of loan applications) • That’s it ….we are done
  • 19. Credit Risk Model Building Real-time Example Attributes used on the model 1. MonthlyIncome 2. Number of loans 3. Number of times 30days late in last 2 years 4. Utilization in last 2 years 5. Age 6. DebtRatio – Monthly Debt / Monthly Income
  • 20. ‘Gupta Bank’ Credit Cards Approval • Lets use above model for gupta bank • Who are the applicants here? • Lets get the bureau information for the applicants
  • 21. Actual model Building Steps Objective & Customer score Observation Portfolio vs. account Exclusions point and/or identification score window Performance Validation plan Bad definition Segmentation window and samples Variable Scoring and Score Model Building Selection Validation Implementation
  • 22. Marketing Example Predicting the response probability to a marketing Campaign • Selling Mobile phones – Marketing campaign • Who should we target? • Consider historical data of mobile phone buyers • See their characteristics • Find top impacting characteristics • Find weight of each characteristic • Score new population • Decide on the cut off • Try to sell people who score more than cut off
  • 23. Other Applications of Model Building • Fraud transactions scorecard – Fraud identification based on attributes like transaction amount, place, time, frequency of transactions etc., • Attrition modeling – Predicting employee attrition based on their characteristics