SlideShare une entreprise Scribd logo
1  sur  15
Reduction in Customer Complaints –
Mortgage Servicing Industry
By prediction of customers likely to complaint & taking
proactive steps for prevention of the same.
Original Project Partners
Pranov Mishra
Aniket Chhabra
Vivek Chandel
Madhu Gollpudi
Codes and cleaned dataset can be
found in my github account.
Link
(https://github.com/Pranov1984/Great-Lakes-Capstone-Project)
Executive Summary
Project Overview:
The project aims at analysis of Customer Complaints/Inquiries received by a US based mortgage (loan)
servicing company . The scope of the project is limited to complaints received with respect to the part of the servicing
life cycle that is related to Escrow Analysis and other related or subsidiary activities.
Goal Statement
Identification of major contributors towards complaints/inquiries. Utilization of the identified significant
contributors and coming up with recommendations for changes/new implementations with the below goals
▫ Reducing Re-work
▫ Reducing Operational Cost
▫ Improve Customer Satisfaction
▫ Improve company preparedness to respond to customers
Data Considered
A few months of data of standard servicing loans of the organisation which comprises of circa 154,000
records was used for data exploration, visualization and hypothesis generation. The data had a lot of missing
values and the missing values were typically when the event corresponding to the variable concerned) did not
occur for that observation. The data was cleaned by creating dummy variables with no missing values.
The code and the dataset provide in the github link constitutes the cleaned data.
Executive Summary Continued ...
• Escalations typically lead to extra work, reputational damage, sometimes regulatory scrutiny and penalties.
Preventing customer complaints and escalations is in the best interest of the company.
• The data was highly imbalanced (Majority class: Minority Class = 96%:4%) and hence appropriate model evaluation
metric was required to be chosen. A combination of Harmonic mean (F1 score) and Area Under the Curve (AUC)
was used to finalize the best model.
• Models tried to arrive at the best are
 Simple Model like Logistic Regression with different thresholds for classification
 Random Forest after balancing the dataset using Synthetic Minority Oversampling Technique (SMOTE)
 Stochastic Gradient Boosting technique after balancing the dataset as was the case with random forest
• The key insights derived from the model with best results, indicate that the variables that significantly impact
customer behaviour can be broadly classified as below:
 Waiver of escrow payments which could arise due to incorrect escrow analysis conducted resulting in customer
requesting for waiver of extra charges levied.
 Presence of “Initials” which comes into play when a customer is escrowed for the first time. So the customer
could be escalating either because he is incorrectly escrowed or the initial payment calculation for the escrow
services is incorrect.
 Process of handling force escrowed loans have been inadequate leading to customer queries and complaints.
• A Gains chart was prepared which gave a cumulative lift of 133% in the first 4 deciles. The customers with highest
probability of making a complaint were identified. The company could use this information to proactively review the
operations performed on the customer’s account and correct any errors if found.
Brief Overview of Escrow Account
Escrow:
Money held by a third-party on behalf of transacting parties
Escrow account:
An escrow account is established with a lender to pay for recurring
expenses related to ones property, such as real estate taxes and
homeowner’s insurance.
It helps borrower to anticipate and manage payment of these expenses
by including these expenses as a portion of monthly mortgage payment.
How does an escrow account work?
• At the time one establishes an escrow account, the customer’s
annual real estate taxes and homeowner’s insurance are estimated,
based on the customer’s most recent bills and premiums.
• An incremental amount of these expenses is added to the
customers’ monthly mortgage payment, in order to cover these
expenses when they are due.
• Each year, this escrow account is reviewed to determine if the
amount being escrowed each month is sufficient to pay for any
change in your real estate taxes or homeowner’s insurance
premiums.
• Incase a non escrowed customer defaults on payment of taxes and
insurance, the lender advances payments to the respective agencies
to protect the rights on the property. The lender then force escrows
the delinquent customers to recover the money.
• Missing value
treatment
• Outlier treatment
• Removing
inconsistencies
Data Treatment
• Review of each
variable and
transforming the
appropriate
variables
Exploratory Data
Analysis
• Event rate is highly
skewed in favor of
“No Complaints.
• SMOTE* used to
balance the data.
Balancing Data set
• Data was split in
70:30 ratio.
• Models were
trained on 70%
data & validated on
30%.
Data Partition
Analytical framework used to prepare the model
* Synthetic Minority
oversampling Technique
Model Building
Logistic Regression
Random Forest
Stochastic Gradient Boosting
Validation – Evaluation Metrics
Accuracy
Sensitivity(TPR) & Specificity(TNR)
Area under the Curve (AUC)
Data Visualization & Data Preparation
Both the graphs give the impression that the distribution is that of a factor variable. They look like
variables which should be a factor variable with most of the data crowding around zero and the
remaining few crowding around one. No data points in between zero and one. Variables transformed to
factor.
Data Visualization & Data Preparation
Both the graphs give the impression that the distribution is that of a factor variable. They look like
variables which should be a factor variable with most of the data crowding around zero and the
remaining few crowding around one. No data points in between zero and one. Variables transformed to
factor. Additionally looks like presence of Waiver is associated with complaints received
Data Visualization & Data Preparation
Both the graphs give the impression that the distribution is that of a factor variable. They look like
variables which should be a factor variable with most of the data crowding around zero and the
remaining few crowding around one. No data points in between zero and one. Variables transformed to
factor. Additionally, presence of Reversed seems to be associated with complaints received in a bigger
way.
Data Visualization & Data Preparation
Looks like a variable which should be numeric but with a high number of outliers. Most of the values are small with
a few very high values. Close to 17% of observations are outliers. Upon further analysis by deciling it, Eight deciles
have min and max as zero which constitute 80% of the data. Complaints received only in the 9th and 10th deciles
when there are surpluses greater than 0. Essentially there is a very high chance of a complaint/query when there is
a surplus. This indicates that the customer thinks that the analysis is incorrect or the surplus is not being returned
on time by the company. Variable converted to factor.
summary(mydata$Surplus)
Min. 1st Qu. Median Mean 3rd Qu. Max. 0.0 0.0 0.0 75.7 0.0 452213.2
Data Visualization & Data Preparation
Shortage Spread looks like a continuous variable but had outliers. The outliers were treated by compressing the
extreme values to between 0 and 85 percentile of the actual values.
Anytime there is a shortage, there is a higher chance of a complaint.
Data Visualization & Data Preparation
With highly skewed numbers in favor of NonBK analysis a separate analysis only on NONBK analysis loans could be
contemplated.
Random Forest models give the best results. Tuning of the parameters i.e. Mtry (no. of variables used while
training models on bootstrapped samples) improved the results marginally. The gradient boosting results were
also nearly as good (marginally less) as the results from random forest. The important point to note though is
that the tree based models were trained on the data after they were balanced by using SMOTE (Synthetic
Minority Oversampling Technique). The logistic regression results were least impressive.
Lift Chart
A Cumulative lift of 133% is achieved by use of the lift chart in the first four deciles. This means by choosing
40% of the total customers, with the aid of the model and the associated gains chart, we can identify more
than 50% of the customers who are likely to complain. Without the model we would have probably identified
20% of the potential complaining customers.
Variable Importance using Random Forest model which gave the best results

Contenu connexe

Tendances

Data Science Interview Questions | Data Science Interview Questions And Answe...
Data Science Interview Questions | Data Science Interview Questions And Answe...Data Science Interview Questions | Data Science Interview Questions And Answe...
Data Science Interview Questions | Data Science Interview Questions And Answe...
Simplilearn
 
A case study on churn analysis1
A case study on churn analysis1A case study on churn analysis1
A case study on churn analysis1
Amit Kumar
 

Tendances (20)

Data Visualization1.pptx
Data Visualization1.pptxData Visualization1.pptx
Data Visualization1.pptx
 
Module 1
Module  1Module  1
Module 1
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data Visualization With Tableau | Edureka
Data Visualization With Tableau | EdurekaData Visualization With Tableau | Edureka
Data Visualization With Tableau | Edureka
 
Data visualization
Data visualizationData visualization
Data visualization
 
Data Analytics PowerPoint Presentation Slides
Data Analytics PowerPoint Presentation SlidesData Analytics PowerPoint Presentation Slides
Data Analytics PowerPoint Presentation Slides
 
Best Practices for Killer Data Visualization
Best Practices for Killer Data VisualizationBest Practices for Killer Data Visualization
Best Practices for Killer Data Visualization
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
 
DAX and Power BI Training - 001 Overview
DAX and Power BI Training -  001 OverviewDAX and Power BI Training -  001 Overview
DAX and Power BI Training - 001 Overview
 
Big data visualization
Big data visualizationBig data visualization
Big data visualization
 
Tableau Presentation
Tableau PresentationTableau Presentation
Tableau Presentation
 
Data Warehouse Concepts and Architecture
Data Warehouse Concepts and ArchitectureData Warehouse Concepts and Architecture
Data Warehouse Concepts and Architecture
 
The Data Science Process
The Data Science ProcessThe Data Science Process
The Data Science Process
 
Data Science Interview Questions | Data Science Interview Questions And Answe...
Data Science Interview Questions | Data Science Interview Questions And Answe...Data Science Interview Questions | Data Science Interview Questions And Answe...
Data Science Interview Questions | Data Science Interview Questions And Answe...
 
Data Visualization Techniques in Power BI
Data Visualization Techniques in Power BIData Visualization Techniques in Power BI
Data Visualization Techniques in Power BI
 
A case study on churn analysis1
A case study on churn analysis1A case study on churn analysis1
A case study on churn analysis1
 

Similaire à Reduction in customer complaints - Mortgage Industry

Creditscore
CreditscoreCreditscore
Creditscore
kevinlan
 
Predictive analytics-white-paper
Predictive analytics-white-paperPredictive analytics-white-paper
Predictive analytics-white-paper
Shubhashish Biswas
 
Module_6_-_Datamining_tasks_and_tools_uGuVaDv4iv-2.pptx
Module_6_-_Datamining_tasks_and_tools_uGuVaDv4iv-2.pptxModule_6_-_Datamining_tasks_and_tools_uGuVaDv4iv-2.pptx
Module_6_-_Datamining_tasks_and_tools_uGuVaDv4iv-2.pptx
HarshitGoel87
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network Model
Eric Esajian
 
CollectionOptimization
CollectionOptimizationCollectionOptimization
CollectionOptimization
Mike Nguyen
 

Similaire à Reduction in customer complaints - Mortgage Industry (20)

Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industry
 
Creditscore
CreditscoreCreditscore
Creditscore
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Case Study: It’s All About Data – And the Customer
Case Study: It’s All About Data – And the CustomerCase Study: It’s All About Data – And the Customer
Case Study: It’s All About Data – And the Customer
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Predictive analytics-white-paper
Predictive analytics-white-paperPredictive analytics-white-paper
Predictive analytics-white-paper
 
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
7. Plan, perform, and evaluate samples for substantive procedures IPPTChap009...
 
Leveraging Data Analysis for Sales
Leveraging Data Analysis for SalesLeveraging Data Analysis for Sales
Leveraging Data Analysis for Sales
 
statistical measurement project presentation
statistical measurement project presentationstatistical measurement project presentation
statistical measurement project presentation
 
Bank churn with Data Science
Bank churn with Data ScienceBank churn with Data Science
Bank churn with Data Science
 
Telecom customer churn prediction
Telecom customer churn predictionTelecom customer churn prediction
Telecom customer churn prediction
 
Module_6_-_Datamining_tasks_and_tools_uGuVaDv4iv-2.pptx
Module_6_-_Datamining_tasks_and_tools_uGuVaDv4iv-2.pptxModule_6_-_Datamining_tasks_and_tools_uGuVaDv4iv-2.pptx
Module_6_-_Datamining_tasks_and_tools_uGuVaDv4iv-2.pptx
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network Model
 
Final Report
Final ReportFinal Report
Final Report
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
 
201406 IASA: Analytics Maturity - Unlocking The Business Impact
201406 IASA: Analytics Maturity - Unlocking The Business Impact201406 IASA: Analytics Maturity - Unlocking The Business Impact
201406 IASA: Analytics Maturity - Unlocking The Business Impact
 
Essay On Stamford International Inc
Essay On Stamford International IncEssay On Stamford International Inc
Essay On Stamford International Inc
 
CollectionOptimization
CollectionOptimizationCollectionOptimization
CollectionOptimization
 
Data mining and analysis of customer churn dataset
Data mining and analysis of customer churn datasetData mining and analysis of customer churn dataset
Data mining and analysis of customer churn dataset
 

Plus de Pranov Mishra

Sales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Sales Performance Deep Dive and Forecast: A ML Driven Analytics SolutionSales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Sales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Pranov Mishra
 
Recommendations for Preventive Maintenance - A Machine Learning Project
Recommendations for Preventive Maintenance - A Machine Learning ProjectRecommendations for Preventive Maintenance - A Machine Learning Project
Recommendations for Preventive Maintenance - A Machine Learning Project
Pranov Mishra
 

Plus de Pranov Mishra (6)

Automation of IT Ticket Automation using NLP and Deep Learning
Automation of IT Ticket Automation using NLP and Deep LearningAutomation of IT Ticket Automation using NLP and Deep Learning
Automation of IT Ticket Automation using NLP and Deep Learning
 
Sales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Sales Performance Deep Dive and Forecast: A ML Driven Analytics SolutionSales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
Sales Performance Deep Dive and Forecast: A ML Driven Analytics Solution
 
Prediction of potential customers for term deposit
Prediction of potential customers for term depositPrediction of potential customers for term deposit
Prediction of potential customers for term deposit
 
Prediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom IndustryPrediction of customer propensity to churn - Telecom Industry
Prediction of customer propensity to churn - Telecom Industry
 
Impact of Macro-Economic Factors on Customer Behaviour in the US Insurance In...
Impact of Macro-Economic Factors on Customer Behaviour in the US Insurance In...Impact of Macro-Economic Factors on Customer Behaviour in the US Insurance In...
Impact of Macro-Economic Factors on Customer Behaviour in the US Insurance In...
 
Recommendations for Preventive Maintenance - A Machine Learning Project
Recommendations for Preventive Maintenance - A Machine Learning ProjectRecommendations for Preventive Maintenance - A Machine Learning Project
Recommendations for Preventive Maintenance - A Machine Learning Project
 

Dernier

Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 

Dernier (20)

Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 

Reduction in customer complaints - Mortgage Industry

  • 1. Reduction in Customer Complaints – Mortgage Servicing Industry By prediction of customers likely to complaint & taking proactive steps for prevention of the same. Original Project Partners Pranov Mishra Aniket Chhabra Vivek Chandel Madhu Gollpudi Codes and cleaned dataset can be found in my github account. Link (https://github.com/Pranov1984/Great-Lakes-Capstone-Project)
  • 2. Executive Summary Project Overview: The project aims at analysis of Customer Complaints/Inquiries received by a US based mortgage (loan) servicing company . The scope of the project is limited to complaints received with respect to the part of the servicing life cycle that is related to Escrow Analysis and other related or subsidiary activities. Goal Statement Identification of major contributors towards complaints/inquiries. Utilization of the identified significant contributors and coming up with recommendations for changes/new implementations with the below goals ▫ Reducing Re-work ▫ Reducing Operational Cost ▫ Improve Customer Satisfaction ▫ Improve company preparedness to respond to customers Data Considered A few months of data of standard servicing loans of the organisation which comprises of circa 154,000 records was used for data exploration, visualization and hypothesis generation. The data had a lot of missing values and the missing values were typically when the event corresponding to the variable concerned) did not occur for that observation. The data was cleaned by creating dummy variables with no missing values. The code and the dataset provide in the github link constitutes the cleaned data.
  • 3. Executive Summary Continued ... • Escalations typically lead to extra work, reputational damage, sometimes regulatory scrutiny and penalties. Preventing customer complaints and escalations is in the best interest of the company. • The data was highly imbalanced (Majority class: Minority Class = 96%:4%) and hence appropriate model evaluation metric was required to be chosen. A combination of Harmonic mean (F1 score) and Area Under the Curve (AUC) was used to finalize the best model. • Models tried to arrive at the best are  Simple Model like Logistic Regression with different thresholds for classification  Random Forest after balancing the dataset using Synthetic Minority Oversampling Technique (SMOTE)  Stochastic Gradient Boosting technique after balancing the dataset as was the case with random forest • The key insights derived from the model with best results, indicate that the variables that significantly impact customer behaviour can be broadly classified as below:  Waiver of escrow payments which could arise due to incorrect escrow analysis conducted resulting in customer requesting for waiver of extra charges levied.  Presence of “Initials” which comes into play when a customer is escrowed for the first time. So the customer could be escalating either because he is incorrectly escrowed or the initial payment calculation for the escrow services is incorrect.  Process of handling force escrowed loans have been inadequate leading to customer queries and complaints. • A Gains chart was prepared which gave a cumulative lift of 133% in the first 4 deciles. The customers with highest probability of making a complaint were identified. The company could use this information to proactively review the operations performed on the customer’s account and correct any errors if found.
  • 4. Brief Overview of Escrow Account Escrow: Money held by a third-party on behalf of transacting parties Escrow account: An escrow account is established with a lender to pay for recurring expenses related to ones property, such as real estate taxes and homeowner’s insurance. It helps borrower to anticipate and manage payment of these expenses by including these expenses as a portion of monthly mortgage payment. How does an escrow account work? • At the time one establishes an escrow account, the customer’s annual real estate taxes and homeowner’s insurance are estimated, based on the customer’s most recent bills and premiums. • An incremental amount of these expenses is added to the customers’ monthly mortgage payment, in order to cover these expenses when they are due. • Each year, this escrow account is reviewed to determine if the amount being escrowed each month is sufficient to pay for any change in your real estate taxes or homeowner’s insurance premiums. • Incase a non escrowed customer defaults on payment of taxes and insurance, the lender advances payments to the respective agencies to protect the rights on the property. The lender then force escrows the delinquent customers to recover the money.
  • 5. • Missing value treatment • Outlier treatment • Removing inconsistencies Data Treatment • Review of each variable and transforming the appropriate variables Exploratory Data Analysis • Event rate is highly skewed in favor of “No Complaints. • SMOTE* used to balance the data. Balancing Data set • Data was split in 70:30 ratio. • Models were trained on 70% data & validated on 30%. Data Partition Analytical framework used to prepare the model * Synthetic Minority oversampling Technique Model Building Logistic Regression Random Forest Stochastic Gradient Boosting Validation – Evaluation Metrics Accuracy Sensitivity(TPR) & Specificity(TNR) Area under the Curve (AUC)
  • 6. Data Visualization & Data Preparation Both the graphs give the impression that the distribution is that of a factor variable. They look like variables which should be a factor variable with most of the data crowding around zero and the remaining few crowding around one. No data points in between zero and one. Variables transformed to factor.
  • 7. Data Visualization & Data Preparation Both the graphs give the impression that the distribution is that of a factor variable. They look like variables which should be a factor variable with most of the data crowding around zero and the remaining few crowding around one. No data points in between zero and one. Variables transformed to factor. Additionally looks like presence of Waiver is associated with complaints received
  • 8. Data Visualization & Data Preparation Both the graphs give the impression that the distribution is that of a factor variable. They look like variables which should be a factor variable with most of the data crowding around zero and the remaining few crowding around one. No data points in between zero and one. Variables transformed to factor. Additionally, presence of Reversed seems to be associated with complaints received in a bigger way.
  • 9. Data Visualization & Data Preparation Looks like a variable which should be numeric but with a high number of outliers. Most of the values are small with a few very high values. Close to 17% of observations are outliers. Upon further analysis by deciling it, Eight deciles have min and max as zero which constitute 80% of the data. Complaints received only in the 9th and 10th deciles when there are surpluses greater than 0. Essentially there is a very high chance of a complaint/query when there is a surplus. This indicates that the customer thinks that the analysis is incorrect or the surplus is not being returned on time by the company. Variable converted to factor. summary(mydata$Surplus) Min. 1st Qu. Median Mean 3rd Qu. Max. 0.0 0.0 0.0 75.7 0.0 452213.2
  • 10. Data Visualization & Data Preparation Shortage Spread looks like a continuous variable but had outliers. The outliers were treated by compressing the extreme values to between 0 and 85 percentile of the actual values. Anytime there is a shortage, there is a higher chance of a complaint.
  • 11. Data Visualization & Data Preparation With highly skewed numbers in favor of NonBK analysis a separate analysis only on NONBK analysis loans could be contemplated.
  • 12. Random Forest models give the best results. Tuning of the parameters i.e. Mtry (no. of variables used while training models on bootstrapped samples) improved the results marginally. The gradient boosting results were also nearly as good (marginally less) as the results from random forest. The important point to note though is that the tree based models were trained on the data after they were balanced by using SMOTE (Synthetic Minority Oversampling Technique). The logistic regression results were least impressive.
  • 13.
  • 14. Lift Chart A Cumulative lift of 133% is achieved by use of the lift chart in the first four deciles. This means by choosing 40% of the total customers, with the aid of the model and the associated gains chart, we can identify more than 50% of the customers who are likely to complain. Without the model we would have probably identified 20% of the potential complaining customers.
  • 15. Variable Importance using Random Forest model which gave the best results