SlideShare une entreprise Scribd logo
1  sur  49
Database	
  Marketing	
  and	
  CRM	
  –	
  
Analyzing	
  DONOR	
  data	
  set	
  	
  
Akanksha	
  Jain	
  
Project	
  Goals	
  
•  Goal:	
  Using	
  historical	
  data	
  set	
  DONOR_RAW,	
  develop	
  a	
  

model	
  which	
  can	
  predict	
  whether	
  the	
  prospect	
  will	
  
donate/	
  not	
  donate	
  

•  Scope:	
  DONOR_RAW	
  data	
  set	
  
•  50	
  Variables	
  
•  19,372	
  observaKons	
  
•  Dependent	
  Variable:	
  TARGET_B(Binary)	
  
•  Responder:	
  1	
  
•  Non-­‐Responder:	
  0	
  
TOOLS	
  
•  SAS	
  Enterprise	
  Miner	
  4.3	
  
•  SAS	
  9.3_M1	
  
Diagram 	
  	
  
Diagram	
  (con’t)	
  
Data	
  Source	
  
•  Reject	
  Variables:	
  
•  TARGET_D	
  (using	
  TARGET_B	
  as	
  target)	
  
•  ID	
  (an	
  id	
  number)	
  
•  WEALTH_RATING	
  (huge	
  no.	
  of	
  missing	
  values)	
  
•  Variable	
  TARGET_B	
  
•  Change	
  Role	
  to	
  TARGET	
  
•  Change	
  Order	
  to	
  DESCENDING	
  

•  Select	
  complete	
  data	
  set	
  as	
  Sample	
  
•  Set	
  Prior	
  ProbabiliKes	
  	
  
•  Responder:	
  0.05	
  
•  Non-­‐Responder:	
  0.95	
  
Data	
  Partition	
  
•  Train	
  –	
  60%	
  
•  Validate	
  –	
  25%	
  
•  Test	
  –	
  15%	
  
Variable	
  Transformation	
  
Taking	
  Log	
  TransformaKon	
  to	
  reduce	
  Skewness	
  
•  LIFETIME_GIFT_RANGE	
  
•  LIFETIME_MAX_GIFT_AMT	
  
•  LIFETIME_MIN_GIFT_AMT	
  
•  MOR_HIT_RATE	
  
•  FILE_AVG_GIFT	
  
•  LIFETIME_AVG_GIFT_AMT	
  
•  PCT_ATTRIBUTE1	
  
•  LAST_GIFT_AMT	
  
•  RECENT_AVG_GIFT_AMT	
  
	
  
Keep	
  all	
  variables,	
  original	
  and	
  log	
  transformaKons	
  
Model:	
  CHAID	
  
•  Nominal	
  Criterion:	
  Chi	
  Square	
  
•  Significance	
  Level:	
  0.1	
  
•  Minimum	
  number	
  of	
  observaKons	
  in	
  a	
  leaf	
  =	
  25	
  
•  ObservaKons	
  required	
  for	
  a	
  split	
  search	
  =	
  55	
  
•  Model	
  assessment	
  measure:	
  Total	
  Leaf	
  Impurity	
  (Gini	
  
Index)	
  
Model:	
  CHAID	
  (con’t)	
  
Model:	
  CHAID	
  (con’t)	
  
Inference:	
  
	
  FREQUENCY_STATUS_97NK	
  =	
  3	
  or	
  4;	
  
MONTHS_SINCE_LAST_GIFT	
  <	
  8.5	
  	
  
	
  1%	
  =	
  56%	
  

Less	
  MarkeKng	
  Effort	
  needed	
  as	
  most	
  likely	
  that	
  
they	
  will	
  donate	
  anyways	
  

FREQUENCY_STATUS_97NK	
  =	
  3	
  or	
  4;	
  
MONTHS_SINCE_LAST_GIFT	
  >=	
  8.5;	
  
NUMBER_PROM_12	
  <11.5	
  
1%	
  =	
  43%	
  
	
  

Will	
  also	
  donate	
  but	
  the	
  company	
  should	
  be	
  
careful	
  and	
  not	
  send	
  them	
  too	
  many	
  promoKons	
  

FREQUENCY_STATUS_97NK	
  =	
  3	
  or	
  4;	
  
MONTHS_SINCE_LAST_GIFT	
  >=	
  8.5;	
  
NUMBER_PROM_12	
  >=	
  11.5	
  
1%	
  =	
  30%	
  
	
  

Are	
  geong	
  too	
  many	
  promoKons;	
  and	
  hence	
  
company	
  should	
  cut	
  on	
  sending	
  them	
  
promoKons	
  

FREQUENCY_STATUS_97NK	
  =	
  1,	
  2	
  or	
  Missing	
  
1%	
  =	
  21%	
  

Study	
  them	
  more	
  closely	
  as	
  in	
  why	
  they	
  are	
  not	
  
donaKng,	
  what	
  other	
  factors	
  are	
  responsible	
  and	
  
then	
  decide	
  how	
  to	
  design	
  a	
  markeKng	
  
campaign	
  for	
  them.	
  
	
  
Variable	
  Selection	
  
•  Target	
  AssociaKons:	
  Select	
  Chi	
  Square	
  
Model:	
  Forward	
  Regression	
  
MODEL	
  OPTIONS	
  -­‐>	
  INPUT	
  CODING	
  -­‐>DEVIATION	
  	
  
SELECTION	
  METHOD	
  -­‐>	
  FORWARD	
  
CRITERIA	
  -­‐>	
  CROSS	
  VALIDATION	
  MISCLASSIFICATION	
  
ADVANCED	
  -­‐>	
  OPTIMIZATION	
  METHOD	
  -­‐>	
  NEWTON-­‐RAPHSON	
  
w/	
  LINE	
  SEARCH	
  
•  SL	
  Entry:	
  0.05	
  
• 
• 
• 
• 
Model:	
  Forward	
  Regression	
  
(con’t)	
  
Model:	
  Forward	
  Regression	
  
(con’t)	
  
Model:	
  Backward	
  Regression	
  
MODEL	
  OPTIONS	
  -­‐>	
  INPUT	
  CODING	
  -­‐>DEVIATION	
  	
  
SELECTION	
  METHOD	
  -­‐>	
  BACKWARD	
  
CRITERIA	
  -­‐>	
  CROSS	
  VALIDATION	
  MISCLASSIFICATION	
  
ADVANCED	
  -­‐>	
  OPTIMIZATION	
  METHOD	
  -­‐>	
  NEWTON-­‐RAPHSON	
  
w/	
  LINE	
  SEARCH	
  
•  SL	
  Stay:	
  0.05	
  
• 
• 
• 
• 
Model:	
  Backward	
  Regression	
  
(con’t)	
  
Model:	
  Backward	
  Regression	
  
(con’t)	
  
Model:	
  Stepwise	
  Regression	
  
MODEL	
  OPTIONS	
  -­‐>	
  INPUT	
  CODING	
  -­‐>DEVIATION	
  	
  
SELECTION	
  METHOD	
  -­‐>	
  STEPWISE	
  
CRITERIA	
  -­‐>	
  CROSS	
  VALIDATION	
  MISCLASSIFICATION	
  
ADVANCED	
  -­‐>	
  OPTIMIZATION	
  METHOD	
  -­‐>	
  NEWTON-­‐RAPHSON	
  
w/	
  LINE	
  SEARCH	
  
•  SL	
  Entry:	
  0.15	
  
•  SL	
  Stay:	
  0.05	
  
• 
• 
• 
• 
Model:	
  Stepwise	
  Regression	
  
(con’t)	
  
Model:	
  Stepwise	
  Regression	
  
(con’t)	
  
Variable	
  Comparison	
  
Forward	
  

Backward	
  

Stepwise	
  

FILE_CARD_GIFT	
  

FILE_CARD_GIFT	
  

FILE_CARD_GIFT	
  

FREQUENCY_STATUS_97NK	
  

FREQUENCY_STATUS_97NK	
  

FREQUENCY_STATUS_97NK	
  

INCOME_GROUP	
  

INCOME_GROUP	
  

INCOME_GROUP	
  

LIFE_AV9*	
  

LIFE_AV9*	
  

LIFE_AV9*	
  

MONTHS_SINCE_LAST_GIFT	
  

MONTHS_SINCE_LAST_GIFT	
  

MONTHS_SINCE_LAST_GIFT	
  

PEP_STAR	
  

PEP_STAR	
  

PEP_STAR	
  

LIFETIME_GIFT_AMOUNT	
  
MEDIAN_HOUSEHOLD_INCOME	
  
RECENT_RESPONSE_PROP	
  

*LIFE_AV9	
  is	
  the	
  log(LIFETIME_AVG_GIFT_AMOUNT)	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>FORWARD	
  	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>BACKWARD	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>FORWARD	
  	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  ANY	
  
Forward	
  versus	
  Backward	
  	
  
•  Variables:	
  

•  LIFETIME_GIFT_AMOUNT	
  
•  MEDIAN_HOUSEHOLD_INCOME	
  
•  RECENT_RESPONSE_PROP	
  

•  CorrelaKons:	
  
•  MEDIAN_HOUSEHOLD_INCOME	
  and	
  INCOME_GROUP	
  =	
  43%	
  
•  LIFE_AV9	
  and	
  LIFETIME_AVG_GIFT_AMT	
  =	
  83%	
  	
  
•  FILE_CARD_GIFT	
  and	
  RECENT_RESPONSE_PROP	
  =	
  30%	
  
Model:	
  Forward	
  +	
  
RECENT_RESPONSE_PROP	
  	
  	
  
•  Variable	
  SelecKon	
  (call	
  it	
  Variable_1Extra):	
  
• 
• 
• 
• 
• 
• 
• 

FILE_CARD_GIFT	
  
FREQUENCY_STATUS_97NK	
  
INCOME_GROUP	
  
LIFE_AV9	
  
MONTHS_SINCE_LAST_GIFT	
  
PEP_STAR	
  
RECENT_RESPONSE_PROP	
  

•  Reject	
  other	
  variables	
  manually	
  
•  Call	
  this	
  model	
  For_1Extra	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>For_1Extra	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  BACKWARD	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>For_1Extra	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  ANY	
  
	
  
Model:	
  Decision	
  
Final	
  Model:	
  FOR_1EXTRA	
  
	
  
Variables:	
  
• 
• 
• 
• 
• 
• 
• 

FILE_CARD_GIFT	
  
FREQUENCY_STATUS_97NK	
  
INCOME_GROUP	
  
LIFE_AV9	
  
MONTHS_SINCE_LAST_GIFT	
  
PEP_STAR	
  
RECENT_RESPONSE_PROP	
  
Interaction	
  Terms	
  
•  FREQ_PEP	
  =	
  FREQUENCY_STATUS_97NK	
  *	
  PEP_STAR	
  
•  FREQ_MONTH	
  =	
  FREQUENCY_STATUS_97NK	
  *	
  
MONTHS_SINCE_LAST_GIFT	
  	
  
•  FREQ_INCOME	
  =	
  FREQUENCY_STATUS_97NK	
  *	
  
INCOME_GROUP	
  
Model:	
  Forward	
  Regression	
  
with	
  Interaction	
  Terms	
  
Rename	
  model	
  as	
  FOR_1E_INT	
  
MODEL	
  OPTIONS	
  -­‐>	
  INPUT	
  CODING	
  -­‐>DEVIATION	
  	
  
SELECTION	
  METHOD	
  -­‐>	
  FORWARD	
  
CRITERIA	
  -­‐>	
  CROSS	
  VALIDATION	
  MISCLASSIFICATION	
  
ADVANCED	
  -­‐>	
  OPTIMIZATION	
  METHOD	
  -­‐>	
  NEWTON-­‐RAPHSON	
  
w/	
  LINE	
  SEARCH	
  
•  SL	
  Entry:	
  0.05	
  
• 
• 
• 
• 
• 
Model	
  FOR_1E_INT:	
  
Cumulative	
  LIFT	
  
Model	
  FOR_1E_INT:	
  Variable	
  
List	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>FOR_1E_INT	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  FOR_1EXTRA	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Model	
  Comparison	
  (TEST):	
  
Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>ANY	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  FOR_1EXTRA	
  
Model:	
  For_1EXTRA	
  +	
  
Interaction	
  terms	
  	
  	
  
•  Variable	
  SelecKon	
  (call	
  it	
  Variable_UNION):	
  
• 
• 
• 
• 
• 
• 
• 
• 
• 
• 
	
  

FILE_CARD_GIFT	
  
FREQENCY_STATUS_97NK	
  
INCOME_GROUP	
  
LIFE_AV9	
  
MONTHS_SINCE_LAST_GIFT	
  
PEP_STAR	
  
RECENT_RESPONSE_PROP	
  
FREQ_PEP	
  	
  
FREQ_MONTH	
  	
  
FREQ_INCOME	
  

•  Reject	
  other	
  variables	
  manually	
  
•  Call	
  this	
  model	
  For_Union	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Model	
  Comparison	
  
(Validation):	
  Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>	
  FOR_1E_INT	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  FOR_UNION	
  
Model	
  Comparison	
  (Test):	
  
Cumulative	
  LIFT	
  
Model	
  Comparison	
  (Test):	
  
Cumulative	
  LIFT	
  
Inference:	
  
•  Capture	
  top	
  20%	
  of	
  the	
  market	
  -­‐>	
  ANY	
  
•  Capture	
  top	
  30%	
  of	
  the	
  market	
  -­‐>	
  FOR_UNION	
  
Model:	
  Decision	
  
Final	
  Model:	
  FOR_1EXTRA	
  because	
  	
  
•  No	
  significant	
  improvement	
  with	
  other	
  models	
  
•  InteracKon	
  terms	
  bring	
  along	
  complexity	
  
	
  
Variables:	
  
• 
• 
• 
• 
• 
• 
• 

FILE_CARD_GIFT	
  
FREQUENCY_STATUS_97NK	
  
INCOME_GROUP	
  
LIFE_AV9	
  
MONTHS_SINCE_LAST_GIFT	
  
PEP_STAR	
  
RECENT_RESPONSE_PROP	
  
Score:	
  On	
  Donor_Raw_Data	
  
THANK	
  YOU	
  

Contenu connexe

Similaire à Analyzing donor data to predict donations

Presentation top tips for getting optimal sql execution
Presentation    top tips for getting optimal sql executionPresentation    top tips for getting optimal sql execution
Presentation top tips for getting optimal sql executionxKinAnx
 
Vertical Recommendation Using Collaborative Filtering
Vertical Recommendation Using Collaborative FilteringVertical Recommendation Using Collaborative Filtering
Vertical Recommendation Using Collaborative Filteringgorass
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Company segmentation - an approach with R
Company segmentation - an approach with RCompany segmentation - an approach with R
Company segmentation - an approach with RCasper Crause
 
Deepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn WayDeepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn Wayyingfeng
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptxAniket Patil
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptxpatilaniket2418
 
Maximizing a churn campaigns profitability with cost sensitive machine learning
Maximizing a churn campaigns profitability with cost sensitive machine learningMaximizing a churn campaigns profitability with cost sensitive machine learning
Maximizing a churn campaigns profitability with cost sensitive machine learningAlejandro Correa Bahnsen, PhD
 
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...Big-Data-Summit
 
Lead Scoring Group Case Study Presentation.pdf
Lead Scoring Group Case Study Presentation.pdfLead Scoring Group Case Study Presentation.pdf
Lead Scoring Group Case Study Presentation.pdfKrishP2
 
Monte Carlo Simulation for Trading System in AmiBroker
Monte Carlo Simulation for Trading System in AmiBrokerMonte Carlo Simulation for Trading System in AmiBroker
Monte Carlo Simulation for Trading System in AmiBrokerThaiQuants
 
Phase 2 of Predicting Payment default on Vehicle Loan EMI
Phase 2 of Predicting Payment default on Vehicle Loan EMIPhase 2 of Predicting Payment default on Vehicle Loan EMI
Phase 2 of Predicting Payment default on Vehicle Loan EMIVikas Virani
 
Machine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APACMachine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APACMinitab, LLC
 
The metrics that matter using scalability metrics for project planning of a d...
The metrics that matter using scalability metrics for project planning of a d...The metrics that matter using scalability metrics for project planning of a d...
The metrics that matter using scalability metrics for project planning of a d...Mary Chan
 
Virtual segment brief
Virtual segment briefVirtual segment brief
Virtual segment briefAhmed Shaaban
 
Wooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit CustomersWooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit CustomersLucinda Linde
 

Similaire à Analyzing donor data to predict donations (20)

Presentation top tips for getting optimal sql execution
Presentation    top tips for getting optimal sql executionPresentation    top tips for getting optimal sql execution
Presentation top tips for getting optimal sql execution
 
Vertical Recommendation Using Collaborative Filtering
Vertical Recommendation Using Collaborative FilteringVertical Recommendation Using Collaborative Filtering
Vertical Recommendation Using Collaborative Filtering
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Company segmentation - an approach with R
Company segmentation - an approach with RCompany segmentation - an approach with R
Company segmentation - an approach with R
 
Deepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn WayDeepak-Computational Advertising-The LinkedIn Way
Deepak-Computational Advertising-The LinkedIn Way
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptxCustomer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
 
Maximizing a churn campaigns profitability with cost sensitive machine learning
Maximizing a churn campaigns profitability with cost sensitive machine learningMaximizing a churn campaigns profitability with cost sensitive machine learning
Maximizing a churn campaigns profitability with cost sensitive machine learning
 
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
BDAS-2017 | Maximizing a churn campaign’s profitability with cost sensitive m...
 
Lead Scoring Group Case Study Presentation.pdf
Lead Scoring Group Case Study Presentation.pdfLead Scoring Group Case Study Presentation.pdf
Lead Scoring Group Case Study Presentation.pdf
 
Monte Carlo Simulation for Trading System in AmiBroker
Monte Carlo Simulation for Trading System in AmiBrokerMonte Carlo Simulation for Trading System in AmiBroker
Monte Carlo Simulation for Trading System in AmiBroker
 
Phase 2 of Predicting Payment default on Vehicle Loan EMI
Phase 2 of Predicting Payment default on Vehicle Loan EMIPhase 2 of Predicting Payment default on Vehicle Loan EMI
Phase 2 of Predicting Payment default on Vehicle Loan EMI
 
Supply Chain Basic
Supply Chain BasicSupply Chain Basic
Supply Chain Basic
 
Machine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APACMachine Learning with Binary Logistic Regression - APAC
Machine Learning with Binary Logistic Regression - APAC
 
1000 track2 boire
1000 track2 boire1000 track2 boire
1000 track2 boire
 
The metrics that matter using scalability metrics for project planning of a d...
The metrics that matter using scalability metrics for project planning of a d...The metrics that matter using scalability metrics for project planning of a d...
The metrics that matter using scalability metrics for project planning of a d...
 
Virtual segment brief
Virtual segment briefVirtual segment brief
Virtual segment brief
 
1030 track1 heiler
1030 track1 heiler1030 track1 heiler
1030 track1 heiler
 
Telecom Churn Analysis
Telecom Churn AnalysisTelecom Churn Analysis
Telecom Churn Analysis
 
Wooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit CustomersWooing the Best Bank Deposit Customers
Wooing the Best Bank Deposit Customers
 

Dernier

BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Situation Analysis | Management Company.
Situation Analysis | Management Company.Situation Analysis | Management Company.
Situation Analysis | Management Company.DanielaQuiroz63
 
The Skin Games 2024 25 - Sponsorship Deck
The Skin Games 2024 25 - Sponsorship DeckThe Skin Games 2024 25 - Sponsorship Deck
The Skin Games 2024 25 - Sponsorship DeckToluwanimi Balogun
 
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...ChesterYang6
 
Local SEO Domination: Put your business at the forefront of local searches!
Local SEO Domination:  Put your business at the forefront of local searches!Local SEO Domination:  Put your business at the forefront of local searches!
Local SEO Domination: Put your business at the forefront of local searches!dstvtechnician
 
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceAvoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceDamien ROBERT
 
Mastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldMastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldScalenut
 
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessBrighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessVarn
 
BLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
BLOOM_April2024. Balmer Lawrie Online Monthly BulletinBLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
BLOOM_April2024. Balmer Lawrie Online Monthly BulletinBalmerLawrie
 
Uncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsUncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsVWO
 
How To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot SetupHow To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot Setupssuser4571da
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessAggregage
 
9654467111 Call Girls In Mahipalpur Women Seeking Men
9654467111 Call Girls In Mahipalpur Women Seeking Men9654467111 Call Girls In Mahipalpur Women Seeking Men
9654467111 Call Girls In Mahipalpur Women Seeking MenSapana Sha
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Richard Ingilby
 
Aryabhata I, II of mathematics of both.pptx
Aryabhata I, II of mathematics of both.pptxAryabhata I, II of mathematics of both.pptx
Aryabhata I, II of mathematics of both.pptxtegevi9289
 

Dernier (20)

BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
 
Situation Analysis | Management Company.
Situation Analysis | Management Company.Situation Analysis | Management Company.
Situation Analysis | Management Company.
 
The Skin Games 2024 25 - Sponsorship Deck
The Skin Games 2024 25 - Sponsorship DeckThe Skin Games 2024 25 - Sponsorship Deck
The Skin Games 2024 25 - Sponsorship Deck
 
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
 
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
Generative AI Master Class - Generative AI, Unleash Creative Opportunity - Pe...
 
Local SEO Domination: Put your business at the forefront of local searches!
Local SEO Domination:  Put your business at the forefront of local searches!Local SEO Domination:  Put your business at the forefront of local searches!
Local SEO Domination: Put your business at the forefront of local searches!
 
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose GuirgisCreator Influencer Strategy Master Class - Corinne Rose Guirgis
Creator Influencer Strategy Master Class - Corinne Rose Guirgis
 
How to Create a Social Media Plan Like a Pro - Jordan Scheltgen
How to Create a Social Media Plan Like a Pro - Jordan ScheltgenHow to Create a Social Media Plan Like a Pro - Jordan Scheltgen
How to Create a Social Media Plan Like a Pro - Jordan Scheltgen
 
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceAvoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
 
Mastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldMastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven World
 
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO SuccessBrighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
Brighton SEO April 2024 - The Good, the Bad & the Ugly of SEO Success
 
BLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
BLOOM_April2024. Balmer Lawrie Online Monthly BulletinBLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
BLOOM_April2024. Balmer Lawrie Online Monthly Bulletin
 
Uncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsUncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 Reports
 
How To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot SetupHow To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot Setup
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail Success
 
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
 
9654467111 Call Girls In Mahipalpur Women Seeking Men
9654467111 Call Girls In Mahipalpur Women Seeking Men9654467111 Call Girls In Mahipalpur Women Seeking Men
9654467111 Call Girls In Mahipalpur Women Seeking Men
 
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAILBUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
BUY GMAIL ACCOUNTS PVA USA IP INDIAN IP GMAIL
 
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
Moving beyond multi-touch attribution - DigiMarCon CanWest 2024
 
Aryabhata I, II of mathematics of both.pptx
Aryabhata I, II of mathematics of both.pptxAryabhata I, II of mathematics of both.pptx
Aryabhata I, II of mathematics of both.pptx
 

Analyzing donor data to predict donations

  • 1. Database  Marketing  and  CRM  –   Analyzing  DONOR  data  set     Akanksha  Jain  
  • 2. Project  Goals   •  Goal:  Using  historical  data  set  DONOR_RAW,  develop  a   model  which  can  predict  whether  the  prospect  will   donate/  not  donate   •  Scope:  DONOR_RAW  data  set   •  50  Variables   •  19,372  observaKons   •  Dependent  Variable:  TARGET_B(Binary)   •  Responder:  1   •  Non-­‐Responder:  0  
  • 3. TOOLS   •  SAS  Enterprise  Miner  4.3   •  SAS  9.3_M1  
  • 6. Data  Source   •  Reject  Variables:   •  TARGET_D  (using  TARGET_B  as  target)   •  ID  (an  id  number)   •  WEALTH_RATING  (huge  no.  of  missing  values)   •  Variable  TARGET_B   •  Change  Role  to  TARGET   •  Change  Order  to  DESCENDING   •  Select  complete  data  set  as  Sample   •  Set  Prior  ProbabiliKes     •  Responder:  0.05   •  Non-­‐Responder:  0.95  
  • 7. Data  Partition   •  Train  –  60%   •  Validate  –  25%   •  Test  –  15%  
  • 8. Variable  Transformation   Taking  Log  TransformaKon  to  reduce  Skewness   •  LIFETIME_GIFT_RANGE   •  LIFETIME_MAX_GIFT_AMT   •  LIFETIME_MIN_GIFT_AMT   •  MOR_HIT_RATE   •  FILE_AVG_GIFT   •  LIFETIME_AVG_GIFT_AMT   •  PCT_ATTRIBUTE1   •  LAST_GIFT_AMT   •  RECENT_AVG_GIFT_AMT     Keep  all  variables,  original  and  log  transformaKons  
  • 9. Model:  CHAID   •  Nominal  Criterion:  Chi  Square   •  Significance  Level:  0.1   •  Minimum  number  of  observaKons  in  a  leaf  =  25   •  ObservaKons  required  for  a  split  search  =  55   •  Model  assessment  measure:  Total  Leaf  Impurity  (Gini   Index)  
  • 11. Model:  CHAID  (con’t)   Inference:    FREQUENCY_STATUS_97NK  =  3  or  4;   MONTHS_SINCE_LAST_GIFT  <  8.5      1%  =  56%   Less  MarkeKng  Effort  needed  as  most  likely  that   they  will  donate  anyways   FREQUENCY_STATUS_97NK  =  3  or  4;   MONTHS_SINCE_LAST_GIFT  >=  8.5;   NUMBER_PROM_12  <11.5   1%  =  43%     Will  also  donate  but  the  company  should  be   careful  and  not  send  them  too  many  promoKons   FREQUENCY_STATUS_97NK  =  3  or  4;   MONTHS_SINCE_LAST_GIFT  >=  8.5;   NUMBER_PROM_12  >=  11.5   1%  =  30%     Are  geong  too  many  promoKons;  and  hence   company  should  cut  on  sending  them   promoKons   FREQUENCY_STATUS_97NK  =  1,  2  or  Missing   1%  =  21%   Study  them  more  closely  as  in  why  they  are  not   donaKng,  what  other  factors  are  responsible  and   then  decide  how  to  design  a  markeKng   campaign  for  them.    
  • 12. Variable  Selection   •  Target  AssociaKons:  Select  Chi  Square  
  • 13. Model:  Forward  Regression   MODEL  OPTIONS  -­‐>  INPUT  CODING  -­‐>DEVIATION     SELECTION  METHOD  -­‐>  FORWARD   CRITERIA  -­‐>  CROSS  VALIDATION  MISCLASSIFICATION   ADVANCED  -­‐>  OPTIMIZATION  METHOD  -­‐>  NEWTON-­‐RAPHSON   w/  LINE  SEARCH   •  SL  Entry:  0.05   •  •  •  • 
  • 14. Model:  Forward  Regression   (con’t)  
  • 15. Model:  Forward  Regression   (con’t)  
  • 16. Model:  Backward  Regression   MODEL  OPTIONS  -­‐>  INPUT  CODING  -­‐>DEVIATION     SELECTION  METHOD  -­‐>  BACKWARD   CRITERIA  -­‐>  CROSS  VALIDATION  MISCLASSIFICATION   ADVANCED  -­‐>  OPTIMIZATION  METHOD  -­‐>  NEWTON-­‐RAPHSON   w/  LINE  SEARCH   •  SL  Stay:  0.05   •  •  •  • 
  • 19. Model:  Stepwise  Regression   MODEL  OPTIONS  -­‐>  INPUT  CODING  -­‐>DEVIATION     SELECTION  METHOD  -­‐>  STEPWISE   CRITERIA  -­‐>  CROSS  VALIDATION  MISCLASSIFICATION   ADVANCED  -­‐>  OPTIMIZATION  METHOD  -­‐>  NEWTON-­‐RAPHSON   w/  LINE  SEARCH   •  SL  Entry:  0.15   •  SL  Stay:  0.05   •  •  •  • 
  • 22. Variable  Comparison   Forward   Backward   Stepwise   FILE_CARD_GIFT   FILE_CARD_GIFT   FILE_CARD_GIFT   FREQUENCY_STATUS_97NK   FREQUENCY_STATUS_97NK   FREQUENCY_STATUS_97NK   INCOME_GROUP   INCOME_GROUP   INCOME_GROUP   LIFE_AV9*   LIFE_AV9*   LIFE_AV9*   MONTHS_SINCE_LAST_GIFT   MONTHS_SINCE_LAST_GIFT   MONTHS_SINCE_LAST_GIFT   PEP_STAR   PEP_STAR   PEP_STAR   LIFETIME_GIFT_AMOUNT   MEDIAN_HOUSEHOLD_INCOME   RECENT_RESPONSE_PROP   *LIFE_AV9  is  the  log(LIFETIME_AVG_GIFT_AMOUNT)  
  • 23. Model  Comparison   (Validation):  Cumulative  LIFT    
  • 24. Model  Comparison   (Validation):  Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>FORWARD     •  Capture  top  30%  of  the  market  -­‐>BACKWARD  
  • 25. Model  Comparison  (TEST):   Cumulative  LIFT  
  • 26. Model  Comparison  (TEST):   Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>FORWARD     •  Capture  top  30%  of  the  market  -­‐>  ANY  
  • 27. Forward  versus  Backward     •  Variables:   •  LIFETIME_GIFT_AMOUNT   •  MEDIAN_HOUSEHOLD_INCOME   •  RECENT_RESPONSE_PROP   •  CorrelaKons:   •  MEDIAN_HOUSEHOLD_INCOME  and  INCOME_GROUP  =  43%   •  LIFE_AV9  and  LIFETIME_AVG_GIFT_AMT  =  83%     •  FILE_CARD_GIFT  and  RECENT_RESPONSE_PROP  =  30%  
  • 28. Model:  Forward  +   RECENT_RESPONSE_PROP       •  Variable  SelecKon  (call  it  Variable_1Extra):   •  •  •  •  •  •  •  FILE_CARD_GIFT   FREQUENCY_STATUS_97NK   INCOME_GROUP   LIFE_AV9   MONTHS_SINCE_LAST_GIFT   PEP_STAR   RECENT_RESPONSE_PROP   •  Reject  other  variables  manually   •  Call  this  model  For_1Extra  
  • 29. Model  Comparison   (Validation):  Cumulative  LIFT  
  • 30. Model  Comparison   (Validation):  Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>For_1Extra   •  Capture  top  30%  of  the  market  -­‐>  BACKWARD  
  • 31. Model  Comparison  (TEST):   Cumulative  LIFT  
  • 32. Model  Comparison  (TEST):   Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>For_1Extra   •  Capture  top  30%  of  the  market  -­‐>  ANY    
  • 33. Model:  Decision   Final  Model:  FOR_1EXTRA     Variables:   •  •  •  •  •  •  •  FILE_CARD_GIFT   FREQUENCY_STATUS_97NK   INCOME_GROUP   LIFE_AV9   MONTHS_SINCE_LAST_GIFT   PEP_STAR   RECENT_RESPONSE_PROP  
  • 34. Interaction  Terms   •  FREQ_PEP  =  FREQUENCY_STATUS_97NK  *  PEP_STAR   •  FREQ_MONTH  =  FREQUENCY_STATUS_97NK  *   MONTHS_SINCE_LAST_GIFT     •  FREQ_INCOME  =  FREQUENCY_STATUS_97NK  *   INCOME_GROUP  
  • 35. Model:  Forward  Regression   with  Interaction  Terms   Rename  model  as  FOR_1E_INT   MODEL  OPTIONS  -­‐>  INPUT  CODING  -­‐>DEVIATION     SELECTION  METHOD  -­‐>  FORWARD   CRITERIA  -­‐>  CROSS  VALIDATION  MISCLASSIFICATION   ADVANCED  -­‐>  OPTIMIZATION  METHOD  -­‐>  NEWTON-­‐RAPHSON   w/  LINE  SEARCH   •  SL  Entry:  0.05   •  •  •  •  • 
  • 38. Model  Comparison   (Validation):  Cumulative  LIFT  
  • 39. Model  Comparison   (Validation):  Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>FOR_1E_INT   •  Capture  top  30%  of  the  market  -­‐>  FOR_1EXTRA  
  • 40. Model  Comparison  (TEST):   Cumulative  LIFT  
  • 41. Model  Comparison  (TEST):   Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>ANY   •  Capture  top  30%  of  the  market  -­‐>  FOR_1EXTRA  
  • 42. Model:  For_1EXTRA  +   Interaction  terms       •  Variable  SelecKon  (call  it  Variable_UNION):   •  •  •  •  •  •  •  •  •  •    FILE_CARD_GIFT   FREQENCY_STATUS_97NK   INCOME_GROUP   LIFE_AV9   MONTHS_SINCE_LAST_GIFT   PEP_STAR   RECENT_RESPONSE_PROP   FREQ_PEP     FREQ_MONTH     FREQ_INCOME   •  Reject  other  variables  manually   •  Call  this  model  For_Union  
  • 43. Model  Comparison   (Validation):  Cumulative  LIFT  
  • 44. Model  Comparison   (Validation):  Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>  FOR_1E_INT   •  Capture  top  30%  of  the  market  -­‐>  FOR_UNION  
  • 45. Model  Comparison  (Test):   Cumulative  LIFT  
  • 46. Model  Comparison  (Test):   Cumulative  LIFT   Inference:   •  Capture  top  20%  of  the  market  -­‐>  ANY   •  Capture  top  30%  of  the  market  -­‐>  FOR_UNION  
  • 47. Model:  Decision   Final  Model:  FOR_1EXTRA  because     •  No  significant  improvement  with  other  models   •  InteracKon  terms  bring  along  complexity     Variables:   •  •  •  •  •  •  •  FILE_CARD_GIFT   FREQUENCY_STATUS_97NK   INCOME_GROUP   LIFE_AV9   MONTHS_SINCE_LAST_GIFT   PEP_STAR   RECENT_RESPONSE_PROP