SlideShare une entreprise Scribd logo
1  sur  12
MSBA 6420 Rapid Winners
Kaushik Nuvvula, Pankaj Singhal, Wenqiuli Zhang, John Tong, Rohith D
Rapid Approaches
Agenda
 Approach
 Different techniques used
 Techniques that worked
 Techniques that did not work
 Best Model
 Future Scope and Learnings
Approach
Neural Network
 Voting (SVM, Neural)
Boosting (Neural)
 Bagging (Neural)
Sampling, Normalization
Data Pre-processing, Weights,
Bagging, Voting, Sampling
Data Pre-processing, Sampling,
Generate Attributes
Select by weights, Data Pre-
processing, Sampling
Data Pre-processing, Sampling
Top5Models
Model Techniques Used
 Stacking (k-NN, Neural)
F-measure and cost: Top 5 Models
Techniques that worked
Data Processing
Data Preprocessing
Generate Attributes
Attribute Selection
Techniques
Attributes Selection
Optimize Parameters
Techniques
PCA, SMOTE
Voting
Normalization
Bagging, Boosting,
Stacking
Filter Examples Sampling
6
SMOTE: Resampling Approach
• SMOTE -Synthetic Minority Oversampling combines Informed Oversampling of the
minority class with Random Under-sampling of the majority class.
• For each minority Sample
– Find its k-nearest minority neighbors
– Randomly select j of these neighbors
– Randomly generate synthetic samples along the lines joining the minority sample
and its j selected neighbors
*SMOTE currently yields best results as far as re-sampling and modifying probabilistic
estimate techniques (Chawla, 2003).
Deep Dive: SMOTE Sampling
: Minority sample
: Synthetic sample
What happens if there is a
nearby majority sample?
: Majority sample
Techniques that did not work
• Meta-cost
• Forward Selection
• Logistic Regression
Best Model
Neural
Network
Class 0:
Above 0.1
Class 0:
Between
0.03 – 0.1
F- Measure and Misclassification
Cost Improvement
Scope - Improvements
FilterExamples
Metric Change Improvement
Average
Friend Age
17 to 31 Positive
Tenure > 4 Positive
Songs
Listened
> 1 Negative
Age > 8 and <70 Negative
Key Learnings: Warnings
• Remove Oversampling - Bias in the data
• Generate Calculated Attributes
• complex ≠ f-measure
• Try to train your models on relatively higher
variability capturing records – Using Filter
Examples
Appendix
True 0 True 1
Pred. 0 24259 335
Pred. 1 1302 109
True 0 True 1
Pred. 0 23442 320
Pred. 1 1289 400
F- Measure
Misclassification Cost

Contenu connexe

Similaire à Predictive Modeling: Predict Premium Subscriber for a Leading International Music Website

Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Lionel Briand
 
Robust inference via generative classifiers for handling noisy labels
Robust inference via generative classifiers for handling noisy labelsRobust inference via generative classifiers for handling noisy labels
Robust inference via generative classifiers for handling noisy labels
Kimin Lee
 
NEURAL Network Design Training
NEURAL Network Design  TrainingNEURAL Network Design  Training
NEURAL Network Design Training
ESCOM
 

Similaire à Predictive Modeling: Predict Premium Subscriber for a Leading International Music Website (20)

Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and...
 
Robust inference via generative classifiers for handling noisy labels
Robust inference via generative classifiers for handling noisy labelsRobust inference via generative classifiers for handling noisy labels
Robust inference via generative classifiers for handling noisy labels
 
slide->title; ?>
slide->title; ?>slide->title; ?>
slide->title; ?>
 
KDD
KDDKDD
KDD
 
Ai
AiAi
Ai
 
Ai
AiAi
Ai
 
Ai
AiAi
Ai
 
Artificial Intelligence Certification
Artificial Intelligence CertificationArtificial Intelligence Certification
Artificial Intelligence Certification
 
Ai
AiAi
Ai
 
Ai
AiAi
Ai
 
NEURAL Network Design Training
NEURAL Network Design  TrainingNEURAL Network Design  Training
NEURAL Network Design Training
 
Classification of Grasp Patterns using sEMG
Classification of Grasp Patterns using sEMGClassification of Grasp Patterns using sEMG
Classification of Grasp Patterns using sEMG
 
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...Protecting Artificial Intelligence/Machine Learning Inventions in the United ...
Protecting Artificial Intelligence/Machine Learning Inventions in the United ...
 
DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLDutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in ML
 
Machine learning
Machine learning Machine learning
Machine learning
 
DataAnalyticsIntroduction and its ci.pptx
DataAnalyticsIntroduction and its ci.pptxDataAnalyticsIntroduction and its ci.pptx
DataAnalyticsIntroduction and its ci.pptx
 
Improving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble ApproachesImproving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble Approaches
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-steps
 
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
[IUI 2017] Criteria Chains: A Novel Multi-Criteria Recommendation Approach
 
Lecture 2 Data mining process.pdf
Lecture 2 Data mining process.pdfLecture 2 Data mining process.pdf
Lecture 2 Data mining process.pdf
 

Dernier

FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
lizamodels9
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
lizamodels9
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
dollysharma2066
 

Dernier (20)

FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...
 
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
👉Chandigarh Call Girls 👉9878799926👉Just Call👉Chandigarh Call Girl In Chandiga...
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptx
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptxB.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptx
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptx
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
 
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Century
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
 
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
Best VIP Call Girls Noida Sector 40 Call Me: 8448380779
 

Predictive Modeling: Predict Premium Subscriber for a Leading International Music Website

  • 1. MSBA 6420 Rapid Winners Kaushik Nuvvula, Pankaj Singhal, Wenqiuli Zhang, John Tong, Rohith D Rapid Approaches
  • 2. Agenda  Approach  Different techniques used  Techniques that worked  Techniques that did not work  Best Model  Future Scope and Learnings
  • 3. Approach Neural Network  Voting (SVM, Neural) Boosting (Neural)  Bagging (Neural) Sampling, Normalization Data Pre-processing, Weights, Bagging, Voting, Sampling Data Pre-processing, Sampling, Generate Attributes Select by weights, Data Pre- processing, Sampling Data Pre-processing, Sampling Top5Models Model Techniques Used  Stacking (k-NN, Neural)
  • 4. F-measure and cost: Top 5 Models
  • 5. Techniques that worked Data Processing Data Preprocessing Generate Attributes Attribute Selection Techniques Attributes Selection Optimize Parameters Techniques PCA, SMOTE Voting Normalization Bagging, Boosting, Stacking Filter Examples Sampling
  • 6. 6 SMOTE: Resampling Approach • SMOTE -Synthetic Minority Oversampling combines Informed Oversampling of the minority class with Random Under-sampling of the majority class. • For each minority Sample – Find its k-nearest minority neighbors – Randomly select j of these neighbors – Randomly generate synthetic samples along the lines joining the minority sample and its j selected neighbors *SMOTE currently yields best results as far as re-sampling and modifying probabilistic estimate techniques (Chawla, 2003).
  • 7. Deep Dive: SMOTE Sampling : Minority sample : Synthetic sample What happens if there is a nearby majority sample? : Majority sample
  • 8. Techniques that did not work • Meta-cost • Forward Selection • Logistic Regression
  • 9. Best Model Neural Network Class 0: Above 0.1 Class 0: Between 0.03 – 0.1 F- Measure and Misclassification Cost Improvement
  • 10. Scope - Improvements FilterExamples Metric Change Improvement Average Friend Age 17 to 31 Positive Tenure > 4 Positive Songs Listened > 1 Negative Age > 8 and <70 Negative
  • 11. Key Learnings: Warnings • Remove Oversampling - Bias in the data • Generate Calculated Attributes • complex ≠ f-measure • Try to train your models on relatively higher variability capturing records – Using Filter Examples
  • 12. Appendix True 0 True 1 Pred. 0 24259 335 Pred. 1 1302 109 True 0 True 1 Pred. 0 23442 320 Pred. 1 1289 400 F- Measure Misclassification Cost