SlideShare une entreprise Scribd logo
1  sur  22
Predictive Analytics
Advanced Techniques in Data Mining

Sara Venturina



                      Copyright © 2011, SAS Institute Inc. All rights reserved.
Agenda
• What is predictive analytics?

• Predictive Analytics Process

• Data Preparation techniques

• Modeling Techniques

• Model Monitoring techniques




                                                                                      2



                          Copyright © 2011, SAS Institute Inc. All rights reserved.
What is Predictive Analytics?
Different levels of analytics


                                                                      Forecasting               Predictive
                                                                                                modeling     Optimization
                                           Statistical
                                           analysis
                     Query drilldown Alerts
                     (or OLAP)
           Ad hoc
           reports
Standard
reports




                                                                                                                            3



                                    Copyright © 2011, SAS Institute Inc. All rights reserved.
What is Predictive Analytics?
Unfortunately, there is no “magic” involved!

• Use of data from different source tables
• Utilizing various data transformation techniques
• Employing statistical theories as foundation
• Will need software to manage this



Focus on business/commercial (as opposed to
 research) analytics is trickier as you need to
 balance the theories with realistic application


                                                                                    4



                        Copyright © 2011, SAS Institute Inc. All rights reserved.
Predictive Analytics Process


                                                Defining
                                               Objectives




             Model                                                                     Data
           Monitoring                                                               Preparation
                                              Predictive
                                              Analytics
                                               Process




                  Deployment                                                Modeling




                                                                                                  5



                        Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
• Possible data sources
• Data transformation techniques
• Deriving “behavioral” information
• Data quality check before modeling




                                                                                  6



                      Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
Possible data sources
• Data warehouse/ data marts
• Operational systems i.e. transaction systems, billing,
  call center data, etc
• External data i.e. survey data, campaign, data from
  external agencies, etc

For external data make sure information is consistently available




                                                                                      7



                          Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
Data transformation techniques
• Entity-level information
• Indicator variables
   • Are values skewed towards 1 level?

• Categorization/grouping of values
   • Is there too many levels of values?
   • Are there values that rarely occur?

• Binning of continuous variables
• Benchmarking information, i.e. industry benchmarking

                                                                                     8



                         Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
Deriving “behavioral” information using several time
 periods
• Average behavior over the last X time periods
• Measures of variation
   • Standard deviation
   • Coefficient of Variation
   • Deviation from the Mean

• Measures of trend information
   • Ratio of 1 vs 3, 3 vs 6 time periods
   • Proportion of Current vs Average of last X time periods
   • Slope of regression line                                                         9



                          Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
Data quality check before modeling
• Generation of summary statistics of derived variables
• Random checking
• Correct imputation of missing values




                                                                                 10



                     Copyright © 2011, SAS Institute Inc. All rights reserved.
Modeling Techniques
• Use of SAS Enterprise Miner
• Ensemble modeling outside of SAS
• Base SAS modeling i.e. for categorical target, survival
 analysis, etc




                                                                                 11



                     Copyright © 2011, SAS Institute Inc. All rights reserved.
Modeling Techniques
Use of SAS Enterprise Miner




     For initial /basic modeling, use Decision Tree, Regression.
      Neural networks can be used to provide diagnostic insights
                                                                                   12



                       Copyright © 2011, SAS Institute Inc. All rights reserved.
Modeling Techniques
Ensemble modeling in and out of SAS EM
                                         Ensemble Models based on the
                                                                      Weightage
                                               following models
                                             Model 1        Decision     0.4
                                             Model 2       Regression    0.6
                                             Model 3       Regression    0.4




                                                                                  13



                  Copyright © 2011, SAS Institute Inc. All rights reserved.
Modeling Techniques
Base SAS modeling
• Categorical data modeling i.e.
    • PROC CATMOD/GENMOD
    • PROC SURVEYLOGISTIC
• Survival analysis:
    • PROC LIFEREG
    • PROC LIFETEST
    • PROC PHREG

Base SAS modeling requires more familiarity with underlying statistical
 concepts
                                                                                     14



                         Copyright © 2011, SAS Institute Inc. All rights reserved.
Model Monitoring Techniques
• Comparing actual vs predicted
• Scored base analysis:
   • Variable distribution analysis
   • Predicted Score distribution




                                                                                  15



                      Copyright © 2011, SAS Institute Inc. All rights reserved.
Model Monitoring
Monitoring of model assessment charts i.e.
                                                                                measures what percentage of all churners
 Compares the effectiveness of running a                                        are in the scoring list (i.e. top 10% scores
    model versus selecting randomly                                                 captured 40% of actual churners)




Other model assessment statistics can be computed such as hit rate,
 Gini coefficient, etc
                                                                                                                               16



                                  Copyright © 2011, SAS Institute Inc. All rights reserved.
Model Monitoring (cont’d)
Scored base analysis i.e.
• Variable distribution analysis




                                                                                   17



                       Copyright © 2011, SAS Institute Inc. All rights reserved.
Model Monitoring (cont’d)
Scored base analysis i.e.
• Predicted Score distribution




                                                                                  18



                      Copyright © 2011, SAS Institute Inc. All rights reserved.
Predictive Analytics as an Iterative Process


                                                 Defining
                                                Objectives




              Model                                                                     Data
            Monitoring                                                               Preparation
                                               Predictive
                                               Analytics
                                                Process




                   Deployment                                                Modeling




                                                                                                   19



                         Copyright © 2011, SAS Institute Inc. All rights reserved.
Questions?




                                                                              20

                                                                         20
             Copyright © 2011, SAS Institute Inc. All rights reserved.
21

                                                            21
Copyright © 2011, SAS Institute Inc. All rights reserved.
Copyright © 2011, SAS Institute Inc. All rights reserved.

Contenu connexe

Tendances

Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Hayim Makabee
 
Predictive analysis and modelling
Predictive analysis and modellingPredictive analysis and modelling
Predictive analysis and modellinglalit Lalitm7225
 
Importance of Data Analytics
 Importance of Data Analytics Importance of Data Analytics
Importance of Data AnalyticsProduct School
 
Analytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsAnalytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsDurga Palakurthy
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An OverviewMachinePulse
 
Planning Your Data Science Projects
Planning Your Data Science ProjectsPlanning Your Data Science Projects
Planning Your Data Science ProjectsSpotle.ai
 
Time series forecasting with machine learning
Time series forecasting with machine learningTime series forecasting with machine learning
Time series forecasting with machine learningDr Wei Liu
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptxVrishit Saraswat
 
Business Intelligence and Business Analytics
Business Intelligence and Business AnalyticsBusiness Intelligence and Business Analytics
Business Intelligence and Business Analyticssnehal_152
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientistVijayMohan Vasu
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelinesjeykottalam
 

Tendances (20)

Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)
 
Predictive analysis and modelling
Predictive analysis and modellingPredictive analysis and modelling
Predictive analysis and modelling
 
Predictive analytics
Predictive analytics Predictive analytics
Predictive analytics
 
Importance of Data Analytics
 Importance of Data Analytics Importance of Data Analytics
Importance of Data Analytics
 
Apply (Big) Data Analytics & Predictive Analytics to Business Application
Apply (Big) Data Analytics & Predictive Analytics to Business ApplicationApply (Big) Data Analytics & Predictive Analytics to Business Application
Apply (Big) Data Analytics & Predictive Analytics to Business Application
 
Predictive data analytics models and their applications
Predictive data analytics models and their applicationsPredictive data analytics models and their applications
Predictive data analytics models and their applications
 
Analytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsAnalytics Overview #Predictive Analytics
Analytics Overview #Predictive Analytics
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
Planning Your Data Science Projects
Planning Your Data Science ProjectsPlanning Your Data Science Projects
Planning Your Data Science Projects
 
Data science - An Introduction
Data science - An IntroductionData science - An Introduction
Data science - An Introduction
 
Time series forecasting with machine learning
Time series forecasting with machine learningTime series forecasting with machine learning
Time series forecasting with machine learning
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptx
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Business Intelligence and Business Analytics
Business Intelligence and Business AnalyticsBusiness Intelligence and Business Analytics
Business Intelligence and Business Analytics
 
machine learning
machine learningmachine learning
machine learning
 
Data science
Data scienceData science
Data science
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
Data analytics
Data analyticsData analytics
Data analytics
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelines
 
7 steps to Predictive Analytics
7 steps to Predictive Analytics 7 steps to Predictive Analytics
7 steps to Predictive Analytics
 

Similaire à Predictive Analytics: Advanced techniques in data mining

Big Data Needs Big Analytics
Big Data Needs Big AnalyticsBig Data Needs Big Analytics
Big Data Needs Big AnalyticsDeepak Ramanathan
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Kun Le
 
Asian Bankers Association, Manila Conference
Asian Bankers Association, Manila ConferenceAsian Bankers Association, Manila Conference
Asian Bankers Association, Manila ConferenceDeepak Ramanathan
 
Big data meets big analytics
Big data meets big analyticsBig data meets big analytics
Big data meets big analyticsDeepak Ramanathan
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data European Data Forum
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsTeradata Aster
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?SAS Canada
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introductionrameshkbudhani
 
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...Pivotal Analytics (Cetas Analytics)
 
Introduction to SAS Forecasting
Introduction to SAS ForecastingIntroduction to SAS Forecasting
Introduction to SAS ForecastingSAS Canada
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance AnalyticsMary Snyder
 
Sybase Complex Event Processing
Sybase Complex Event ProcessingSybase Complex Event Processing
Sybase Complex Event ProcessingSybase Türkiye
 
Real-time Big Data Analytics: From Deployment to Production
Real-time Big Data Analytics: From Deployment to ProductionReal-time Big Data Analytics: From Deployment to Production
Real-time Big Data Analytics: From Deployment to ProductionRevolution Analytics
 

Similaire à Predictive Analytics: Advanced techniques in data mining (20)

Big Data Needs Big Analytics
Big Data Needs Big AnalyticsBig Data Needs Big Analytics
Big Data Needs Big Analytics
 
Big Data Needs Big Analytics
Big Data Needs Big AnalyticsBig Data Needs Big Analytics
Big Data Needs Big Analytics
 
101 ab 1345-1415
101 ab 1345-1415101 ab 1345-1415
101 ab 1345-1415
 
101 ab 1345-1415
101 ab 1345-1415101 ab 1345-1415
101 ab 1345-1415
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...
 
Future of Analytics is here
Future of Analytics is hereFuture of Analytics is here
Future of Analytics is here
 
Asian Bankers Association, Manila Conference
Asian Bankers Association, Manila ConferenceAsian Bankers Association, Manila Conference
Asian Bankers Association, Manila Conference
 
Big data meets big analytics
Big data meets big analyticsBig data meets big analytics
Big data meets big analytics
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introduction
 
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
 
Introduction to SAS Forecasting
Introduction to SAS ForecastingIntroduction to SAS Forecasting
Introduction to SAS Forecasting
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance Analytics
 
Sybase Complex Event Processing
Sybase Complex Event ProcessingSybase Complex Event Processing
Sybase Complex Event Processing
 
Clinical approach to technical upgrade
Clinical approach to technical upgradeClinical approach to technical upgrade
Clinical approach to technical upgrade
 
Technology update
Technology update   Technology update
Technology update
 
Technology Update
Technology UpdateTechnology Update
Technology Update
 
Real-time Big Data Analytics: From Deployment to Production
Real-time Big Data Analytics: From Deployment to ProductionReal-time Big Data Analytics: From Deployment to Production
Real-time Big Data Analytics: From Deployment to Production
 

Plus de SAS Asia Pacific

Improving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble ApproachesImproving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble ApproachesSAS Asia Pacific
 
Instantly & Visually Explore Big Data with Powerful Analytics
Instantly & Visually Explore Big Data with Powerful AnalyticsInstantly & Visually Explore Big Data with Powerful Analytics
Instantly & Visually Explore Big Data with Powerful AnalyticsSAS Asia Pacific
 
Produce Analytical Talent to Meet the Industry Needs
Produce Analytical Talent to Meet the Industry NeedsProduce Analytical Talent to Meet the Industry Needs
Produce Analytical Talent to Meet the Industry NeedsSAS Asia Pacific
 
Better decisions through analytics in healthcare industry. Our journey so far
Better decisions through analytics in healthcare industry.  Our journey so farBetter decisions through analytics in healthcare industry.  Our journey so far
Better decisions through analytics in healthcare industry. Our journey so farSAS Asia Pacific
 
How can Analytics Drive Customer Values?
How can Analytics Drive Customer Values?How can Analytics Drive Customer Values?
How can Analytics Drive Customer Values?SAS Asia Pacific
 
Developing an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorDeveloping an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorSAS Asia Pacific
 
Gaining New Insights into Usage Log Data
Gaining New Insights into Usage Log Data Gaining New Insights into Usage Log Data
Gaining New Insights into Usage Log Data SAS Asia Pacific
 
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...SAS Asia Pacific
 
A journey through the spatial data mining and geographic knowledge discovery ...
A journey through the spatial data mining and geographic knowledge discovery ...A journey through the spatial data mining and geographic knowledge discovery ...
A journey through the spatial data mining and geographic knowledge discovery ...SAS Asia Pacific
 

Plus de SAS Asia Pacific (9)

Improving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble ApproachesImproving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble Approaches
 
Instantly & Visually Explore Big Data with Powerful Analytics
Instantly & Visually Explore Big Data with Powerful AnalyticsInstantly & Visually Explore Big Data with Powerful Analytics
Instantly & Visually Explore Big Data with Powerful Analytics
 
Produce Analytical Talent to Meet the Industry Needs
Produce Analytical Talent to Meet the Industry NeedsProduce Analytical Talent to Meet the Industry Needs
Produce Analytical Talent to Meet the Industry Needs
 
Better decisions through analytics in healthcare industry. Our journey so far
Better decisions through analytics in healthcare industry.  Our journey so farBetter decisions through analytics in healthcare industry.  Our journey so far
Better decisions through analytics in healthcare industry. Our journey so far
 
How can Analytics Drive Customer Values?
How can Analytics Drive Customer Values?How can Analytics Drive Customer Values?
How can Analytics Drive Customer Values?
 
Developing an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorDeveloping an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical Competitor
 
Gaining New Insights into Usage Log Data
Gaining New Insights into Usage Log Data Gaining New Insights into Usage Log Data
Gaining New Insights into Usage Log Data
 
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
 
A journey through the spatial data mining and geographic knowledge discovery ...
A journey through the spatial data mining and geographic knowledge discovery ...A journey through the spatial data mining and geographic knowledge discovery ...
A journey through the spatial data mining and geographic knowledge discovery ...
 

Dernier

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Dernier (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

Predictive Analytics: Advanced techniques in data mining

  • 1. Predictive Analytics Advanced Techniques in Data Mining Sara Venturina Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 2. Agenda • What is predictive analytics? • Predictive Analytics Process • Data Preparation techniques • Modeling Techniques • Model Monitoring techniques 2 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 3. What is Predictive Analytics? Different levels of analytics Forecasting Predictive modeling Optimization Statistical analysis Query drilldown Alerts (or OLAP) Ad hoc reports Standard reports 3 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 4. What is Predictive Analytics? Unfortunately, there is no “magic” involved! • Use of data from different source tables • Utilizing various data transformation techniques • Employing statistical theories as foundation • Will need software to manage this Focus on business/commercial (as opposed to research) analytics is trickier as you need to balance the theories with realistic application 4 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 5. Predictive Analytics Process Defining Objectives Model Data Monitoring Preparation Predictive Analytics Process Deployment Modeling 5 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 6. Data Preparation Techniques • Possible data sources • Data transformation techniques • Deriving “behavioral” information • Data quality check before modeling 6 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 7. Data Preparation Techniques Possible data sources • Data warehouse/ data marts • Operational systems i.e. transaction systems, billing, call center data, etc • External data i.e. survey data, campaign, data from external agencies, etc For external data make sure information is consistently available 7 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 8. Data Preparation Techniques Data transformation techniques • Entity-level information • Indicator variables • Are values skewed towards 1 level? • Categorization/grouping of values • Is there too many levels of values? • Are there values that rarely occur? • Binning of continuous variables • Benchmarking information, i.e. industry benchmarking 8 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 9. Data Preparation Techniques Deriving “behavioral” information using several time periods • Average behavior over the last X time periods • Measures of variation • Standard deviation • Coefficient of Variation • Deviation from the Mean • Measures of trend information • Ratio of 1 vs 3, 3 vs 6 time periods • Proportion of Current vs Average of last X time periods • Slope of regression line 9 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 10. Data Preparation Techniques Data quality check before modeling • Generation of summary statistics of derived variables • Random checking • Correct imputation of missing values 10 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 11. Modeling Techniques • Use of SAS Enterprise Miner • Ensemble modeling outside of SAS • Base SAS modeling i.e. for categorical target, survival analysis, etc 11 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 12. Modeling Techniques Use of SAS Enterprise Miner For initial /basic modeling, use Decision Tree, Regression. Neural networks can be used to provide diagnostic insights 12 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 13. Modeling Techniques Ensemble modeling in and out of SAS EM Ensemble Models based on the Weightage following models Model 1 Decision 0.4 Model 2 Regression 0.6 Model 3 Regression 0.4 13 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 14. Modeling Techniques Base SAS modeling • Categorical data modeling i.e. • PROC CATMOD/GENMOD • PROC SURVEYLOGISTIC • Survival analysis: • PROC LIFEREG • PROC LIFETEST • PROC PHREG Base SAS modeling requires more familiarity with underlying statistical concepts 14 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 15. Model Monitoring Techniques • Comparing actual vs predicted • Scored base analysis: • Variable distribution analysis • Predicted Score distribution 15 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 16. Model Monitoring Monitoring of model assessment charts i.e. measures what percentage of all churners Compares the effectiveness of running a are in the scoring list (i.e. top 10% scores model versus selecting randomly captured 40% of actual churners) Other model assessment statistics can be computed such as hit rate, Gini coefficient, etc 16 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 17. Model Monitoring (cont’d) Scored base analysis i.e. • Variable distribution analysis 17 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 18. Model Monitoring (cont’d) Scored base analysis i.e. • Predicted Score distribution 18 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 19. Predictive Analytics as an Iterative Process Defining Objectives Model Data Monitoring Preparation Predictive Analytics Process Deployment Modeling 19 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 20. Questions? 20 20 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 21. 21 21 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 22. Copyright © 2011, SAS Institute Inc. All rights reserved.