SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Data Mining beyond
Adventure Works

Mark Tabladillo Ph.D.
http://marktab.net
October 3, 2009
Approach of this Presentation
• Emphasize
       – Conceptual value of data mining
       – Relationship of data mining to the real
         world
• Reserve
       – Specific procedures and mechanics
       – Specific mathematics
       – Production implementation


© 2009 Mark Tabladillo Ph.D.                       2
Outline
• Data Mining Fundamentals
• Interactive Demos
• Conclusion




© 2009 Mark Tabladillo Ph.D.   3
Interactive Demos
• Sports
• Government Forecasting




© 2009 Mark Tabladillo Ph.D.   4
Data Mining Definitions
• Data mining is the automatic or semi-
  automatic process of exploring data for
  meaningful or useful patterns.
• Data mining algorithms typically use
  estimation or optimization to achieve
  results (as opposed to only calculations).




© 2009 Mark Tabladillo Ph.D.                   5
Microsoft Data Mining
• Microsoft Data Mining refers to
  Microsoft’s specific implementation of
  certain common data mining algorithms for
  the DMX (Data Mining Extensions)
  language.
• Also called SQL Server Data Mining, the
  technology is integrated into SQL Server
  rather than presented as an independent
  application.

© 2009 Mark Tabladillo Ph.D.              6
Data Mining Tasks
• Supervised
       – Answer known, what is correlated?
• Unsupervised
       – Answer unknown (unspecified), what are the
         groups?
• Forecasting
       – Given a trend, what is next?        Value
                                             Slide




© 2009 Mark Tabladillo Ph.D.                          7
List the Data Mining Algorithms
• Ten Answers
• Each one is a field of academic focus




© 2009 Mark Tabladillo Ph.D.              8
The Data Mining Algorithms
•    Microsoft Naive Bayes
•    Microsoft Linear Regression
•    Microsoft Decision Trees
•    Microsoft Time Series
•    Microsoft Clustering
•    Microsoft Sequence Clustering
•    Microsoft Association Rules
•    Microsoft Neural Networks
•    Microsoft Logistic Regression
•    Text Mining
© 2009 Mark Tabladillo Ph.D.         9
The Analyze Tab


            Menu Option                     Data Mining Algorithm
            Analyze Key Influencers         Naïve Bayes
            Detect Categories               Clustering
            Fill from Example               Logistic Regression
            Forecast                        Time Series
            Highlight Exceptions            Clustering
            Scenario Analysis (Goal Seek)   Logistic Regression
            Scenario Analysis (What If)     Logistic Regression
            Prediction Calculator           Logistic Regression
            Shopping Basket Analysis        Association Rules
© 2009 Mark Tabladillo Ph.D.                                        10
Demo One:
National League Baseball
• Directions:
  You are on the management team for the
  Atlanta Braves. To better serve the team,
  you have been instructed by the owner to
  group the players by considering both their
  position and their salary.




© 2009 Mark Tabladillo Ph.D.                11
Demo One:
National League Baseball
• The following rules apply:
       – You must make more than one group
       – Each group must have at least two players
       – Players of different position may be in the
         same group




© 2009 Mark Tabladillo Ph.D.                           12
Demo One:
National League Baseball
• Individual attributes can be used to make
  groups
• Historical statistics can be used to group
  new players
• Both supervised and unsupervised
  algorithms can be applied to the same
  data



© 2009 Mark Tabladillo Ph.D.                   13
Demo Two:
Government Forecasting
• Directions:
  The President is asking your opinion on
  how the following numbers will increase
  over the next few months. Because this
  project is sensitive, you do not know what
  these numbers measure. However, based
  on the available history, make your best
  projection for the next six periods.


© 2009 Mark Tabladillo Ph.D.               14
Demo Two:
Government Forecasting
8



7



6



5



4



3



2



1



0
    Jan Feb Mar Apr May Jun       Jul   Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun           Jul   Aug Sep Oct Nov Dec
    2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008

© 2009 Mark Tabladillo Ph.D.                                                                                             15
Demo Two:
Government Forecasting
12




10




 8




 6




 4




 2




 0
     Sep Oct Nov Dec Jan Feb Mar Apr May Jun           Jul   Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun           Jul   Aug
     2007 2007 2007 2007 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2009 2009 2009 2009 2009 2009 2009 2009

© 2009 Mark Tabladillo Ph.D.                                                                                               16
Demo Two:
Government Forecasting
• Rapid response is as useful as prediction
• Seek intelligent correlations among related
  metrics
• Projections depend on time frame –
  modeling is continual




© 2009 Mark Tabladillo Ph.D.                17
Forecasting Algorithms
• Microsoft Time Series




                               Value
                               Slide




© 2009 Mark Tabladillo Ph.D.           18
Supervised Algorithms
•    Microsoft Naive Bayes
•    Microsoft Linear Regression
•    Microsoft Decision Trees
•    Microsoft Neural Networks
•    Microsoft Logistic Regression


                                     Value
                                     Slide




© 2009 Mark Tabladillo Ph.D.                 19
Unsupervised Algorithms
•    Microsoft Clustering
•    Microsoft Sequence Clustering
•    Microsoft Association Rules
•    Text Mining



                                     Value
                                     Slide




© 2009 Mark Tabladillo Ph.D.                 20
Resources
• MarkTab.NET
     Links, video resources and information for data mining

•    Data Mining with Microsoft SQL Server 2008
     by Jamie MacLennan (Author), ZhaoHui Tang (Author), Bogdan Crivat (Author)

•    Smart Business Intelligence Solutions with Microsoft® SQL Server® 2008
     (PRO-Developer)
     by Lynn Langit (Author), Matthew Roche (Author)




© 2009 Mark Tabladillo Ph.D.                                                  21
Regroup and Conclusion
• Main Points from this Presentation




© 2009 Mark Tabladillo Ph.D.           22
Contact Information
• Mark Tabladillo
  Twitter @marktabnet

• Also on:
  Linked In
  Facebook




© 2009 Mark Tabladillo Ph.D.   23
Bonus:
Sequence Clustering Ideas
•    Trading players in professional sports
•    Assigning players to certain positions
•    Moving from city to city
•    Store path at the mall
•    Cancer treatment path
•    Taking up a musical instrument
•    Taking up sports
•    Blogging
•    Viral news

© 2009 Mark Tabladillo Ph.D.                  24

Contenu connexe

Plus de Mark Tabladillo

Plus de Mark Tabladillo (20)

201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML
 
201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning
 
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
 
Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904
 
Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904
 
Training of Python scikit-learn models on Azure
Training of Python scikit-learn models on AzureTraining of Python scikit-learn models on Azure
Training of Python scikit-learn models on Azure
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft Azure
 
Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808
 
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
 
Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017
 
Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016
 
Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608Microsoft Data Science Technologies 201608
Microsoft Data Science Technologies 201608
 
Insider's guide to azure machine learning 201606
Insider's guide to azure machine learning 201606Insider's guide to azure machine learning 201606
Insider's guide to azure machine learning 201606
 
Window functions for Data Science
Window functions for Data ScienceWindow functions for Data Science
Window functions for Data Science
 
Microsoft Technologies for Data Science 201601
Microsoft Technologies for Data Science 201601Microsoft Technologies for Data Science 201601
Microsoft Technologies for Data Science 201601
 
Microsoft Data Science Technologies: Back Office Edition
Microsoft Data Science Technologies: Back Office EditionMicrosoft Data Science Technologies: Back Office Edition
Microsoft Data Science Technologies: Back Office Edition
 
Microsoft Data Science Technologies 201510
Microsoft Data Science Technologies 201510Microsoft Data Science Technologies 201510
Microsoft Data Science Technologies 201510
 

Dernier

The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai KuwaitThe Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
daisycvs
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
dlhescort
 
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service NoidaCall Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
dlhescort
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
dollysharma2066
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
lizamodels9
 
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Anamikakaur10
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
dlhescort
 
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Sheetaleventcompany
 
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
lizamodels9
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
amitlee9823
 

Dernier (20)

RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors Data
 
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
 
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai KuwaitThe Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Falcon's Invoice Discounting: Your Path to Prosperity
Falcon's Invoice Discounting: Your Path to ProsperityFalcon's Invoice Discounting: Your Path to Prosperity
Falcon's Invoice Discounting: Your Path to Prosperity
 
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
Call Girls In Majnu Ka Tilla 959961~3876 Shot 2000 Night 8000
 
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service NoidaCall Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
 
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 MonthsSEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
 
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Majnu Ka Tilla, Delhi Contact Us 8377877756
 
Business Model Canvas (BMC)- A new venture concept
Business Model Canvas (BMC)-  A new venture conceptBusiness Model Canvas (BMC)-  A new venture concept
Business Model Canvas (BMC)- A new venture concept
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
 
PHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation FinalPHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation Final
 
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
Call Now ☎️🔝 9332606886🔝 Call Girls ❤ Service In Bhilwara Female Escorts Serv...
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
 
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
Chandigarh Escorts Service 📞8868886958📞 Just📲 Call Nihal Chandigarh Call Girl...
 
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
Russian Call Girls In Rajiv Chowk Gurgaon ❤️8448577510 ⊹Best Escorts Service ...
 
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
Call Girls Jp Nagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bang...
 

Data Mining Beyond Adventure Works (Redmond WA 10/3/2009)

  • 1. Data Mining beyond Adventure Works Mark Tabladillo Ph.D. http://marktab.net October 3, 2009
  • 2. Approach of this Presentation • Emphasize – Conceptual value of data mining – Relationship of data mining to the real world • Reserve – Specific procedures and mechanics – Specific mathematics – Production implementation © 2009 Mark Tabladillo Ph.D. 2
  • 3. Outline • Data Mining Fundamentals • Interactive Demos • Conclusion © 2009 Mark Tabladillo Ph.D. 3
  • 4. Interactive Demos • Sports • Government Forecasting © 2009 Mark Tabladillo Ph.D. 4
  • 5. Data Mining Definitions • Data mining is the automatic or semi- automatic process of exploring data for meaningful or useful patterns. • Data mining algorithms typically use estimation or optimization to achieve results (as opposed to only calculations). © 2009 Mark Tabladillo Ph.D. 5
  • 6. Microsoft Data Mining • Microsoft Data Mining refers to Microsoft’s specific implementation of certain common data mining algorithms for the DMX (Data Mining Extensions) language. • Also called SQL Server Data Mining, the technology is integrated into SQL Server rather than presented as an independent application. © 2009 Mark Tabladillo Ph.D. 6
  • 7. Data Mining Tasks • Supervised – Answer known, what is correlated? • Unsupervised – Answer unknown (unspecified), what are the groups? • Forecasting – Given a trend, what is next? Value Slide © 2009 Mark Tabladillo Ph.D. 7
  • 8. List the Data Mining Algorithms • Ten Answers • Each one is a field of academic focus © 2009 Mark Tabladillo Ph.D. 8
  • 9. The Data Mining Algorithms • Microsoft Naive Bayes • Microsoft Linear Regression • Microsoft Decision Trees • Microsoft Time Series • Microsoft Clustering • Microsoft Sequence Clustering • Microsoft Association Rules • Microsoft Neural Networks • Microsoft Logistic Regression • Text Mining © 2009 Mark Tabladillo Ph.D. 9
  • 10. The Analyze Tab Menu Option Data Mining Algorithm Analyze Key Influencers Naïve Bayes Detect Categories Clustering Fill from Example Logistic Regression Forecast Time Series Highlight Exceptions Clustering Scenario Analysis (Goal Seek) Logistic Regression Scenario Analysis (What If) Logistic Regression Prediction Calculator Logistic Regression Shopping Basket Analysis Association Rules © 2009 Mark Tabladillo Ph.D. 10
  • 11. Demo One: National League Baseball • Directions: You are on the management team for the Atlanta Braves. To better serve the team, you have been instructed by the owner to group the players by considering both their position and their salary. © 2009 Mark Tabladillo Ph.D. 11
  • 12. Demo One: National League Baseball • The following rules apply: – You must make more than one group – Each group must have at least two players – Players of different position may be in the same group © 2009 Mark Tabladillo Ph.D. 12
  • 13. Demo One: National League Baseball • Individual attributes can be used to make groups • Historical statistics can be used to group new players • Both supervised and unsupervised algorithms can be applied to the same data © 2009 Mark Tabladillo Ph.D. 13
  • 14. Demo Two: Government Forecasting • Directions: The President is asking your opinion on how the following numbers will increase over the next few months. Because this project is sensitive, you do not know what these numbers measure. However, based on the available history, make your best projection for the next six periods. © 2009 Mark Tabladillo Ph.D. 14
  • 15. Demo Two: Government Forecasting 8 7 6 5 4 3 2 1 0 Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 © 2009 Mark Tabladillo Ph.D. 15
  • 16. Demo Two: Government Forecasting 12 10 8 6 4 2 0 Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug 2007 2007 2007 2007 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2008 2009 2009 2009 2009 2009 2009 2009 2009 © 2009 Mark Tabladillo Ph.D. 16
  • 17. Demo Two: Government Forecasting • Rapid response is as useful as prediction • Seek intelligent correlations among related metrics • Projections depend on time frame – modeling is continual © 2009 Mark Tabladillo Ph.D. 17
  • 18. Forecasting Algorithms • Microsoft Time Series Value Slide © 2009 Mark Tabladillo Ph.D. 18
  • 19. Supervised Algorithms • Microsoft Naive Bayes • Microsoft Linear Regression • Microsoft Decision Trees • Microsoft Neural Networks • Microsoft Logistic Regression Value Slide © 2009 Mark Tabladillo Ph.D. 19
  • 20. Unsupervised Algorithms • Microsoft Clustering • Microsoft Sequence Clustering • Microsoft Association Rules • Text Mining Value Slide © 2009 Mark Tabladillo Ph.D. 20
  • 21. Resources • MarkTab.NET Links, video resources and information for data mining • Data Mining with Microsoft SQL Server 2008 by Jamie MacLennan (Author), ZhaoHui Tang (Author), Bogdan Crivat (Author) • Smart Business Intelligence Solutions with Microsoft® SQL Server® 2008 (PRO-Developer) by Lynn Langit (Author), Matthew Roche (Author) © 2009 Mark Tabladillo Ph.D. 21
  • 22. Regroup and Conclusion • Main Points from this Presentation © 2009 Mark Tabladillo Ph.D. 22
  • 23. Contact Information • Mark Tabladillo Twitter @marktabnet • Also on: Linked In Facebook © 2009 Mark Tabladillo Ph.D. 23
  • 24. Bonus: Sequence Clustering Ideas • Trading players in professional sports • Assigning players to certain positions • Moving from city to city • Store path at the mall • Cancer treatment path • Taking up a musical instrument • Taking up sports • Blogging • Viral news © 2009 Mark Tabladillo Ph.D. 24