SlideShare une entreprise Scribd logo
1  sur  37
Télécharger pour lire hors ligne
Enterprise Data Mining with SQL Server

        Mark Tabladillo Ph.D.
        Microsoft MVP
        MarkTab Consulting


        March 21, 2012
About Mark Tabladillo

    • 20 Years in Atlanta, Georgia
    • Consulting since 1998; Incorporated 2003
      – Part-Time Faculty at University of Phoenix
    • SAS and Microsoft Expert
      – Presenter since 1998 at conferences like Microsoft
        TechEd and SAS Global Forum
    • Taught statistics at undergraduate and graduate level
    • Blog: http://marktab.net    @MarkTabNet



3
Enterprise:
Leaders of Leaders of
      Leaders
Enterprise Challenge
Enterprise Challenge
Enterprise Challenge
Enterprise Challenge
“Data Mining”
Definitions

Phrase          Goal
“Data Mining”   Inform actionable decisions


“Machine        Determine best performing
Learning”       algorithm
Data Mining > Just Drilldown

          Query     Typical Result

          T‐SQL     Exact values and 
                    calculations
          MDX       Exact values and 
                    calculations
          DAX       Exact values and 
                    calculations
          DMX       Values plus 
                    probabilities
SQL Server
     2008 R2:

Physical and Logical
OLAP Engine
Physical
Architecture
• http://msdn.microsoft.com/en-
  us/library/ms174776.aspx
Analysis Services
Logical Architecture
• http://msdn.microsoft.com/en-us/library/ms174587.aspx
Outline

• Contoso Retail and Fundamentals
• Enterprise-Level Data Mining Demo for
  SQL Server
• What is my next step?
What is Contoso Retail?

• Demonstration dataset for SQL Server
  Database Engine and Analysis Services
•   http://www.microsoft.com/downloads/en/details.aspx?displaylang=en&FamilyID=868662dc-187a-
    4a85-b611-b7df7dc909fc
What are the fundamentals?


                           ‘Readin’

   Arithmetic    Reading
                           ‘Ritin’


           Writing         ‘Rithmetic
What Enterprise Tools support Data
Mining?

• SQL Server Management Studio (SSMS)
• Business Intelligence Development Studio
  (BIDS)
  – SQL Server Integration Services (SSIS)
• PowerShell version 2
What Enterprise Tools support Data
Mining?



                Data 
               Mining

  SSMS           SSIS      PowerShell
Variable      0   1   2   3   4   5   6   7



Discretized
Discretized
Continuous
Discrete
Variable      0   1   2   3   4   5   6   7



Discretized
Discretized
Continuous
Discrete
Variable      0   1   2   3   4   5   6   7



Discretized
Discretized
Continuous
Discrete
Variable      0   1   2   3   4   5   6   7



Discretized
Discretized
Continuous
Discrete
Variable      0   1   2   3   4   5   6   7


Discretized
Discretized
Continuous
Discrete
Documentation

• Data Mining Structures
 – http://msdn.microsoft.com/en-us/library/cc645741.aspx
 – http://msdn.microsoft.com/en-us/library/ms174757.aspx
• Data Mining Models
 – http://msdn.microsoft.com/en-us/library/cc645779.aspx
Contoso Retail:
Enterprise Data Mining

   Demonstration
What is my next step?

• SQL Server 2008 R2 Enterprise
  (includes database engine, Analysis Services,
  SSMS and BIDS)
 – http://www.microsoft.com/sqlserver/2008/en/us/trial-software.aspx
• Microsoft Office 2010 Professional
 – http://office.microsoft.com/en-us/try
• PowerShell 2.0
 – http://support.microsoft.com/kb/968929
• Data Mining Portal and Blog
 – http://www.marktab.net
Conclusion

  • Data mining leaders can tackle enterprise
    data mining challenges with
    – SQL Server Management Studio
    – Business Intelligence Development Studio
    – PowerShell version 2
  • Become leaders of leaders of leaders
Where Can I Find More Information?

•   http://marktab.net Data Mining Resource
•   http://marktab.net/datamining Data Mining Blog
•   http://sqlserverdatamining.com SQL Server Data Mining
•   http://technet.microsoft.com Microsoft’s TechNet
Graphics

• Ship graphics Copyright © 1995-2006 Nova Development
  and its licensors. All rights reserved. Used with
  permission.
Abstract

     This presentation introduces SQL Server Data Mining (SSDM) for SQL
     Server Professionals based on the speaker's past presentation for
     Microsoft TechEd. Starting with SQL Server Management Studio
     (SSMS), the demo includes the interfaces important for professional
     development, including Business Intelligence Development Studio
     (BIDS), highlighting Integration Services, and PowerShell. The
     interactive demos are based on Microsoft's Contoso Retail sample
     data. Finally we will evaluate where Microsoft data mining can help you
     in a practical business environment, which may include Oracle and
     SAS.

     Online Video:
     http://channel9.msdn.com/Events/TechEd/NorthAmerica/2011/DBI326


36
Thank You to our Sponsors

Contenu connexe

Similaire à 24 Hours of PASS -- Enterprise Data Mining with SQL Server

SQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL ServerSQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL ServerMark Tabladillo
 
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL ServerSQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL ServerMark Tabladillo
 
Enterprise Data Mining for SQL Server Pros
Enterprise Data Mining for SQL Server ProsEnterprise Data Mining for SQL Server Pros
Enterprise Data Mining for SQL Server ProsMark Tabladillo
 
Introduction To SQL Server 2014
Introduction To SQL Server 2014Introduction To SQL Server 2014
Introduction To SQL Server 2014Vishal Pawar
 
SQL Operations Studio - new multi-platform tool for SQL Server database devel...
SQL Operations Studio - new multi-platform tool for SQL Server database devel...SQL Operations Studio - new multi-platform tool for SQL Server database devel...
SQL Operations Studio - new multi-platform tool for SQL Server database devel...Marek Maśko
 
BI 2008 Simple
BI 2008 SimpleBI 2008 Simple
BI 2008 Simplellangit
 
SSAS, MDX , Cube understanding, Browsing and Tools information
SSAS, MDX , Cube understanding, Browsing and Tools information SSAS, MDX , Cube understanding, Browsing and Tools information
SSAS, MDX , Cube understanding, Browsing and Tools information Vishal Pawar
 
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...Edureka!
 
SQL Server 2019 Master Data Service
SQL Server 2019 Master Data ServiceSQL Server 2019 Master Data Service
SQL Server 2019 Master Data ServiceKenichiro Nakamura
 
BrianMiller CV short 2015
BrianMiller CV short 2015BrianMiller CV short 2015
BrianMiller CV short 2015Brian Miller
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Miningllangit
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Miningllangit
 
Satya\'s Resume
Satya\'s ResumeSatya\'s Resume
Satya\'s Resumesqlmaster
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Miningllangit
 
Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310Mark Tabladillo
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligenceAhsan Kabir
 
Data Mining 2008
Data Mining 2008Data Mining 2008
Data Mining 2008llangit
 

Similaire à 24 Hours of PASS -- Enterprise Data Mining with SQL Server (20)

SQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL ServerSQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
 
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL ServerSQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
 
Enterprise Data Mining for SQL Server Pros
Enterprise Data Mining for SQL Server ProsEnterprise Data Mining for SQL Server Pros
Enterprise Data Mining for SQL Server Pros
 
Introduction To SQL Server 2014
Introduction To SQL Server 2014Introduction To SQL Server 2014
Introduction To SQL Server 2014
 
SQL Operations Studio - new multi-platform tool for SQL Server database devel...
SQL Operations Studio - new multi-platform tool for SQL Server database devel...SQL Operations Studio - new multi-platform tool for SQL Server database devel...
SQL Operations Studio - new multi-platform tool for SQL Server database devel...
 
BI 2008 Simple
BI 2008 SimpleBI 2008 Simple
BI 2008 Simple
 
SSAS, MDX , Cube understanding, Browsing and Tools information
SSAS, MDX , Cube understanding, Browsing and Tools information SSAS, MDX , Cube understanding, Browsing and Tools information
SSAS, MDX , Cube understanding, Browsing and Tools information
 
Business analyst with project training
Business analyst with project trainingBusiness analyst with project training
Business analyst with project training
 
Mstr meetup
Mstr meetupMstr meetup
Mstr meetup
 
Data mining (Part I)
Data mining (Part I)Data mining (Part I)
Data mining (Part I)
 
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...
 
SQL Server 2019 Master Data Service
SQL Server 2019 Master Data ServiceSQL Server 2019 Master Data Service
SQL Server 2019 Master Data Service
 
BrianMiller CV short 2015
BrianMiller CV short 2015BrianMiller CV short 2015
BrianMiller CV short 2015
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
Satya\'s Resume
Satya\'s ResumeSatya\'s Resume
Satya\'s Resume
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligence
 
Data Mining 2008
Data Mining 2008Data Mining 2008
Data Mining 2008
 

Plus de Mark Tabladillo

How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006Mark Tabladillo
 
Microsoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science RecapMicrosoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science RecapMark Tabladillo
 
201909 Automated ML for Developers
201909 Automated ML for Developers201909 Automated ML for Developers
201909 Automated ML for DevelopersMark Tabladillo
 
201908 Overview of Automated ML
201908 Overview of Automated ML201908 Overview of Automated ML
201908 Overview of Automated MLMark Tabladillo
 
201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0Mark Tabladillo
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019Mark Tabladillo
 
201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusMLMark Tabladillo
 
201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0Mark Tabladillo
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine LearningMark Tabladillo
 
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...Mark Tabladillo
 
Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904Mark Tabladillo
 
Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Mark Tabladillo
 
Training of Python scikit-learn models on Azure
Training of Python scikit-learn models on AzureTraining of Python scikit-learn models on Azure
Training of Python scikit-learn models on AzureMark Tabladillo
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureMark Tabladillo
 
Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808Mark Tabladillo
 
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)Mark Tabladillo
 
Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Mark Tabladillo
 
Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612Mark Tabladillo
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610Mark Tabladillo
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Mark Tabladillo
 

Plus de Mark Tabladillo (20)

How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006
 
Microsoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science RecapMicrosoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science Recap
 
201909 Automated ML for Developers
201909 Automated ML for Developers201909 Automated ML for Developers
201909 Automated ML for Developers
 
201908 Overview of Automated ML
201908 Overview of Automated ML201908 Overview of Automated ML
201908 Overview of Automated ML
 
201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019
 
201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML
 
201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning
 
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
 
Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904
 
Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904
 
Training of Python scikit-learn models on Azure
Training of Python scikit-learn models on AzureTraining of Python scikit-learn models on Azure
Training of Python scikit-learn models on Azure
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft Azure
 
Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808
 
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
 
Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017
 
Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016
 

Dernier

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 

Dernier (20)

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

24 Hours of PASS -- Enterprise Data Mining with SQL Server

  • 1. Enterprise Data Mining with SQL Server Mark Tabladillo Ph.D. Microsoft MVP MarkTab Consulting March 21, 2012
  • 2.
  • 3. About Mark Tabladillo • 20 Years in Atlanta, Georgia • Consulting since 1998; Incorporated 2003 – Part-Time Faculty at University of Phoenix • SAS and Microsoft Expert – Presenter since 1998 at conferences like Microsoft TechEd and SAS Global Forum • Taught statistics at undergraduate and graduate level • Blog: http://marktab.net @MarkTabNet 3
  • 10. Definitions Phrase Goal “Data Mining” Inform actionable decisions “Machine  Determine best performing Learning” algorithm
  • 11. Data Mining > Just Drilldown Query Typical Result T‐SQL Exact values and  calculations MDX Exact values and  calculations DAX Exact values and  calculations DMX Values plus  probabilities
  • 12. SQL Server 2008 R2: Physical and Logical
  • 14. Analysis Services Logical Architecture • http://msdn.microsoft.com/en-us/library/ms174587.aspx
  • 15. Outline • Contoso Retail and Fundamentals • Enterprise-Level Data Mining Demo for SQL Server • What is my next step?
  • 16. What is Contoso Retail? • Demonstration dataset for SQL Server Database Engine and Analysis Services • http://www.microsoft.com/downloads/en/details.aspx?displaylang=en&FamilyID=868662dc-187a- 4a85-b611-b7df7dc909fc
  • 17. What are the fundamentals? ‘Readin’ Arithmetic Reading ‘Ritin’ Writing ‘Rithmetic
  • 18. What Enterprise Tools support Data Mining? • SQL Server Management Studio (SSMS) • Business Intelligence Development Studio (BIDS) – SQL Server Integration Services (SSIS) • PowerShell version 2
  • 19. What Enterprise Tools support Data Mining? Data  Mining SSMS SSIS PowerShell
  • 20.
  • 21.
  • 22. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 23. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 24. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 25. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 26. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 27. Documentation • Data Mining Structures – http://msdn.microsoft.com/en-us/library/cc645741.aspx – http://msdn.microsoft.com/en-us/library/ms174757.aspx • Data Mining Models – http://msdn.microsoft.com/en-us/library/cc645779.aspx
  • 28. Contoso Retail: Enterprise Data Mining Demonstration
  • 29. What is my next step? • SQL Server 2008 R2 Enterprise (includes database engine, Analysis Services, SSMS and BIDS) – http://www.microsoft.com/sqlserver/2008/en/us/trial-software.aspx • Microsoft Office 2010 Professional – http://office.microsoft.com/en-us/try • PowerShell 2.0 – http://support.microsoft.com/kb/968929 • Data Mining Portal and Blog – http://www.marktab.net
  • 30.
  • 31.
  • 32.
  • 33. Conclusion • Data mining leaders can tackle enterprise data mining challenges with – SQL Server Management Studio – Business Intelligence Development Studio – PowerShell version 2 • Become leaders of leaders of leaders
  • 34. Where Can I Find More Information? • http://marktab.net Data Mining Resource • http://marktab.net/datamining Data Mining Blog • http://sqlserverdatamining.com SQL Server Data Mining • http://technet.microsoft.com Microsoft’s TechNet
  • 35. Graphics • Ship graphics Copyright © 1995-2006 Nova Development and its licensors. All rights reserved. Used with permission.
  • 36. Abstract This presentation introduces SQL Server Data Mining (SSDM) for SQL Server Professionals based on the speaker's past presentation for Microsoft TechEd. Starting with SQL Server Management Studio (SSMS), the demo includes the interfaces important for professional development, including Business Intelligence Development Studio (BIDS), highlighting Integration Services, and PowerShell. The interactive demos are based on Microsoft's Contoso Retail sample data. Finally we will evaluate where Microsoft data mining can help you in a practical business environment, which may include Oracle and SAS. Online Video: http://channel9.msdn.com/Events/TechEd/NorthAmerica/2011/DBI326 36
  • 37. Thank You to our Sponsors