SlideShare une entreprise Scribd logo
1  sur  22
Machine Learning 
Smackdown 
@LynnLangit
Agenda 
Definitions 
On premise solutions 
3rd party Excel 
Machine 
Learning Add-ins 
Microsoft SQL 
Server Data 
Mining Add-ins 
R Studio 
Cloud solutions 
Predixion 
Software 
Azure 
Machine 
Learning
Analytics Defined 
• Business Analytics - deterministic 
 Query 
 Aggregation 
• Predictive Analytics - probabilistic 
 Machine Learning 
 Statistics 
 Unsupervised Data Mining 
 Supervised Data Mining 
 Other
Machine Learning Roles Defined 
Data Scientist 
Store 
Clean 
Aggregate 
ML Engineer 
Selects 
Libraries 
Applies 
Algorithms 
Creates 
Solutions 
ML Researcher 
Creates Algorithms
Algorithms by Example 
 Segment – Cluster 
 Example: Marketing 
 Best Customer Traits 
 Forecast – Time Series 
 Example: Logistics 
 Product movement over 
time 
 Classify/Estimate – Predict 
 Example: Medical 
 Predict condition 
likelihood 
 Associate – Market Basket 
 Example: Retail 
 Show these items nearby
ML Developer Learning Path Defined 
Pick your 
IDE 
Learn a 
ML 
language 
Pick a 
problem 
space 
Get 
Data 
Visualize 
results 
Process 
and 
ITERATE
What is the R Language?
R Language Semantics 
 search() and ls() # lists packages and objects in 
scope 
 ?mean # shows function definition 
 Vectors (numeric, logical, character), lists, NULLs 
 Data Frame, Matrix (same types), Factors (Categorical) 
 meanx <- mean(x) or meanx = mean(x) # assignment 
 x[1] <- 9 # extracts and/or changes pieces 
 print(x) or x # prints x 
 plot(x) # graphs x
3rd party 
Excel Machine Learning Add-ins 
XLMiner StatsMiner XLStat RExcel 
Important: All of these tools assume expert statistical knowledge
Add-in Example: 
XLMiner
Data Mining Add-ins For Excel 
Table Analysis Tools for Excel 
•Use mining models with Excel 
data or external data 
Data Mining Client for Excel 
•Create/test/explore/manage 
Mining Models 
Data Mining Templates for Visio 
•Render/share mining models 
as Visio Drawings 
Important: Use requires connection to SQL Server 2012 SSAS
Data Mining Add-ins for Excel
Data Mining Structures 
Containers 
• Cleansed source data 
One+ SSAS Algorithm(s) 
• Clustering 
• Time Series Prediction 
• Market-Basket Analysis 
• Text Mining 
• Neural Networks 
Models 
• Query 
• Model processing
Predixion Software
Predixion Software 
Suite of tools for predictive analytics 
Insight Now 
Use mining models with Excel 
data or external data 
Insight 
Analytics 
Create/test/explore/manage 
Mining Models 
Insight 
Workbench 
Prepare data for model 
creation 
Web-based 
Viewers and 
Tools 
HTML 5 
Important: Runs as EITHER connected to SSAS on premise OR Connected to Predixion’s cloud-based servers
18
Azure ML
Azure Machine Learning 
Cloud-based 
SaaS service 
Create ML 
Experiments 
using Datasets 
Can publish 
results as Web 
Services 
Expects 
knowledge of 
statistics and 
data mining
Understanding options… 
Add-in 
Server 
Required 
Complexity 
of install 
Other 
Cost of 
Add-in 
Cost of 
Solution 
XLMiner none easy Assumes stats expertise $$ $$ 
RExcel none easy Assumes R expertise $ $ 
Data Mining Add-ins SQL Server SSAS medium Designed for single user 0 $$$ 
Predixion on premise SQL Express easy Requires local R install 0 $$-$$$ 
Predixion on premise SQL Server SSAS medium Your data is stored locally 0 $$$$ 
Predixion cloud none easy Supports SSAS Data Mining 
AND R Language 
0 $$-$$$ 
Azure Machine 
Learning 
none easy Rich set of algorithms and 
supports R 
n/a unknown
@LynnLangit

Contenu connexe

Tendances

Tendances (20)

MLflow on and inside Azure
MLflow on and inside AzureMLflow on and inside Azure
MLflow on and inside Azure
 
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
 
Managing your ML lifecycle with Azure Databricks and Azure ML
Managing your ML lifecycle with Azure Databricks and Azure MLManaging your ML lifecycle with Azure Databricks and Azure ML
Managing your ML lifecycle with Azure Databricks and Azure ML
 
Operationalizing Machine Learning at Scale at Starbucks
Operationalizing Machine Learning at Scale at StarbucksOperationalizing Machine Learning at Scale at Starbucks
Operationalizing Machine Learning at Scale at Starbucks
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
Clinical Suspecting at Scale Using PySpark
Clinical Suspecting at Scale Using PySparkClinical Suspecting at Scale Using PySpark
Clinical Suspecting at Scale Using PySpark
 
Integration Monday - Analysing StackExchange data with Azure Data Lake
Integration Monday - Analysing StackExchange data with Azure Data LakeIntegration Monday - Analysing StackExchange data with Azure Data Lake
Integration Monday - Analysing StackExchange data with Azure Data Lake
 
Options for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current MarketOptions for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current Market
 
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei VaranovichLambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
 
Serverless data pipelines gcp
Serverless data pipelines gcpServerless data pipelines gcp
Serverless data pipelines gcp
 
Data engineering
Data engineeringData engineering
Data engineering
 
ETL in the Cloud With Microsoft Azure
ETL in the Cloud With Microsoft AzureETL in the Cloud With Microsoft Azure
ETL in the Cloud With Microsoft Azure
 
Translating Models to Medicine an Example of Managing Visual Communications
Translating Models to Medicine an Example of Managing Visual CommunicationsTranslating Models to Medicine an Example of Managing Visual Communications
Translating Models to Medicine an Example of Managing Visual Communications
 
Analyzing StackExchange data with Azure Data Lake
Analyzing StackExchange data with Azure Data LakeAnalyzing StackExchange data with Azure Data Lake
Analyzing StackExchange data with Azure Data Lake
 
Дмитрий Лавриненко "Big & Fast Data for Identity & Telemetry services"
Дмитрий Лавриненко "Big & Fast Data for Identity & Telemetry services"Дмитрий Лавриненко "Big & Fast Data for Identity & Telemetry services"
Дмитрий Лавриненко "Big & Fast Data for Identity & Telemetry services"
 
Дмитрий Попович "How to build a data warehouse?"
Дмитрий Попович "How to build a data warehouse?"Дмитрий Попович "How to build a data warehouse?"
Дмитрий Попович "How to build a data warehouse?"
 
Fighting Fraud with Apache Spark
Fighting Fraud with Apache SparkFighting Fraud with Apache Spark
Fighting Fraud with Apache Spark
 
Demystifying data engineering
Demystifying data engineeringDemystifying data engineering
Demystifying data engineering
 
The IoT and big data
The IoT and big dataThe IoT and big data
The IoT and big data
 
Using Redash for SQL Analytics on Databricks
Using Redash for SQL Analytics on DatabricksUsing Redash for SQL Analytics on Databricks
Using Redash for SQL Analytics on Databricks
 

En vedette

Google Tech Talk with Dr. Eric Brewer in Korea Apr.27.2015
Google Tech Talk with Dr. Eric Brewer in Korea Apr.27.2015Google Tech Talk with Dr. Eric Brewer in Korea Apr.27.2015
Google Tech Talk with Dr. Eric Brewer in Korea Apr.27.2015
Chris Jang
 

En vedette (20)

Rapid Model Refresh (RMR) in Online Fraud Detection Engine
Rapid Model Refresh (RMR) in Online Fraud Detection EngineRapid Model Refresh (RMR) in Online Fraud Detection Engine
Rapid Model Refresh (RMR) in Online Fraud Detection Engine
 
Google Tech Talk with Dr. Eric Brewer in Korea Apr.27.2015
Google Tech Talk with Dr. Eric Brewer in Korea Apr.27.2015Google Tech Talk with Dr. Eric Brewer in Korea Apr.27.2015
Google Tech Talk with Dr. Eric Brewer in Korea Apr.27.2015
 
Google Cloud Platform 2014Q1 - Starter Guide
Google Cloud Platform   2014Q1 - Starter GuideGoogle Cloud Platform   2014Q1 - Starter Guide
Google Cloud Platform 2014Q1 - Starter Guide
 
Modern Machine Learning Infrastructure and Practices
Modern Machine Learning Infrastructure and PracticesModern Machine Learning Infrastructure and Practices
Modern Machine Learning Infrastructure and Practices
 
Building Enterprise Applications on Google Cloud Platform Cloud Computing Exp...
Building Enterprise Applications on Google Cloud Platform Cloud Computing Exp...Building Enterprise Applications on Google Cloud Platform Cloud Computing Exp...
Building Enterprise Applications on Google Cloud Platform Cloud Computing Exp...
 
Square's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong YanSquare's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong Yan
 
Google Cloud Platform: Prototype ->Production-> Planet scale
Google Cloud Platform: Prototype ->Production-> Planet scaleGoogle Cloud Platform: Prototype ->Production-> Planet scale
Google Cloud Platform: Prototype ->Production-> Planet scale
 
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...
 
Introduction to Google Cloud Platform Technologies
Introduction to Google Cloud Platform TechnologiesIntroduction to Google Cloud Platform Technologies
Introduction to Google Cloud Platform Technologies
 
Machine learning pipeline with spark ml
Machine learning pipeline with spark mlMachine learning pipeline with spark ml
Machine learning pipeline with spark ml
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
 
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
 
Google Cloud and Data Pipeline Patterns
Google Cloud and Data Pipeline PatternsGoogle Cloud and Data Pipeline Patterns
Google Cloud and Data Pipeline Patterns
 
Google Cloud Technologies Overview
Google Cloud Technologies OverviewGoogle Cloud Technologies Overview
Google Cloud Technologies Overview
 
Production machine learning_infrastructure
Production machine learning_infrastructureProduction machine learning_infrastructure
Production machine learning_infrastructure
 
Google Cloud Platform Empowers TensorFlow and Machine Learning
Google Cloud Platform Empowers TensorFlow and Machine LearningGoogle Cloud Platform Empowers TensorFlow and Machine Learning
Google Cloud Platform Empowers TensorFlow and Machine Learning
 
Machine Learning In Production
Machine Learning In ProductionMachine Learning In Production
Machine Learning In Production
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelines
 
Introduction to Google Cloud Platform
Introduction to Google Cloud PlatformIntroduction to Google Cloud Platform
Introduction to Google Cloud Platform
 
Pitch Deck Template for startups
Pitch Deck Template for startupsPitch Deck Template for startups
Pitch Deck Template for startups
 

Similaire à Machine Learning on the Microsoft Stack

Data Mining for Developers
Data Mining for DevelopersData Mining for Developers
Data Mining for Developers
llangit
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
DataWorks Summit
 
Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16
Andy Lathrop
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine LearningAUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
Sandesh Rao
 

Similaire à Machine Learning on the Microsoft Stack (20)

Data Mining for Developers
Data Mining for DevelopersData Mining for Developers
Data Mining for Developers
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
microsoft r server for distributed computing
microsoft r server for distributed computingmicrosoft r server for distributed computing
microsoft r server for distributed computing
 
20160317 - PAZUR - PowerBI & R
20160317  - PAZUR - PowerBI & R20160317  - PAZUR - PowerBI & R
20160317 - PAZUR - PowerBI & R
 
BI 2008 Simple
BI 2008 SimpleBI 2008 Simple
BI 2008 Simple
 
IaaS, PaaS, and DevOps for Data Scientist
IaaS, PaaS, and DevOps for Data ScientistIaaS, PaaS, and DevOps for Data Scientist
IaaS, PaaS, and DevOps for Data Scientist
 
Microsoft Azure BI Solutions in the Cloud
Microsoft Azure BI Solutions in the CloudMicrosoft Azure BI Solutions in the Cloud
Microsoft Azure BI Solutions in the Cloud
 
Prague data management meetup 2017-01-23
Prague data management meetup 2017-01-23Prague data management meetup 2017-01-23
Prague data management meetup 2017-01-23
 
Data Mining 2008
Data Mining 2008Data Mining 2008
Data Mining 2008
 
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePoint
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
 
Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16Bluegranite AA Webinar FINAL 28JUN16
Bluegranite AA Webinar FINAL 28JUN16
 
DataMass Summit - Machine Learning for Big Data in SQL Server
DataMass Summit - Machine Learning for Big Data  in SQL ServerDataMass Summit - Machine Learning for Big Data  in SQL Server
DataMass Summit - Machine Learning for Big Data in SQL Server
 
Azure Data.pptx
Azure Data.pptxAzure Data.pptx
Azure Data.pptx
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientist
 
.Net development with Azure Machine Learning (AzureML) Nov 2014
.Net development with Azure Machine Learning (AzureML) Nov 2014.Net development with Azure Machine Learning (AzureML) Nov 2014
.Net development with Azure Machine Learning (AzureML) Nov 2014
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine LearningAUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - AI and Machine Learning
 

Plus de Lynn Langit

Plus de Lynn Langit (20)

VariantSpark on AWS
VariantSpark on AWSVariantSpark on AWS
VariantSpark on AWS
 
Serverless Architectures
Serverless ArchitecturesServerless Architectures
Serverless Architectures
 
10+ Years of Teaching Kids Programming
10+ Years of Teaching Kids Programming10+ Years of Teaching Kids Programming
10+ Years of Teaching Kids Programming
 
Blastn plus jupyter on Docker
Blastn plus jupyter on DockerBlastn plus jupyter on Docker
Blastn plus jupyter on Docker
 
Testing in Ballerina Language
Testing in Ballerina LanguageTesting in Ballerina Language
Testing in Ballerina Language
 
Teaching Kids to create Alexa Skills
Teaching Kids to create Alexa SkillsTeaching Kids to create Alexa Skills
Teaching Kids to create Alexa Skills
 
Practical cloud
Practical cloudPractical cloud
Practical cloud
 
Understanding Jupyter notebooks using bioinformatics examples
Understanding Jupyter notebooks using bioinformatics examplesUnderstanding Jupyter notebooks using bioinformatics examples
Understanding Jupyter notebooks using bioinformatics examples
 
Genome-scale Big Data Pipelines
Genome-scale Big Data PipelinesGenome-scale Big Data Pipelines
Genome-scale Big Data Pipelines
 
Teaching Kids Programming
Teaching Kids ProgrammingTeaching Kids Programming
Teaching Kids Programming
 
Practical Cloud
Practical CloudPractical Cloud
Practical Cloud
 
Serverless Reality
Serverless RealityServerless Reality
Serverless Reality
 
Genomic Scale Big Data Pipelines
Genomic Scale Big Data PipelinesGenomic Scale Big Data Pipelines
Genomic Scale Big Data Pipelines
 
VariantSpark - a Spark library for genomics
VariantSpark - a Spark library for genomicsVariantSpark - a Spark library for genomics
VariantSpark - a Spark library for genomics
 
Bioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWSBioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWS
 
Serverless Reality
Serverless RealityServerless Reality
Serverless Reality
 
Beyond Relational
Beyond RelationalBeyond Relational
Beyond Relational
 
New AWS Services for Bioinformatics
New AWS Services for BioinformaticsNew AWS Services for Bioinformatics
New AWS Services for Bioinformatics
 
Scaling Galaxy on Google Cloud Platform
Scaling Galaxy on Google Cloud PlatformScaling Galaxy on Google Cloud Platform
Scaling Galaxy on Google Cloud Platform
 
SQL Server on Google Cloud Platform
SQL Server on Google Cloud PlatformSQL Server on Google Cloud Platform
SQL Server on Google Cloud Platform
 

Dernier

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Dernier (20)

SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 

Machine Learning on the Microsoft Stack

  • 2. Agenda Definitions On premise solutions 3rd party Excel Machine Learning Add-ins Microsoft SQL Server Data Mining Add-ins R Studio Cloud solutions Predixion Software Azure Machine Learning
  • 3. Analytics Defined • Business Analytics - deterministic  Query  Aggregation • Predictive Analytics - probabilistic  Machine Learning  Statistics  Unsupervised Data Mining  Supervised Data Mining  Other
  • 4. Machine Learning Roles Defined Data Scientist Store Clean Aggregate ML Engineer Selects Libraries Applies Algorithms Creates Solutions ML Researcher Creates Algorithms
  • 5. Algorithms by Example  Segment – Cluster  Example: Marketing  Best Customer Traits  Forecast – Time Series  Example: Logistics  Product movement over time  Classify/Estimate – Predict  Example: Medical  Predict condition likelihood  Associate – Market Basket  Example: Retail  Show these items nearby
  • 6. ML Developer Learning Path Defined Pick your IDE Learn a ML language Pick a problem space Get Data Visualize results Process and ITERATE
  • 7. What is the R Language?
  • 8.
  • 9. R Language Semantics  search() and ls() # lists packages and objects in scope  ?mean # shows function definition  Vectors (numeric, logical, character), lists, NULLs  Data Frame, Matrix (same types), Factors (Categorical)  meanx <- mean(x) or meanx = mean(x) # assignment  x[1] <- 9 # extracts and/or changes pieces  print(x) or x # prints x  plot(x) # graphs x
  • 10. 3rd party Excel Machine Learning Add-ins XLMiner StatsMiner XLStat RExcel Important: All of these tools assume expert statistical knowledge
  • 12. Data Mining Add-ins For Excel Table Analysis Tools for Excel •Use mining models with Excel data or external data Data Mining Client for Excel •Create/test/explore/manage Mining Models Data Mining Templates for Visio •Render/share mining models as Visio Drawings Important: Use requires connection to SQL Server 2012 SSAS
  • 13. Data Mining Add-ins for Excel
  • 14. Data Mining Structures Containers • Cleansed source data One+ SSAS Algorithm(s) • Clustering • Time Series Prediction • Market-Basket Analysis • Text Mining • Neural Networks Models • Query • Model processing
  • 15.
  • 17. Predixion Software Suite of tools for predictive analytics Insight Now Use mining models with Excel data or external data Insight Analytics Create/test/explore/manage Mining Models Insight Workbench Prepare data for model creation Web-based Viewers and Tools HTML 5 Important: Runs as EITHER connected to SSAS on premise OR Connected to Predixion’s cloud-based servers
  • 18. 18
  • 20. Azure Machine Learning Cloud-based SaaS service Create ML Experiments using Datasets Can publish results as Web Services Expects knowledge of statistics and data mining
  • 21. Understanding options… Add-in Server Required Complexity of install Other Cost of Add-in Cost of Solution XLMiner none easy Assumes stats expertise $$ $$ RExcel none easy Assumes R expertise $ $ Data Mining Add-ins SQL Server SSAS medium Designed for single user 0 $$$ Predixion on premise SQL Express easy Requires local R install 0 $$-$$$ Predixion on premise SQL Server SSAS medium Your data is stored locally 0 $$$$ Predixion cloud none easy Supports SSAS Data Mining AND R Language 0 $$-$$$ Azure Machine Learning none easy Rich set of algorithms and supports R n/a unknown

Notes de l'éditeur

  1. Use the VM named – Predixion Win 7x64 playground
  2. Image - http://www.masternewmedia.org/the-google-panda-guide-part-2-machine-learning-and-the-new-mindset/
  3. Concept from -- http://lh3.ggpht.com/-hBUE4diPT28/UvYcixQpDSI/AAAAAAAAC8s/-zf_IPyvDcs/ML%252520Skills%252520Pyramid%252520v1.0.png
  4. http://www.predixionsoftware.com/Product/Supported-Algorithms#class
  5. http://www.r-project.org/ --- use demo(persp) & demo(smooth)
  6. http://www.r-project.org/ --- use demo(persp) & demo(smooth) A bit more…to see all available packages >library() to download a package >install.packages(“fortunes”) to use a package, attach it to a session >require(fortunes)
  7. Rexcel - http://rcom.univie.ac.at/
  8. http://www.microsoft.com/en-us/download/details.aspx?id=35578
  9. http://blogs.msdn.com/b/data__knowledge__intelligence/archive/2013/05/21/data-mining-plugin-for-excel-2013.aspx
  10. http://technet.microsoft.com/en-us/library/ms174757.aspx
  11. http://www.microsoft.com/en-us/download/details.aspx?id=35578
  12. R-Excel -- http://rcom.univie.ac.at/
  13. Lynn