SlideShare a Scribd company logo
1 of 37
11
Shawn Scully - VP of Customer Success & Applications
scully@dato.com @backwoodsbrains
Intelligent Applications with
Machine Learning Toolkits
Within 5 years, every innovative application
will be intelligent.
33
Intelligent applications create tremendous value
…but take a lot of time & specialized skills to build.
Recommenders
Lead Scoring
Churn Prediction
Multi-channel Targeting
Auto-Summarization
Fraud detection
Intrusion Detection
Demand Forecasting
Data Matching
Failure Prediction
Our mission is to
Accelerate innovators to create intelligent
applications with agile machine learning.
Needs of an Agile ML Platform
5
Dato
Predictive Services
GraphLab Create
rapid development
deploy as microservice
live serving,
monitoring,
& model management
iterate
w/feedback
A toolkit view of the world
77
Algorithms vs. toolkits
SVD++ w/SGD
vs.SVD
Recommender
• item similarity
• SVD++
• iALS
• factorization machine
• many more!
• PhD students care a lot about these!
• many papers focused on “my curve is better
than your curve”
• Not always the most practical…
• Grouped by a common task
• Focused on meaningful differences in data &
problem
• Practical implementations
8
import graphlab as gl
data = gl.SFrame.read_csv('my_data.csv')
model = gl.recommender.create(
data,
user_id='user',
item_id='movie’,
target='rating')
recommendations = model.recommend(k=5)
cluster = gl.deploy.load(‘s3://path’)
cluster.add(‘servicename’, model)
Easily create a live machine learning service
Create a Recommender
5 lines of code
Toolkit w/auto selection
Deploy in minutes
99
Dato Machine Learning Toolkits
Applications
• recommender
• sentiment_analysis
• similarity_search
• churn_predictor
• data_matching
• lead_scoring
• clickthrough_predictor
Fundamentals
• regression
• classifier
• nearest_neighbors
• clustering
• deeplearning
• anomaly_detection
• pattern_mining
• text_analytics
• graph_analytics
Utilities
• model_parameter_search
• cross_validation
• evaluation
• comparison
• feature_engineering
https://dato.com/products/create/docs/graphlab.toolkits.html
50+ models including factorization machines,
convolutional neural nets, label propagation, &
topic models all in one framework!
Toolkit: Recommender
10
1111
Examples of Recommenders
Recommend
12
Value:
• Increase user engagement
• Sell more/increase clickthrough
• Create better user experiences
Goal: Find or recommend similar or related items.
1313
Recommend - Data + Toolkit
user_id item_id item_name
103 1 ‘Empire Strikes Back’
102 2 ‘Wrath of Khan’
104 3 ‘Sleepless in Seattle’
102 4 ‘Rambo’
104 5 ‘Chocolate’
103 6 ‘The Avengers’
102 1 ‘Empire Strikes Back’
104 1 ‘Empire Strikes Back’
103 4 ‘Rambo’
104 7 ‘When Harry Met Sally’
102 2 ‘Wrath of Khan’
104 8 ‘Up’
recommender
graphlab.recommender.create
Toolkit: Sentiment Analysis & Product Sentiment
1515
Examples of sentiment scoring & summarization
Sentiment Analysis & Product Sentiment
16
Value:
• Quantitative measures from unstructured text
• Eliminate the need to read everything
• Summarize on aspects you care about
Goal: Score sentiment of a sentence, document, or aspect.
1717
Sentiment scoring- Data + Toolkit
sentiment_analysis
graphlab.sentiment_analysis.create
graphlab.product_sentiment.create
Toolkit: Similarity Search
1919
Examples of image search & tagging
Image Search & Tagging
20
Value:
• create more intuitive user experiences
• learn interesting things like style
• reduce manual processes (like tagging)
Goal: Find visually similar images.
2121
Image search - Data + Toolkit
similarity_search
graphlab.data_matching.similarity_search.create
Toolkit: Churn Predictor
Churn Prediction
23
Value:
• Keep your customers
• Optimize marketing/customer success spend
• Identify issues with product or business
Goal: Identify users that are likely to stop doing something
(e.g. paying for your service, using a product feature, etc.)
Confidential - GraphLab internal use only
Problem setup
Period 1
Period 2
Period 3
Features Target
Hold out set
Goal: model that predicts if a user does not appear in Period 2
Evaluation: score for (app, user) pairs absent in Period 3 Machine
learning
model
Evaluation
Data Transformations
25
Time Unique
pairs
app user time etc app user feature
1
feature
2
Features:
● time since last use
● time since first use
● # unique days user has used app
● # times user used app in last delta days
● Rolling aggregates
● etc
Aggregate to generate predictive featuresopens
2626
Predict Churn - Data + Toolkit
user_id event datetimestamp
103 play ‘01-01-15’
102 click ’02-05-15’
102 visit ‘03-06-15’
102 visit ’03-09-15’
103 purchase ’03-21-15’
103 click ’03-22-15’
102 click ’03-23-15’
103 click ’04-02-15’
103 play ‘04-01-15’
103 purchase ’05-02-15’
103 play ‘05-01-15’
103 play ’05-15-15’
churn_predictor
graphlab.churn_predictor.create
Toolkit: Data Matching
27
2828
Examples of data matching
record= {‘SSN’:None,
‘Name’:’Smith, Will’
‘Sex’:’Male’,
‘ZIP;:94701}
Data Matching
29
Value:
• Deduplicate contacts/records
• “360 view” of customer across multiple properties
• Improve data quality
Goal: Identify entities & appropriately link records.
3030
Data matching – Data + Toolkit
data_matching
graphlab.deduplication.create
graphlab.record_linker.create
More than 50,000 developers are using Dato
31
3232
Tools built for innovators
The Agile Machine Learning Platform
Dato Confidential - Do not Distribute
34
Agility to create machine learning services
GraphLab Create
Application Toolkits:
• Auto-select the best algorithm
• Auto-prepare the data for ML
• Task-oriented methods
Data Layer for ML
• Manipulate all-relevant data types
• Out-of-core design eliminates scale pains
Robust Enterprise-Grade Algorithms
• 50+ of best-practice & novel algorithms
• Robust to real-world data
3535
Dato Predictive ServicesReal-time Recommendations
Online Ad Scoring & Serving
Transactional Fraud detection
Agility to deploy – Microservices on AWS, premises, Yarn
How will you make your enterprise intelligent?
37
Thanks!
get the software!: https://www.dato.com/download/
platform overview: https://dato.com/products/
talk about ML at your company: scully@dato.com
Toolkits:
overview:https://dato.com/products/create/docs/graphlab.toolkits.html
recommender: https://dato.com/products/create/docs/graphlab.toolkits.recommender.html
churn_predictor: https://dato.com/products/create/docs/graphlab.toolkits.churn_predictor.html
similarity_search: https://dato.com/products/create/docs/graphlab.toolkits.data_matching.html#similarity-search-model
sentiment_analysis: https://dato.com/products/create/docs/graphlab.toolkits.sentiment_analysis.html
data_matching: https://dato.com/products/create/docs/graphlab.toolkits.data_matching.html

More Related Content

What's hot

Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Turi, Inc.
 
Promoting a Data Driven Culture in a Microservices Environment
Promoting a Data Driven Culture in a Microservices EnvironmentPromoting a Data Driven Culture in a Microservices Environment
Promoting a Data Driven Culture in a Microservices EnvironmentPyData
 
Question Answering and Virtual Assistants with Deep Learning
Question Answering and Virtual Assistants with Deep LearningQuestion Answering and Virtual Assistants with Deep Learning
Question Answering and Virtual Assistants with Deep LearningLucidworks
 
The More the Merrier: Scaling Model Building Infrastructure at Zendesk
The More the Merrier: Scaling Model Building Infrastructure at ZendeskThe More the Merrier: Scaling Model Building Infrastructure at Zendesk
The More the Merrier: Scaling Model Building Infrastructure at ZendeskDatabricks
 
How Graph Technology is Changing AI
How Graph Technology is Changing AIHow Graph Technology is Changing AI
How Graph Technology is Changing AIDatabricks
 
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTKAshish Jaiman
 
Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One
Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital OneUsing H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One
Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital OneSri Ambati
 
Intro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSIntro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSSri Ambati
 
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsTuri, Inc.
 
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...Databricks
 
SoC Keynote:The State of the Art in Integration Technology
SoC Keynote:The State of the Art in Integration TechnologySoC Keynote:The State of the Art in Integration Technology
SoC Keynote:The State of the Art in Integration TechnologySrinath Perera
 
Data Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2OData Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2OSri Ambati
 
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...Databricks
 
Fast Data Intelligence in the IoT - real-time data analytics with Spark
Fast Data Intelligence in the IoT - real-time data analytics with SparkFast Data Intelligence in the IoT - real-time data analytics with Spark
Fast Data Intelligence in the IoT - real-time data analytics with SparkBas Geerdink
 
Webinar - Analyzing Video
Webinar - Analyzing VideoWebinar - Analyzing Video
Webinar - Analyzing VideoTuri, Inc.
 
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4jTransforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4jFred Madrid
 
Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019Faisal Siddiqi
 
Use Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data ClustersUse Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data ClustersDatabricks
 
Role of Analytics in Digital Business
Role of Analytics in Digital BusinessRole of Analytics in Digital Business
Role of Analytics in Digital BusinessSrinath Perera
 
Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSpark Summit
 

What's hot (20)

Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)
 
Promoting a Data Driven Culture in a Microservices Environment
Promoting a Data Driven Culture in a Microservices EnvironmentPromoting a Data Driven Culture in a Microservices Environment
Promoting a Data Driven Culture in a Microservices Environment
 
Question Answering and Virtual Assistants with Deep Learning
Question Answering and Virtual Assistants with Deep LearningQuestion Answering and Virtual Assistants with Deep Learning
Question Answering and Virtual Assistants with Deep Learning
 
The More the Merrier: Scaling Model Building Infrastructure at Zendesk
The More the Merrier: Scaling Model Building Infrastructure at ZendeskThe More the Merrier: Scaling Model Building Infrastructure at Zendesk
The More the Merrier: Scaling Model Building Infrastructure at Zendesk
 
How Graph Technology is Changing AI
How Graph Technology is Changing AIHow Graph Technology is Changing AI
How Graph Technology is Changing AI
 
Deep Learning with CNTK
Deep Learning with CNTKDeep Learning with CNTK
Deep Learning with CNTK
 
Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One
Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital OneUsing H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One
Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One
 
Intro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSIntro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWS
 
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
 
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...
 
SoC Keynote:The State of the Art in Integration Technology
SoC Keynote:The State of the Art in Integration TechnologySoC Keynote:The State of the Art in Integration Technology
SoC Keynote:The State of the Art in Integration Technology
 
Data Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2OData Science, Machine Learning, and H2O
Data Science, Machine Learning, and H2O
 
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
 
Fast Data Intelligence in the IoT - real-time data analytics with Spark
Fast Data Intelligence in the IoT - real-time data analytics with SparkFast Data Intelligence in the IoT - real-time data analytics with Spark
Fast Data Intelligence in the IoT - real-time data analytics with Spark
 
Webinar - Analyzing Video
Webinar - Analyzing VideoWebinar - Analyzing Video
Webinar - Analyzing Video
 
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4jTransforming AI with Graphs: Real World Examples using Spark and Neo4j
Transforming AI with Graphs: Real World Examples using Spark and Neo4j
 
Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019
 
Use Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data ClustersUse Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data Clusters
 
Role of Analytics in Digital Business
Role of Analytics in Digital BusinessRole of Analytics in Digital Business
Role of Analytics in Digital Business
 
Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya Hristakeva
 

Viewers also liked

Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine LearningTuri, Inc.
 
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Turi, Inc.
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesTuri, Inc.
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in productionTuri, Inc.
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligenceu053675
 
Pattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataPattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataTuri, Inc.
 
Cassandra synergy
Cassandra synergyCassandra synergy
Cassandra synergyniallmilton
 
Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Turi, Inc.
 
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinMachine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinTuri, Inc.
 
The Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the MassesThe Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the MassesAlice Zheng
 
Big data analytics and building intelligent applications
Big data analytics and building intelligent applicationsBig data analytics and building intelligent applications
Big data analytics and building intelligent applicationsFlytxt
 
Applications of Machine Learning
Applications of Machine LearningApplications of Machine Learning
Applications of Machine LearningHayim Makabee
 
Machine Learning and Applications
Machine Learning and ApplicationsMachine Learning and Applications
Machine Learning and ApplicationsGeeta Arora
 
An Introduction to Blockchain
An Introduction to BlockchainAn Introduction to Blockchain
An Introduction to BlockchainThomvest Ventures
 
Blockchain technology powerpoint
Blockchain technology powerpointBlockchain technology powerpoint
Blockchain technology powerpointRadius Anesthesia
 
Applications of Machine Learning at USC
Applications of Machine Learning at USCApplications of Machine Learning at USC
Applications of Machine Learning at USCSri Ambati
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep LearningDavid Rostcheck
 

Viewers also liked (20)

Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine Learning
 
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive Services
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in production
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Pattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataPattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log Data
 
Cassandra synergy
Cassandra synergyCassandra synergy
Cassandra synergy
 
Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)
 
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinMachine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos Guestrin
 
The Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the MassesThe Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the Masses
 
Big data analytics and building intelligent applications
Big data analytics and building intelligent applicationsBig data analytics and building intelligent applications
Big data analytics and building intelligent applications
 
Applications of Machine Learning
Applications of Machine LearningApplications of Machine Learning
Applications of Machine Learning
 
Intelligent web applications
Intelligent web applicationsIntelligent web applications
Intelligent web applications
 
Deep Learning Lightning Talk
Deep Learning Lightning TalkDeep Learning Lightning Talk
Deep Learning Lightning Talk
 
Introduction to Blockchain
Introduction to BlockchainIntroduction to Blockchain
Introduction to Blockchain
 
Machine Learning and Applications
Machine Learning and ApplicationsMachine Learning and Applications
Machine Learning and Applications
 
An Introduction to Blockchain
An Introduction to BlockchainAn Introduction to Blockchain
An Introduction to Blockchain
 
Blockchain technology powerpoint
Blockchain technology powerpointBlockchain technology powerpoint
Blockchain technology powerpoint
 
Applications of Machine Learning at USC
Applications of Machine Learning at USCApplications of Machine Learning at USC
Applications of Machine Learning at USC
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
 

Similar to Intelligent Applications with Machine Learning Toolkits

LoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BILoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BILoQutus
 
How to Uncover Big Growth Opportunities with Data
How to Uncover Big Growth Opportunities with DataHow to Uncover Big Growth Opportunities with Data
How to Uncover Big Growth Opportunities with DataLooker
 
Growth Hacking with Data: How to Find Big Growth with Deep Data Dives
Growth Hacking with Data: How to Find Big Growth with Deep Data DivesGrowth Hacking with Data: How to Find Big Growth with Deep Data Dives
Growth Hacking with Data: How to Find Big Growth with Deep Data DivesSean Ellis
 
Creating a Single Source of Truth: Leverage all of your data with powerful an...
Creating a Single Source of Truth: Leverage all of your data with powerful an...Creating a Single Source of Truth: Leverage all of your data with powerful an...
Creating a Single Source of Truth: Leverage all of your data with powerful an...Looker
 
GAIBT Cleveland - Personalizer.pptx
GAIBT Cleveland - Personalizer.pptxGAIBT Cleveland - Personalizer.pptx
GAIBT Cleveland - Personalizer.pptxLuis Beltran
 
Using Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation SystemUsing Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation SystemVMware Tanzu
 
Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)ShehryarSH1
 
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data HubEnable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data HubCloudera, Inc.
 
Salesforce - Overview & Getting Started
Salesforce - Overview & Getting StartedSalesforce - Overview & Getting Started
Salesforce - Overview & Getting StartedKatia Bonella
 
How Lyft Drives Data Discovery
How Lyft Drives Data DiscoveryHow Lyft Drives Data Discovery
How Lyft Drives Data DiscoveryNeo4j
 
Snowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWSSnowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWSGiuseppe Gaviani
 
Disrupting Data Discovery
Disrupting Data DiscoveryDisrupting Data Discovery
Disrupting Data Discoverymarkgrover
 
Deep-Dive: Predicting Customer Behavior with Apigee Insights
Deep-Dive: Predicting Customer Behavior with Apigee InsightsDeep-Dive: Predicting Customer Behavior with Apigee Insights
Deep-Dive: Predicting Customer Behavior with Apigee InsightsApigee | Google Cloud
 
Azure ml and dynamics 365
Azure ml and dynamics 365Azure ml and dynamics 365
Azure ml and dynamics 365Jivtesh Singh
 
Vlad Micu “20 things succesful game developers do beyond making games”
Vlad Micu “20 things succesful game developers do beyond making games”Vlad Micu “20 things succesful game developers do beyond making games”
Vlad Micu “20 things succesful game developers do beyond making games”Lviv Startup Club
 
WebVisions2016 Presumptive Design Workshop
WebVisions2016 Presumptive Design WorkshopWebVisions2016 Presumptive Design Workshop
WebVisions2016 Presumptive Design WorkshopLeo Frishberg
 

Similar to Intelligent Applications with Machine Learning Toolkits (20)

BIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECTBIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECT
 
BIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECTBIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECT
 
BIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECTBIGDATA & HADOOP PROJECT
BIGDATA & HADOOP PROJECT
 
LoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BILoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BI
 
How to Uncover Big Growth Opportunities with Data
How to Uncover Big Growth Opportunities with DataHow to Uncover Big Growth Opportunities with Data
How to Uncover Big Growth Opportunities with Data
 
Growth Hacking with Data: How to Find Big Growth with Deep Data Dives
Growth Hacking with Data: How to Find Big Growth with Deep Data DivesGrowth Hacking with Data: How to Find Big Growth with Deep Data Dives
Growth Hacking with Data: How to Find Big Growth with Deep Data Dives
 
Creating a Single Source of Truth: Leverage all of your data with powerful an...
Creating a Single Source of Truth: Leverage all of your data with powerful an...Creating a Single Source of Truth: Leverage all of your data with powerful an...
Creating a Single Source of Truth: Leverage all of your data with powerful an...
 
GAIBT Cleveland - Personalizer.pptx
GAIBT Cleveland - Personalizer.pptxGAIBT Cleveland - Personalizer.pptx
GAIBT Cleveland - Personalizer.pptx
 
Using Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation SystemUsing Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation System
 
Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)
 
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data HubEnable Advanced Analytics with Hadoop and an Enterprise Data Hub
Enable Advanced Analytics with Hadoop and an Enterprise Data Hub
 
Salesforce - Overview & Getting Started
Salesforce - Overview & Getting StartedSalesforce - Overview & Getting Started
Salesforce - Overview & Getting Started
 
How Lyft Drives Data Discovery
How Lyft Drives Data DiscoveryHow Lyft Drives Data Discovery
How Lyft Drives Data Discovery
 
Snowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWSSnowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWS
 
Disrupting Data Discovery
Disrupting Data DiscoveryDisrupting Data Discovery
Disrupting Data Discovery
 
Deep-Dive: Predicting Customer Behavior with Apigee Insights
Deep-Dive: Predicting Customer Behavior with Apigee InsightsDeep-Dive: Predicting Customer Behavior with Apigee Insights
Deep-Dive: Predicting Customer Behavior with Apigee Insights
 
Azure ml and dynamics 365
Azure ml and dynamics 365Azure ml and dynamics 365
Azure ml and dynamics 365
 
Vlad Micu “20 things succesful game developers do beyond making games”
Vlad Micu “20 things succesful game developers do beyond making games”Vlad Micu “20 things succesful game developers do beyond making games”
Vlad Micu “20 things succesful game developers do beyond making games”
 
WebVisions2016 Presumptive Design Workshop
WebVisions2016 Presumptive Design WorkshopWebVisions2016 Presumptive Design Workshop
WebVisions2016 Presumptive Design Workshop
 
AI & AWS DeepComposer
AI & AWS DeepComposerAI & AWS DeepComposer
AI & AWS DeepComposer
 

More from Turi, Inc.

Webinar - Patient Readmission Risk
Webinar - Patient Readmission RiskWebinar - Patient Readmission Risk
Webinar - Patient Readmission RiskTuri, Inc.
 
Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Turi, Inc.
 
Scalable data structures for data science
Scalable data structures for data scienceScalable data structures for data science
Scalable data structures for data scienceTuri, Inc.
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Turi, Inc.
 
Introduction to Recommender Systems
Introduction to Recommender SystemsIntroduction to Recommender Systems
Introduction to Recommender SystemsTuri, Inc.
 
Overview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringOverview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringTuri, Inc.
 
Towards a Comprehensive Machine Learning Benchmark
Towards a Comprehensive Machine Learning BenchmarkTowards a Comprehensive Machine Learning Benchmark
Towards a Comprehensive Machine Learning BenchmarkTuri, Inc.
 
New Capabilities in the PyData Ecosystem
New Capabilities in the PyData EcosystemNew Capabilities in the PyData Ecosystem
New Capabilities in the PyData EcosystemTuri, Inc.
 
Anomaly Detection Using Isolation Forests
Anomaly Detection Using Isolation ForestsAnomaly Detection Using Isolation Forests
Anomaly Detection Using Isolation ForestsTuri, Inc.
 
Data! Data! Data! I Can't Make Bricks Without Clay!
Data! Data! Data! I Can't Make Bricks Without Clay!Data! Data! Data! I Can't Make Bricks Without Clay!
Data! Data! Data! I Can't Make Bricks Without Clay!Turi, Inc.
 
Declarative Machine Learning: Bring your own Syntax, Algorithm, Data and Infr...
Declarative Machine Learning: Bring your own Syntax, Algorithm, Data and Infr...Declarative Machine Learning: Bring your own Syntax, Algorithm, Data and Infr...
Declarative Machine Learning: Bring your own Syntax, Algorithm, Data and Infr...Turi, Inc.
 
Pandas & Cloudera: Scaling the Python Data Experience
Pandas & Cloudera: Scaling the Python Data ExperiencePandas & Cloudera: Scaling the Python Data Experience
Pandas & Cloudera: Scaling the Python Data ExperienceTuri, Inc.
 
Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark Turi, Inc.
 
Deep Learning in a Dumpster
Deep Learning in a DumpsterDeep Learning in a Dumpster
Deep Learning in a DumpsterTuri, Inc.
 
Visualization for Discovery
Visualization for DiscoveryVisualization for Discovery
Visualization for DiscoveryTuri, Inc.
 

More from Turi, Inc. (17)

Webinar - Patient Readmission Risk
Webinar - Patient Readmission RiskWebinar - Patient Readmission Risk
Webinar - Patient Readmission Risk
 
Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)
 
Scalable data structures for data science
Scalable data structures for data scienceScalable data structures for data science
Scalable data structures for data science
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
 
Introduction to Recommender Systems
Introduction to Recommender SystemsIntroduction to Recommender Systems
Introduction to Recommender Systems
 
Overview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringOverview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature Engineering
 
SFrame
SFrameSFrame
SFrame
 
Towards a Comprehensive Machine Learning Benchmark
Towards a Comprehensive Machine Learning BenchmarkTowards a Comprehensive Machine Learning Benchmark
Towards a Comprehensive Machine Learning Benchmark
 
Dato Keynote
Dato KeynoteDato Keynote
Dato Keynote
 
New Capabilities in the PyData Ecosystem
New Capabilities in the PyData EcosystemNew Capabilities in the PyData Ecosystem
New Capabilities in the PyData Ecosystem
 
Anomaly Detection Using Isolation Forests
Anomaly Detection Using Isolation ForestsAnomaly Detection Using Isolation Forests
Anomaly Detection Using Isolation Forests
 
Data! Data! Data! I Can't Make Bricks Without Clay!
Data! Data! Data! I Can't Make Bricks Without Clay!Data! Data! Data! I Can't Make Bricks Without Clay!
Data! Data! Data! I Can't Make Bricks Without Clay!
 
Declarative Machine Learning: Bring your own Syntax, Algorithm, Data and Infr...
Declarative Machine Learning: Bring your own Syntax, Algorithm, Data and Infr...Declarative Machine Learning: Bring your own Syntax, Algorithm, Data and Infr...
Declarative Machine Learning: Bring your own Syntax, Algorithm, Data and Infr...
 
Pandas & Cloudera: Scaling the Python Data Experience
Pandas & Cloudera: Scaling the Python Data ExperiencePandas & Cloudera: Scaling the Python Data Experience
Pandas & Cloudera: Scaling the Python Data Experience
 
Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark
 
Deep Learning in a Dumpster
Deep Learning in a DumpsterDeep Learning in a Dumpster
Deep Learning in a Dumpster
 
Visualization for Discovery
Visualization for DiscoveryVisualization for Discovery
Visualization for Discovery
 

Recently uploaded

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 

Recently uploaded (20)

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 

Intelligent Applications with Machine Learning Toolkits

  • 1. 11 Shawn Scully - VP of Customer Success & Applications scully@dato.com @backwoodsbrains Intelligent Applications with Machine Learning Toolkits
  • 2. Within 5 years, every innovative application will be intelligent.
  • 3. 33 Intelligent applications create tremendous value …but take a lot of time & specialized skills to build. Recommenders Lead Scoring Churn Prediction Multi-channel Targeting Auto-Summarization Fraud detection Intrusion Detection Demand Forecasting Data Matching Failure Prediction
  • 4. Our mission is to Accelerate innovators to create intelligent applications with agile machine learning.
  • 5. Needs of an Agile ML Platform 5 Dato Predictive Services GraphLab Create rapid development deploy as microservice live serving, monitoring, & model management iterate w/feedback
  • 6. A toolkit view of the world
  • 7. 77 Algorithms vs. toolkits SVD++ w/SGD vs.SVD Recommender • item similarity • SVD++ • iALS • factorization machine • many more! • PhD students care a lot about these! • many papers focused on “my curve is better than your curve” • Not always the most practical… • Grouped by a common task • Focused on meaningful differences in data & problem • Practical implementations
  • 8. 8 import graphlab as gl data = gl.SFrame.read_csv('my_data.csv') model = gl.recommender.create( data, user_id='user', item_id='movie’, target='rating') recommendations = model.recommend(k=5) cluster = gl.deploy.load(‘s3://path’) cluster.add(‘servicename’, model) Easily create a live machine learning service Create a Recommender 5 lines of code Toolkit w/auto selection Deploy in minutes
  • 9. 99 Dato Machine Learning Toolkits Applications • recommender • sentiment_analysis • similarity_search • churn_predictor • data_matching • lead_scoring • clickthrough_predictor Fundamentals • regression • classifier • nearest_neighbors • clustering • deeplearning • anomaly_detection • pattern_mining • text_analytics • graph_analytics Utilities • model_parameter_search • cross_validation • evaluation • comparison • feature_engineering https://dato.com/products/create/docs/graphlab.toolkits.html 50+ models including factorization machines, convolutional neural nets, label propagation, & topic models all in one framework!
  • 12. Recommend 12 Value: • Increase user engagement • Sell more/increase clickthrough • Create better user experiences Goal: Find or recommend similar or related items.
  • 13. 1313 Recommend - Data + Toolkit user_id item_id item_name 103 1 ‘Empire Strikes Back’ 102 2 ‘Wrath of Khan’ 104 3 ‘Sleepless in Seattle’ 102 4 ‘Rambo’ 104 5 ‘Chocolate’ 103 6 ‘The Avengers’ 102 1 ‘Empire Strikes Back’ 104 1 ‘Empire Strikes Back’ 103 4 ‘Rambo’ 104 7 ‘When Harry Met Sally’ 102 2 ‘Wrath of Khan’ 104 8 ‘Up’ recommender graphlab.recommender.create
  • 14. Toolkit: Sentiment Analysis & Product Sentiment
  • 15. 1515 Examples of sentiment scoring & summarization
  • 16. Sentiment Analysis & Product Sentiment 16 Value: • Quantitative measures from unstructured text • Eliminate the need to read everything • Summarize on aspects you care about Goal: Score sentiment of a sentence, document, or aspect.
  • 17. 1717 Sentiment scoring- Data + Toolkit sentiment_analysis graphlab.sentiment_analysis.create graphlab.product_sentiment.create
  • 19. 1919 Examples of image search & tagging
  • 20. Image Search & Tagging 20 Value: • create more intuitive user experiences • learn interesting things like style • reduce manual processes (like tagging) Goal: Find visually similar images.
  • 21. 2121 Image search - Data + Toolkit similarity_search graphlab.data_matching.similarity_search.create
  • 23. Churn Prediction 23 Value: • Keep your customers • Optimize marketing/customer success spend • Identify issues with product or business Goal: Identify users that are likely to stop doing something (e.g. paying for your service, using a product feature, etc.)
  • 24. Confidential - GraphLab internal use only Problem setup Period 1 Period 2 Period 3 Features Target Hold out set Goal: model that predicts if a user does not appear in Period 2 Evaluation: score for (app, user) pairs absent in Period 3 Machine learning model Evaluation
  • 25. Data Transformations 25 Time Unique pairs app user time etc app user feature 1 feature 2 Features: ● time since last use ● time since first use ● # unique days user has used app ● # times user used app in last delta days ● Rolling aggregates ● etc Aggregate to generate predictive featuresopens
  • 26. 2626 Predict Churn - Data + Toolkit user_id event datetimestamp 103 play ‘01-01-15’ 102 click ’02-05-15’ 102 visit ‘03-06-15’ 102 visit ’03-09-15’ 103 purchase ’03-21-15’ 103 click ’03-22-15’ 102 click ’03-23-15’ 103 click ’04-02-15’ 103 play ‘04-01-15’ 103 purchase ’05-02-15’ 103 play ‘05-01-15’ 103 play ’05-15-15’ churn_predictor graphlab.churn_predictor.create
  • 28. 2828 Examples of data matching record= {‘SSN’:None, ‘Name’:’Smith, Will’ ‘Sex’:’Male’, ‘ZIP;:94701}
  • 29. Data Matching 29 Value: • Deduplicate contacts/records • “360 view” of customer across multiple properties • Improve data quality Goal: Identify entities & appropriately link records.
  • 30. 3030 Data matching – Data + Toolkit data_matching graphlab.deduplication.create graphlab.record_linker.create
  • 31. More than 50,000 developers are using Dato 31
  • 32. 3232
  • 33. Tools built for innovators The Agile Machine Learning Platform Dato Confidential - Do not Distribute
  • 34. 34 Agility to create machine learning services GraphLab Create Application Toolkits: • Auto-select the best algorithm • Auto-prepare the data for ML • Task-oriented methods Data Layer for ML • Manipulate all-relevant data types • Out-of-core design eliminates scale pains Robust Enterprise-Grade Algorithms • 50+ of best-practice & novel algorithms • Robust to real-world data
  • 35. 3535 Dato Predictive ServicesReal-time Recommendations Online Ad Scoring & Serving Transactional Fraud detection Agility to deploy – Microservices on AWS, premises, Yarn
  • 36. How will you make your enterprise intelligent?
  • 37. 37 Thanks! get the software!: https://www.dato.com/download/ platform overview: https://dato.com/products/ talk about ML at your company: scully@dato.com Toolkits: overview:https://dato.com/products/create/docs/graphlab.toolkits.html recommender: https://dato.com/products/create/docs/graphlab.toolkits.recommender.html churn_predictor: https://dato.com/products/create/docs/graphlab.toolkits.churn_predictor.html similarity_search: https://dato.com/products/create/docs/graphlab.toolkits.data_matching.html#similarity-search-model sentiment_analysis: https://dato.com/products/create/docs/graphlab.toolkits.sentiment_analysis.html data_matching: https://dato.com/products/create/docs/graphlab.toolkits.data_matching.html

Editor's Notes

  1. Empower businesses not about create, stay competitive, destroy,
  2. Empower businesses not about create, stay competitive, destroy,
  3. Innovators want… Have I convinced you that we are right for you? Why not?