SlideShare une entreprise Scribd logo
1  sur  43
Télécharger pour lire hors ligne
1st edition | July 8-11, 2019
BigML, Inc #DutchMLSchool
Anatomy of an ML Application
Machine Learning End-to-End
Poul Petersen
CIO, BigML
2
BigML, Inc #DutchMLSchool
Examples of ML Applications
3
BigML, Inc #DutchMLSchool
Real-world ML Applications
4
• Should you sign that NDA?
• Upload the NDA to the website
• The service uses Machine Learning to decide if the terms are fair
https://ndalynn.com/
BigML, Inc #DutchMLSchool
Real-world ML Applications
5
• Gathers over 500 features about companies:
• Crunchbase / Tweets / Patents / LinkedIn / etc.
• Creates a label for success/failure:
• IPO or acquisition = success
• Bankruptcy or irrelevance = failure
• Uses Machine Learning to build a model that predicts the success
or failure of startups
• And puts all of the information together into an investor dashboard
https://preseries.com
BigML, Inc #DutchMLSchool
ML Adoption
6
"The gap for most
companies isn’t that
machine learning
doesn’t work, but that
they struggle to actually
use it”
• Why?
• Too much focus on algorithms
• Not enough focus on applying Machine
BigML, Inc #DutchMLSchool
Real-world ML Applications
7
https://thepointsguy.com/news/this-is-the-reason-you-arent-feeling-as-much-turbulence-on-delta-flights/
…collecting and
analyzing “hundreds
of thousands of data
points,” with a plan
to boost that to
“millions,” creating a
model that forecasts
turbulence with a
level of confidence
heretofore unseen.
Not Important: the algorithm!
BigML, Inc #DutchMLSchool
Machine Learning Evolution
8
Genesis
Custom built
Product Service
Utility
Academics &
Researchers
Scientists
Developers
Analysts
Everyone
1950s
2000s 2011
2030
Commodity
2020
Ubiquity
CertaintyUnknown Defined
NovelCommon
Weka, Scikit
BigML, Azure
ML, Amazon
ML, Google
Cloud ML1st
Workshop on
Machine Learning
1980
1980
• Machine Learning algorithms are fun to talk about: GPUs, NNs, etc
• But the algorithms are largely a commodity already
• Difficulty is knowing how to apply ML
BigML, Inc #DutchMLSchool
What is an ML Application
9
AIRLINE ORIGIN DESTINATION
DEPARTURE
DELAY
DISTANCE
ARRIVAL
DELAY
AS ANC SEA -11 1448,0 -22
AA LAX PBI -8 2330,0 -9
US SFO CLT -2 2296,0 5
AA LAX MIA -5 2342,0 -9
AS SEA ANC -1 1448,0 -21
DL SFO MSP -5 1589 8
NK LAS MSP -6 1299 -17
US LAX CLT 14 2125,0 -10
AA SFO DFW -11 1464,0 -13
DL LAS ATL 3 1747,0 -15
Finding patterns in data that can be used to
make inferences…
Predictive Models
Consider: ML Definition
BigML, Inc #DutchMLSchool
What is an ML Application
10
AIRLINE ORIGIN DESTINATION
DEPARTURE
DELAY
DISTANCE
ARRIVAL
DELAY
AS ANC SEA -11 1448,0 -22
AA LAX PBI -8 2330,0 -9
US SFO CLT -2 2296,0 5
AA LAX MIA -5 2342,0 -9
AS SEA ANC -1 1448,0 -21
DL SFO MSP -5 1589 8
NK LAS MSP -6 1299 -17
US LAX CLT 14 2125,0 -10
AA SFO DFW -11 1464,0 -13
DL LAS ATL 3 1747,0 -15
Predictive Models
• Where does this data come from?
• How do you know what data?
• Is the data formatted correctly?
• What do you do with these models?
• How do you combine them?
• Will it work?
BigML, Inc #DutchMLSchool
Reality of a ML Application
11
Data
Transformations
Feature
Engineering
Data
Collection
Evaluation
& Retraining
Seen
Unseen
Predictive App
BigML, Inc #DutchMLSchool
Where to Start?
12
Step
1
Finish
Step
2
- - - - - - - -
???
“Let’s predict 

customer churn!”
“Here are the
customers we predict
will leave our service”
BigML, Inc #DutchMLSchool
Where to Start?
13
Step
1
Finish
Step
2
- - - - - - - -
???
“Let’s detect 

fraud!
“Here are the
transactions we should
stop immediately.
BigML, Inc #DutchMLSchool
ML Application Guide
14
• Remember: ML finds patterns in data enabling predictions about
future events
• This means you need data
• What data depends on what you want to predict
• And the data you have or can collect
• Data needs to have patterns related to what you want to predict
• Not magic: still can’t predict random events, lotteries, etc
• Your problem statement needs to be specific
• Not “Let’s predict churn”
• But “Let’s predict churn by looking at the profile data of all
previous customers of our service who have/have not
churned”
• This can be tricky…
State the problem as an ML Task
BigML, Inc #DutchMLSchool
Where to Start?
15
Step
1
Finish
“Let’s predict 

the Oscars!”
“Here are the 

predicted winners”
Step
2
- - - - - - - -
???
• Statement is not specific enough!!!
• What data can we collect that predicts Oscar wins?
BigML, Inc #DutchMLSchool
Predicting the Oscars
16
• 6 out of 6 right!
• 8 out of 8 actually, but
probability of the predictions
was “too low”
• Adapted Screenplay
• Original Screenplay
BigML Scoresheet
2018
• 4 our of 8 major awards
correctly predicted
• Probabilities were lower this
year
• This is still significantly
better than guessing
2019
How is this possible? Isn't the winner random?
BigML, Inc #DutchMLSchool
How an Oscar is Won
17
voting

intention?
7,000+ members
Insight: winning awards is not a random event!
BigML, Inc #DutchMLSchool
Let’s Predict Best Picture
18
Win
London
Critics
Lose
Writers
Guild
Win
Directors
Guild
Win
Golden
Win
Bafta
• These events are *not* independent
• Similar, but not identical, factors contribute to
each win…
• We can expect a higher probability for Shape of
Water to win
Oscar
?Win?
BigML, Inc #DutchMLSchool
The Features
19
MOVIES AWARDS OBJECTIVE
• year
• movie
• movie_id
• certificate
• duration
• genre
• rate
• metascore
• synopsis
• votes
• gross
• release_date
• user_reviews
• critic_reviews
• popularity
• awards_wins
• awards_nomination
s
• release_date.year
• release_date.mont
h
• release_date.day-
of-month
• release_date.day-
of-week
• Oscar_Best_Picture_nominated
• Oscar_Best_Director_nominated
• Oscar_Best_Actor_nominated
• Oscar_Best_Actress_nominated
• Oscar_Best_Supporting_Actor_nominated
• Oscar_Best_Supporting_Actress_nominated
• Oscar_Best_AdaScreen_nominated
• Oscar_Best_OriScreen_nominated
• Oscar_nominated
• Oscar_nominated_categories
• Golden_Globes_won
• Golden_Globes_won_categories
• Golden_Globes_nominated
• Golden_Globes_nominated_categories
• BAFTA_won
• BAFTA_won_categories
• BAFTA_nominated
• BAFTA_nominated_categories
• Screen_Actors_Guild_won
• Screen_Actors_Guild_won_categories
• Screen_Actors_Guild_nominated
• Screen_Actors_Guild_nominated_categories
• Critics_Choice_won
• Critics_Choice_won_categories
• Critics_Choice_nominated
• Critics_Choice_nominated_categories
• Directors_Guild_won
• Directors_Guild_won_categories
• Directors_Guild_nominated
• Directors_Guild_nominated_categories
• Producers_Guild_won
• Producers_Guild_won_categories
• Producers_Guild_nominated
• Producers_Guild_nominated_categories
• Art_Directors_Guild_won
• Art_Directors_Guild_won_categories
• Art_Directors_Guild_nominated
• Art_Directors_Guild_nominated_categories
• Writers_Guild_won
• Writers_Guild_won_categories
• Writers_Guild_nominated
• Writers_Guild_nominated_categories
• Costume_Designers_Guild_won
• Costume_Designers_Guild_won_categories
• Costume_Designers_Guild_nominated
• Costume_Designers_Guild_nominated_categories
• Online_Film_Television_Association_won
• Online_Film_Television_Association_won_categories
• Online_Film_Television_Association_nominated
• Online_Film_Television_Association_nominated_catego
ries
• Online_Film_Critics_Society_won
• Online_Film_Critics_Society_won_categories
• Online_Film_Critics_Society_nominated
• Online_Film_Critics_Society_nominated_categories
• People_Choice_won
• People_Choice_won_categories
• People_Choice_nominated
• People_Choice_nominated_categories
• London_Critics_Circle_Film_won
• London_Critics_Circle_Film_won_categories
• London_Critics_Circle_Film_nominated
• London_Critics_Circle_Film_nominated_categories
• American_Cinema_Editors_won
• American_Cinema_Editors_won_categories
• American_Cinema_Editors_nominated
• American_Cinema_Editors_nominated_categories
• Hollywood_Film_won
• Hollywood_Film_won_categories
• Hollywood_Film_nominated
• Hollywood_Film_nominated_categories
• Austin_Film_Critics_Association_won
• Austin_Film_Critics_Association_won_categories
• Austin_Film_Critics_Association_nominated
• Austin_Film_Critics_Association_nominated_categories
• Denver_Film_Critics_Society_won
• Denver_Film_Critics_Society_won_categories
• Denver_Film_Critics_Society_nominated
• Denver_Film_Critics_Society_nominated_categories
• Boston_Society_of_Film_Critics_won
• Boston_Society_of_Film_Critics_won_categories
• Boston_Society_of_Film_Critics_nominated
• Boston_Society_of_Film_Critics_nominated_categories
• New_York_Film_Critics_Circle_won
• Oscar_Best_Picture_wo
n
• Oscar_Best_Director_w
on
• Oscar_Best_Actor_won
• Oscar_Best_Actress_wo
n
• Oscar_Best_Supporting
_Actor_won
• Oscar_Best_Supporting
_Actress_won
Data pulled from IMDB…
Engineered Features:
Award items field

Nomination Counts

Awards Counts
BigML, Inc #DutchMLSchool
Oscars Dataset
20
DATASET is publicly available: 

https://bigml.com/user/academy_awards/gallery/dataset/
5a94302592fb565ed400103b
BigML, Inc #DutchMLSchool
Oscars Example
21
• When specifying the problem, be as specific as possible
• Not: “Let’s predict the Oscars”
• Instead: “Let’s Predict the Oscars by correlating a series
of award wins with the final Oscar win.”
• The statement of the problem will guide the data required
• Be aware of the cost of collecting the data versus the ROI:
Tidbits and Lessons Learned….
BigML, Inc #DutchMLSchool
Ranking ML Applications
22
FEASIBILITY
(incdataavailability/deccomplexity)
ROI
(impact and cost)
-
+
+
NO-BRAINERS

START HERE
NO-GO
POSTPONABLE
BRAINERS
Thinking about an ML Application?
BigML, Inc #DutchMLSchool
Oscars Example
23
• When specifying the problem, be as specific as possible
• Not: “Let’s predict the Oscars”
• Instead: “Let’s Predict the Oscars by correlating a series
of award wins with the final Oscar win.”
• The statement of the problem will guide the data required
• Be aware of the cost of collecting the data versus the ROI:
• IMDB data is readily availble
• We’re done right?
• Nope. You can’t escape Feature Engineering
• Items: BAFTA_won_categories = list of nominations
• Aggregations: Nomination and Award counts
• You can’t escape Feature Selection
• Full user reviews costly to collect and not useful
Tidbits and Lessons Learned….
Wait: How were you confident in the predictions?
BigML, Inc #DutchMLSchool
2013

2016
119 variables
Evaluating the Model
24
119 variables
2000

2016 119 variables
2000

2012Original Dataset
Test Dataset
Train Dataset
• Ultimately, we want to use all the history to predict the winner
for the current year
• In order to evaluate success, we use a model built from
2000-2012 data to predict the winners for 2013-2016
• Built a separate Deepnet for each award category
• Evaluation obtained a ROC AUC over 0.98 across all award
categories
Great: The model seems OK, what next?
BigML, Inc #DutchMLSchool
Effort of a ML Application
25
State the problem as an ML task
Data wrangling
Feature engineering
Modeling and Evaluations
Predictions
Measure Results
Data transformations ~80% effort
~5% effort
~5% effort
This is only such low
effort because of
platforms like
This is an area where
is currently
innovating
Task
~10% effort
Effort
BigML, Inc #DutchMLSchool
Reality Check
26
• All Machine Learned models are wrong
• Real-world Machine Learning is iterative
• End-to-end Machine Learning is compositional
Three Important Concepts in Applying ML…
BigML, Inc #DutchMLSchool
End-to-end ML is Compositional
27
• Real-world problems
• Solved by applying a combination of algorithms
• Very rarely is it one-and-done
BigML, Inc #DutchMLSchool
Basic Workflow
28
SOURCE DATASET MODEL PREDICTION
BigML, Inc #DutchMLSchool
Feature Engineering
29
MODEL
FILTERSOLD HOMES
BATCH
PREDICTION
NEW FEATURES
DATASET DEALS
DATASET
FILTERFORSALE HOMES NEW FEATURES
BigML, Inc #DutchMLSchool
End-to-end ML is Compositional
30
• Real-world problems
• Solved by applying a combination of algorithms
• Very rarely is it one-and-done
• Each “step” is often multi-stage as well
• Filtering/Cleaning data
BigML, Inc #DutchMLSchool
Anomaly Filter and Evaluate
31
DIABETES
SOURCE
DIABETES
DATASET
TRAIN SET
TEST SET
ALL
MODEL
CLEAN
DATASET
FILTER
ALL
MODEL
ALL
EVALUATION
CLEAN
EVALUATION
COMPARE
EVALUATIONS
ANAOMALY
DETECTOR
BigML, Inc #DutchMLSchool
Fixing Missing Values
32
Fix Missing Values in a “Meaningful” Way
Filter Zeros
Model 

insulin
Predict 

insulin
Select 

insulin
Fixed

Dataset
Amended

Dataset
Original

Dataset
Clean

Dataset
BigML, Inc #DutchMLSchool
End-to-end ML is Compositional
33
• Real-world problems
• Solved by applying a combination of algorithms
• Very rarely is it one-and-done
• Each “step” is often multi-stage as well
• Filtering/Cleaning data
• Tuning a model for optimum performance
BigML, Inc #DutchMLSchool
Ensemble Tuning
34
ENSEMBLE
N=20
EVALUATION
SOURCE DATASET
TRAINING
TEST
EVALUATIONEVALUATION
ENSEMBLE
N=10
ENSEMBLE
N=1000
CHOOSE
BigML, Inc #DutchMLSchool
End-to-end ML is Compositional
35
• Real-world problems
• Solved by applying a combination of algorithms
• Very rarely is it one-and-done
• Each “step” is often multi-stage as well
• Filtering/Cleaning data
• Tuning a model for optimum performance
• Finding the best features
BigML, Inc #DutchMLSchool
Best-first Feature Selection
36
{F1}
CHOOSE BEST
S = {Fa}
{F2} {F3} {F4} Fn
S+{F1} S+{F2} S+{F3} S+{F4} S+{Fn-1}
CHOOSE BEST
S = {Fa, Fb}
S+{F1} S+{F2} S+{F3} S+{F4} S+{Fn-1}
CHOOSE BEST
S = {Fa, Fb, Fc}
BigML, Inc #DutchMLSchool
End-to-end ML is Compositional
37
• Real-world problems
• Solved by applying a combination of algorithms
• Very rarely is it one-and-done
• Each “step” is often multi-stage as well
• Filtering/Cleaning data
• Tuning a model for optimum performance
• Finding the best features
• May require models for several domains of knowledge
• Multiple Training / Scoring
BigML, Inc #DutchMLSchool
AGGREGATED
BY CARD
AGGREGATED
BY USER
AGGREGATED
BY PROFILE
Multiple Domains
38
TRANSACTIONS
ANOMALY

BY CARD
ANOMALY

BY USER
ANOMALY

BY PROFILE
ANOMALY

SCORE
ANOMALY

SCORE
ANOMALY

SCORE
NEW TRANSACTION
APPROVED?
BigML, Inc #DutchMLSchool
End-to-end ML is Compositional
39
• Real-world problems
• Solved by applying a combination of algorithms
• Very rarely is it one-and-done
• Each “step” is often multi-stage as well
• Filtering/Cleaning data
• Tuning a model for optimum performance
• Finding the best features
• May require models for several domains of knowledge
• Multiple Training / Scoring
• Even after deploying a model
• Workflow to monitor performance, know when to retrain
BigML, Inc #DutchMLSchool
Model Retraining
40
TRAINING
INPUT DATA
PREDICTIONS
ANOMALY

SCORES
OUTCOMES
RETRAIN DATA
BigML, Inc #DutchMLSchool
Reality Check
41
• All Machine Learned models are wrong
Three Important Concepts in Applying ML…
• Real-world Machine Learning is iterative
• End-to-end Machine Learning is compositional
BigML, Inc #DutchMLSchool
• Better features always beat better algorithms
• Good algorithms already exist and are good enough
• Tools like OptiML exist which can help optimize performance
• The data is never good enough
Tenets of Machine Learning
42
• All Machine Learned models are wrong
• Real-world Machine Learning is iterative
• End-to-end Machine Learning is compositional
• Automation is better than hand tuning - you need an API!
• When data changes quickly, training speed is more
important than accuracy
• Repeatability is superior to a single strong result
• Problems are solved with workflows of algorithms
• A ML solution is not real until it is in production
• ML is here: Now we need 100,000x people applying ML
, but some are useful
Co-organized by: Sponsor:
Business Partners:

Contenu connexe

Tendances

DutchMLSchool. ML: A Technical Perspective
DutchMLSchool. ML: A Technical PerspectiveDutchMLSchool. ML: A Technical Perspective
DutchMLSchool. ML: A Technical PerspectiveBigML, Inc
 
DutchMLSchool. Your first BigML Project
DutchMLSchool. Your first BigML ProjectDutchMLSchool. Your first BigML Project
DutchMLSchool. Your first BigML ProjectBigML, Inc
 
DutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML PlatformDutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML PlatformBigML, Inc
 
MLSEV. Models, Evaluations and Ensembles
MLSEV. Models, Evaluations and Ensembles MLSEV. Models, Evaluations and Ensembles
MLSEV. Models, Evaluations and Ensembles BigML, Inc
 
DutchMLSchool. Associations and Topic Models
DutchMLSchool. Associations and Topic ModelsDutchMLSchool. Associations and Topic Models
DutchMLSchool. Associations and Topic ModelsBigML, Inc
 
DutchMLSchool. Opening Remarks
DutchMLSchool. Opening RemarksDutchMLSchool. Opening Remarks
DutchMLSchool. Opening RemarksBigML, Inc
 
MLSEV. Cluster Analysis and Anomaly Detection
MLSEV. Cluster Analysis and Anomaly DetectionMLSEV. Cluster Analysis and Anomaly Detection
MLSEV. Cluster Analysis and Anomaly DetectionBigML, Inc
 
DutchMLSchool. ML for Logistics
DutchMLSchool. ML for LogisticsDutchMLSchool. ML for Logistics
DutchMLSchool. ML for LogisticsBigML, Inc
 
Square's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong YanSquare's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong YanHakka Labs
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLPaco Nathan
 
Explainability and bias in AI
Explainability and bias in AIExplainability and bias in AI
Explainability and bias in AIBill Liu
 
Interpretable Machine Learning
Interpretable Machine LearningInterpretable Machine Learning
Interpretable Machine LearningSri Ambati
 
Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016
Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016
Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016MLconf
 
Building Custom
Machine Learning Algorithms
with Apache SystemML
Building Custom
Machine Learning Algorithms
with Apache SystemMLBuilding Custom
Machine Learning Algorithms
with Apache SystemML
Building Custom
Machine Learning Algorithms
with Apache SystemMLsparktc
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AIGary Allemann
 
Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective Saurabh Kaushik
 
Practical Explainable AI: How to build trustworthy, transparent and unbiased ...
Practical Explainable AI: How to build trustworthy, transparent and unbiased ...Practical Explainable AI: How to build trustworthy, transparent and unbiased ...
Practical Explainable AI: How to build trustworthy, transparent and unbiased ...Raheel Ahmad
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1Sara Hooker
 
Explainable AI
Explainable AIExplainable AI
Explainable AIDinesh V
 

Tendances (20)

DutchMLSchool. ML: A Technical Perspective
DutchMLSchool. ML: A Technical PerspectiveDutchMLSchool. ML: A Technical Perspective
DutchMLSchool. ML: A Technical Perspective
 
DutchMLSchool. Your first BigML Project
DutchMLSchool. Your first BigML ProjectDutchMLSchool. Your first BigML Project
DutchMLSchool. Your first BigML Project
 
DutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML PlatformDutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML Platform
 
MLSEV. Models, Evaluations and Ensembles
MLSEV. Models, Evaluations and Ensembles MLSEV. Models, Evaluations and Ensembles
MLSEV. Models, Evaluations and Ensembles
 
DutchMLSchool. Associations and Topic Models
DutchMLSchool. Associations and Topic ModelsDutchMLSchool. Associations and Topic Models
DutchMLSchool. Associations and Topic Models
 
DutchMLSchool. Opening Remarks
DutchMLSchool. Opening RemarksDutchMLSchool. Opening Remarks
DutchMLSchool. Opening Remarks
 
MLSEV. Cluster Analysis and Anomaly Detection
MLSEV. Cluster Analysis and Anomaly DetectionMLSEV. Cluster Analysis and Anomaly Detection
MLSEV. Cluster Analysis and Anomaly Detection
 
DutchMLSchool. ML for Logistics
DutchMLSchool. ML for LogisticsDutchMLSchool. ML for Logistics
DutchMLSchool. ML for Logistics
 
Square's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong YanSquare's Machine Learning Infrastructure and Applications - Rong Yan
Square's Machine Learning Infrastructure and Applications - Rong Yan
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAML
 
Explainability and bias in AI
Explainability and bias in AIExplainability and bias in AI
Explainability and bias in AI
 
Interpretable Machine Learning
Interpretable Machine LearningInterpretable Machine Learning
Interpretable Machine Learning
 
Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016
Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016
Elena Grewal, Data Science Manager, Airbnb at MLconf SF 2016
 
Building Custom
Machine Learning Algorithms
with Apache SystemML
Building Custom
Machine Learning Algorithms
with Apache SystemMLBuilding Custom
Machine Learning Algorithms
with Apache SystemML
Building Custom
Machine Learning Algorithms
with Apache SystemML
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AI
 
Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective
 
Practical Explainable AI: How to build trustworthy, transparent and unbiased ...
Practical Explainable AI: How to build trustworthy, transparent and unbiased ...Practical Explainable AI: How to build trustworthy, transparent and unbiased ...
Practical Explainable AI: How to build trustworthy, transparent and unbiased ...
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 

Similaire à DutchMLSchool. Machine Learning End-to-End

MLSEV. Anatomy of an ML Application
MLSEV. Anatomy of an ML ApplicationMLSEV. Anatomy of an ML Application
MLSEV. Anatomy of an ML ApplicationBigML, Inc
 
BSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, EvaluationsBSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, EvaluationsBigML, Inc
 
MLSD18. End-to-End Machine Learning
MLSD18. End-to-End Machine LearningMLSD18. End-to-End Machine Learning
MLSD18. End-to-End Machine LearningBigML, Inc
 
Growth Hacking Strategies For Rapid Growth
Growth Hacking Strategies For Rapid GrowthGrowth Hacking Strategies For Rapid Growth
Growth Hacking Strategies For Rapid GrowthRoland Frasier
 
Data Science Applications in Finance and Investing
Data Science Applications in Finance and InvestingData Science Applications in Finance and Investing
Data Science Applications in Finance and InvestingChristopher Conlan
 
VSSML18 Introduction to Supervised Learning
VSSML18 Introduction to Supervised LearningVSSML18 Introduction to Supervised Learning
VSSML18 Introduction to Supervised LearningBigML, Inc
 
Competitive Analysis for SEO - SEMNE
Competitive Analysis for SEO - SEMNE Competitive Analysis for SEO - SEMNE
Competitive Analysis for SEO - SEMNE Casie Gillette
 
Startup Metrics, a love story. All slides of an 6h Lean Analytics workshop.
Startup Metrics, a love story. All slides of an 6h Lean Analytics workshop.Startup Metrics, a love story. All slides of an 6h Lean Analytics workshop.
Startup Metrics, a love story. All slides of an 6h Lean Analytics workshop.Andreas Klinger
 
The Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine LearninThe Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine LearninInside Analysis
 
BigMLSchool: Bankruptcy Prediction
BigMLSchool: Bankruptcy PredictionBigMLSchool: Bankruptcy Prediction
BigMLSchool: Bankruptcy PredictionBigML, Inc
 
Chris Silversmith: Structure Local SEO
Chris Silversmith: Structure Local SEOChris Silversmith: Structure Local SEO
Chris Silversmith: Structure Local SEODFWSEM
 
Return to Basics: Supply Chain Re-design ..'Isc' turkey 2015
Return to Basics: Supply Chain Re-design ..'Isc' turkey 2015Return to Basics: Supply Chain Re-design ..'Isc' turkey 2015
Return to Basics: Supply Chain Re-design ..'Isc' turkey 2015Walaa Maher
 
20 top AB testing mistakes and how to avoid them
20 top AB testing mistakes and how to avoid them20 top AB testing mistakes and how to avoid them
20 top AB testing mistakes and how to avoid themCraig Sullivan
 
Vsm Voc Brownbag Webinar 0610009
Vsm Voc Brownbag Webinar 0610009Vsm Voc Brownbag Webinar 0610009
Vsm Voc Brownbag Webinar 0610009Daniel Walker
 
Myths and Illusions of Cross Device Testing - Elite Camp June 2015
Myths and Illusions of Cross Device Testing - Elite Camp June 2015Myths and Illusions of Cross Device Testing - Elite Camp June 2015
Myths and Illusions of Cross Device Testing - Elite Camp June 2015Craig Sullivan
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
FREE! How to Conceptualize Your Idea into a Fundable Startup/Investor Gary Jinks
FREE! How to Conceptualize Your Idea into a Fundable Startup/Investor Gary JinksFREE! How to Conceptualize Your Idea into a Fundable Startup/Investor Gary Jinks
FREE! How to Conceptualize Your Idea into a Fundable Startup/Investor Gary Jinksideatoipo
 

Similaire à DutchMLSchool. Machine Learning End-to-End (20)

MLSEV. Anatomy of an ML Application
MLSEV. Anatomy of an ML ApplicationMLSEV. Anatomy of an ML Application
MLSEV. Anatomy of an ML Application
 
BSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, EvaluationsBSSML17 - Introduction, Models, Evaluations
BSSML17 - Introduction, Models, Evaluations
 
MLSD18. End-to-End Machine Learning
MLSD18. End-to-End Machine LearningMLSD18. End-to-End Machine Learning
MLSD18. End-to-End Machine Learning
 
Big Data? Big Deal, Barclaycard
Big Data? Big Deal, Barclaycard Big Data? Big Deal, Barclaycard
Big Data? Big Deal, Barclaycard
 
Growth Hacking Strategies For Rapid Growth
Growth Hacking Strategies For Rapid GrowthGrowth Hacking Strategies For Rapid Growth
Growth Hacking Strategies For Rapid Growth
 
Data Science Applications in Finance and Investing
Data Science Applications in Finance and InvestingData Science Applications in Finance and Investing
Data Science Applications in Finance and Investing
 
VSSML18 Introduction to Supervised Learning
VSSML18 Introduction to Supervised LearningVSSML18 Introduction to Supervised Learning
VSSML18 Introduction to Supervised Learning
 
Competitive Analysis for SEO - SEMNE
Competitive Analysis for SEO - SEMNE Competitive Analysis for SEO - SEMNE
Competitive Analysis for SEO - SEMNE
 
Startup Metrics, a love story. All slides of an 6h Lean Analytics workshop.
Startup Metrics, a love story. All slides of an 6h Lean Analytics workshop.Startup Metrics, a love story. All slides of an 6h Lean Analytics workshop.
Startup Metrics, a love story. All slides of an 6h Lean Analytics workshop.
 
The Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine LearninThe Sky’s the Limit – The Rise of Machine Learnin
The Sky’s the Limit – The Rise of Machine Learnin
 
AI & AWS DeepComposer
AI & AWS DeepComposerAI & AWS DeepComposer
AI & AWS DeepComposer
 
BigMLSchool: Bankruptcy Prediction
BigMLSchool: Bankruptcy PredictionBigMLSchool: Bankruptcy Prediction
BigMLSchool: Bankruptcy Prediction
 
Chris Silversmith: Structure Local SEO
Chris Silversmith: Structure Local SEOChris Silversmith: Structure Local SEO
Chris Silversmith: Structure Local SEO
 
Return to Basics: Supply Chain Re-design ..'Isc' turkey 2015
Return to Basics: Supply Chain Re-design ..'Isc' turkey 2015Return to Basics: Supply Chain Re-design ..'Isc' turkey 2015
Return to Basics: Supply Chain Re-design ..'Isc' turkey 2015
 
20 top AB testing mistakes and how to avoid them
20 top AB testing mistakes and how to avoid them20 top AB testing mistakes and how to avoid them
20 top AB testing mistakes and how to avoid them
 
Vsm Voc Brownbag Webinar 0610009
Vsm Voc Brownbag Webinar 0610009Vsm Voc Brownbag Webinar 0610009
Vsm Voc Brownbag Webinar 0610009
 
A6 big data_in_the_cloud
A6 big data_in_the_cloudA6 big data_in_the_cloud
A6 big data_in_the_cloud
 
Myths and Illusions of Cross Device Testing - Elite Camp June 2015
Myths and Illusions of Cross Device Testing - Elite Camp June 2015Myths and Illusions of Cross Device Testing - Elite Camp June 2015
Myths and Illusions of Cross Device Testing - Elite Camp June 2015
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
FREE! How to Conceptualize Your Idea into a Fundable Startup/Investor Gary Jinks
FREE! How to Conceptualize Your Idea into a Fundable Startup/Investor Gary JinksFREE! How to Conceptualize Your Idea into a Fundable Startup/Investor Gary Jinks
FREE! How to Conceptualize Your Idea into a Fundable Startup/Investor Gary Jinks
 

Plus de BigML, Inc

Digital Transformation and Process Optimization in Manufacturing
Digital Transformation and Process Optimization in ManufacturingDigital Transformation and Process Optimization in Manufacturing
Digital Transformation and Process Optimization in ManufacturingBigML, Inc
 
DutchMLSchool 2022 - Automation
DutchMLSchool 2022 - AutomationDutchMLSchool 2022 - Automation
DutchMLSchool 2022 - AutomationBigML, Inc
 
DutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - ML for AML ComplianceDutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - ML for AML ComplianceBigML, Inc
 
DutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - Multi Perspective AnomaliesDutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - Multi Perspective AnomaliesBigML, Inc
 
DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - My First Anomaly Detector DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - My First Anomaly Detector BigML, Inc
 
DutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - Anomaly DetectionDutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - Anomaly DetectionBigML, Inc
 
DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLDutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLBigML, Inc
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLBigML, Inc
 
DutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyDutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyBigML, Inc
 
DutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorDutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorBigML, Inc
 
DutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsDutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsBigML, Inc
 
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsDutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsBigML, Inc
 
DutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleDutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleBigML, Inc
 
DutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIDutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIBigML, Inc
 
Democratizing Object Detection
Democratizing Object DetectionDemocratizing Object Detection
Democratizing Object DetectionBigML, Inc
 
BigML Release: Image Processing
BigML Release: Image ProcessingBigML Release: Image Processing
BigML Release: Image ProcessingBigML, Inc
 
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureMachine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureBigML, Inc
 
Machine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorMachine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorBigML, Inc
 
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotBigML, Inc
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...BigML, Inc
 

Plus de BigML, Inc (20)

Digital Transformation and Process Optimization in Manufacturing
Digital Transformation and Process Optimization in ManufacturingDigital Transformation and Process Optimization in Manufacturing
Digital Transformation and Process Optimization in Manufacturing
 
DutchMLSchool 2022 - Automation
DutchMLSchool 2022 - AutomationDutchMLSchool 2022 - Automation
DutchMLSchool 2022 - Automation
 
DutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - ML for AML ComplianceDutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - ML for AML Compliance
 
DutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - Multi Perspective AnomaliesDutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - Multi Perspective Anomalies
 
DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - My First Anomaly Detector DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - My First Anomaly Detector
 
DutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - Anomaly DetectionDutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - Anomaly Detection
 
DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLDutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in ML
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End ML
 
DutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyDutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven Company
 
DutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorDutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal Sector
 
DutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsDutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe Stadiums
 
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsDutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
 
DutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleDutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at Scale
 
DutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIDutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AI
 
Democratizing Object Detection
Democratizing Object DetectionDemocratizing Object Detection
Democratizing Object Detection
 
BigML Release: Image Processing
BigML Release: Image ProcessingBigML Release: Image Processing
BigML Release: Image Processing
 
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureMachine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
 
Machine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorMachine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail Sector
 
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
 

Dernier

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 

Dernier (20)

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

DutchMLSchool. Machine Learning End-to-End

  • 1. 1st edition | July 8-11, 2019
  • 2. BigML, Inc #DutchMLSchool Anatomy of an ML Application Machine Learning End-to-End Poul Petersen CIO, BigML 2
  • 3. BigML, Inc #DutchMLSchool Examples of ML Applications 3
  • 4. BigML, Inc #DutchMLSchool Real-world ML Applications 4 • Should you sign that NDA? • Upload the NDA to the website • The service uses Machine Learning to decide if the terms are fair https://ndalynn.com/
  • 5. BigML, Inc #DutchMLSchool Real-world ML Applications 5 • Gathers over 500 features about companies: • Crunchbase / Tweets / Patents / LinkedIn / etc. • Creates a label for success/failure: • IPO or acquisition = success • Bankruptcy or irrelevance = failure • Uses Machine Learning to build a model that predicts the success or failure of startups • And puts all of the information together into an investor dashboard https://preseries.com
  • 6. BigML, Inc #DutchMLSchool ML Adoption 6 "The gap for most companies isn’t that machine learning doesn’t work, but that they struggle to actually use it” • Why? • Too much focus on algorithms • Not enough focus on applying Machine
  • 7. BigML, Inc #DutchMLSchool Real-world ML Applications 7 https://thepointsguy.com/news/this-is-the-reason-you-arent-feeling-as-much-turbulence-on-delta-flights/ …collecting and analyzing “hundreds of thousands of data points,” with a plan to boost that to “millions,” creating a model that forecasts turbulence with a level of confidence heretofore unseen. Not Important: the algorithm!
  • 8. BigML, Inc #DutchMLSchool Machine Learning Evolution 8 Genesis Custom built Product Service Utility Academics & Researchers Scientists Developers Analysts Everyone 1950s 2000s 2011 2030 Commodity 2020 Ubiquity CertaintyUnknown Defined NovelCommon Weka, Scikit BigML, Azure ML, Amazon ML, Google Cloud ML1st Workshop on Machine Learning 1980 1980 • Machine Learning algorithms are fun to talk about: GPUs, NNs, etc • But the algorithms are largely a commodity already • Difficulty is knowing how to apply ML
  • 9. BigML, Inc #DutchMLSchool What is an ML Application 9 AIRLINE ORIGIN DESTINATION DEPARTURE DELAY DISTANCE ARRIVAL DELAY AS ANC SEA -11 1448,0 -22 AA LAX PBI -8 2330,0 -9 US SFO CLT -2 2296,0 5 AA LAX MIA -5 2342,0 -9 AS SEA ANC -1 1448,0 -21 DL SFO MSP -5 1589 8 NK LAS MSP -6 1299 -17 US LAX CLT 14 2125,0 -10 AA SFO DFW -11 1464,0 -13 DL LAS ATL 3 1747,0 -15 Finding patterns in data that can be used to make inferences… Predictive Models Consider: ML Definition
  • 10. BigML, Inc #DutchMLSchool What is an ML Application 10 AIRLINE ORIGIN DESTINATION DEPARTURE DELAY DISTANCE ARRIVAL DELAY AS ANC SEA -11 1448,0 -22 AA LAX PBI -8 2330,0 -9 US SFO CLT -2 2296,0 5 AA LAX MIA -5 2342,0 -9 AS SEA ANC -1 1448,0 -21 DL SFO MSP -5 1589 8 NK LAS MSP -6 1299 -17 US LAX CLT 14 2125,0 -10 AA SFO DFW -11 1464,0 -13 DL LAS ATL 3 1747,0 -15 Predictive Models • Where does this data come from? • How do you know what data? • Is the data formatted correctly? • What do you do with these models? • How do you combine them? • Will it work?
  • 11. BigML, Inc #DutchMLSchool Reality of a ML Application 11 Data Transformations Feature Engineering Data Collection Evaluation & Retraining Seen Unseen Predictive App
  • 12. BigML, Inc #DutchMLSchool Where to Start? 12 Step 1 Finish Step 2 - - - - - - - - ??? “Let’s predict 
 customer churn!” “Here are the customers we predict will leave our service”
  • 13. BigML, Inc #DutchMLSchool Where to Start? 13 Step 1 Finish Step 2 - - - - - - - - ??? “Let’s detect 
 fraud! “Here are the transactions we should stop immediately.
  • 14. BigML, Inc #DutchMLSchool ML Application Guide 14 • Remember: ML finds patterns in data enabling predictions about future events • This means you need data • What data depends on what you want to predict • And the data you have or can collect • Data needs to have patterns related to what you want to predict • Not magic: still can’t predict random events, lotteries, etc • Your problem statement needs to be specific • Not “Let’s predict churn” • But “Let’s predict churn by looking at the profile data of all previous customers of our service who have/have not churned” • This can be tricky… State the problem as an ML Task
  • 15. BigML, Inc #DutchMLSchool Where to Start? 15 Step 1 Finish “Let’s predict 
 the Oscars!” “Here are the 
 predicted winners” Step 2 - - - - - - - - ??? • Statement is not specific enough!!! • What data can we collect that predicts Oscar wins?
  • 16. BigML, Inc #DutchMLSchool Predicting the Oscars 16 • 6 out of 6 right! • 8 out of 8 actually, but probability of the predictions was “too low” • Adapted Screenplay • Original Screenplay BigML Scoresheet 2018 • 4 our of 8 major awards correctly predicted • Probabilities were lower this year • This is still significantly better than guessing 2019 How is this possible? Isn't the winner random?
  • 17. BigML, Inc #DutchMLSchool How an Oscar is Won 17 voting intention? 7,000+ members Insight: winning awards is not a random event!
  • 18. BigML, Inc #DutchMLSchool Let’s Predict Best Picture 18 Win London Critics Lose Writers Guild Win Directors Guild Win Golden Win Bafta • These events are *not* independent • Similar, but not identical, factors contribute to each win… • We can expect a higher probability for Shape of Water to win Oscar ?Win?
  • 19. BigML, Inc #DutchMLSchool The Features 19 MOVIES AWARDS OBJECTIVE • year • movie • movie_id • certificate • duration • genre • rate • metascore • synopsis • votes • gross • release_date • user_reviews • critic_reviews • popularity • awards_wins • awards_nomination s • release_date.year • release_date.mont h • release_date.day- of-month • release_date.day- of-week • Oscar_Best_Picture_nominated • Oscar_Best_Director_nominated • Oscar_Best_Actor_nominated • Oscar_Best_Actress_nominated • Oscar_Best_Supporting_Actor_nominated • Oscar_Best_Supporting_Actress_nominated • Oscar_Best_AdaScreen_nominated • Oscar_Best_OriScreen_nominated • Oscar_nominated • Oscar_nominated_categories • Golden_Globes_won • Golden_Globes_won_categories • Golden_Globes_nominated • Golden_Globes_nominated_categories • BAFTA_won • BAFTA_won_categories • BAFTA_nominated • BAFTA_nominated_categories • Screen_Actors_Guild_won • Screen_Actors_Guild_won_categories • Screen_Actors_Guild_nominated • Screen_Actors_Guild_nominated_categories • Critics_Choice_won • Critics_Choice_won_categories • Critics_Choice_nominated • Critics_Choice_nominated_categories • Directors_Guild_won • Directors_Guild_won_categories • Directors_Guild_nominated • Directors_Guild_nominated_categories • Producers_Guild_won • Producers_Guild_won_categories • Producers_Guild_nominated • Producers_Guild_nominated_categories • Art_Directors_Guild_won • Art_Directors_Guild_won_categories • Art_Directors_Guild_nominated • Art_Directors_Guild_nominated_categories • Writers_Guild_won • Writers_Guild_won_categories • Writers_Guild_nominated • Writers_Guild_nominated_categories • Costume_Designers_Guild_won • Costume_Designers_Guild_won_categories • Costume_Designers_Guild_nominated • Costume_Designers_Guild_nominated_categories • Online_Film_Television_Association_won • Online_Film_Television_Association_won_categories • Online_Film_Television_Association_nominated • Online_Film_Television_Association_nominated_catego ries • Online_Film_Critics_Society_won • Online_Film_Critics_Society_won_categories • Online_Film_Critics_Society_nominated • Online_Film_Critics_Society_nominated_categories • People_Choice_won • People_Choice_won_categories • People_Choice_nominated • People_Choice_nominated_categories • London_Critics_Circle_Film_won • London_Critics_Circle_Film_won_categories • London_Critics_Circle_Film_nominated • London_Critics_Circle_Film_nominated_categories • American_Cinema_Editors_won • American_Cinema_Editors_won_categories • American_Cinema_Editors_nominated • American_Cinema_Editors_nominated_categories • Hollywood_Film_won • Hollywood_Film_won_categories • Hollywood_Film_nominated • Hollywood_Film_nominated_categories • Austin_Film_Critics_Association_won • Austin_Film_Critics_Association_won_categories • Austin_Film_Critics_Association_nominated • Austin_Film_Critics_Association_nominated_categories • Denver_Film_Critics_Society_won • Denver_Film_Critics_Society_won_categories • Denver_Film_Critics_Society_nominated • Denver_Film_Critics_Society_nominated_categories • Boston_Society_of_Film_Critics_won • Boston_Society_of_Film_Critics_won_categories • Boston_Society_of_Film_Critics_nominated • Boston_Society_of_Film_Critics_nominated_categories • New_York_Film_Critics_Circle_won • Oscar_Best_Picture_wo n • Oscar_Best_Director_w on • Oscar_Best_Actor_won • Oscar_Best_Actress_wo n • Oscar_Best_Supporting _Actor_won • Oscar_Best_Supporting _Actress_won Data pulled from IMDB… Engineered Features: Award items field Nomination Counts Awards Counts
  • 20. BigML, Inc #DutchMLSchool Oscars Dataset 20 DATASET is publicly available: https://bigml.com/user/academy_awards/gallery/dataset/ 5a94302592fb565ed400103b
  • 21. BigML, Inc #DutchMLSchool Oscars Example 21 • When specifying the problem, be as specific as possible • Not: “Let’s predict the Oscars” • Instead: “Let’s Predict the Oscars by correlating a series of award wins with the final Oscar win.” • The statement of the problem will guide the data required • Be aware of the cost of collecting the data versus the ROI: Tidbits and Lessons Learned….
  • 22. BigML, Inc #DutchMLSchool Ranking ML Applications 22 FEASIBILITY (incdataavailability/deccomplexity) ROI (impact and cost) - + + NO-BRAINERS START HERE NO-GO POSTPONABLE BRAINERS Thinking about an ML Application?
  • 23. BigML, Inc #DutchMLSchool Oscars Example 23 • When specifying the problem, be as specific as possible • Not: “Let’s predict the Oscars” • Instead: “Let’s Predict the Oscars by correlating a series of award wins with the final Oscar win.” • The statement of the problem will guide the data required • Be aware of the cost of collecting the data versus the ROI: • IMDB data is readily availble • We’re done right? • Nope. You can’t escape Feature Engineering • Items: BAFTA_won_categories = list of nominations • Aggregations: Nomination and Award counts • You can’t escape Feature Selection • Full user reviews costly to collect and not useful Tidbits and Lessons Learned…. Wait: How were you confident in the predictions?
  • 24. BigML, Inc #DutchMLSchool 2013 2016 119 variables Evaluating the Model 24 119 variables 2000 2016 119 variables 2000 2012Original Dataset Test Dataset Train Dataset • Ultimately, we want to use all the history to predict the winner for the current year • In order to evaluate success, we use a model built from 2000-2012 data to predict the winners for 2013-2016 • Built a separate Deepnet for each award category • Evaluation obtained a ROC AUC over 0.98 across all award categories Great: The model seems OK, what next?
  • 25. BigML, Inc #DutchMLSchool Effort of a ML Application 25 State the problem as an ML task Data wrangling Feature engineering Modeling and Evaluations Predictions Measure Results Data transformations ~80% effort ~5% effort ~5% effort This is only such low effort because of platforms like This is an area where is currently innovating Task ~10% effort Effort
  • 26. BigML, Inc #DutchMLSchool Reality Check 26 • All Machine Learned models are wrong • Real-world Machine Learning is iterative • End-to-end Machine Learning is compositional Three Important Concepts in Applying ML…
  • 27. BigML, Inc #DutchMLSchool End-to-end ML is Compositional 27 • Real-world problems • Solved by applying a combination of algorithms • Very rarely is it one-and-done
  • 28. BigML, Inc #DutchMLSchool Basic Workflow 28 SOURCE DATASET MODEL PREDICTION
  • 29. BigML, Inc #DutchMLSchool Feature Engineering 29 MODEL FILTERSOLD HOMES BATCH PREDICTION NEW FEATURES DATASET DEALS DATASET FILTERFORSALE HOMES NEW FEATURES
  • 30. BigML, Inc #DutchMLSchool End-to-end ML is Compositional 30 • Real-world problems • Solved by applying a combination of algorithms • Very rarely is it one-and-done • Each “step” is often multi-stage as well • Filtering/Cleaning data
  • 31. BigML, Inc #DutchMLSchool Anomaly Filter and Evaluate 31 DIABETES SOURCE DIABETES DATASET TRAIN SET TEST SET ALL MODEL CLEAN DATASET FILTER ALL MODEL ALL EVALUATION CLEAN EVALUATION COMPARE EVALUATIONS ANAOMALY DETECTOR
  • 32. BigML, Inc #DutchMLSchool Fixing Missing Values 32 Fix Missing Values in a “Meaningful” Way Filter Zeros Model 
 insulin Predict 
 insulin Select 
 insulin Fixed
 Dataset Amended
 Dataset Original
 Dataset Clean
 Dataset
  • 33. BigML, Inc #DutchMLSchool End-to-end ML is Compositional 33 • Real-world problems • Solved by applying a combination of algorithms • Very rarely is it one-and-done • Each “step” is often multi-stage as well • Filtering/Cleaning data • Tuning a model for optimum performance
  • 34. BigML, Inc #DutchMLSchool Ensemble Tuning 34 ENSEMBLE N=20 EVALUATION SOURCE DATASET TRAINING TEST EVALUATIONEVALUATION ENSEMBLE N=10 ENSEMBLE N=1000 CHOOSE
  • 35. BigML, Inc #DutchMLSchool End-to-end ML is Compositional 35 • Real-world problems • Solved by applying a combination of algorithms • Very rarely is it one-and-done • Each “step” is often multi-stage as well • Filtering/Cleaning data • Tuning a model for optimum performance • Finding the best features
  • 36. BigML, Inc #DutchMLSchool Best-first Feature Selection 36 {F1} CHOOSE BEST S = {Fa} {F2} {F3} {F4} Fn S+{F1} S+{F2} S+{F3} S+{F4} S+{Fn-1} CHOOSE BEST S = {Fa, Fb} S+{F1} S+{F2} S+{F3} S+{F4} S+{Fn-1} CHOOSE BEST S = {Fa, Fb, Fc}
  • 37. BigML, Inc #DutchMLSchool End-to-end ML is Compositional 37 • Real-world problems • Solved by applying a combination of algorithms • Very rarely is it one-and-done • Each “step” is often multi-stage as well • Filtering/Cleaning data • Tuning a model for optimum performance • Finding the best features • May require models for several domains of knowledge • Multiple Training / Scoring
  • 38. BigML, Inc #DutchMLSchool AGGREGATED BY CARD AGGREGATED BY USER AGGREGATED BY PROFILE Multiple Domains 38 TRANSACTIONS ANOMALY BY CARD ANOMALY BY USER ANOMALY BY PROFILE ANOMALY SCORE ANOMALY SCORE ANOMALY SCORE NEW TRANSACTION APPROVED?
  • 39. BigML, Inc #DutchMLSchool End-to-end ML is Compositional 39 • Real-world problems • Solved by applying a combination of algorithms • Very rarely is it one-and-done • Each “step” is often multi-stage as well • Filtering/Cleaning data • Tuning a model for optimum performance • Finding the best features • May require models for several domains of knowledge • Multiple Training / Scoring • Even after deploying a model • Workflow to monitor performance, know when to retrain
  • 40. BigML, Inc #DutchMLSchool Model Retraining 40 TRAINING INPUT DATA PREDICTIONS ANOMALY SCORES OUTCOMES RETRAIN DATA
  • 41. BigML, Inc #DutchMLSchool Reality Check 41 • All Machine Learned models are wrong Three Important Concepts in Applying ML… • Real-world Machine Learning is iterative • End-to-end Machine Learning is compositional
  • 42. BigML, Inc #DutchMLSchool • Better features always beat better algorithms • Good algorithms already exist and are good enough • Tools like OptiML exist which can help optimize performance • The data is never good enough Tenets of Machine Learning 42 • All Machine Learned models are wrong • Real-world Machine Learning is iterative • End-to-end Machine Learning is compositional • Automation is better than hand tuning - you need an API! • When data changes quickly, training speed is more important than accuracy • Repeatability is superior to a single strong result • Problems are solved with workflows of algorithms • A ML solution is not real until it is in production • ML is here: Now we need 100,000x people applying ML , but some are useful