SlideShare a Scribd company logo
1 of 44
Download to read offline
Online, Offline, Mobile, Email, Social
www.sailthru.com
Cost Effectively Scaling Machine Learning Systems in the Cloud
Agenda:
● Background on me, Sailthru & Sightlines (mercifully short)
● Cost effective resources in the AWS cloud
● Efficient(ish) application design
● Easy maintenance and evolution
● Machine learning details
Online, Offline, Mobile, Email, Social
www.sailthru.com
@jeremystan
Capitalism
Idealism
Indirect
Value
Direct
Value
Graduate student
Math
2000
Consultant
Finance
2005 CTO
Ad Tech
2010
Chief Data Scientist
Mar Tech
2015
Online, Offline, Mobile, Email, Social
www.sailthru.com
Sailthru
Online, Offline, Mobile, Email, Social
www.sailthru.com
Sightlines
Analytics
- Segmentation
- Forecasting
Personalization
- Recommendations
- Discounting
Optimization
- Frequency
- Channel
Online, Offline, Mobile, Email, Social
www.sailthru.com
Requirements
1. ~5 million users per client
2. JSON formatted user data, siloed across clients
3. Predict varying outcomes
normal, poisson, binomial, quantile, ...
4. Update models & predictions daily
5. Only really care about predictive performance
6. Scale to 1,000+ clients
Online, Offline, Mobile, Email, Social
www.sailthru.com
Our Cost Effective Scaling Strategy
1. Get really cheap computing power
2. Make it work really, really hard
3. Optimize apps for ease of evolution
4. Setup identical A/B environments
Iterate aggressively based on data:
✓ Features
✓ Efficiency
✓ Scale
10x
3x
0.6x =
0.5x
= 9x
JSON to
Features
GBM in
Memory
1 x0.2x
Half our
processing
Half our
processing
Online, Offline, Mobile, Email, Social
www.sailthru.com
Cost Effective
Resources in
the AWS Cloud
Online, Offline, Mobile, Email, Social
www.sailthru.com
Cost Effective r3.8xlarge
32 vCPU, 244GB RAM
Resource Utilization
30%
(typical cloud)
10%
(data center)
90%
(highly efficient)
Cost
Per
Hour
$2.80
(on demand)
$1.76
(reserved 1yr)
$1.05
(reserved 3yr)
$0.28
(spot instance)
Cloud
$9.80
Data Center
$10.50
Spot + Mesos + Relay
$0.30
30x more cost
efficient!
($10.50 = $1.05 / 10%)
Online, Offline, Mobile, Email, Social
www.sailthru.com
AWS Spot Instances
Your bid
What you pay
All instances died!
Online, Offline, Mobile, Email, Social
www.sailthru.com
Mesos
81 “slaves”
4 availability zones
2 instance types
1,360 CPUs
10TB of RAM
94% utilized
$11.90 per hour
$104,244 per year
Online, Offline, Mobile, Email, Social
www.sailthru.com
Mesos + Marathon
Zone 1 Zone 2 Zone 3 Zone 4
Mesos
Slave
(16 CPU)
Mesos
Slave
(8 CPU)
Online, Offline, Mobile, Email, Social
www.sailthru.com
Mesos + Marathon
Zone 1 Zone 2 Zone 3 Zone 4
Mesos
Slave
(16 CPU)
Mesos
Slave
(8 CPU)
Mesos
Master
App A
App B
App C
Queue Size
Applications must be:
● Distributed to be scheduled wherever Mesos wants
● Fine Grained to maximize utilization in Mesos
● Idempotent to handle duplicate runs in case network
is partitioned
Online, Offline, Mobile, Email, Social
www.sailthru.com
Mesos + Marathon
Zone 1 Zone 2 Zone 3 Zone 4
Mesos
Slave
(16 CPU)
Mesos
Slave
(8 CPU)
Mesos
Master
App A
App B
App C
Queue Size
Time
Available
Mesos
CPU
Jiffies
Doesn’t work for apps
with highly variable load
Idle
User
Online, Offline, Mobile, Email, Social
www.sailthru.com
Mesos + Relay
Available
Mesos
CPU
Jiffies
User
Idle
Available
Mesos
CPU
Jiffies
User
Idle
Relay.Mesos
Auto-scaler for distributed applications
github.com/sailthru/relay.mesos
● Allocates resources based on queue size
● Wraps applications inside Mesos slaves
● Can significantly improve cluster utilization
Before Relay
After
Relay
App A
App B
App C
Queue Size
Mesos
Master
Time
After Relay
Relay.
Mesos
Online, Offline, Mobile, Email, Social
www.sailthru.com
Efficient(ish)
Application
Design
Online, Offline, Mobile, Email, Social
www.sailthru.com
Stolos
Distributed task dependency manager
github.com/sailthru/stolos
● Directed acyclic graph
● Parameterizable templates
● Handles queueing
● Ensures idempotent
Application Pipeline (simplified)
Assembly GBMs
Analyze
Models
JSON
Sailthru
User
API
Predict Upload Mongo
Reports
Actually much more complex
● ~1,000 clients
● ~10 models
● ~10 steps
● ~100 sub-tasks
ETL
Mongo
Online, Offline, Mobile, Email, Social
www.sailthru.com
shard 1
shard 1,000
Sampling Strategy
JSON
Day
1
Mongo
S3
JSON sharded on hash(user)
Online, Offline, Mobile, Email, Social
www.sailthru.com
shard 1
shard 1,000
Sampling Strategy
JSON
Day
N
Mongo
Day
1
S3
Online, Offline, Mobile, Email, Social
www.sailthru.com
Day
N
Day
1
shard 1
shard 1,000
Sampling Strategy
JSON
Consistent 0.1% of data to a
Mesos Slave CPU
Mongo
S3
Online, Offline, Mobile, Email, Social
www.sailthru.com
Day
N
Day
1
shard 1
shard 1,000
Sampling Strategy
JSON
Apps sample more as needed
Mongo
S3
Online, Offline, Mobile, Email, Social
www.sailthru.com
User Profile JSON Data
Online, Offline, Mobile, Email, Social
www.sailthru.com
Each User Radically Different
User
Feature
???
Online, Offline, Mobile, Email, Social
www.sailthru.com
Each User Radically Different
User
Feature
tidyjson
Turn JSON into data frames
github.com/sailthru/tidyjson
● Arbitrary JSON into R data.frames
● Guarantees deterministic structure
● Seamless with dplyr and %>%
Online, Offline, Mobile, Email, Social
www.sailthru.com
Why GBMs?
● Predict varying outcomes
normal, poisson, binomial, quantile, …
● Flexible enough to capture non-linearity & complex interactions
no need to feature engineer for each client
● Minimal number of hyper-parameters
depth, shrinkage, number of trees
● Robust to missing values
no need to impute
Online, Offline, Mobile, Email, Social
www.sailthru.com
+ … + αK
*
Distributing a GBM
α1
*
tree 1 tree 2 tree 3 tree K
+ α2
* + α3
*
Online, Offline, Mobile, Email, Social
www.sailthru.com
+ … + αK
*
Distributing a GBM
α1
*
tree 1 tree 2 tree 3 tree K
1. Across the sum
Gives bagging, not boosting (iterative)
=> less accurate
+ α2
* + α3
*
Zone 1 Zone 2 Zone 3 Zone 4
Mesos
Slaves
Online, Offline, Mobile, Email, Social
www.sailthru.com
+ … + αK
*
Distributing a GBM
α1
*
tree 1 tree 2 tree 3 tree K
1. Across the sum
Gives bagging, not boosting (iterative)
=> less accurate
2. Within each tree (Spark MLLib, H20)
A lot of overhead and coordination
=> not efficient for many small GBMs
+ α2
* + α3
*
Zone 1 Zone 2 Zone 3 Zone 4
Mesos
Slaves
Online, Offline, Mobile, Email, Social
www.sailthru.com
Distributing a GBM
1. Across the sum
Gives bagging, not boosting (iterative)
=> less accurate
2. Within each tree (Spark MLLib, H20)
A lot of overhead and coordination
=> not efficient for many small GBMs
3. Across the GBMs
50,000 GBMs to build
=> each can be built independently
Zone 1 Zone 2 Zone 3 Zone 4
Mesos
Slaves
+ … + αK
*α1
*
tree 1 tree 2 tree 3 tree K
+ α2
* + α3
* + … + αK
*α1
*
tree 1 tree 2 tree 3 tree K
+ α2
* + α3
*
…
GBM 1 GBM 50,000
50,000 = 1,000 clients * 10 models * 5-fold CV
✓
Online, Offline, Mobile, Email, Social
www.sailthru.com
Grid Search
+ … + αK
*α1
*
tree 1 tree 2 tree 3 tree K
+ α2
* + α3
*
For each client & model:
1. Grid search over:
a. Depth: size of trees
b. Shrinkage: λ “learning rate” for {αi
}
2. Cross-validate for optimal # of trees
Online, Offline, Mobile, Email, Social
www.sailthru.com
Easy
Maintenance
& Evolution
Online, Offline, Mobile, Email, Social
www.sailthru.com
Tools Used
R
Modeling
Python
ETL
AWS S3
Batch
Applications
State
Frameworks
Zookeeper
Coordination
Spark
Map Reduce
Marathon
Running Apps
Cluster
Mesos
Sharing
Maintenance
ELK
Log Mgmt
Consul
Discovery
Configuration
Chef
Automation
Librato
Monitoring
Sensu
Alerting
Asgard
Auto Scaling
AWS Spot
Compute
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
JSON
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
JSON
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
JSON
v1.0.0
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
JSON
v1.0.0
v1.0.1
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
JSON
v1.0.0
v1.0.1
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
JSON
v1.0.0
v1.0.1
v1.0.2
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
✓ Check monitoring
JSON
v1.0.0
v1.0.1
v1.0.2
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
✓ Check monitoring
✓ Check logging
JSON
v1.0.0
v1.0.1
v1.0.2
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
✓ Check monitoring
✓ Check logging
✓ Check performance
JSON
v1.0.0
v1.0.1
v1.0.2
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
✓ Check monitoring
✓ Check logging
✓ Check performance
JSON
v1.0.0
v1.0.1
v1.0.2
Online, Offline, Mobile, Email, Social
www.sailthru.com
How we Iterate A
B
Sailthru
User
API
Mongo
● Tools
● Configuration
● Applications
✓ Check monitoring
✓ Check logging
✓ Check performance
JSON
v1.0.0
v1.0.1
v1.0.2
Thank You! Our team:
Divyanshu Vats Alex Gaudio Andras Kerekes Jeremy Stanley

More Related Content

Viewers also liked

2017 Digital Retail Innovation: 9 Areas Retail Marketers are Investing and Why
2017 Digital Retail Innovation: 9 Areas Retail Marketers are Investing and Why2017 Digital Retail Innovation: 9 Areas Retail Marketers are Investing and Why
2017 Digital Retail Innovation: 9 Areas Retail Marketers are Investing and WhySailthru
 
Balancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem FormulationBalancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem FormulationAlex D. Gaudio
 
Larry Birnbaum, Narrative Science, 11 June
Larry Birnbaum, Narrative Science, 11 JuneLarry Birnbaum, Narrative Science, 11 June
Larry Birnbaum, Narrative Science, 11 JuneGlobal Editors Network
 
13 Stats That Will Redefine Your Email Marketing Priorities
13 Stats That Will Redefine Your Email Marketing Priorities13 Stats That Will Redefine Your Email Marketing Priorities
13 Stats That Will Redefine Your Email Marketing PrioritiesSailthru
 
Playing the Marketing Long Game
Playing the Marketing Long GamePlaying the Marketing Long Game
Playing the Marketing Long GameWork-Bench
 
Dave Govan (VP of Sales, Sailthru) - Aligning a Go to Market Strategy with Sa...
Dave Govan (VP of Sales, Sailthru) - Aligning a Go to Market Strategy with Sa...Dave Govan (VP of Sales, Sailthru) - Aligning a Go to Market Strategy with Sa...
Dave Govan (VP of Sales, Sailthru) - Aligning a Go to Market Strategy with Sa...Sales Hacker
 
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsBuilding a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsSpark Summit
 
50 Facts That Will Make Businesses Rethink their Customer Service
50 Facts That Will Make Businesses Rethink their Customer Service50 Facts That Will Make Businesses Rethink their Customer Service
50 Facts That Will Make Businesses Rethink their Customer ServiceDesk
 

Viewers also liked (8)

2017 Digital Retail Innovation: 9 Areas Retail Marketers are Investing and Why
2017 Digital Retail Innovation: 9 Areas Retail Marketers are Investing and Why2017 Digital Retail Innovation: 9 Areas Retail Marketers are Investing and Why
2017 Digital Retail Innovation: 9 Areas Retail Marketers are Investing and Why
 
Balancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem FormulationBalancing Infrastructure with Optimization and Problem Formulation
Balancing Infrastructure with Optimization and Problem Formulation
 
Larry Birnbaum, Narrative Science, 11 June
Larry Birnbaum, Narrative Science, 11 JuneLarry Birnbaum, Narrative Science, 11 June
Larry Birnbaum, Narrative Science, 11 June
 
13 Stats That Will Redefine Your Email Marketing Priorities
13 Stats That Will Redefine Your Email Marketing Priorities13 Stats That Will Redefine Your Email Marketing Priorities
13 Stats That Will Redefine Your Email Marketing Priorities
 
Playing the Marketing Long Game
Playing the Marketing Long GamePlaying the Marketing Long Game
Playing the Marketing Long Game
 
Dave Govan (VP of Sales, Sailthru) - Aligning a Go to Market Strategy with Sa...
Dave Govan (VP of Sales, Sailthru) - Aligning a Go to Market Strategy with Sa...Dave Govan (VP of Sales, Sailthru) - Aligning a Go to Market Strategy with Sa...
Dave Govan (VP of Sales, Sailthru) - Aligning a Go to Market Strategy with Sa...
 
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu VatsBuilding a Recommendation Engine Using Diverse Features by Divyanshu Vats
Building a Recommendation Engine Using Diverse Features by Divyanshu Vats
 
50 Facts That Will Make Businesses Rethink their Customer Service
50 Facts That Will Make Businesses Rethink their Customer Service50 Facts That Will Make Businesses Rethink their Customer Service
50 Facts That Will Make Businesses Rethink their Customer Service
 

Similar to Jeremy Stanley, EVP/Data Scientist, Sailthru at MLconf NYC

Internet of things
Internet of things  Internet of things
Internet of things gule mariam
 
The Yin and Yang of Software
The Yin and Yang of SoftwareThe Yin and Yang of Software
The Yin and Yang of Softwareelliando dias
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4heyramzz
 
Building and scaling your first containerized microservice
Building and scaling your first containerized microserviceBuilding and scaling your first containerized microservice
Building and scaling your first containerized microserviceAmazon Web Services
 
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...HostedbyConfluent
 
#TwitterRealTime - Real time processing @twitter
#TwitterRealTime - Real time processing @twitter#TwitterRealTime - Real time processing @twitter
#TwitterRealTime - Real time processing @twitterTwitter Developers
 
AWS re:Invent 2016: Fraud Detection with Amazon Machine Learning on AWS (FIN301)
AWS re:Invent 2016: Fraud Detection with Amazon Machine Learning on AWS (FIN301)AWS re:Invent 2016: Fraud Detection with Amazon Machine Learning on AWS (FIN301)
AWS re:Invent 2016: Fraud Detection with Amazon Machine Learning on AWS (FIN301)Amazon Web Services
 
Prassnitha Sampath - Real Time Big Data Analytics with Kafka, Storm & HBase -...
Prassnitha Sampath - Real Time Big Data Analytics with Kafka, Storm & HBase -...Prassnitha Sampath - Real Time Big Data Analytics with Kafka, Storm & HBase -...
Prassnitha Sampath - Real Time Big Data Analytics with Kafka, Storm & HBase -...NoSQLmatters
 
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Boris Adryan
 
Cassandra Summit 2014: A Train of Thoughts About Growing and Scalability — Bu...
Cassandra Summit 2014: A Train of Thoughts About Growing and Scalability — Bu...Cassandra Summit 2014: A Train of Thoughts About Growing and Scalability — Bu...
Cassandra Summit 2014: A Train of Thoughts About Growing and Scalability — Bu...DataStax Academy
 
MeasureWorks - Multichannel 2013 - Why your users quit your online (mobile) s...
MeasureWorks - Multichannel 2013 - Why your users quit your online (mobile) s...MeasureWorks - Multichannel 2013 - Why your users quit your online (mobile) s...
MeasureWorks - Multichannel 2013 - Why your users quit your online (mobile) s...MeasureWorks
 
AWS re:Invent 2016: Getting the most Bang for your buck with #EC2 #Winning (C...
AWS re:Invent 2016: Getting the most Bang for your buck with #EC2 #Winning (C...AWS re:Invent 2016: Getting the most Bang for your buck with #EC2 #Winning (C...
AWS re:Invent 2016: Getting the most Bang for your buck with #EC2 #Winning (C...Amazon Web Services
 
MongoDB World 2016: Scaling Targeted Notifications in the Music Streaming Wor...
MongoDB World 2016: Scaling Targeted Notifications in the Music Streaming Wor...MongoDB World 2016: Scaling Targeted Notifications in the Music Streaming Wor...
MongoDB World 2016: Scaling Targeted Notifications in the Music Streaming Wor...MongoDB
 
Accelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAccelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAmazon Web Services
 
Getting the most Bang for your Buck with #EC2 #Winning
Getting the most Bang for your Buck with #EC2 #WinningGetting the most Bang for your Buck with #EC2 #Winning
Getting the most Bang for your Buck with #EC2 #WinningAmazon Web Services
 
Get the Most Bang for Your Buck with #EC2 #WINNING
Get the Most Bang for Your Buck with #EC2 #WINNINGGet the Most Bang for Your Buck with #EC2 #WINNING
Get the Most Bang for Your Buck with #EC2 #WINNINGAmazon Web Services
 
Efficient Cloud Resource Management - Unleashing FinOps for Azure Cost Optimi...
Efficient Cloud Resource Management - Unleashing FinOps for Azure Cost Optimi...Efficient Cloud Resource Management - Unleashing FinOps for Azure Cost Optimi...
Efficient Cloud Resource Management - Unleashing FinOps for Azure Cost Optimi...Estêvão Bissoli Saleme
 
Meetup: Streaming Data Pipeline Development
Meetup:  Streaming Data Pipeline DevelopmentMeetup:  Streaming Data Pipeline Development
Meetup: Streaming Data Pipeline DevelopmentTimothy Spann
 
Software Architectures, Week 4 - Message-based Architectures, Message Bus
Software Architectures, Week 4 - Message-based Architectures, Message BusSoftware Architectures, Week 4 - Message-based Architectures, Message Bus
Software Architectures, Week 4 - Message-based Architectures, Message BusAngelos Kapsimanis
 
GSX Monitor and Analyzer for Microsoft Exchange Online - Presented by Atidan
GSX Monitor and Analyzer for Microsoft Exchange Online - Presented by AtidanGSX Monitor and Analyzer for Microsoft Exchange Online - Presented by Atidan
GSX Monitor and Analyzer for Microsoft Exchange Online - Presented by AtidanDavid J Rosenthal
 

Similar to Jeremy Stanley, EVP/Data Scientist, Sailthru at MLconf NYC (20)

Internet of things
Internet of things  Internet of things
Internet of things
 
The Yin and Yang of Software
The Yin and Yang of SoftwareThe Yin and Yang of Software
The Yin and Yang of Software
 
Big data tutorial_part4
Big data tutorial_part4Big data tutorial_part4
Big data tutorial_part4
 
Building and scaling your first containerized microservice
Building and scaling your first containerized microserviceBuilding and scaling your first containerized microservice
Building and scaling your first containerized microservice
 
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
 
#TwitterRealTime - Real time processing @twitter
#TwitterRealTime - Real time processing @twitter#TwitterRealTime - Real time processing @twitter
#TwitterRealTime - Real time processing @twitter
 
AWS re:Invent 2016: Fraud Detection with Amazon Machine Learning on AWS (FIN301)
AWS re:Invent 2016: Fraud Detection with Amazon Machine Learning on AWS (FIN301)AWS re:Invent 2016: Fraud Detection with Amazon Machine Learning on AWS (FIN301)
AWS re:Invent 2016: Fraud Detection with Amazon Machine Learning on AWS (FIN301)
 
Prassnitha Sampath - Real Time Big Data Analytics with Kafka, Storm & HBase -...
Prassnitha Sampath - Real Time Big Data Analytics with Kafka, Storm & HBase -...Prassnitha Sampath - Real Time Big Data Analytics with Kafka, Storm & HBase -...
Prassnitha Sampath - Real Time Big Data Analytics with Kafka, Storm & HBase -...
 
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
Mehr und schneller ist nicht automatisch besser - data2day, 06.10.16
 
Cassandra Summit 2014: A Train of Thoughts About Growing and Scalability — Bu...
Cassandra Summit 2014: A Train of Thoughts About Growing and Scalability — Bu...Cassandra Summit 2014: A Train of Thoughts About Growing and Scalability — Bu...
Cassandra Summit 2014: A Train of Thoughts About Growing and Scalability — Bu...
 
MeasureWorks - Multichannel 2013 - Why your users quit your online (mobile) s...
MeasureWorks - Multichannel 2013 - Why your users quit your online (mobile) s...MeasureWorks - Multichannel 2013 - Why your users quit your online (mobile) s...
MeasureWorks - Multichannel 2013 - Why your users quit your online (mobile) s...
 
AWS re:Invent 2016: Getting the most Bang for your buck with #EC2 #Winning (C...
AWS re:Invent 2016: Getting the most Bang for your buck with #EC2 #Winning (C...AWS re:Invent 2016: Getting the most Bang for your buck with #EC2 #Winning (C...
AWS re:Invent 2016: Getting the most Bang for your buck with #EC2 #Winning (C...
 
MongoDB World 2016: Scaling Targeted Notifications in the Music Streaming Wor...
MongoDB World 2016: Scaling Targeted Notifications in the Music Streaming Wor...MongoDB World 2016: Scaling Targeted Notifications in the Music Streaming Wor...
MongoDB World 2016: Scaling Targeted Notifications in the Music Streaming Wor...
 
Accelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAccelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of Genomics
 
Getting the most Bang for your Buck with #EC2 #Winning
Getting the most Bang for your Buck with #EC2 #WinningGetting the most Bang for your Buck with #EC2 #Winning
Getting the most Bang for your Buck with #EC2 #Winning
 
Get the Most Bang for Your Buck with #EC2 #WINNING
Get the Most Bang for Your Buck with #EC2 #WINNINGGet the Most Bang for Your Buck with #EC2 #WINNING
Get the Most Bang for Your Buck with #EC2 #WINNING
 
Efficient Cloud Resource Management - Unleashing FinOps for Azure Cost Optimi...
Efficient Cloud Resource Management - Unleashing FinOps for Azure Cost Optimi...Efficient Cloud Resource Management - Unleashing FinOps for Azure Cost Optimi...
Efficient Cloud Resource Management - Unleashing FinOps for Azure Cost Optimi...
 
Meetup: Streaming Data Pipeline Development
Meetup:  Streaming Data Pipeline DevelopmentMeetup:  Streaming Data Pipeline Development
Meetup: Streaming Data Pipeline Development
 
Software Architectures, Week 4 - Message-based Architectures, Message Bus
Software Architectures, Week 4 - Message-based Architectures, Message BusSoftware Architectures, Week 4 - Message-based Architectures, Message Bus
Software Architectures, Week 4 - Message-based Architectures, Message Bus
 
GSX Monitor and Analyzer for Microsoft Exchange Online - Presented by Atidan
GSX Monitor and Analyzer for Microsoft Exchange Online - Presented by AtidanGSX Monitor and Analyzer for Microsoft Exchange Online - Presented by Atidan
GSX Monitor and Analyzer for Microsoft Exchange Online - Presented by Atidan
 

More from MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Recently uploaded

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Recently uploaded (20)

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Jeremy Stanley, EVP/Data Scientist, Sailthru at MLconf NYC

  • 1.
  • 2. Online, Offline, Mobile, Email, Social www.sailthru.com Cost Effectively Scaling Machine Learning Systems in the Cloud Agenda: ● Background on me, Sailthru & Sightlines (mercifully short) ● Cost effective resources in the AWS cloud ● Efficient(ish) application design ● Easy maintenance and evolution ● Machine learning details
  • 3. Online, Offline, Mobile, Email, Social www.sailthru.com @jeremystan Capitalism Idealism Indirect Value Direct Value Graduate student Math 2000 Consultant Finance 2005 CTO Ad Tech 2010 Chief Data Scientist Mar Tech 2015
  • 4. Online, Offline, Mobile, Email, Social www.sailthru.com Sailthru
  • 5. Online, Offline, Mobile, Email, Social www.sailthru.com Sightlines Analytics - Segmentation - Forecasting Personalization - Recommendations - Discounting Optimization - Frequency - Channel
  • 6. Online, Offline, Mobile, Email, Social www.sailthru.com Requirements 1. ~5 million users per client 2. JSON formatted user data, siloed across clients 3. Predict varying outcomes normal, poisson, binomial, quantile, ... 4. Update models & predictions daily 5. Only really care about predictive performance 6. Scale to 1,000+ clients
  • 7. Online, Offline, Mobile, Email, Social www.sailthru.com Our Cost Effective Scaling Strategy 1. Get really cheap computing power 2. Make it work really, really hard 3. Optimize apps for ease of evolution 4. Setup identical A/B environments Iterate aggressively based on data: ✓ Features ✓ Efficiency ✓ Scale 10x 3x 0.6x = 0.5x = 9x JSON to Features GBM in Memory 1 x0.2x Half our processing Half our processing
  • 8. Online, Offline, Mobile, Email, Social www.sailthru.com Cost Effective Resources in the AWS Cloud
  • 9. Online, Offline, Mobile, Email, Social www.sailthru.com Cost Effective r3.8xlarge 32 vCPU, 244GB RAM Resource Utilization 30% (typical cloud) 10% (data center) 90% (highly efficient) Cost Per Hour $2.80 (on demand) $1.76 (reserved 1yr) $1.05 (reserved 3yr) $0.28 (spot instance) Cloud $9.80 Data Center $10.50 Spot + Mesos + Relay $0.30 30x more cost efficient! ($10.50 = $1.05 / 10%)
  • 10. Online, Offline, Mobile, Email, Social www.sailthru.com AWS Spot Instances Your bid What you pay All instances died!
  • 11. Online, Offline, Mobile, Email, Social www.sailthru.com Mesos 81 “slaves” 4 availability zones 2 instance types 1,360 CPUs 10TB of RAM 94% utilized $11.90 per hour $104,244 per year
  • 12. Online, Offline, Mobile, Email, Social www.sailthru.com Mesos + Marathon Zone 1 Zone 2 Zone 3 Zone 4 Mesos Slave (16 CPU) Mesos Slave (8 CPU)
  • 13. Online, Offline, Mobile, Email, Social www.sailthru.com Mesos + Marathon Zone 1 Zone 2 Zone 3 Zone 4 Mesos Slave (16 CPU) Mesos Slave (8 CPU) Mesos Master App A App B App C Queue Size Applications must be: ● Distributed to be scheduled wherever Mesos wants ● Fine Grained to maximize utilization in Mesos ● Idempotent to handle duplicate runs in case network is partitioned
  • 14. Online, Offline, Mobile, Email, Social www.sailthru.com Mesos + Marathon Zone 1 Zone 2 Zone 3 Zone 4 Mesos Slave (16 CPU) Mesos Slave (8 CPU) Mesos Master App A App B App C Queue Size Time Available Mesos CPU Jiffies Doesn’t work for apps with highly variable load Idle User
  • 15. Online, Offline, Mobile, Email, Social www.sailthru.com Mesos + Relay Available Mesos CPU Jiffies User Idle Available Mesos CPU Jiffies User Idle Relay.Mesos Auto-scaler for distributed applications github.com/sailthru/relay.mesos ● Allocates resources based on queue size ● Wraps applications inside Mesos slaves ● Can significantly improve cluster utilization Before Relay After Relay App A App B App C Queue Size Mesos Master Time After Relay Relay. Mesos
  • 16. Online, Offline, Mobile, Email, Social www.sailthru.com Efficient(ish) Application Design
  • 17. Online, Offline, Mobile, Email, Social www.sailthru.com Stolos Distributed task dependency manager github.com/sailthru/stolos ● Directed acyclic graph ● Parameterizable templates ● Handles queueing ● Ensures idempotent Application Pipeline (simplified) Assembly GBMs Analyze Models JSON Sailthru User API Predict Upload Mongo Reports Actually much more complex ● ~1,000 clients ● ~10 models ● ~10 steps ● ~100 sub-tasks ETL Mongo
  • 18. Online, Offline, Mobile, Email, Social www.sailthru.com shard 1 shard 1,000 Sampling Strategy JSON Day 1 Mongo S3 JSON sharded on hash(user)
  • 19. Online, Offline, Mobile, Email, Social www.sailthru.com shard 1 shard 1,000 Sampling Strategy JSON Day N Mongo Day 1 S3
  • 20. Online, Offline, Mobile, Email, Social www.sailthru.com Day N Day 1 shard 1 shard 1,000 Sampling Strategy JSON Consistent 0.1% of data to a Mesos Slave CPU Mongo S3
  • 21. Online, Offline, Mobile, Email, Social www.sailthru.com Day N Day 1 shard 1 shard 1,000 Sampling Strategy JSON Apps sample more as needed Mongo S3
  • 22. Online, Offline, Mobile, Email, Social www.sailthru.com User Profile JSON Data
  • 23. Online, Offline, Mobile, Email, Social www.sailthru.com Each User Radically Different User Feature ???
  • 24. Online, Offline, Mobile, Email, Social www.sailthru.com Each User Radically Different User Feature tidyjson Turn JSON into data frames github.com/sailthru/tidyjson ● Arbitrary JSON into R data.frames ● Guarantees deterministic structure ● Seamless with dplyr and %>%
  • 25. Online, Offline, Mobile, Email, Social www.sailthru.com Why GBMs? ● Predict varying outcomes normal, poisson, binomial, quantile, … ● Flexible enough to capture non-linearity & complex interactions no need to feature engineer for each client ● Minimal number of hyper-parameters depth, shrinkage, number of trees ● Robust to missing values no need to impute
  • 26. Online, Offline, Mobile, Email, Social www.sailthru.com + … + αK * Distributing a GBM α1 * tree 1 tree 2 tree 3 tree K + α2 * + α3 *
  • 27. Online, Offline, Mobile, Email, Social www.sailthru.com + … + αK * Distributing a GBM α1 * tree 1 tree 2 tree 3 tree K 1. Across the sum Gives bagging, not boosting (iterative) => less accurate + α2 * + α3 * Zone 1 Zone 2 Zone 3 Zone 4 Mesos Slaves
  • 28. Online, Offline, Mobile, Email, Social www.sailthru.com + … + αK * Distributing a GBM α1 * tree 1 tree 2 tree 3 tree K 1. Across the sum Gives bagging, not boosting (iterative) => less accurate 2. Within each tree (Spark MLLib, H20) A lot of overhead and coordination => not efficient for many small GBMs + α2 * + α3 * Zone 1 Zone 2 Zone 3 Zone 4 Mesos Slaves
  • 29. Online, Offline, Mobile, Email, Social www.sailthru.com Distributing a GBM 1. Across the sum Gives bagging, not boosting (iterative) => less accurate 2. Within each tree (Spark MLLib, H20) A lot of overhead and coordination => not efficient for many small GBMs 3. Across the GBMs 50,000 GBMs to build => each can be built independently Zone 1 Zone 2 Zone 3 Zone 4 Mesos Slaves + … + αK *α1 * tree 1 tree 2 tree 3 tree K + α2 * + α3 * + … + αK *α1 * tree 1 tree 2 tree 3 tree K + α2 * + α3 * … GBM 1 GBM 50,000 50,000 = 1,000 clients * 10 models * 5-fold CV ✓
  • 30. Online, Offline, Mobile, Email, Social www.sailthru.com Grid Search + … + αK *α1 * tree 1 tree 2 tree 3 tree K + α2 * + α3 * For each client & model: 1. Grid search over: a. Depth: size of trees b. Shrinkage: λ “learning rate” for {αi } 2. Cross-validate for optimal # of trees
  • 31. Online, Offline, Mobile, Email, Social www.sailthru.com Easy Maintenance & Evolution
  • 32. Online, Offline, Mobile, Email, Social www.sailthru.com Tools Used R Modeling Python ETL AWS S3 Batch Applications State Frameworks Zookeeper Coordination Spark Map Reduce Marathon Running Apps Cluster Mesos Sharing Maintenance ELK Log Mgmt Consul Discovery Configuration Chef Automation Librato Monitoring Sensu Alerting Asgard Auto Scaling AWS Spot Compute
  • 33. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo JSON
  • 34. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo JSON
  • 35. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications JSON v1.0.0
  • 36. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications JSON v1.0.0 v1.0.1
  • 37. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications JSON v1.0.0 v1.0.1
  • 38. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications JSON v1.0.0 v1.0.1 v1.0.2
  • 39. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications ✓ Check monitoring JSON v1.0.0 v1.0.1 v1.0.2
  • 40. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications ✓ Check monitoring ✓ Check logging JSON v1.0.0 v1.0.1 v1.0.2
  • 41. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications ✓ Check monitoring ✓ Check logging ✓ Check performance JSON v1.0.0 v1.0.1 v1.0.2
  • 42. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications ✓ Check monitoring ✓ Check logging ✓ Check performance JSON v1.0.0 v1.0.1 v1.0.2
  • 43. Online, Offline, Mobile, Email, Social www.sailthru.com How we Iterate A B Sailthru User API Mongo ● Tools ● Configuration ● Applications ✓ Check monitoring ✓ Check logging ✓ Check performance JSON v1.0.0 v1.0.1 v1.0.2
  • 44. Thank You! Our team: Divyanshu Vats Alex Gaudio Andras Kerekes Jeremy Stanley