SlideShare une entreprise Scribd logo
1  sur  49
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Working with Amazon SageMaker Algorithms for Faster Model
Training
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is Amazon SageMaker?
Exploration Training
Hosting
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
The Amazon Machine Learning Stack
FRAMEWORKS & INTERFACES
Caffe2 CNTK
Apache
MXNet
PyTorch
TensorFlo
w
Chainer Keras Gluon
AWS Deep Learning AMIs
Amazon SageMaker
Rekognition Transcribe Translate Polly Comprehend Lex
AWS
DeepLens
EDUCATION
PLATFORM SERVICES
APPLICATION SERVICES
Amazon Mechanical Turk
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Challenges in Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Large Scale Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Large Scale Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Our Customers use ML at a massive scale
“We collect 160M events
daily in the ML pipeline and
run training over the last
15 days and need it to
complete in one hour.
Effectively there's 100M
features in the model”
Valentino Volonghi, CTO
“We process 3 million ad
requests a second, 100,000
features per request. That’s
250 trillion per day. Not
your run of the mill Data
science problem!”
Bill Simmons, CTO
“Our data warehouse is
100TB and we are
processing 2TB daily. We're
running mostly gradient
boosting (trees), LDA and
K-Means clustering and
collaborative filtering.“
Shahar Cizer Kobrinsky, VP
Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
Distributed, with
Strong Machines
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
Distributed, with
Strong Machines
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
2
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
Unusable Data /
Wasted opportunity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Architecture and Design
Streaming, GPU/CPU, Distributed with a Shared State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Streaming
State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Streaming
Data Size
Memory
Data Size
Time/Cost
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
2
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
GPU/CPU
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Distributed
GPU State
GPU State
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Shared State
GPU
GPU
GPU Local
State
Shared
State
Local
State
Local
State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Cost vs. Time vs. Accuracy
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
State Model
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Abstraction and Containerization
def initialize(...)
def update(...)
def finalize(...)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
No unusable Data /
No wasted opportunity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Amazon SageMaker Algorithms
• DeepAR
• Factorization Machines
• Gradient Boosted Trees (XGBoost)
• Image Classification (ResNet)
• K-Means Clustering
• Latent Dirichlet Allocation (LDA)
• Linear Learner Classification and
Regression
• Neural Topic Modeling (NTM)
• Principal Components Analysis
(PCA)
• Random Cut Forest
• Seq2Seq
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Linear Learner
Regression:
Estimate a real valued function
Binary Classification:
Predict a 0/1 class
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Linear Learner
Train
Fit thresholds
and select
Select model with best validation performance
>8x speedup over naïve parallel training!
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
30GB datasets for web-spam and web-url classification
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
K-Means Clustering
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Principal Component Analysis (PCA)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Principal Component Analysis (PCA)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Neural Topic Modeling
Encoder: feedforward net
Input term counts vector
Document
Posterior
Sampled Document
Representation
Decoder:
Softmax
Output term counts vector
Perplexity vs. Number of Topic
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
DeepAR –time series forecasting
Mean absolute
percentage error
P90 Loss
DeepAR R DeepAR R
traffic
Hourly occupancy rate of 963
bay area freeways
0.14 0.27 0.13 0.24
electricity
Electricity use of 370
homes over time
0.07 0.11 0.08 0.09
pageviews
Page view hits
of websites
10k 0.32 0.32 0.44 0.31
180k 0.32 0.34 0.29 NA
One hour on p2.xlarge, $1
Input
Network
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Using AmazonSageMaker Algorithms
Command Line
SageMaker Notebooks
Amazon EMR
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Input Data
>> aws --profile <profile> 
--region us-west-2 
sm create-training-job 
--training-job-name kmeans-demo 
--algorithm-specification TrainingImage=0123456789.dkr.ecr.us-east-
1.amazonaws.com/kmeanswebscale:latest,TrainingInputMode=File 
--role-arn "arn:aws:iam::0123456789:role/demo" 
--input-data-config '{"ChannelName": "train", "DataSource": 
{"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": 
"s3://kmeans_demo/train", "S3DataDistributionType": 
"FullyReplicated"}}, "CompressionType": "None", "RecordWrapperType": "None"}' 
--output-data-config S3OutputPath=s3://kmeans_demo/output 
--resource-config InstanceCount=2,InstanceType=c4.8xlarge,VolumeSizeInGB=50 
--stopping-condition MaxRuntimeInHours=1
From Command Line
Hardware
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
From Amazon SageMaker Notebooks
Parameters
Hardware
Start Training
Host model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
From Amazon EMR
Start Training
Parameters
Hardware
Apply Model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Questions?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
aws.amazon.com/activate
Everything and Anything Startups
Need to Get Started on AWS

Contenu connexe

Tendances

Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Amazon Web Services
 
MAE402-Media Intelligence for the Cloud with Amazon AI.pdf
MAE402-Media Intelligence for the Cloud with Amazon AI.pdfMAE402-Media Intelligence for the Cloud with Amazon AI.pdf
MAE402-Media Intelligence for the Cloud with Amazon AI.pdfAmazon Web Services
 
re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...
re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...
re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...Adrian Hornsby
 
RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019
RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019 RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019
RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019 AWSKRUG - AWS한국사용자모임
 
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon PollyMCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon PollyAmazon Web Services
 
Training Chatbots and Conversational Artificial Intelligence Agents with Amaz...
Training Chatbots and Conversational Artificial Intelligence Agents with Amaz...Training Chatbots and Conversational Artificial Intelligence Agents with Amaz...
Training Chatbots and Conversational Artificial Intelligence Agents with Amaz...Amazon Web Services
 
[AWS Container Service] Getting Started with Kubernetes on AWS
[AWS Container Service] Getting Started with Kubernetes on AWS[AWS Container Service] Getting Started with Kubernetes on AWS
[AWS Container Service] Getting Started with Kubernetes on AWSAmazon Web Services Korea
 
AWS Cloud9과 Workspace만으로 PC없는 개발환경 활용기 (박성용, 허밍랩) :: AWS DevDay 2018
AWS Cloud9과 Workspace만으로 PC없는 개발환경 활용기 (박성용, 허밍랩) :: AWS DevDay 2018AWS Cloud9과 Workspace만으로 PC없는 개발환경 활용기 (박성용, 허밍랩) :: AWS DevDay 2018
AWS Cloud9과 Workspace만으로 PC없는 개발환경 활용기 (박성용, 허밍랩) :: AWS DevDay 2018Amazon Web Services Korea
 
딥러닝@EDM페스티발 누가누가 잘 노나? :: 김태웅 - AWS Community Day 2019
딥러닝@EDM페스티발 누가누가 잘 노나? :: 김태웅 - AWS Community Day 2019 딥러닝@EDM페스티발 누가누가 잘 노나? :: 김태웅 - AWS Community Day 2019
딥러닝@EDM페스티발 누가누가 잘 노나? :: 김태웅 - AWS Community Day 2019 AWSKRUG - AWS한국사용자모임
 
Serverless in Action on AWS
Serverless in Action on AWSServerless in Action on AWS
Serverless in Action on AWSAdrian Hornsby
 
RET302-Delight your Retail Customers with an Interactive Customer Service Exp...
RET302-Delight your Retail Customers with an Interactive Customer Service Exp...RET302-Delight your Retail Customers with an Interactive Customer Service Exp...
RET302-Delight your Retail Customers with an Interactive Customer Service Exp...Amazon Web Services
 
Innovations and the Cloud
Innovations and the CloudInnovations and the Cloud
Innovations and the CloudAdrian Hornsby
 
Sentiment Analysis Using Apache MXNet and Gluon - AWS Online Tech Talks
Sentiment Analysis Using Apache MXNet and Gluon - AWS Online Tech TalksSentiment Analysis Using Apache MXNet and Gluon - AWS Online Tech Talks
Sentiment Analysis Using Apache MXNet and Gluon - AWS Online Tech TalksAmazon Web Services
 
Innovations and The Cloud
Innovations and The CloudInnovations and The Cloud
Innovations and The CloudAdrian Hornsby
 
SRV318_Research at PNNL Powered by AWS
SRV318_Research at PNNL Powered by AWSSRV318_Research at PNNL Powered by AWS
SRV318_Research at PNNL Powered by AWSAmazon Web Services
 
데이터센터 1도모르는 개발자가 마이크로서비스를 만났을때 (안주은, MyMusicTaste) :: AWS DevDay 2018
데이터센터 1도모르는 개발자가 마이크로서비스를 만났을때 (안주은, MyMusicTaste) :: AWS DevDay 2018 데이터센터 1도모르는 개발자가 마이크로서비스를 만났을때 (안주은, MyMusicTaste) :: AWS DevDay 2018
데이터센터 1도모르는 개발자가 마이크로서비스를 만났을때 (안주은, MyMusicTaste) :: AWS DevDay 2018 Amazon Web Services Korea
 
Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017
Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017
Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017Amazon Web Services
 
MCL302_Maximizing the Customer Experience with AI on AWS
MCL302_Maximizing the Customer Experience with AI on AWSMCL302_Maximizing the Customer Experience with AI on AWS
MCL302_Maximizing the Customer Experience with AI on AWSAmazon Web Services
 

Tendances (20)

Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
 
MAE402-Media Intelligence for the Cloud with Amazon AI.pdf
MAE402-Media Intelligence for the Cloud with Amazon AI.pdfMAE402-Media Intelligence for the Cloud with Amazon AI.pdf
MAE402-Media Intelligence for the Cloud with Amazon AI.pdf
 
re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...
re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...
re:Invent re:Cap - An overview of Artificial Intelligence and Machine Learnin...
 
RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019
RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019 RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019
RoboMaker로 DeepRacer 자율 주행차 만들기 :: 유정열 - AWS Community Day 2019
 
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon PollyMCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
MCL206-Creating Next Generation Speech-Enabled Applications with Amazon Polly
 
Training Chatbots and Conversational Artificial Intelligence Agents with Amaz...
Training Chatbots and Conversational Artificial Intelligence Agents with Amaz...Training Chatbots and Conversational Artificial Intelligence Agents with Amaz...
Training Chatbots and Conversational Artificial Intelligence Agents with Amaz...
 
Practical AWS Fargate
Practical AWS FargatePractical AWS Fargate
Practical AWS Fargate
 
[AWS Container Service] Getting Started with Kubernetes on AWS
[AWS Container Service] Getting Started with Kubernetes on AWS[AWS Container Service] Getting Started with Kubernetes on AWS
[AWS Container Service] Getting Started with Kubernetes on AWS
 
AWS Cloud9과 Workspace만으로 PC없는 개발환경 활용기 (박성용, 허밍랩) :: AWS DevDay 2018
AWS Cloud9과 Workspace만으로 PC없는 개발환경 활용기 (박성용, 허밍랩) :: AWS DevDay 2018AWS Cloud9과 Workspace만으로 PC없는 개발환경 활용기 (박성용, 허밍랩) :: AWS DevDay 2018
AWS Cloud9과 Workspace만으로 PC없는 개발환경 활용기 (박성용, 허밍랩) :: AWS DevDay 2018
 
딥러닝@EDM페스티발 누가누가 잘 노나? :: 김태웅 - AWS Community Day 2019
딥러닝@EDM페스티발 누가누가 잘 노나? :: 김태웅 - AWS Community Day 2019 딥러닝@EDM페스티발 누가누가 잘 노나? :: 김태웅 - AWS Community Day 2019
딥러닝@EDM페스티발 누가누가 잘 노나? :: 김태웅 - AWS Community Day 2019
 
Serverless in Action on AWS
Serverless in Action on AWSServerless in Action on AWS
Serverless in Action on AWS
 
RET302-Delight your Retail Customers with an Interactive Customer Service Exp...
RET302-Delight your Retail Customers with an Interactive Customer Service Exp...RET302-Delight your Retail Customers with an Interactive Customer Service Exp...
RET302-Delight your Retail Customers with an Interactive Customer Service Exp...
 
Machine Learning in Practice
Machine Learning in PracticeMachine Learning in Practice
Machine Learning in Practice
 
Innovations and the Cloud
Innovations and the CloudInnovations and the Cloud
Innovations and the Cloud
 
Sentiment Analysis Using Apache MXNet and Gluon - AWS Online Tech Talks
Sentiment Analysis Using Apache MXNet and Gluon - AWS Online Tech TalksSentiment Analysis Using Apache MXNet and Gluon - AWS Online Tech Talks
Sentiment Analysis Using Apache MXNet and Gluon - AWS Online Tech Talks
 
Innovations and The Cloud
Innovations and The CloudInnovations and The Cloud
Innovations and The Cloud
 
SRV318_Research at PNNL Powered by AWS
SRV318_Research at PNNL Powered by AWSSRV318_Research at PNNL Powered by AWS
SRV318_Research at PNNL Powered by AWS
 
데이터센터 1도모르는 개발자가 마이크로서비스를 만났을때 (안주은, MyMusicTaste) :: AWS DevDay 2018
데이터센터 1도모르는 개발자가 마이크로서비스를 만났을때 (안주은, MyMusicTaste) :: AWS DevDay 2018 데이터센터 1도모르는 개발자가 마이크로서비스를 만났을때 (안주은, MyMusicTaste) :: AWS DevDay 2018
데이터센터 1도모르는 개발자가 마이크로서비스를 만났을때 (안주은, MyMusicTaste) :: AWS DevDay 2018
 
Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017
Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017
Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017
 
MCL302_Maximizing the Customer Experience with AI on AWS
MCL302_Maximizing the Customer Experience with AI on AWSMCL302_Maximizing the Customer Experience with AI on AWS
MCL302_Maximizing the Customer Experience with AI on AWS
 

Similaire à Working with Amazon SageMaker Algorithms for Faster Model Training

SageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningSageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningAmazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
FSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine LearningFSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine LearningAmazon Web Services
 
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...New Relic
 
在遊戲上應用AI (包括現場展示)
在遊戲上應用AI (包括現場展示)在遊戲上應用AI (包括現場展示)
在遊戲上應用AI (包括現場展示)Amazon Web Services
 
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...Amazon Web Services
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsMLconf
 
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Amazon Web Services
 
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...Amazon Web Services
 
ENT212-An Overview of Best Practices for Large-Scale Migrations
ENT212-An Overview of Best Practices for Large-Scale MigrationsENT212-An Overview of Best Practices for Large-Scale Migrations
ENT212-An Overview of Best Practices for Large-Scale MigrationsAmazon Web Services
 
AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0Amazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...New Relic
 
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2Amazon Web Services
 
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017Amazon Web Services
 
Building a Multi-Region, Active-Active Serverless Backends.
Building a Multi-Region, Active-Active Serverless Backends.Building a Multi-Region, Active-Active Serverless Backends.
Building a Multi-Region, Active-Active Serverless Backends.Adrian Hornsby
 
Optimizing EC2 for Fun and Profit #bigsavings #newfeatures - CMP202 - re:Inve...
Optimizing EC2 for Fun and Profit #bigsavings #newfeatures - CMP202 - re:Inve...Optimizing EC2 for Fun and Profit #bigsavings #newfeatures - CMP202 - re:Inve...
Optimizing EC2 for Fun and Profit #bigsavings #newfeatures - CMP202 - re:Inve...Amazon Web Services
 
透過 AWS 上的終端使用者計算解決方案提升工作效率
透過 AWS 上的終端使用者計算解決方案提升工作效率 透過 AWS 上的終端使用者計算解決方案提升工作效率
透過 AWS 上的終端使用者計算解決方案提升工作效率 Amazon Web Services
 
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...Amazon Web Services
 
Building Global Serverless Backends powered by Amazon DynamoDB Global Tables
Building Global Serverless Backends powered by Amazon DynamoDB Global TablesBuilding Global Serverless Backends powered by Amazon DynamoDB Global Tables
Building Global Serverless Backends powered by Amazon DynamoDB Global TablesAmazon Web Services
 

Similaire à Working with Amazon SageMaker Algorithms for Faster Model Training (20)

SageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningSageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine Learning
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
FSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine LearningFSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine Learning
 
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...
 
在遊戲上應用AI (包括現場展示)
在遊戲上應用AI (包括現場展示)在遊戲上應用AI (包括現場展示)
在遊戲上應用AI (包括現場展示)
 
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithms
 
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
 
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
 
ENT212-An Overview of Best Practices for Large-Scale Migrations
ENT212-An Overview of Best Practices for Large-Scale MigrationsENT212-An Overview of Best Practices for Large-Scale Migrations
ENT212-An Overview of Best Practices for Large-Scale Migrations
 
AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...
 
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
 
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017
 
Building a Multi-Region, Active-Active Serverless Backends.
Building a Multi-Region, Active-Active Serverless Backends.Building a Multi-Region, Active-Active Serverless Backends.
Building a Multi-Region, Active-Active Serverless Backends.
 
Optimizing EC2 for Fun and Profit #bigsavings #newfeatures - CMP202 - re:Inve...
Optimizing EC2 for Fun and Profit #bigsavings #newfeatures - CMP202 - re:Inve...Optimizing EC2 for Fun and Profit #bigsavings #newfeatures - CMP202 - re:Inve...
Optimizing EC2 for Fun and Profit #bigsavings #newfeatures - CMP202 - re:Inve...
 
透過 AWS 上的終端使用者計算解決方案提升工作效率
透過 AWS 上的終端使用者計算解決方案提升工作效率 透過 AWS 上的終端使用者計算解決方案提升工作效率
透過 AWS 上的終端使用者計算解決方案提升工作效率
 
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
 
Building Global Serverless Backends powered by Amazon DynamoDB Global Tables
Building Global Serverless Backends powered by Amazon DynamoDB Global TablesBuilding Global Serverless Backends powered by Amazon DynamoDB Global Tables
Building Global Serverless Backends powered by Amazon DynamoDB Global Tables
 

Plus de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Plus de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Working with Amazon SageMaker Algorithms for Faster Model Training

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Working with Amazon SageMaker Algorithms for Faster Model Training
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is Amazon SageMaker? Exploration Training Hosting
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved The Amazon Machine Learning Stack FRAMEWORKS & INTERFACES Caffe2 CNTK Apache MXNet PyTorch TensorFlo w Chainer Keras Gluon AWS Deep Learning AMIs Amazon SageMaker Rekognition Transcribe Translate Polly Comprehend Lex AWS DeepLens EDUCATION PLATFORM SERVICES APPLICATION SERVICES Amazon Mechanical Turk
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Challenges in Machine Learning
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Machine Learning
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Large Scale Machine Learning
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Large Scale Machine Learning
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Our Customers use ML at a massive scale “We collect 160M events daily in the ML pipeline and run training over the last 15 days and need it to complete in one hour. Effectively there's 100M features in the model” Valentino Volonghi, CTO “We process 3 million ad requests a second, 100,000 features per request. That’s 250 trillion per day. Not your run of the mill Data science problem!” Bill Simmons, CTO “Our data warehouse is 100TB and we are processing 2TB daily. We're running mostly gradient boosting (trees), LDA and K-Means clustering and collaborative filtering.“ Shahar Cizer Kobrinsky, VP Architecture
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine Distributed, with Strong Machines
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine Distributed, with Strong Machines
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1 1
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 2 3 1 2
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level Unusable Data / Wasted opportunity
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Architecture and Design Streaming, GPU/CPU, Distributed with a Shared State
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Streaming State
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Streaming Data Size Memory Data Size Time/Cost
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 2 3 1 2
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 3 1 2
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved GPU/CPU GPU State
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Distributed GPU State GPU State GPU State
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Shared State GPU GPU GPU Local State Shared State Local State Local State
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Cost vs. Time vs. Accuracy $$$$ $$$ $$ $ Minutes Hours Days Weeks Months
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved State Model GPU State
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1 1
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Abstraction and Containerization def initialize(...) def update(...) def finalize(...)
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level No unusable Data / No wasted opportunity
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Amazon SageMaker Algorithms • DeepAR • Factorization Machines • Gradient Boosted Trees (XGBoost) • Image Classification (ResNet) • K-Means Clustering • Latent Dirichlet Allocation (LDA) • Linear Learner Classification and Regression • Neural Topic Modeling (NTM) • Principal Components Analysis (PCA) • Random Cut Forest • Seq2Seq
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Linear Learner Regression: Estimate a real valued function Binary Classification: Predict a 0/1 class
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Linear Learner Train Fit thresholds and select Select model with best validation performance >8x speedup over naïve parallel training!
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved 30GB datasets for web-spam and web-url classification
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved K-Means Clustering
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Principal Component Analysis (PCA)
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Principal Component Analysis (PCA)
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Neural Topic Modeling Encoder: feedforward net Input term counts vector Document Posterior Sampled Document Representation Decoder: Softmax Output term counts vector Perplexity vs. Number of Topic
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved DeepAR –time series forecasting Mean absolute percentage error P90 Loss DeepAR R DeepAR R traffic Hourly occupancy rate of 963 bay area freeways 0.14 0.27 0.13 0.24 electricity Electricity use of 370 homes over time 0.07 0.11 0.08 0.09 pageviews Page view hits of websites 10k 0.32 0.32 0.44 0.31 180k 0.32 0.34 0.29 NA One hour on p2.xlarge, $1 Input Network
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Using AmazonSageMaker Algorithms Command Line SageMaker Notebooks Amazon EMR
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Input Data >> aws --profile <profile> --region us-west-2 sm create-training-job --training-job-name kmeans-demo --algorithm-specification TrainingImage=0123456789.dkr.ecr.us-east- 1.amazonaws.com/kmeanswebscale:latest,TrainingInputMode=File --role-arn "arn:aws:iam::0123456789:role/demo" --input-data-config '{"ChannelName": "train", "DataSource": {"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": "s3://kmeans_demo/train", "S3DataDistributionType": "FullyReplicated"}}, "CompressionType": "None", "RecordWrapperType": "None"}' --output-data-config S3OutputPath=s3://kmeans_demo/output --resource-config InstanceCount=2,InstanceType=c4.8xlarge,VolumeSizeInGB=50 --stopping-condition MaxRuntimeInHours=1 From Command Line Hardware
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved From Amazon SageMaker Notebooks Parameters Hardware Start Training Host model
  • 47. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved From Amazon EMR Start Training Parameters Hardware Apply Model
  • 48. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Questions?
  • 49. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved aws.amazon.com/activate Everything and Anything Startups Need to Get Started on AWS