SlideShare une entreprise Scribd logo
1  sur  76
Télécharger pour lire hors ligne
© 2021 Amazon Web Services, Inc. or its affiliates. All rights reserved |
Sungmin Kim, AWS Solutions Architect
End-to-End Machine Learning with
Amazon SageMaker
In this Talk
• What is Machine Learning?
• Machine Learning Workflow
• Build → Train → Deploy
• Build fast and collaborate
• Amazon SageMaker Studio Notebooks
• Training and tune models
• Amazon SageMaker Training Job
• Amazon SageMaker Hyperparameter Optimization
• Deploy and manage models
• Amazon SageMaker Endpoints
• Amazon SageMaker Pipelines
• Automatic ML Model Generation
• Amazon SageMaker Autopilot
• Machine Learning in the cloud
Marketing Offer On A New Product
Option 1- Build A Rule Engine
Age Gender Purchase
Date
Items
30 M 3/1/2017 Toy
40 M 1/3/2017 Books
…. …… ….. …..
Input Output
Age Gender Purchase
Date
Items
30 M 3/1/2017 Toy
…. …… ….. …..
Rule 1: 15 <age< 30
Rule 2: Bought Toy=Y,
Last Purchase<30 days
Rule 3: Gender = ‘M’,
Bought Toy =‘Y’
Rule 4: ……..
Rule 5: ……..
Human
Programmer
Option 2 - Learn The Business Rules From Data
Learning
Algorithm
Model
Output
Historical Purchase Data
(Training Data)
Prediction
Age Gender Items
35 F
39 M Toy
Input - New Unseen Data
Age Gender Purchase
Date
Items
30 M 3/1/2017 Toy
40 M 1/3/2017 Books
…. …… ….. …..
We Call This Approach Machine Learning
Learning
Algorithm
Model
Output
Historical Purchase Data
(Training Data)
Prediction
Age Gender Items
35 F
39 M Toy
Input - New Unseen Data
Age Gender Purchase
Date
Items
30 M 3/1/2017 Toy
40 M 1/3/2017 Books
…. …… ….. …..
Rule 1: 15 <age< 30
Rule 2: Bought Toy=Y,
Last Purchase<30 days
Rule 3: Gender = ‘M’,
Bought Toy =‘Y’
Rule 4: ……..
Rule 5: ……..
Human
Programmer
Typical Machine Learning Process
Collect, prepare and
label training data
Choose and
optimize
ML algorithm
Train and
tune ML models
Set up and
manage
environments
for training
Deploy models
in production
Scale and manage
the production
environment
1
2
3
Set up and track
experiment
Machine Learning is iterative
Choose model
Debug, compare, and
evaluate experiments
Monitor quality, detect
drift, and retrain
Share, review, and
collaborate
Common machine learning development
Laptop
Upside:
• Flexible. Personal. Easy to get started.
Downside:
• Extremely difficult to scale
• Nearly impossible to run in production
• Need virtual environments in order to experiment
Common machine learning development
Servers
Upside:
• Familiar. May seem less expensive upfront.
Downside:
• Availability is incredibly challenging to maintain
• Stuck in either over- or under- utilization
• Experimentation is risky and expensive
• New ideas have to wait for months to start
• Good luck going global!
Amazon
SageMaker
Label
data
Aggregate &
prepare data
Store & share
features
Auto ML Spark/R Detect bias
Visualize in
notebooks
Pick
algorithm
Train
models
Tune
parameters
Debug &
profile
Deploy in
production
Manage
& monitor
CI/CD
Human
review
Ground
Truth Data Wrangler
Feature
store Autopilot Processing Clarify
Studio
Notebooks
Built-in or
Bring-your-own
Experiments
Spot Training
Distributed
Training
Automatic
Model
Tuning
Debugger
Model Hosting
Multi-model
Endpoints
Model
Monitor
Pipelines
Augmented
AI
AMAZON SAGEMAKER EDGE MANAGER
SAGEMAKER STUDIO IDE
AMAZON SAGEMAKER JUMPSTART
VISION SPEECH TEXT SEARCH CHATBOTS PERSONALIZATION FORECASTING FRAUD CONTACT CENTERS
Deep
Learning
AMIs &
Containers
GPUs &
CPUs
Elastic
Inference
Trainium Inferentia FPGA
AI SERVICES
ML SERVICES
FRAMEWORKS & INFRASTRUCTURE
DeepGraphLibrary
Amazon
Rekognition
Amazon
Polly
Amazon
Transcribe
+Medical
Amazon
Lex
Amazon
Personalize
Amazon
Forecast
Amazon
Comprehend
+Medical
Amazon
Textract
Amazon
Kendra
Amazon
CodeGuru
Amazon
Fraud Detector
Amazon
Translate
INDUSTRIAL AI CODE AND DEVOPS
Amazon
DevOps Guru
Voice ID
For Amazon Connect
Contact Lens
Amazon
Monitron
AWS Panorama
+ Appliance
Amazon Lookout
for Vision
Amazon Lookout
for Equipment
Amazon
HealthLake
HEALTHCARE AI
Amazon Lookout
for Metrics
ANOMOLY DETECTION
Amazon
Transcribe
for Medical
Amazon
Comprehend
for Medical
모든 개발자를 위한 다양한 인공 지능 도구 제공
Amazon
SageMaker
Label
data
Aggregate &
prepare data
Store & share
features
Auto ML Spark/R Detect bias
Visualize in
notebooks
Pick
algorithm
Train
models
Tune
parameters
Debug &
profile
Deploy in
production
Manage
& monitor
CI/CD
Human
review
Ground
Truth Data Wrangler
Feature
store Autopilot Processing Clarify
Studio
Notebooks
Built-in or
Bring-your-own
Experiments
Spot Training
Distributed
Training
Automatic
Model
Tuning
Debugger
Model Hosting
Multi-model
Endpoints
Model
Monitor
Pipelines
Augmented
AI
AMAZON SAGEMAKER EDGE MANAGER
SAGEMAKER STUDIO IDE
AMAZON SAGEMAKER JUMPSTART
VISION SPEECH TEXT SEARCH CHATBOTS PERSONALIZATION FORECASTING FRAUD CONTACT CENTERS
Deep
Learning
AMIs &
Containers
GPUs &
CPUs
Elastic
Inference
Trainium Inferentia FPGA
AI SERVICES
ML SERVICES
FRAMEWORKS & INFRASTRUCTURE
DeepGraphLibrary
Amazon
Rekognition
Amazon
Polly
Amazon
Transcribe
+Medical
Amazon
Lex
Amazon
Personalize
Amazon
Forecast
Amazon
Comprehend
+Medical
Amazon
Textract
Amazon
Kendra
Amazon
CodeGuru
Amazon
Fraud Detector
Amazon
Translate
INDUSTRIAL AI CODE AND DEVOPS
Amazon
DevOps Guru
Voice ID
For Amazon Connect
Contact Lens
Amazon
Monitron
AWS Panorama
+ Appliance
Amazon Lookout
for Vision
Amazon Lookout
for Equipment
Amazon
HealthLake
HEALTHCARE AI
Amazon Lookout
for Metrics
ANOMOLY DETECTION
Amazon
Transcribe
for Medical
Amazon
Comprehend
for Medical
모든 개발자를 위한 다양한 인공 지능 도구 제공
End-to-End
Machine Learning
Platform
Zero setup Flexible Model
Training
Pay by the second
$
Amazon SageMaker
손쉬운 기계 학습 모델 생성, 훈련 및 서비스 배포 완전 관리 서비스
Set up and track
experiment
Choose model
Debug, compare, and
evaluate experiments
Monitor quality, detect
drift, and retrain
Share, review, and
collaborate
Build fast and collaborate
Amazon SageMaker Studio
Collaboration
at scale
코드 의존성 추적
없이 확장 가능한
노트북 공유
Easy
experiment
management
수천 개의 모델
실험을 구성, 추적 및
비교
Automatic
model
generation
코드 작성 없이
데이터를 가지고 자동
모델 생성
Higher quality
ML models
오류 자동 디버깅 및
실시간 오류 경보
모델 모니터링 및
고품질 유지
Increased
productivity
완전 자동화된 머신
러닝 워크플로 구축
기계 학습 모델 개발 및 배포를 위한 최초의 완전 통합 개발 환경 (IDE)
Amazon SageMaker Studio
기계 학습 모델 개발 및 배포를 위한 최초의 완전 통합 개발 환경 (IDE)
Amazon
SageMaker
Studio
시작 화면
한번의 클릭으로 노트북 공유 가능
개발자가 몇 초 만에 ML 노트북을 가동 후 한 번의 클릭으로 공유 할 수 있는
새로운 개발 환경 제공
Amazon SageMaker Notebooks
직원 자격 증빙으로 바로
개발 환경 접근 가능
관리자가 손쉽게 권한 및
접근 제어 가능
보안성 높은 완전
관리형 서비스
손쉬운 협업
환경 제공
클릭 한번으로
URL기반 공유 가능
싱글 사인온 (SSO)을
통한 손쉬운 접근
컴퓨팅 리소스 없이
서버리스 환경
별도의 설정이나
구동 불필요
• Jupyter notebooks
• Support for Jupyter Lab
• Multiple built-in kernels
• Install external libraries and
kernels
• Integrate with Git
• Sample notebooks
• VPC Integration for
integrated security
Set up and track
experiment
Choose model
Debug, compare, and
evaluate experiments
Monitor quality, detect
drift, and retrain
Share, review, and
collaborate
Train and tune models
Amazon SageMaker Training
Docker
Container
EC2
Instance
S3 Bucket
Elastic Container Registry
Download
Algorithm
Image
3
Write trained model to S3
4
Sends your data
2
EC2
Instance
EC2
Instance
model.fit()
1
From Amazon SageMaker Notebooks
training
Amazon SageMaker Training
Specify Training Infrastructure
Amazon SageMaker Training
Use Algorithm and Start training
Execution Role
SageMaker Estimator
Amazon SageMaker Training
How does training happen
XGBoost
validation(optional)
test(optional
ECR
S3
ML Instance
ml.m4.xlarge
xgboost
linear-learner
PCA
DeepAR
BlazingText
Image classification
…
Object Detection
Images
S3
SageMaker
Notebook
SageMaker
Training Job
train
Model
Launch
container for
training Job
SageMaker training supports Spot Instances
EC2 Instance Spot Pricing
• Specify a maximum wait time
• SageMaker will default to giving you the lowest
possible cost
• Store model checkpoints in Amazon S3 in case
your job is interrupted for BYOM
• Many built-in algorithms automatically revert to a
training job
• We have examples
• Save up to 90%!
Amazon SageMaker Training
Algorithm Options
Built-in
algorithms
Script mode Docker
container
(BYOC)
AWS ML
marketplace
4
1 2 3
Train with a built-in algorithm
xgboost
linear-learner
PCA
DeepAR
BlazingText
Image classification
…
Object Detection
Built-in Algorithm
Images
Elastic Container Registry
Train using script mode
Bring Your own Container (BYOC)
Elastic Container Registry
Training code
• Matrix factorization
• Regression
• Principal component analysis
• K-means clustering
• Gradient boosted trees
• And more!
17 Built-in algorithms
Bring your own script
(Amazon SageMaker managed container)
Bring your own
algorithm
(you build the
Docker container)
Subscribe to
Algorithms and
Model Packages
on AWS
Marketplace
Many ways to train a model on SageMaker
Algorithm Options
Neural Networks
Number of layers
Hidden layer width
Learning rate
Embedding
dimensions
Dropout
…
Decision Trees
Tree depth
Max leaf nodes
Gamma
Eta
Lambda
Alpha
…
“Hyperparameters”
(algorithm parameters that significantly affect model quality)
Amazon SageMaker Automatic Model Tuning
Hyperparameter Tuning
Amazon SageMaker Automatic Model Tuning
Setting up hyper parameter tuning job
1
2
3
Automatic
Model Tuning
Training Job 1
Training Job 2
Training Job N
Best
Model
Selector
Best Model
• Define Metrics • Hyperparameter
ranges/scaling
• Stop tuning job
early
• Use warm start
• Bayesian ~OR~
Random Search
Amazon SageMaker Automatic Model Tuning
Hyperparameter Tuning
Amazon SageMaker Automatic Model Tuning
Hyperparameter Search Strategy
Bayesian Search
Random Search
Amazon SageMaker Automatic Model Tuning
Bayesian vs. Random Search
Bayesian Search Random Search
Amazon SageMaker Automatic Model Tuning
What if I need all my jobs tuned at the same time?
Bayesian Search Random Search
Set up and track
experiment
Choose model
Debug, compare, and
evaluate experiments
Monitor quality, detect
drift, and retrain
Share, review, and
collaborate
Deploy models
Amazon SageMaker Deployment
Hosting Services
Inference Image
Training Image
Training Data
Model artifacts
Endpoint
Amazon
SageMaker
Amazon S3 Amazon ECR
Model artifacts Inference Image
Model artifacts Inference Image
Amazon SageMaker Deployment
SageMaker Endpoints (Private API)
Auto Scaling group
Availability Zone 1
Availability Zone 2
Availability Zone 3
Elastic
Load Balancing
Model
Endpoint Client
Deployment / Hosting
Amazon SageMaker ML
Compute Instances
Input Data
(Request)
Prediction
(Response)
Amazon SageMaker Deployment
SageMaker Endpoints (Public API)
Auto Scaling group
Availability Zone 1
Availability Zone 2
Availability Zone 3
Elastic
Load Balancing
Model
Endpoint
Amazon
API Gateway Client
Deployment / Hosting
Amazon SageMaker ML
Compute Instances
Input Data
(Request)
Prediction
(Response)
Amazon SageMaker Deployment
Updating Endpoints
Blue-green
deployments mean no
scheduled downtime
Deploy one or more
models behind the
same endpoint
Amazon SageMaker Deployment
A/B Testing
A/B Testing
Secure Endpoint
Inference
Code
Helper
Code
Model
Artifacts
Inference
code Images
Client Application
Inference
request
Inference
result
• 1-10 Production Variants (Model Versions)
• All models must have the same I/O schema
• Endpoint Modification w/o service disruption
Model-1
Inference
Code
Helper
Code
Model
Artifacts
Inference
code Images
Model-2
{
…
'InitialVariantWeight’: 2
} {ProductionVariants}
{
…
'InitialVariantWeight’: 1
}
Amazon SageMaker Deployment
A/B Testing
Model Version 설정
Deploy Invoke
Amazon SageMaker Deployment
Multi-Model Endpoints
• Scalable/Cost Effective for large number
of models
• Works best when models are of similar
size and latency
• Automatic memory handling
Multi-Model Endpoints
Secure Endpoint
Model
Artifacts
Client Application
Inference
request
Inference
result
Model-1
Inference
Code
Helper
Code
Container Model
Artifacts
Model-2
Inference
Code
Helper
Code
Container
Invoke Endpoint: TargetModel = Model-1
Pre
fix = SalesForecast/ Prefix =
SalesForecast/
Multi-model endpoints
Significant savings for large-scale deployments
EP-1
Model 1
EP-2
Model 2
EP-10
Model 10
…
EP
Model 1
Model 2
…
Model 10
Sample scenario: ml.c5.xlarge, $0.238/hour, 2 instances running 24/7
10 separate endpoints
$3,430/month
1 multi-model endpoint
$343/month
Multi-model endpoints
nevada.tar.gz
Mode: MultiModel
Artifact location:
s3://bucket/your-endpoint-models
predict
predict(‘nevada.tar.gz’,
features)
s3://bucket/your-endpoint-models/
new_york.tar.gz
florida.tar.gz
texas.tar.gz
load
new_york.tar.gz
texas.tar.gz
florida.tar.gz
nevada.tar.gz
Amazon SageMaker
Multi-model endpoint Amazon S3
model storage
Define
Estimator
Object
created
deploy()
predict()
Object
created
fit()
Amazon SageMaker
End to End Training and Deployment
Set up and track
experiment
Choose model
Debug, compare, and
evaluate experiments
Monitor quality, detect
drift, and retrain
Share, review, and
collaborate
Manage Workflow for ML Lifecycle
Challenges with creating a complete workflow for the ML lifecycle
1
2 컨셉concept에서 프로덕션까지 모델을 가져오는 데는 여러 단계가 포함
• ML 수명주기lifecycle의 각 단계에 대한 표준 코드 패키지 생성
• 워크플로라는 구조로 연결
• 단계step 간 종속성 관리
• 오케스트레이션 된 시퀀스로 워크플로 실행
모델 구축, 훈련 및 배포는 반복적인 프로세스
3 워크플로의 각 단계에 대한 아티팩트 추적
5 ML Ops의 일부로 전체 워크플로 자동화 및 확장
4 수천 개의 모델에서 올바른 버전의 모델 배포 및 관리
Amazon SageMaker Pipelines
규모에 맞게 완전 자동화된 머신 러닝 워크플로 구축
ML Workflow
작성 및 관리
사용하기 쉬운
Python SDK로
상세한
Workflow를
만들고 시각적으로
관리
거버넌스 및
감사audit를 위한
모델 계보lineage 추적
ML 수명주기lifecycle의
각 단계에 대한 코드,
데이터 셋 및 버전
추적
Workflow 재생
및 재실행
사용자 지정
일정에 따라 모든
단계를 다시
실행하여 모델을
최신 상태로 유지
시각적으로
모델 비교, 선택
및 배포 가능
SageMaker
Studio의 시각적
인터페이스를 통해
모듈 배포 및 관리
Registery를
활용한 중앙
집중식 ML
모델 관리
모델 레지스트리를
사용하여 프로덕션
배포에 가장
적합한 모델 선택
CI/CD 지원이
내장된 완전
관리형 MLOps
CI/CD 사례를
사용하여 완전
자동화된 머신
러닝 워크플로
구축
CI/CD Pipeline 예제 (1)
2. Git Commit & Push 3. Automatic Pipelining
1. 코드 수정 & Git Add
CI/CD Pipeline 예제 (2)
프로덕션 배포 승인 여부
UI로 쉽게 모델 버전 간 성능을 비교할 수 있고, status를 변경해서 One-click
모델 배포 가능
모델 버전 간 metric 비교
3
1
4
2
Summary So Far
Set up and track
experiment
Choose model
Debug, compare, and
evaluate experiments
Monitor quality, detect
drift, and retrain
Share, review, and
collaborate
Machine Learning Workflow
Amazon SageMaker Studio
기계 학습 모델 개발 및 배포를 위한 최초의 완전 통합 개발 환경 (IDE)
학습 모델
구축 및 협업
SageMaker
Notebooks
SageMaker
Pipelines
완전 자동화된
머신 러닝
워크플로 구축
학습 모델 훈련
및 검증
SageMaker
Training Job
One-click 배포 ,
모델 모니터링 및
고품질 유지
SageMaker
Endpoints
학습 모델
최적화 및 다중
알고리즘 튜닝
SageMaker
HPO
Set up and track
experiment
Choose model
Debug, compare, and
evaluate experiments
Monitor quality, detect
drift, and retrain
Share, review, and
collaborate
If You Still Feel Machine Learning Difficult…
자동 모델 생성
자동 모델 튜닝을 통한
ML 모델 자동 생성
추천 및 최적화 기능
리더 보드 확보 및
모델 개선 계속
Amazon SageMaker Autopilot
기존 AutoML의 단점을 극복하기 위해 모델 제어 및 가시성 확보를 기반으로
자동 모델 생성 및 관리 서비스
가시성 및 데이터 제어
모델에 맞는 노트북
소스 코드
빠르게 시작 가능
How Amazon SageMaker Autopilot Works
https://github.com/aws/amazon-sagemaker-examples/tree/master/autopilot
Autopilot from
SageMaker
Studio
1
2
3
Use Amazon SageMaker Autopilot to automatically
train and tune the best machine learning models
✓
Use Amazon SageMaker Autopilot to automatically
train and tune the best machine learning models
Generate the Codes and Notebooks for you
Amazon SageMaker Autopilot Data
Exploration
Amazon SageMaker Autopilot Candidate
Definition Notebook
Classification
• Linear Learner
• XGBoost
• KNN
Working with Text
• BlazingText
• Supervised
• Unsupervised*
Recommendation
• Factorization Machines
Forecasting
• DeepAR
Topic Modeling
• LDA
• NTM
Amazon SageMaker에서 제공하는 Built-in Algorithms
Sequence Translation
• Seq2Seq*
Clustering
• KMeans
Feature Reduction
• PCA
• Object2Vec
Anomaly Detection
• Random Cut Forests
• IP Insights
Computer Vision
• Image Classification
• Object Detection
• Semantic Segmentation
Regression
• Linear
Learner
• XGBoost
• KNN
https://docs.aws.amazon.com/sagemaker/latest/dg/algos.html
PREPARE
SageMaker Ground Truth
Label training data for machine learning
SageMaker Data Wrangler
Aggregate and prepare data for
machine learning
SageMaker Processing
Built-in Python, BYO R/Spark
SageMaker Feature Store
Store, update, retrieve, and share features
SageMaker Clarify
Detect bias and understand
model predictions
BUILD
SageMaker Studio Notebooks
Jupyter notebooks with elastic compute
and sharing
Built-in and Bring
your-own Algorithms
Dozens of optimized algorithms or bring
your own
Local Mode
Test and prototype on your local machine
SageMaker Autopilot
Automatically create machine learning
models with full visibility
SageMaker JumpStart
Pre-built solutions for common use cases
TRAIN & TUNE
Managed Training
Distributed infrastructure
management
SageMaker Experiments
Capture, organize, and compare
every step
Automatic
Model Tuning
Hyperparameter optimization
Distributed Training
Libraries
Training for large datasets
and models
SageMaker Debugger
Debug and profile training runs
Managed Spot Training
Reduce training cost by 90%
DEPLOY & MANAGE
Managed Deployment
Fully managed, ultra low latency,
high throughput
Kubernetes & Kubeflow
Integration
Simplify Kubernetes-based
machine learning
Multi-Model Endpoints
Reduce cost by hosting multiple models
per instance
SageMaker Model Monitor
Maintain accuracy of deployed models
SageMaker Edge Manager
Manage and monitor models on
edge devices
SageMaker Pipelines
Workflow orchestration and automation
Amazon SageMaker
SageMaker Studio
Integrated development environment (IDE) for ML
Amazon SageMaker overview
✓
✓
✓
✓
✓
✓
✓
✓
✓
Machine learning development
Laptop Servers Cloud
Build Train Deploy
ML infrastructure
Operations
Security & Compliance
Machine Learning in the cloud
SageMaker offers up to 96% lower TCO and 10x more developer
productivity
Capability
Amazon
SageMaker
Compared to
self-managed
Amazon EC2
Compared to
self-managed
Kubernetes
(EKS)
Provision & manage
instances
Fully managed Self-managed Managed by AWS
Manage security &
compliance
Built-in Self-managed Self-managed
Infrastructure performance
optimization
Scales
automatically
Self-managed Self-managed
Infrastructure management
for high-availability
Optimizes
automatically
Self-managed Self-managed
Source of cost-savings
Getting started with
• SageMaker Immersion Day Workshop ✯✯✯
• SageMaker Examples (100+) ✯✯✯
• SageMaker Workshop (한국어)
• Amazon SageMaker Overview (2020-03-25)
• [Video] Amazon SageMaker Overview (2020-03-25)
• [Video] Amazon SageMaker 데모 (2020-03-25) ✯✯✯
• AI/ML Resources - 동영상, 발표 자료 등
Put machine learning in the
hands of every developer
Our mission at

Contenu connexe

Tendances

Tendances (20)

Amazon SageMaker
Amazon SageMakerAmazon SageMaker
Amazon SageMaker
 
Introducing Amazon Personalize
Introducing Amazon PersonalizeIntroducing Amazon Personalize
Introducing Amazon Personalize
 
AWS Well-Architected Framework
AWS Well-Architected FrameworkAWS Well-Architected Framework
AWS Well-Architected Framework
 
Simplify & Standardise Your Migration to AWS with a Migration Landing Zone
Simplify & Standardise Your Migration to AWS with a Migration Landing ZoneSimplify & Standardise Your Migration to AWS with a Migration Landing Zone
Simplify & Standardise Your Migration to AWS with a Migration Landing Zone
 
AWS 101
AWS 101AWS 101
AWS 101
 
Reducing the Total Cost of IT Infrastructure with AWS Cloud Economics
Reducing the Total Cost of IT Infrastructure with AWS Cloud EconomicsReducing the Total Cost of IT Infrastructure with AWS Cloud Economics
Reducing the Total Cost of IT Infrastructure with AWS Cloud Economics
 
AWS reInvent 2022 reCap AI/ML and Data
AWS reInvent 2022 reCap AI/ML and DataAWS reInvent 2022 reCap AI/ML and Data
AWS reInvent 2022 reCap AI/ML and Data
 
Cost Optimisation on AWS
Cost Optimisation on AWSCost Optimisation on AWS
Cost Optimisation on AWS
 
Introduction to Cloud Computing with AWS (Thai Session)
Introduction to Cloud Computing with AWS (Thai Session)Introduction to Cloud Computing with AWS (Thai Session)
Introduction to Cloud Computing with AWS (Thai Session)
 
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
 
Getting Started with AWS Compute Services
Getting Started with AWS Compute ServicesGetting Started with AWS Compute Services
Getting Started with AWS Compute Services
 
Choose Right Stream Storage: Amazon Kinesis Data Streams vs MSK
Choose Right Stream Storage: Amazon Kinesis Data Streams vs MSKChoose Right Stream Storage: Amazon Kinesis Data Streams vs MSK
Choose Right Stream Storage: Amazon Kinesis Data Streams vs MSK
 
AWS Summit Seoul 2023 | 바쁘다 바빠, 현대사회! Amazon Kendra로 원하는 자료를 적재적소에 찾아서 활용하기
AWS Summit Seoul 2023 | 바쁘다 바빠, 현대사회! Amazon Kendra로 원하는 자료를 적재적소에 찾아서 활용하기AWS Summit Seoul 2023 | 바쁘다 바빠, 현대사회! Amazon Kendra로 원하는 자료를 적재적소에 찾아서 활용하기
AWS Summit Seoul 2023 | 바쁘다 바빠, 현대사회! Amazon Kendra로 원하는 자료를 적재적소에 찾아서 활용하기
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
 
Introduction to AWS Storage Services
Introduction to AWS Storage ServicesIntroduction to AWS Storage Services
Introduction to AWS Storage Services
 
AWS Marketplace
AWS MarketplaceAWS Marketplace
AWS Marketplace
 
성공적인 AWS Cloud 마이그레이션 전략 및 사례 - 방희란 매니저:: AWS Cloud Track 1 Intro
성공적인 AWS Cloud 마이그레이션 전략 및 사례 - 방희란 매니저:: AWS Cloud Track 1 Intro성공적인 AWS Cloud 마이그레이션 전략 및 사례 - 방희란 매니저:: AWS Cloud Track 1 Intro
성공적인 AWS Cloud 마이그레이션 전략 및 사례 - 방희란 매니저:: AWS Cloud Track 1 Intro
 
AWS Business Essentials
AWS Business EssentialsAWS Business Essentials
AWS Business Essentials
 
Cost Optimization on AWS
Cost Optimization on AWSCost Optimization on AWS
Cost Optimization on AWS
 
Cost Optimization on AWS
Cost Optimization on AWSCost Optimization on AWS
Cost Optimization on AWS
 

Similaire à End-to-End Machine Learning with Amazon SageMaker

MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
Provectus
 
UI Test Cases With CloudStack
UI Test Cases With CloudStackUI Test Cases With CloudStack
UI Test Cases With CloudStack
ShapeBlue
 

Similaire à End-to-End Machine Learning with Amazon SageMaker (20)

Building, Training and Deploying Custom Algorithms with Amazon SageMaker
Building, Training and Deploying Custom Algorithms with Amazon SageMakerBuilding, Training and Deploying Custom Algorithms with Amazon SageMaker
Building, Training and Deploying Custom Algorithms with Amazon SageMaker
 
Demystifying Machine Learning with AWS (ACD Mumbai)
Demystifying Machine Learning with AWS (ACD Mumbai)Demystifying Machine Learning with AWS (ACD Mumbai)
Demystifying Machine Learning with AWS (ACD Mumbai)
 
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed Raafat
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed RaafatAWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed Raafat
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed Raafat
 
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scalaSviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
 
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019
 
Machine Learning with Amazon SageMaker
Machine Learning with Amazon SageMakerMachine Learning with Amazon SageMaker
Machine Learning with Amazon SageMaker
 
Demystifying Amazon Sagemaker (ACD Kochi)
Demystifying Amazon Sagemaker (ACD Kochi)Demystifying Amazon Sagemaker (ACD Kochi)
Demystifying Amazon Sagemaker (ACD Kochi)
 
Building Machine Learning Models Automatically (June 2020)
Building Machine Learning Models Automatically (June 2020)Building Machine Learning Models Automatically (June 2020)
Building Machine Learning Models Automatically (June 2020)
 
Machine Learning in azione con Amazon SageMaker
Machine Learning in azione con Amazon SageMakerMachine Learning in azione con Amazon SageMaker
Machine Learning in azione con Amazon SageMaker
 
AWS ML Model Deployment
AWS ML Model DeploymentAWS ML Model Deployment
AWS ML Model Deployment
 
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
 
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotCostruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
 
Building an ML model with zero code
Building an ML model with zero codeBuilding an ML model with zero code
Building an ML model with zero code
 
Amazon SageMaker for MLOps Presentation.
Amazon SageMaker for MLOps Presentation.Amazon SageMaker for MLOps Presentation.
Amazon SageMaker for MLOps Presentation.
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
 
UI Test Cases With CloudStack
UI Test Cases With CloudStackUI Test Cases With CloudStack
UI Test Cases With CloudStack
 
Machine Learning as a Service with Amazon Machine Learning
Machine Learning as a Service with Amazon Machine LearningMachine Learning as a Service with Amazon Machine Learning
Machine Learning as a Service with Amazon Machine Learning
 
Building Intelligent Solutions with AWS IoT
Building Intelligent Solutions with AWS IoT Building Intelligent Solutions with AWS IoT
Building Intelligent Solutions with AWS IoT
 
ACDKOCHI19 - Demystifying amazon sagemaker
ACDKOCHI19 - Demystifying amazon sagemakerACDKOCHI19 - Demystifying amazon sagemaker
ACDKOCHI19 - Demystifying amazon sagemaker
 

Plus de Sungmin Kim

Plus de Sungmin Kim (13)

Build Computer Vision Applications with Amazon Rekognition and SageMaker
Build Computer Vision Applications with Amazon Rekognition and SageMakerBuild Computer Vision Applications with Amazon Rekognition and SageMaker
Build Computer Vision Applications with Amazon Rekognition and SageMaker
 
Introduction to Amazon Athena
Introduction to Amazon AthenaIntroduction to Amazon Athena
Introduction to Amazon Athena
 
1시간만에 머신러닝 개념 따라 잡기
1시간만에 머신러닝 개념 따라 잡기1시간만에 머신러닝 개념 따라 잡기
1시간만에 머신러닝 개념 따라 잡기
 
AWS re:Invent 2020 Awesome AI/ML Services
AWS re:Invent 2020 Awesome AI/ML ServicesAWS re:Invent 2020 Awesome AI/ML Services
AWS re:Invent 2020 Awesome AI/ML Services
 
AWS Personalize 중심으로 살펴본 추천 시스템 원리와 구축
AWS Personalize 중심으로 살펴본 추천 시스템 원리와 구축AWS Personalize 중심으로 살펴본 추천 시스템 원리와 구축
AWS Personalize 중심으로 살펴본 추천 시스템 원리와 구축
 
Starup을 위한 AWS AI/ML 서비스 활용 방법
Starup을 위한 AWS AI/ML 서비스 활용 방법Starup을 위한 AWS AI/ML 서비스 활용 방법
Starup을 위한 AWS AI/ML 서비스 활용 방법
 
Octember on AWS (Revised Edition)
Octember on AWS (Revised Edition)Octember on AWS (Revised Edition)
Octember on AWS (Revised Edition)
 
Realtime Analytics on AWS
Realtime Analytics on AWSRealtime Analytics on AWS
Realtime Analytics on AWS
 
Amazon Athena 사용 팁
Amazon Athena 사용 팁Amazon Athena 사용 팁
Amazon Athena 사용 팁
 
AWS Analytics Immersion Day - Build BI System from Scratch (Day1, Day2 Full V...
AWS Analytics Immersion Day - Build BI System from Scratch (Day1, Day2 Full V...AWS Analytics Immersion Day - Build BI System from Scratch (Day1, Day2 Full V...
AWS Analytics Immersion Day - Build BI System from Scratch (Day1, Day2 Full V...
 
Databases & Analytics AWS re:invent 2019 Recap
Databases & Analytics AWS re:invent 2019 RecapDatabases & Analytics AWS re:invent 2019 Recap
Databases & Analytics AWS re:invent 2019 Recap
 
Octember on AWS
Octember on AWSOctember on AWS
Octember on AWS
 
AI/ML re:invent 2019 recap at Delivery Hero Korea
AI/ML re:invent 2019 recap at Delivery Hero KoreaAI/ML re:invent 2019 recap at Delivery Hero Korea
AI/ML re:invent 2019 recap at Delivery Hero Korea
 

Dernier

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 

Dernier (20)

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 

End-to-End Machine Learning with Amazon SageMaker

  • 1. © 2021 Amazon Web Services, Inc. or its affiliates. All rights reserved | Sungmin Kim, AWS Solutions Architect End-to-End Machine Learning with Amazon SageMaker
  • 2. In this Talk • What is Machine Learning? • Machine Learning Workflow • Build → Train → Deploy • Build fast and collaborate • Amazon SageMaker Studio Notebooks • Training and tune models • Amazon SageMaker Training Job • Amazon SageMaker Hyperparameter Optimization • Deploy and manage models • Amazon SageMaker Endpoints • Amazon SageMaker Pipelines • Automatic ML Model Generation • Amazon SageMaker Autopilot • Machine Learning in the cloud
  • 3. Marketing Offer On A New Product
  • 4. Option 1- Build A Rule Engine Age Gender Purchase Date Items 30 M 3/1/2017 Toy 40 M 1/3/2017 Books …. …… ….. ….. Input Output Age Gender Purchase Date Items 30 M 3/1/2017 Toy …. …… ….. ….. Rule 1: 15 <age< 30 Rule 2: Bought Toy=Y, Last Purchase<30 days Rule 3: Gender = ‘M’, Bought Toy =‘Y’ Rule 4: …….. Rule 5: …….. Human Programmer
  • 5. Option 2 - Learn The Business Rules From Data Learning Algorithm Model Output Historical Purchase Data (Training Data) Prediction Age Gender Items 35 F 39 M Toy Input - New Unseen Data Age Gender Purchase Date Items 30 M 3/1/2017 Toy 40 M 1/3/2017 Books …. …… ….. …..
  • 6. We Call This Approach Machine Learning Learning Algorithm Model Output Historical Purchase Data (Training Data) Prediction Age Gender Items 35 F 39 M Toy Input - New Unseen Data Age Gender Purchase Date Items 30 M 3/1/2017 Toy 40 M 1/3/2017 Books …. …… ….. ….. Rule 1: 15 <age< 30 Rule 2: Bought Toy=Y, Last Purchase<30 days Rule 3: Gender = ‘M’, Bought Toy =‘Y’ Rule 4: …….. Rule 5: …….. Human Programmer
  • 7. Typical Machine Learning Process Collect, prepare and label training data Choose and optimize ML algorithm Train and tune ML models Set up and manage environments for training Deploy models in production Scale and manage the production environment 1 2 3
  • 8. Set up and track experiment Machine Learning is iterative Choose model Debug, compare, and evaluate experiments Monitor quality, detect drift, and retrain Share, review, and collaborate
  • 9. Common machine learning development Laptop Upside: • Flexible. Personal. Easy to get started. Downside: • Extremely difficult to scale • Nearly impossible to run in production • Need virtual environments in order to experiment
  • 10. Common machine learning development Servers Upside: • Familiar. May seem less expensive upfront. Downside: • Availability is incredibly challenging to maintain • Stuck in either over- or under- utilization • Experimentation is risky and expensive • New ideas have to wait for months to start • Good luck going global!
  • 11. Amazon SageMaker Label data Aggregate & prepare data Store & share features Auto ML Spark/R Detect bias Visualize in notebooks Pick algorithm Train models Tune parameters Debug & profile Deploy in production Manage & monitor CI/CD Human review Ground Truth Data Wrangler Feature store Autopilot Processing Clarify Studio Notebooks Built-in or Bring-your-own Experiments Spot Training Distributed Training Automatic Model Tuning Debugger Model Hosting Multi-model Endpoints Model Monitor Pipelines Augmented AI AMAZON SAGEMAKER EDGE MANAGER SAGEMAKER STUDIO IDE AMAZON SAGEMAKER JUMPSTART VISION SPEECH TEXT SEARCH CHATBOTS PERSONALIZATION FORECASTING FRAUD CONTACT CENTERS Deep Learning AMIs & Containers GPUs & CPUs Elastic Inference Trainium Inferentia FPGA AI SERVICES ML SERVICES FRAMEWORKS & INFRASTRUCTURE DeepGraphLibrary Amazon Rekognition Amazon Polly Amazon Transcribe +Medical Amazon Lex Amazon Personalize Amazon Forecast Amazon Comprehend +Medical Amazon Textract Amazon Kendra Amazon CodeGuru Amazon Fraud Detector Amazon Translate INDUSTRIAL AI CODE AND DEVOPS Amazon DevOps Guru Voice ID For Amazon Connect Contact Lens Amazon Monitron AWS Panorama + Appliance Amazon Lookout for Vision Amazon Lookout for Equipment Amazon HealthLake HEALTHCARE AI Amazon Lookout for Metrics ANOMOLY DETECTION Amazon Transcribe for Medical Amazon Comprehend for Medical 모든 개발자를 위한 다양한 인공 지능 도구 제공
  • 12. Amazon SageMaker Label data Aggregate & prepare data Store & share features Auto ML Spark/R Detect bias Visualize in notebooks Pick algorithm Train models Tune parameters Debug & profile Deploy in production Manage & monitor CI/CD Human review Ground Truth Data Wrangler Feature store Autopilot Processing Clarify Studio Notebooks Built-in or Bring-your-own Experiments Spot Training Distributed Training Automatic Model Tuning Debugger Model Hosting Multi-model Endpoints Model Monitor Pipelines Augmented AI AMAZON SAGEMAKER EDGE MANAGER SAGEMAKER STUDIO IDE AMAZON SAGEMAKER JUMPSTART VISION SPEECH TEXT SEARCH CHATBOTS PERSONALIZATION FORECASTING FRAUD CONTACT CENTERS Deep Learning AMIs & Containers GPUs & CPUs Elastic Inference Trainium Inferentia FPGA AI SERVICES ML SERVICES FRAMEWORKS & INFRASTRUCTURE DeepGraphLibrary Amazon Rekognition Amazon Polly Amazon Transcribe +Medical Amazon Lex Amazon Personalize Amazon Forecast Amazon Comprehend +Medical Amazon Textract Amazon Kendra Amazon CodeGuru Amazon Fraud Detector Amazon Translate INDUSTRIAL AI CODE AND DEVOPS Amazon DevOps Guru Voice ID For Amazon Connect Contact Lens Amazon Monitron AWS Panorama + Appliance Amazon Lookout for Vision Amazon Lookout for Equipment Amazon HealthLake HEALTHCARE AI Amazon Lookout for Metrics ANOMOLY DETECTION Amazon Transcribe for Medical Amazon Comprehend for Medical 모든 개발자를 위한 다양한 인공 지능 도구 제공
  • 13. End-to-End Machine Learning Platform Zero setup Flexible Model Training Pay by the second $ Amazon SageMaker 손쉬운 기계 학습 모델 생성, 훈련 및 서비스 배포 완전 관리 서비스
  • 14.
  • 15. Set up and track experiment Choose model Debug, compare, and evaluate experiments Monitor quality, detect drift, and retrain Share, review, and collaborate Build fast and collaborate
  • 16. Amazon SageMaker Studio Collaboration at scale 코드 의존성 추적 없이 확장 가능한 노트북 공유 Easy experiment management 수천 개의 모델 실험을 구성, 추적 및 비교 Automatic model generation 코드 작성 없이 데이터를 가지고 자동 모델 생성 Higher quality ML models 오류 자동 디버깅 및 실시간 오류 경보 모델 모니터링 및 고품질 유지 Increased productivity 완전 자동화된 머신 러닝 워크플로 구축 기계 학습 모델 개발 및 배포를 위한 최초의 완전 통합 개발 환경 (IDE)
  • 17. Amazon SageMaker Studio 기계 학습 모델 개발 및 배포를 위한 최초의 완전 통합 개발 환경 (IDE)
  • 19.
  • 20.
  • 21.
  • 23. 개발자가 몇 초 만에 ML 노트북을 가동 후 한 번의 클릭으로 공유 할 수 있는 새로운 개발 환경 제공 Amazon SageMaker Notebooks 직원 자격 증빙으로 바로 개발 환경 접근 가능 관리자가 손쉽게 권한 및 접근 제어 가능 보안성 높은 완전 관리형 서비스 손쉬운 협업 환경 제공 클릭 한번으로 URL기반 공유 가능 싱글 사인온 (SSO)을 통한 손쉬운 접근 컴퓨팅 리소스 없이 서버리스 환경 별도의 설정이나 구동 불필요
  • 24. • Jupyter notebooks • Support for Jupyter Lab • Multiple built-in kernels • Install external libraries and kernels • Integrate with Git • Sample notebooks • VPC Integration for integrated security
  • 25. Set up and track experiment Choose model Debug, compare, and evaluate experiments Monitor quality, detect drift, and retrain Share, review, and collaborate Train and tune models
  • 26. Amazon SageMaker Training Docker Container EC2 Instance S3 Bucket Elastic Container Registry Download Algorithm Image 3 Write trained model to S3 4 Sends your data 2 EC2 Instance EC2 Instance model.fit() 1
  • 27. From Amazon SageMaker Notebooks training
  • 28. Amazon SageMaker Training Specify Training Infrastructure
  • 29. Amazon SageMaker Training Use Algorithm and Start training Execution Role SageMaker Estimator
  • 30. Amazon SageMaker Training How does training happen XGBoost validation(optional) test(optional ECR S3 ML Instance ml.m4.xlarge xgboost linear-learner PCA DeepAR BlazingText Image classification … Object Detection Images S3 SageMaker Notebook SageMaker Training Job train Model
  • 32. SageMaker training supports Spot Instances EC2 Instance Spot Pricing • Specify a maximum wait time • SageMaker will default to giving you the lowest possible cost • Store model checkpoints in Amazon S3 in case your job is interrupted for BYOM • Many built-in algorithms automatically revert to a training job • We have examples • Save up to 90%!
  • 33. Amazon SageMaker Training Algorithm Options Built-in algorithms Script mode Docker container (BYOC) AWS ML marketplace 4 1 2 3
  • 34. Train with a built-in algorithm xgboost linear-learner PCA DeepAR BlazingText Image classification … Object Detection Built-in Algorithm Images Elastic Container Registry
  • 36. Bring Your own Container (BYOC) Elastic Container Registry
  • 37. Training code • Matrix factorization • Regression • Principal component analysis • K-means clustering • Gradient boosted trees • And more! 17 Built-in algorithms Bring your own script (Amazon SageMaker managed container) Bring your own algorithm (you build the Docker container) Subscribe to Algorithms and Model Packages on AWS Marketplace Many ways to train a model on SageMaker Algorithm Options
  • 38. Neural Networks Number of layers Hidden layer width Learning rate Embedding dimensions Dropout … Decision Trees Tree depth Max leaf nodes Gamma Eta Lambda Alpha … “Hyperparameters” (algorithm parameters that significantly affect model quality) Amazon SageMaker Automatic Model Tuning Hyperparameter Tuning
  • 39. Amazon SageMaker Automatic Model Tuning Setting up hyper parameter tuning job 1 2 3
  • 40. Automatic Model Tuning Training Job 1 Training Job 2 Training Job N Best Model Selector Best Model • Define Metrics • Hyperparameter ranges/scaling • Stop tuning job early • Use warm start • Bayesian ~OR~ Random Search Amazon SageMaker Automatic Model Tuning Hyperparameter Tuning
  • 41. Amazon SageMaker Automatic Model Tuning Hyperparameter Search Strategy Bayesian Search Random Search
  • 42. Amazon SageMaker Automatic Model Tuning Bayesian vs. Random Search Bayesian Search Random Search
  • 43. Amazon SageMaker Automatic Model Tuning What if I need all my jobs tuned at the same time? Bayesian Search Random Search
  • 44. Set up and track experiment Choose model Debug, compare, and evaluate experiments Monitor quality, detect drift, and retrain Share, review, and collaborate Deploy models
  • 45. Amazon SageMaker Deployment Hosting Services Inference Image Training Image Training Data Model artifacts Endpoint Amazon SageMaker Amazon S3 Amazon ECR Model artifacts Inference Image Model artifacts Inference Image
  • 46. Amazon SageMaker Deployment SageMaker Endpoints (Private API) Auto Scaling group Availability Zone 1 Availability Zone 2 Availability Zone 3 Elastic Load Balancing Model Endpoint Client Deployment / Hosting Amazon SageMaker ML Compute Instances Input Data (Request) Prediction (Response)
  • 47. Amazon SageMaker Deployment SageMaker Endpoints (Public API) Auto Scaling group Availability Zone 1 Availability Zone 2 Availability Zone 3 Elastic Load Balancing Model Endpoint Amazon API Gateway Client Deployment / Hosting Amazon SageMaker ML Compute Instances Input Data (Request) Prediction (Response)
  • 48. Amazon SageMaker Deployment Updating Endpoints Blue-green deployments mean no scheduled downtime Deploy one or more models behind the same endpoint
  • 49. Amazon SageMaker Deployment A/B Testing A/B Testing Secure Endpoint Inference Code Helper Code Model Artifacts Inference code Images Client Application Inference request Inference result • 1-10 Production Variants (Model Versions) • All models must have the same I/O schema • Endpoint Modification w/o service disruption Model-1 Inference Code Helper Code Model Artifacts Inference code Images Model-2 { … 'InitialVariantWeight’: 2 } {ProductionVariants} { … 'InitialVariantWeight’: 1 }
  • 50. Amazon SageMaker Deployment A/B Testing Model Version 설정 Deploy Invoke
  • 51. Amazon SageMaker Deployment Multi-Model Endpoints • Scalable/Cost Effective for large number of models • Works best when models are of similar size and latency • Automatic memory handling Multi-Model Endpoints Secure Endpoint Model Artifacts Client Application Inference request Inference result Model-1 Inference Code Helper Code Container Model Artifacts Model-2 Inference Code Helper Code Container Invoke Endpoint: TargetModel = Model-1 Pre fix = SalesForecast/ Prefix = SalesForecast/
  • 52. Multi-model endpoints Significant savings for large-scale deployments EP-1 Model 1 EP-2 Model 2 EP-10 Model 10 … EP Model 1 Model 2 … Model 10 Sample scenario: ml.c5.xlarge, $0.238/hour, 2 instances running 24/7 10 separate endpoints $3,430/month 1 multi-model endpoint $343/month
  • 53. Multi-model endpoints nevada.tar.gz Mode: MultiModel Artifact location: s3://bucket/your-endpoint-models predict predict(‘nevada.tar.gz’, features) s3://bucket/your-endpoint-models/ new_york.tar.gz florida.tar.gz texas.tar.gz load new_york.tar.gz texas.tar.gz florida.tar.gz nevada.tar.gz Amazon SageMaker Multi-model endpoint Amazon S3 model storage
  • 55. Set up and track experiment Choose model Debug, compare, and evaluate experiments Monitor quality, detect drift, and retrain Share, review, and collaborate Manage Workflow for ML Lifecycle
  • 56. Challenges with creating a complete workflow for the ML lifecycle 1 2 컨셉concept에서 프로덕션까지 모델을 가져오는 데는 여러 단계가 포함 • ML 수명주기lifecycle의 각 단계에 대한 표준 코드 패키지 생성 • 워크플로라는 구조로 연결 • 단계step 간 종속성 관리 • 오케스트레이션 된 시퀀스로 워크플로 실행 모델 구축, 훈련 및 배포는 반복적인 프로세스 3 워크플로의 각 단계에 대한 아티팩트 추적 5 ML Ops의 일부로 전체 워크플로 자동화 및 확장 4 수천 개의 모델에서 올바른 버전의 모델 배포 및 관리
  • 57. Amazon SageMaker Pipelines 규모에 맞게 완전 자동화된 머신 러닝 워크플로 구축 ML Workflow 작성 및 관리 사용하기 쉬운 Python SDK로 상세한 Workflow를 만들고 시각적으로 관리 거버넌스 및 감사audit를 위한 모델 계보lineage 추적 ML 수명주기lifecycle의 각 단계에 대한 코드, 데이터 셋 및 버전 추적 Workflow 재생 및 재실행 사용자 지정 일정에 따라 모든 단계를 다시 실행하여 모델을 최신 상태로 유지 시각적으로 모델 비교, 선택 및 배포 가능 SageMaker Studio의 시각적 인터페이스를 통해 모듈 배포 및 관리 Registery를 활용한 중앙 집중식 ML 모델 관리 모델 레지스트리를 사용하여 프로덕션 배포에 가장 적합한 모델 선택 CI/CD 지원이 내장된 완전 관리형 MLOps CI/CD 사례를 사용하여 완전 자동화된 머신 러닝 워크플로 구축
  • 58. CI/CD Pipeline 예제 (1) 2. Git Commit & Push 3. Automatic Pipelining 1. 코드 수정 & Git Add
  • 59. CI/CD Pipeline 예제 (2) 프로덕션 배포 승인 여부 UI로 쉽게 모델 버전 간 성능을 비교할 수 있고, status를 변경해서 One-click 모델 배포 가능 모델 버전 간 metric 비교 3 1 4 2
  • 61. Set up and track experiment Choose model Debug, compare, and evaluate experiments Monitor quality, detect drift, and retrain Share, review, and collaborate Machine Learning Workflow
  • 62. Amazon SageMaker Studio 기계 학습 모델 개발 및 배포를 위한 최초의 완전 통합 개발 환경 (IDE) 학습 모델 구축 및 협업 SageMaker Notebooks SageMaker Pipelines 완전 자동화된 머신 러닝 워크플로 구축 학습 모델 훈련 및 검증 SageMaker Training Job One-click 배포 , 모델 모니터링 및 고품질 유지 SageMaker Endpoints 학습 모델 최적화 및 다중 알고리즘 튜닝 SageMaker HPO
  • 63. Set up and track experiment Choose model Debug, compare, and evaluate experiments Monitor quality, detect drift, and retrain Share, review, and collaborate If You Still Feel Machine Learning Difficult…
  • 64. 자동 모델 생성 자동 모델 튜닝을 통한 ML 모델 자동 생성 추천 및 최적화 기능 리더 보드 확보 및 모델 개선 계속 Amazon SageMaker Autopilot 기존 AutoML의 단점을 극복하기 위해 모델 제어 및 가시성 확보를 기반으로 자동 모델 생성 및 관리 서비스 가시성 및 데이터 제어 모델에 맞는 노트북 소스 코드 빠르게 시작 가능
  • 65. How Amazon SageMaker Autopilot Works https://github.com/aws/amazon-sagemaker-examples/tree/master/autopilot
  • 67. Use Amazon SageMaker Autopilot to automatically train and tune the best machine learning models ✓
  • 68. Use Amazon SageMaker Autopilot to automatically train and tune the best machine learning models
  • 69. Generate the Codes and Notebooks for you Amazon SageMaker Autopilot Data Exploration Amazon SageMaker Autopilot Candidate Definition Notebook
  • 70. Classification • Linear Learner • XGBoost • KNN Working with Text • BlazingText • Supervised • Unsupervised* Recommendation • Factorization Machines Forecasting • DeepAR Topic Modeling • LDA • NTM Amazon SageMaker에서 제공하는 Built-in Algorithms Sequence Translation • Seq2Seq* Clustering • KMeans Feature Reduction • PCA • Object2Vec Anomaly Detection • Random Cut Forests • IP Insights Computer Vision • Image Classification • Object Detection • Semantic Segmentation Regression • Linear Learner • XGBoost • KNN https://docs.aws.amazon.com/sagemaker/latest/dg/algos.html
  • 71. PREPARE SageMaker Ground Truth Label training data for machine learning SageMaker Data Wrangler Aggregate and prepare data for machine learning SageMaker Processing Built-in Python, BYO R/Spark SageMaker Feature Store Store, update, retrieve, and share features SageMaker Clarify Detect bias and understand model predictions BUILD SageMaker Studio Notebooks Jupyter notebooks with elastic compute and sharing Built-in and Bring your-own Algorithms Dozens of optimized algorithms or bring your own Local Mode Test and prototype on your local machine SageMaker Autopilot Automatically create machine learning models with full visibility SageMaker JumpStart Pre-built solutions for common use cases TRAIN & TUNE Managed Training Distributed infrastructure management SageMaker Experiments Capture, organize, and compare every step Automatic Model Tuning Hyperparameter optimization Distributed Training Libraries Training for large datasets and models SageMaker Debugger Debug and profile training runs Managed Spot Training Reduce training cost by 90% DEPLOY & MANAGE Managed Deployment Fully managed, ultra low latency, high throughput Kubernetes & Kubeflow Integration Simplify Kubernetes-based machine learning Multi-Model Endpoints Reduce cost by hosting multiple models per instance SageMaker Model Monitor Maintain accuracy of deployed models SageMaker Edge Manager Manage and monitor models on edge devices SageMaker Pipelines Workflow orchestration and automation Amazon SageMaker SageMaker Studio Integrated development environment (IDE) for ML Amazon SageMaker overview ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
  • 73. Build Train Deploy ML infrastructure Operations Security & Compliance Machine Learning in the cloud SageMaker offers up to 96% lower TCO and 10x more developer productivity
  • 74. Capability Amazon SageMaker Compared to self-managed Amazon EC2 Compared to self-managed Kubernetes (EKS) Provision & manage instances Fully managed Self-managed Managed by AWS Manage security & compliance Built-in Self-managed Self-managed Infrastructure performance optimization Scales automatically Self-managed Self-managed Infrastructure management for high-availability Optimizes automatically Self-managed Self-managed Source of cost-savings
  • 75. Getting started with • SageMaker Immersion Day Workshop ✯✯✯ • SageMaker Examples (100+) ✯✯✯ • SageMaker Workshop (한국어) • Amazon SageMaker Overview (2020-03-25) • [Video] Amazon SageMaker Overview (2020-03-25) • [Video] Amazon SageMaker 데모 (2020-03-25) ✯✯✯ • AI/ML Resources - 동영상, 발표 자료 등
  • 76. Put machine learning in the hands of every developer Our mission at