SlideShare une entreprise Scribd logo
1  sur  63
AI Stack on AWS: Amazon
SageMaker and Beyond
Presented by:
Stepan Pushkarev, CTO @ Provectus
Chris Burns, Senior AI/ML Solutions Architect @ AWS
Pritpal Sahota, Technical Account Executive @ Provectus
Introductions
This webinar is brought to you by Provectus & AWS
Pritpal Sahota
Technical Account
Executive, Provectus
Chris Burns
Senior AI/ML Solutions
Architect, AWS
Stepan Pushkarev
Chief Technology Officer,
Provectus
Provectus: AI consultancy and Solutions provider
Established in 2010,
Headquartered in Palo Alto
450 engineers and growingOffices across the US,
Canada, and Europe
Clients: fast-growing startups
and large enterprises
AWS Competency Partner in DevOps, Data & Analytics, and Machine Learning
1. Mid-to-proficiency level in Machine Learning
a. or proficiency level in system / cloud architecture
2. Familiarity with AWS ecosystem
3. Familiarity with SageMaker fundamentals (notebooks, training, hosting)
SageMaker and Beyond prerequisites
1. Deep understanding of Amazon SageMaker capabilities, limitations, and
opportunities
2. Best practices for using Amazon SageMaker with open-source tools for
better experience and productivity
3. Holistic understanding of integration of ML process into the rest of AWS
architecture
SageMaker and Beyond outcomes
VISION SPEECH TEXT SEARCH NEW CHATBOTS PERSONALIZATION FORECASTING FRAUD NEW DEVELOPMENT NEW CONTACT CENTERS
Amazon SageMaker
Amazon
SageMaker
Ground
Truth
Amazon
A2I
Amazon
SageMaker
Neo
Built-in
algorithms
SageMaker
Notebooks NEW
SageMaker
Experiments NEW
Model
tuning
SageMaker
Debugger NEW
SageMaker
Autopilot NEW
Model
hosting
SageMaker
Model Monitor NEW
Deep Learning
AMIs & Containers
GPUs &
CPUs
Elastic
Inference
Inferentia FPGA
Amazon
Rekognition
Amazon
Polly
Amazon
Transcribe
+Medical
Amazon
Comprehend
+Medical
Amazon
Translate
Amazon
Lex
Amazon
Personalize
Amazon
Forecast
Amazon
Fraud Detector
Amazon
CodeGuru
AWS AI Services
AWS ML Services + Provectus Foundation Solutions
AWS ML Frameworks & Infrastructure
Amazon
Textract
Amazon
Kendra
Contact Lens
For Amazon
Connect
Amazon SageMaker Studio IDE
NEW
NEW
Supply Chain
Optimization
Customer Support
Automation
Disease Screening
& Diagnosis
Worker Health Safety
Customer Retention
Optimization
Claims & Document
Processing
Provectus Value-adding AI Solutions
Feature Store
Kubeflow
Orchestration
MLOps
Advanced
Monitoring
NEW
SageMaker is Awesome
Feature Store
Store and reuse features to build ML models faster
ML Workflow Orchestrator
Reproduce and track the whole ML Workflow
Athena ML
Inference ML models from SQL
Dataset Versioning
Track and govern training datasets
Data Sampling
Sample from production streams
Elastic Inference
Save GPU costs
Amazon SageMaker Processing
Data Processing & Model Evaluation
ML Infrastructure - Nice to Have or Must-Have?
Must-Have Use Case:
FDA Compliant Disease Screening
Screening at birth for potential
pathologies helps find an
expert ophthalmologist who
can evaluate, treat and prevent
disease.
Pr3vent
Pr3vent
Best time for
treatment
Screened Too late?4 million babies are neither screened nor treated
Infancy, 1-5 years KindergartenPremature
FDA Guidelines
ML infrastructure to comply with FDA Guidelines
Auditable and trusted environment
Data
annotation
Raw data
Experiment
ation
Model
catalogue
Testing
Production
inferencing
Monitoring
Maintenan
ce
Start with Data: Data Lake for ML
Enterprise Machine Learning starts with Data
1. Machine Learning Datasets Reproducibility
2. Models Datasets Versioning
3. Machine Learning Datasets Bias detection and Fairness
4. Machine Learning Datasets Auditability
5. Model Data Lake Governance
6. Model Data Monitoring
Data Lake Characteristics
1. Powered by data pipelines
2. Infinity dataset
3. Cheap storage
4. Decoupled from compute
5. Columnar Access
a. Optimized Parquet file size
6. Append only
7. Partitioned
8. Exposes Metadata for each column:
a. Type
b. Description
c. Source (Lineage)
d. SLA
1. Includes Model Metadata:
a. Prediction, confidence
b. Other model output
c. Model name & version
d. Model Monitoring checks
2. Includes Annotation Metadata
a. Labeling job ID
b. Judgements
c. Agreements
3. Has Governance Metadata for each column:
a. Owner
b. Description
c. Last updated, SLA
d. Upstream ML models (used_by)
e. Statistics (min, max, uniques, nulls)
4. Supports higher level operations
a. Subsample
b. Take a Snapshot
Adding ML Awareness into Data Lake
Sampling - generating a versioned dataset
ML Dataset Characteristics
1. Immutable
2. Finity
3. Versioned
4. Could be downloaded locally (DVC)
5. Could be compared with other datasets
6. Exposes Metadata:
a. Dataset Owner
b. Subsample pipeline version
c. Subsample pipeline parameters
ML Featurization
Feature Store Characteristics
1. Where ML Training job starts
2. Where ML adoption is accelerated
3. Immutable
4. Versioned
5. Each version could be downloaded
locally
6. Could be compared with other versions
5. Exposes Metadata:
a. Owner
b. Subsample pipeline version
c. Subsample pipeline parameters
d. Upstream models
e. Feature descriptions
f. Feature versions
Data Layer for ML: Summary
1. Add ML Awareness into Data Lake by enriching it with ML specific metadata
2. Invest into reusable sampling, featurization and other steps of the pipeline
3. Build it yourself with AWS tools like Amazon EMR, Athena, DynamoDB, AWS Glue Catalogue
4. Amplify the adoption of ML by introducing a centralized feature store
Build: SageMaker Experiments
Experimentation Flow
Data Preprocessing
Model Training
Model Evaluation
Tensorboard is good to track Training
● Log training metrics and other scalars
● Examine execution graph
● TensorFlow, PyTorch
● Hyperparameter tuning
● What-IF tool
● Evaluate model with fairness indicators
● Profiling tool
… but has its flaws
● Tracks training step logs only
● Doesn’t track run parameters
● Comparing runs is not as straightforward
as it could be
● TensorFlow, PyTorch only
● Do it Yourself on AWS
Amazon SageMaker Experiments
● Offers seamless integration into the existing ML workflow
● Offers a structured organization scheme to help users group and organize
their machine learning iterations
● Provides tracking and analytics of experiments
● Facilitates decomposition of monolithic workflow into multiple steps
Tracking Capabilities
● Parameters
● Inputs
● Outputs
● Artifacts
● Metrics
Analyzing experiments in Studio
● Visualize information about experiments and their trials in real-time with
predefined widgets using Amazon Sagemaker Studio
Analysing experiments using SDK
● All logged information about an experiment can be easily exported to a Pandas DataFrame
AWS Sagemaker Experiments: Summary
Pros
○ Fully managed
○ Ability to track a rich set of
parameters
○ Ability to build complex plots
from Studio
○ Ability to extract all logged
information for custom analysis
○ Native integration with Amazon
SageMaker Autopilot, Amazon
SageMaker Endpoints
Current limitations / things to be aware of
○ Does not allow building complex
DAGs, i.e. sequential execution
only
○ Lack of instruments for
configuring robust pipelines
○ Available within AWS
Sagemaker Studio only - per
user context, can not compare
runs by different users
○ Can not compare trials from
different runs
Build & Train: Orchestration
Beyond SageMaker Experiments
Kubeflow: Orchestrator of Choice
Orchestrate it all with Kubeflow Pipelines
Kubeflow on AWS
Best Practices:
● Invest into a library of reusable components
● Use SageMaker Operators for Kubernetes
● Deploy on EKS
● Use separate on-demand/spot nodegroups for CPU/GPU
bound ML tasks
● Use Amazon FSx for Lustre to avoid data transfer from
Amazon S3
● Integrate with Amazon Cognito
Kubeflow on AWS
Challenges:
● Under rapid development
● Still needs Ops support even on EKS
● Resource management between service
and ML workloads
● Poor support from AWS community
Best Practices:
● Invest into a library of reusable steps
● Use SageMaker Operators for Kubernetes
● Deploy on EKS
● Use separate on-demand/spot
nodegroups for CPU/GPU bound ML tasks
● Use FSx for Lustre to avoid data transfer
from S3
● Integrate with AWS Cognito
Kubeflow Pipelines: Summary
● Extends beyond SageMaker ecosystem
● Built on top of Argo Workflows, facilitates GitOps
● Allows building complex processing DAGs
● Rich purposely built UI
● Growing opensource community
● Requires deep Kubernetes/Ops expertise
Build: SageMaker Debugger
Code
● Unit tests
● Logging
● Peer review
How to debug models?
Experiments
● Assert model parameters
● Track loss curves / metrics
during training
● Check model outputs
Can we go beyond curves?
SageMaker Debugger — Logging + Statistics + Alerts
● Vanishing gradients
● Overfitting
● Poor weight initialization
● Saturated activations
● Overpruned trees
Out of the box Rules
SageMaker Debugger: Summary
● No warnings, errors only
● Not available for built-in algorithms
Pros
● Flowing through the graph: goes beyond
watching scalars (losses) during training
and provides full visibility into history of
all tensors
● Early stopping & near real time alerts
● Requires minimal instrumentation of the
model code
● Growing set of out-of-the-box Rules
Current limitations / things to be aware of
Deploy: SageMaker Model
Monitor
Monitoring Production
Data Quality
Alerts when issues
appear
SageMaker Model Monitoring Goal
Training Data Production Data
SageMaker endpoint
requests predictions
training data
SageMaker endpoint
requests predictions
production request
storage
training data
SageMaker endpoint
requests predictions
production request
storage
training data
baseline
statistics
SageMaker
Processing
Job
SageMaker
endpoint
requests predictions
training data
baseline
statistics
SageMaker
Processing
Job
Scheduled
Monitoring Job
generated reports:
statistics and violations
production request
storage
SageMaker
endpoint
requests predictions
SageMaker
Processing
Job
Scheduled
Monitoring Job
generated reports:
statistics and violations
training data
baseline
statistics
production request
storage
SageMaker
endpoint
requests predictions
training data
baseline
statistics
SageMaker
Processing
Job
Scheduled
Monitoring Job
generated reports:
statistics and violations
production request
storage
What is
REALLY
SageMaker Model Monitoring?
Scheduled
Monitoring Job
Pre Built Container
in a nutshell
Scheduled
Monitoring Job
➔ Min
➔ Max
➔ Sum
➔ Sample Count
➔ Average
➔ Completeness
➔ Baseline Drift == two sample KS test
➔ Missing Columns
➔ Excessive columns
SageMaker
endpoint
requests predictions
production request
storage
training data
baseline
statistics
SageMaker
Processing
Job
generated reports:
statistics and violations
ANYTHING YOU
WANT
1. Realtime processing and alerts
2. Image Data Drift
3. Text Data Drift
4. Anomaly Detection
5. Interpretability of drift
Provectus Value Add Model Monitoring Features
ANYTHING YOU
WANT
1. Built-in container with schema extractor from training data
2. Built-in container with Min/Max/Mean and KS test
3. Fully managed data wrangling, traffic shadowing, job
scheduling, pushing metrics to CloudWatch and retrieving
latest job results
SageMaker Monitor: Summary
● Modern ML infrastructure accelerates time to value for ML initiatives and
increases trust from the business
● Amazon SageMaker has the broadest and deepest set of fully managed
tools for building and managing AI applications at scale
● Complement it with the rest of AWS tools for data processing, storage &
metadata management
● Complement it with mature opensource tools to go beyond main offerings
Webinar Takeaways
125 University Avenue
Suite 290, Palo Alto
California, 94301
hello@provectus.com
Questions, details?
We would be happy to answer!

Contenu connexe

Tendances

Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018Amazon Web Services
 
Build Machine Learning Solutions on Data Lakes (ARC321) - AWS re:Invent 2018
Build Machine Learning Solutions on Data Lakes (ARC321) - AWS re:Invent 2018Build Machine Learning Solutions on Data Lakes (ARC321) - AWS re:Invent 2018
Build Machine Learning Solutions on Data Lakes (ARC321) - AWS re:Invent 2018Amazon Web Services
 
McGraw-Hill Education: Global Migration in Less than 2 Years (ENT211) | AWS r...
McGraw-Hill Education: Global Migration in Less than 2 Years (ENT211) | AWS r...McGraw-Hill Education: Global Migration in Less than 2 Years (ENT211) | AWS r...
McGraw-Hill Education: Global Migration in Less than 2 Years (ENT211) | AWS r...Amazon Web Services
 
Track 6 Session 3_如何藉由 AWS AI 和機器學習平台搭建多功能的 AI 解決方案.pptx
Track 6 Session 3_如何藉由 AWS AI 和機器學習平台搭建多功能的 AI 解決方案.pptxTrack 6 Session 3_如何藉由 AWS AI 和機器學習平台搭建多功能的 AI 解決方案.pptx
Track 6 Session 3_如何藉由 AWS AI 和機器學習平台搭建多功能的 AI 解決方案.pptxAmazon Web Services
 
Tiered Data Sets in Amazon Redshift (ANT321) - AWS re:Invent 2018
Tiered Data Sets in Amazon Redshift (ANT321) - AWS re:Invent 2018Tiered Data Sets in Amazon Redshift (ANT321) - AWS re:Invent 2018
Tiered Data Sets in Amazon Redshift (ANT321) - AWS re:Invent 2018Amazon Web Services
 
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotCostruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotAmazon Web Services
 
Amazon AI/ML Overview
Amazon AI/ML OverviewAmazon AI/ML Overview
Amazon AI/ML OverviewBESPIN GLOBAL
 
AMF304-Optimizing Design and Engineering Performance in the Cloud for Manufac...
AMF304-Optimizing Design and Engineering Performance in the Cloud for Manufac...AMF304-Optimizing Design and Engineering Performance in the Cloud for Manufac...
AMF304-Optimizing Design and Engineering Performance in the Cloud for Manufac...Amazon Web Services
 
Artificial intelligence in actions: delivering a new experience to Formula 1 ...
Artificial intelligence in actions: delivering a new experience to Formula 1 ...Artificial intelligence in actions: delivering a new experience to Formula 1 ...
Artificial intelligence in actions: delivering a new experience to Formula 1 ...GoDataDriven
 
AWS for Semiconductor and Electronics Design | Hsinchu, April 10
AWS for Semiconductor and Electronics Design | Hsinchu, April 10AWS for Semiconductor and Electronics Design | Hsinchu, April 10
AWS for Semiconductor and Electronics Design | Hsinchu, April 10Amazon Web Services
 
Cloud Migration Insights Forum, Melbourne
Cloud Migration Insights Forum, MelbourneCloud Migration Insights Forum, Melbourne
Cloud Migration Insights Forum, MelbourneAmazon Web Services
 
Cloud Economics: The Financial Case for Cloud Migration
Cloud Economics: The Financial Case for Cloud MigrationCloud Economics: The Financial Case for Cloud Migration
Cloud Economics: The Financial Case for Cloud MigrationAmazon Web Services
 
Optimize Amazon EC2 Instances, AWS Fargate Containers, & Lambda Functions (CM...
Optimize Amazon EC2 Instances, AWS Fargate Containers, & Lambda Functions (CM...Optimize Amazon EC2 Instances, AWS Fargate Containers, & Lambda Functions (CM...
Optimize Amazon EC2 Instances, AWS Fargate Containers, & Lambda Functions (CM...Amazon Web Services
 
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019GoDataDriven
 
Leadership Session: AWS Semiconductor (MFG201-L) - AWS re:Invent 2018
Leadership Session: AWS Semiconductor (MFG201-L) - AWS re:Invent 2018Leadership Session: AWS Semiconductor (MFG201-L) - AWS re:Invent 2018
Leadership Session: AWS Semiconductor (MFG201-L) - AWS re:Invent 2018Amazon Web Services
 
AI Services for Developers - Floor28
AI Services for Developers - Floor28AI Services for Developers - Floor28
AI Services for Developers - Floor28Boaz Ziniman
 
Day 4 - Cloud Migration - But How?
Day 4 - Cloud Migration - But How?Day 4 - Cloud Migration - But How?
Day 4 - Cloud Migration - But How?Amazon Web Services
 
데이터 기반 의사결정을 통한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
데이터 기반 의사결정을 통한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)데이터 기반 의사결정을 통한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
데이터 기반 의사결정을 통한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)Amazon Web Services Korea
 
Track 1 Session 1_企業善用雲端來加速數位化及創新
Track 1 Session 1_企業善用雲端來加速數位化及創新Track 1 Session 1_企業善用雲端來加速數位化及創新
Track 1 Session 1_企業善用雲端來加速數位化及創新Amazon Web Services
 
Alexa, Ask Jarvis to Create a Serverless App for Me (SRV315) - AWS re:Invent ...
Alexa, Ask Jarvis to Create a Serverless App for Me (SRV315) - AWS re:Invent ...Alexa, Ask Jarvis to Create a Serverless App for Me (SRV315) - AWS re:Invent ...
Alexa, Ask Jarvis to Create a Serverless App for Me (SRV315) - AWS re:Invent ...Amazon Web Services
 

Tendances (20)

Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
 
Build Machine Learning Solutions on Data Lakes (ARC321) - AWS re:Invent 2018
Build Machine Learning Solutions on Data Lakes (ARC321) - AWS re:Invent 2018Build Machine Learning Solutions on Data Lakes (ARC321) - AWS re:Invent 2018
Build Machine Learning Solutions on Data Lakes (ARC321) - AWS re:Invent 2018
 
McGraw-Hill Education: Global Migration in Less than 2 Years (ENT211) | AWS r...
McGraw-Hill Education: Global Migration in Less than 2 Years (ENT211) | AWS r...McGraw-Hill Education: Global Migration in Less than 2 Years (ENT211) | AWS r...
McGraw-Hill Education: Global Migration in Less than 2 Years (ENT211) | AWS r...
 
Track 6 Session 3_如何藉由 AWS AI 和機器學習平台搭建多功能的 AI 解決方案.pptx
Track 6 Session 3_如何藉由 AWS AI 和機器學習平台搭建多功能的 AI 解決方案.pptxTrack 6 Session 3_如何藉由 AWS AI 和機器學習平台搭建多功能的 AI 解決方案.pptx
Track 6 Session 3_如何藉由 AWS AI 和機器學習平台搭建多功能的 AI 解決方案.pptx
 
Tiered Data Sets in Amazon Redshift (ANT321) - AWS re:Invent 2018
Tiered Data Sets in Amazon Redshift (ANT321) - AWS re:Invent 2018Tiered Data Sets in Amazon Redshift (ANT321) - AWS re:Invent 2018
Tiered Data Sets in Amazon Redshift (ANT321) - AWS re:Invent 2018
 
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotCostruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
 
Amazon AI/ML Overview
Amazon AI/ML OverviewAmazon AI/ML Overview
Amazon AI/ML Overview
 
AMF304-Optimizing Design and Engineering Performance in the Cloud for Manufac...
AMF304-Optimizing Design and Engineering Performance in the Cloud for Manufac...AMF304-Optimizing Design and Engineering Performance in the Cloud for Manufac...
AMF304-Optimizing Design and Engineering Performance in the Cloud for Manufac...
 
Artificial intelligence in actions: delivering a new experience to Formula 1 ...
Artificial intelligence in actions: delivering a new experience to Formula 1 ...Artificial intelligence in actions: delivering a new experience to Formula 1 ...
Artificial intelligence in actions: delivering a new experience to Formula 1 ...
 
AWS for Semiconductor and Electronics Design | Hsinchu, April 10
AWS for Semiconductor and Electronics Design | Hsinchu, April 10AWS for Semiconductor and Electronics Design | Hsinchu, April 10
AWS for Semiconductor and Electronics Design | Hsinchu, April 10
 
Cloud Migration Insights Forum, Melbourne
Cloud Migration Insights Forum, MelbourneCloud Migration Insights Forum, Melbourne
Cloud Migration Insights Forum, Melbourne
 
Cloud Economics: The Financial Case for Cloud Migration
Cloud Economics: The Financial Case for Cloud MigrationCloud Economics: The Financial Case for Cloud Migration
Cloud Economics: The Financial Case for Cloud Migration
 
Optimize Amazon EC2 Instances, AWS Fargate Containers, & Lambda Functions (CM...
Optimize Amazon EC2 Instances, AWS Fargate Containers, & Lambda Functions (CM...Optimize Amazon EC2 Instances, AWS Fargate Containers, & Lambda Functions (CM...
Optimize Amazon EC2 Instances, AWS Fargate Containers, & Lambda Functions (CM...
 
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
 
Leadership Session: AWS Semiconductor (MFG201-L) - AWS re:Invent 2018
Leadership Session: AWS Semiconductor (MFG201-L) - AWS re:Invent 2018Leadership Session: AWS Semiconductor (MFG201-L) - AWS re:Invent 2018
Leadership Session: AWS Semiconductor (MFG201-L) - AWS re:Invent 2018
 
AI Services for Developers - Floor28
AI Services for Developers - Floor28AI Services for Developers - Floor28
AI Services for Developers - Floor28
 
Day 4 - Cloud Migration - But How?
Day 4 - Cloud Migration - But How?Day 4 - Cloud Migration - But How?
Day 4 - Cloud Migration - But How?
 
데이터 기반 의사결정을 통한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
데이터 기반 의사결정을 통한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)데이터 기반 의사결정을 통한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
데이터 기반 의사결정을 통한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
 
Track 1 Session 1_企業善用雲端來加速數位化及創新
Track 1 Session 1_企業善用雲端來加速數位化及創新Track 1 Session 1_企業善用雲端來加速數位化及創新
Track 1 Session 1_企業善用雲端來加速數位化及創新
 
Alexa, Ask Jarvis to Create a Serverless App for Me (SRV315) - AWS re:Invent ...
Alexa, Ask Jarvis to Create a Serverless App for Me (SRV315) - AWS re:Invent ...Alexa, Ask Jarvis to Create a Serverless App for Me (SRV315) - AWS re:Invent ...
Alexa, Ask Jarvis to Create a Serverless App for Me (SRV315) - AWS re:Invent ...
 

Similaire à AI Stack on AWS: Amazon SageMaker and Beyond

MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerProvectus
 
Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...
Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...
Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...SQUADEX
 
ML_Development_with_Sagemaker.pptx
ML_Development_with_Sagemaker.pptxML_Development_with_Sagemaker.pptx
ML_Development_with_Sagemaker.pptxTemiReply
 
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Sotrender
 
201908 Overview of Automated ML
201908 Overview of Automated ML201908 Overview of Automated ML
201908 Overview of Automated MLMark Tabladillo
 
EPAM ML/AI Accelerator - ODAHU
EPAM ML/AI Accelerator - ODAHUEPAM ML/AI Accelerator - ODAHU
EPAM ML/AI Accelerator - ODAHUDmitrii Suslov
 
DevOps for Machine Learning overview en-us
DevOps for Machine Learning overview en-usDevOps for Machine Learning overview en-us
DevOps for Machine Learning overview en-useltonrodriguez11
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019Mark Tabladillo
 
Machine Learning and AI
Machine Learning and AIMachine Learning and AI
Machine Learning and AIJames Serra
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureMark Tabladillo
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in ProductionDataWorks Summit
 
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on DatabricksDataScienceConferenc1
 
Running Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using KubernetesRunning Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using KubernetesDatabricks
 
AWS ML Model Deployment
AWS ML Model DeploymentAWS ML Model Deployment
AWS ML Model DeploymentKnoldus Inc.
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerAmazon Web Services
 
Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...DataWorks Summit
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabszekeLabs Technologies
 
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...PAPIs.io
 

Similaire à AI Stack on AWS: Amazon SageMaker and Beyond (20)

MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
 
Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...
Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...
Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...
 
ML_Development_with_Sagemaker.pptx
ML_Development_with_Sagemaker.pptxML_Development_with_Sagemaker.pptx
ML_Development_with_Sagemaker.pptx
 
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
 
201908 Overview of Automated ML
201908 Overview of Automated ML201908 Overview of Automated ML
201908 Overview of Automated ML
 
EPAM ML/AI Accelerator - ODAHU
EPAM ML/AI Accelerator - ODAHUEPAM ML/AI Accelerator - ODAHU
EPAM ML/AI Accelerator - ODAHU
 
DevOps for Machine Learning overview en-us
DevOps for Machine Learning overview en-usDevOps for Machine Learning overview en-us
DevOps for Machine Learning overview en-us
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019
 
Machine Learning and AI
Machine Learning and AIMachine Learning and AI
Machine Learning and AI
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft Azure
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
 
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
 
Running Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using KubernetesRunning Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using Kubernetes
 
AWS ML Model Deployment
AWS ML Model DeploymentAWS ML Model Deployment
AWS ML Model Deployment
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMaker
 
MLOps in action
MLOps in actionMLOps in action
MLOps in action
 
Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabs
 
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
 

Plus de Provectus

Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP SolutionProvectus
 
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Provectus
 
Choosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare OrganizationsChoosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare OrganizationsProvectus
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningProvectus
 
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMRCost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMRProvectus
 
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...Provectus
 
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K..."Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...Provectus
 
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ..."How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...Provectus
 
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky..."Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...Provectus
 
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2..."Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...Provectus
 
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma..."Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...Provectus
 
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ..."Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...Provectus
 
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019Provectus
 
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019Provectus
 
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti..."Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti...Provectus
 
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019Provectus
 
How to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAMHow to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAMProvectus
 
Yurii Gavrilin | ML Interpretability: From A to Z | Kazan ODSC Meetup
Yurii Gavrilin | ML Interpretability: From A to Z | Kazan ODSC MeetupYurii Gavrilin | ML Interpretability: From A to Z | Kazan ODSC Meetup
Yurii Gavrilin | ML Interpretability: From A to Z | Kazan ODSC MeetupProvectus
 
Andrei Grigoriev | Version Control in Data Science | Kazan ODSC Meetup
Andrei Grigoriev | Version Control in Data Science | Kazan ODSC MeetupAndrei Grigoriev | Version Control in Data Science | Kazan ODSC Meetup
Andrei Grigoriev | Version Control in Data Science | Kazan ODSC MeetupProvectus
 

Plus de Provectus (20)

Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP Solution
 
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
 
Choosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare OrganizationsChoosing the Right Document Processing Solution for Healthcare Organizations
Choosing the Right Document Processing Solution for Healthcare Organizations
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMRCost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
Cost Optimization for Apache Hadoop/Spark Workloads with Amazon EMR
 
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
ODSC webinar "Kubeflow, MLFlow and Beyond — augmenting ML delivery" Stepan Pu...
 
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K..."Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
 
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ..."How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
"How to build a global serverless service", Alex Casalboni, AWS Dev Day Kyiv ...
 
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky..."Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
"Automating AWS Infrastructure with PowerShell", Martin Beeby, AWS Dev Day Ky...
 
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2..."Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
"Analyzing your web and application logs", Javier Ramirez, AWS Dev Day Kyiv 2...
 
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma..."Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
"Resiliency and Availability Design Patterns for the Cloud", Sebastien Storma...
 
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ..."Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
"Architecting SaaS solutions on AWS", Oleksandr Mykhalchuk, AWS Dev Day Kyiv ...
 
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
"Developing with .NET Core on AWS", Martin Beeby, AWS Dev Day Kyiv 2019
 
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
"How to build real-time backends", Martin Beeby, AWS Dev Day Kyiv 2019
 
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti..."Integrate your front end apps with serverless backend in the cloud", Sebasti...
"Integrate your front end apps with serverless backend in the cloud", Sebasti...
 
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
"Scaling ML from 0 to millions of users", Julien Simon, AWS Dev Day Kyiv 2019
 
How to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAMHow to implement authorization in your backend with AWS IAM
How to implement authorization in your backend with AWS IAM
 
Yurii Gavrilin | ML Interpretability: From A to Z | Kazan ODSC Meetup
Yurii Gavrilin | ML Interpretability: From A to Z | Kazan ODSC MeetupYurii Gavrilin | ML Interpretability: From A to Z | Kazan ODSC Meetup
Yurii Gavrilin | ML Interpretability: From A to Z | Kazan ODSC Meetup
 
Andrei Grigoriev | Version Control in Data Science | Kazan ODSC Meetup
Andrei Grigoriev | Version Control in Data Science | Kazan ODSC MeetupAndrei Grigoriev | Version Control in Data Science | Kazan ODSC Meetup
Andrei Grigoriev | Version Control in Data Science | Kazan ODSC Meetup
 

Dernier

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 

Dernier (20)

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 

AI Stack on AWS: Amazon SageMaker and Beyond

  • 1. AI Stack on AWS: Amazon SageMaker and Beyond Presented by: Stepan Pushkarev, CTO @ Provectus Chris Burns, Senior AI/ML Solutions Architect @ AWS Pritpal Sahota, Technical Account Executive @ Provectus
  • 2. Introductions This webinar is brought to you by Provectus & AWS Pritpal Sahota Technical Account Executive, Provectus Chris Burns Senior AI/ML Solutions Architect, AWS Stepan Pushkarev Chief Technology Officer, Provectus
  • 3. Provectus: AI consultancy and Solutions provider Established in 2010, Headquartered in Palo Alto 450 engineers and growingOffices across the US, Canada, and Europe Clients: fast-growing startups and large enterprises AWS Competency Partner in DevOps, Data & Analytics, and Machine Learning
  • 4. 1. Mid-to-proficiency level in Machine Learning a. or proficiency level in system / cloud architecture 2. Familiarity with AWS ecosystem 3. Familiarity with SageMaker fundamentals (notebooks, training, hosting) SageMaker and Beyond prerequisites
  • 5. 1. Deep understanding of Amazon SageMaker capabilities, limitations, and opportunities 2. Best practices for using Amazon SageMaker with open-source tools for better experience and productivity 3. Holistic understanding of integration of ML process into the rest of AWS architecture SageMaker and Beyond outcomes
  • 6. VISION SPEECH TEXT SEARCH NEW CHATBOTS PERSONALIZATION FORECASTING FRAUD NEW DEVELOPMENT NEW CONTACT CENTERS Amazon SageMaker Amazon SageMaker Ground Truth Amazon A2I Amazon SageMaker Neo Built-in algorithms SageMaker Notebooks NEW SageMaker Experiments NEW Model tuning SageMaker Debugger NEW SageMaker Autopilot NEW Model hosting SageMaker Model Monitor NEW Deep Learning AMIs & Containers GPUs & CPUs Elastic Inference Inferentia FPGA Amazon Rekognition Amazon Polly Amazon Transcribe +Medical Amazon Comprehend +Medical Amazon Translate Amazon Lex Amazon Personalize Amazon Forecast Amazon Fraud Detector Amazon CodeGuru AWS AI Services AWS ML Services + Provectus Foundation Solutions AWS ML Frameworks & Infrastructure Amazon Textract Amazon Kendra Contact Lens For Amazon Connect Amazon SageMaker Studio IDE NEW NEW Supply Chain Optimization Customer Support Automation Disease Screening & Diagnosis Worker Health Safety Customer Retention Optimization Claims & Document Processing Provectus Value-adding AI Solutions Feature Store Kubeflow Orchestration MLOps Advanced Monitoring NEW
  • 8. Feature Store Store and reuse features to build ML models faster ML Workflow Orchestrator Reproduce and track the whole ML Workflow Athena ML Inference ML models from SQL Dataset Versioning Track and govern training datasets Data Sampling Sample from production streams Elastic Inference Save GPU costs Amazon SageMaker Processing Data Processing & Model Evaluation
  • 9. ML Infrastructure - Nice to Have or Must-Have?
  • 10. Must-Have Use Case: FDA Compliant Disease Screening
  • 11. Screening at birth for potential pathologies helps find an expert ophthalmologist who can evaluate, treat and prevent disease. Pr3vent
  • 12. Pr3vent Best time for treatment Screened Too late?4 million babies are neither screened nor treated Infancy, 1-5 years KindergartenPremature
  • 14. ML infrastructure to comply with FDA Guidelines Auditable and trusted environment Data annotation Raw data Experiment ation Model catalogue Testing Production inferencing Monitoring Maintenan ce
  • 15. Start with Data: Data Lake for ML
  • 16. Enterprise Machine Learning starts with Data 1. Machine Learning Datasets Reproducibility 2. Models Datasets Versioning 3. Machine Learning Datasets Bias detection and Fairness 4. Machine Learning Datasets Auditability 5. Model Data Lake Governance 6. Model Data Monitoring
  • 17. Data Lake Characteristics 1. Powered by data pipelines 2. Infinity dataset 3. Cheap storage 4. Decoupled from compute 5. Columnar Access a. Optimized Parquet file size 6. Append only 7. Partitioned 8. Exposes Metadata for each column: a. Type b. Description c. Source (Lineage) d. SLA
  • 18. 1. Includes Model Metadata: a. Prediction, confidence b. Other model output c. Model name & version d. Model Monitoring checks 2. Includes Annotation Metadata a. Labeling job ID b. Judgements c. Agreements 3. Has Governance Metadata for each column: a. Owner b. Description c. Last updated, SLA d. Upstream ML models (used_by) e. Statistics (min, max, uniques, nulls) 4. Supports higher level operations a. Subsample b. Take a Snapshot Adding ML Awareness into Data Lake
  • 19. Sampling - generating a versioned dataset
  • 20. ML Dataset Characteristics 1. Immutable 2. Finity 3. Versioned 4. Could be downloaded locally (DVC) 5. Could be compared with other datasets 6. Exposes Metadata: a. Dataset Owner b. Subsample pipeline version c. Subsample pipeline parameters
  • 22. Feature Store Characteristics 1. Where ML Training job starts 2. Where ML adoption is accelerated 3. Immutable 4. Versioned 5. Each version could be downloaded locally 6. Could be compared with other versions 5. Exposes Metadata: a. Owner b. Subsample pipeline version c. Subsample pipeline parameters d. Upstream models e. Feature descriptions f. Feature versions
  • 23. Data Layer for ML: Summary 1. Add ML Awareness into Data Lake by enriching it with ML specific metadata 2. Invest into reusable sampling, featurization and other steps of the pipeline 3. Build it yourself with AWS tools like Amazon EMR, Athena, DynamoDB, AWS Glue Catalogue 4. Amplify the adoption of ML by introducing a centralized feature store
  • 26. Tensorboard is good to track Training ● Log training metrics and other scalars ● Examine execution graph ● TensorFlow, PyTorch ● Hyperparameter tuning ● What-IF tool ● Evaluate model with fairness indicators ● Profiling tool
  • 27. … but has its flaws ● Tracks training step logs only ● Doesn’t track run parameters ● Comparing runs is not as straightforward as it could be ● TensorFlow, PyTorch only ● Do it Yourself on AWS
  • 28. Amazon SageMaker Experiments ● Offers seamless integration into the existing ML workflow ● Offers a structured organization scheme to help users group and organize their machine learning iterations ● Provides tracking and analytics of experiments ● Facilitates decomposition of monolithic workflow into multiple steps
  • 29. Tracking Capabilities ● Parameters ● Inputs ● Outputs ● Artifacts ● Metrics
  • 30. Analyzing experiments in Studio ● Visualize information about experiments and their trials in real-time with predefined widgets using Amazon Sagemaker Studio
  • 31. Analysing experiments using SDK ● All logged information about an experiment can be easily exported to a Pandas DataFrame
  • 32. AWS Sagemaker Experiments: Summary Pros ○ Fully managed ○ Ability to track a rich set of parameters ○ Ability to build complex plots from Studio ○ Ability to extract all logged information for custom analysis ○ Native integration with Amazon SageMaker Autopilot, Amazon SageMaker Endpoints Current limitations / things to be aware of ○ Does not allow building complex DAGs, i.e. sequential execution only ○ Lack of instruments for configuring robust pipelines ○ Available within AWS Sagemaker Studio only - per user context, can not compare runs by different users ○ Can not compare trials from different runs
  • 33. Build & Train: Orchestration Beyond SageMaker Experiments
  • 35. Orchestrate it all with Kubeflow Pipelines
  • 36. Kubeflow on AWS Best Practices: ● Invest into a library of reusable components ● Use SageMaker Operators for Kubernetes ● Deploy on EKS ● Use separate on-demand/spot nodegroups for CPU/GPU bound ML tasks ● Use Amazon FSx for Lustre to avoid data transfer from Amazon S3 ● Integrate with Amazon Cognito
  • 37. Kubeflow on AWS Challenges: ● Under rapid development ● Still needs Ops support even on EKS ● Resource management between service and ML workloads ● Poor support from AWS community Best Practices: ● Invest into a library of reusable steps ● Use SageMaker Operators for Kubernetes ● Deploy on EKS ● Use separate on-demand/spot nodegroups for CPU/GPU bound ML tasks ● Use FSx for Lustre to avoid data transfer from S3 ● Integrate with AWS Cognito
  • 38. Kubeflow Pipelines: Summary ● Extends beyond SageMaker ecosystem ● Built on top of Argo Workflows, facilitates GitOps ● Allows building complex processing DAGs ● Rich purposely built UI ● Growing opensource community ● Requires deep Kubernetes/Ops expertise
  • 40. Code ● Unit tests ● Logging ● Peer review How to debug models? Experiments ● Assert model parameters ● Track loss curves / metrics during training ● Check model outputs
  • 41.
  • 42. Can we go beyond curves?
  • 43. SageMaker Debugger — Logging + Statistics + Alerts
  • 44. ● Vanishing gradients ● Overfitting ● Poor weight initialization ● Saturated activations ● Overpruned trees Out of the box Rules
  • 45. SageMaker Debugger: Summary ● No warnings, errors only ● Not available for built-in algorithms Pros ● Flowing through the graph: goes beyond watching scalars (losses) during training and provides full visibility into history of all tensors ● Early stopping & near real time alerts ● Requires minimal instrumentation of the model code ● Growing set of out-of-the-box Rules Current limitations / things to be aware of
  • 48. SageMaker Model Monitoring Goal Training Data Production Data
  • 50. SageMaker endpoint requests predictions production request storage training data
  • 51. SageMaker endpoint requests predictions production request storage training data baseline statistics SageMaker Processing Job
  • 52. SageMaker endpoint requests predictions training data baseline statistics SageMaker Processing Job Scheduled Monitoring Job generated reports: statistics and violations production request storage
  • 53. SageMaker endpoint requests predictions SageMaker Processing Job Scheduled Monitoring Job generated reports: statistics and violations training data baseline statistics production request storage
  • 54. SageMaker endpoint requests predictions training data baseline statistics SageMaker Processing Job Scheduled Monitoring Job generated reports: statistics and violations production request storage
  • 56. Scheduled Monitoring Job Pre Built Container in a nutshell
  • 57. Scheduled Monitoring Job ➔ Min ➔ Max ➔ Sum ➔ Sample Count ➔ Average ➔ Completeness ➔ Baseline Drift == two sample KS test ➔ Missing Columns ➔ Excessive columns
  • 58.
  • 59. SageMaker endpoint requests predictions production request storage training data baseline statistics SageMaker Processing Job generated reports: statistics and violations ANYTHING YOU WANT
  • 60. 1. Realtime processing and alerts 2. Image Data Drift 3. Text Data Drift 4. Anomaly Detection 5. Interpretability of drift Provectus Value Add Model Monitoring Features ANYTHING YOU WANT
  • 61. 1. Built-in container with schema extractor from training data 2. Built-in container with Min/Max/Mean and KS test 3. Fully managed data wrangling, traffic shadowing, job scheduling, pushing metrics to CloudWatch and retrieving latest job results SageMaker Monitor: Summary
  • 62. ● Modern ML infrastructure accelerates time to value for ML initiatives and increases trust from the business ● Amazon SageMaker has the broadest and deepest set of fully managed tools for building and managing AI applications at scale ● Complement it with the rest of AWS tools for data processing, storage & metadata management ● Complement it with mature opensource tools to go beyond main offerings Webinar Takeaways
  • 63. 125 University Avenue Suite 290, Palo Alto California, 94301 hello@provectus.com Questions, details? We would be happy to answer!