SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
Accelerate ML Lifecycle with
Kubernetes and Containerized
Data Science Tools
April 16th, 2020
1
Abhinav Joshi & Tushar Katarki
Red Hat
2
Abhinav Joshi
Senior Manager, Red Hat OpenShift Product Marketing
19+ yrs IT experience, 2 yrs at Red Hat, ex VMware, NetApp, Cisco
Email: abhjoshi@redhat.com
LinkedIn: https://www.linkedin.com/in/abhinavjoshi/
Tushar Katarki
Senior Manager, Red Hat OpenShift Product Management
20 yrs IT experience, 8 yrs at Red Hat, Ex Oracle/Sun, Polycom, etc
Email: tkatarki@redhat.com
LinkedIn: https://www.linkedin.com/in/katarki/
What we’ll
discuss today
3
● Desired AI/ML architecture & execution challenges
● Why containers, Kubernetes, and DevOps for AI/ML
● Enterprise Kubernetes Platform examples
● Real world deployment use cases
4
Desired Architecture and
execution challenges
5
AI/ML lifecycle and key personas
App developer
IT operations
Set
goals
Gather and
prepare data
Develop ML
model
Deploy ML
models in app
dev process
Implement
Apps &
Inference
ML models
Monitoring &
Management
Data engineer
Business
leadership
Data scientists
ML Engineer
ML/DL and DevOps Tools (e.g. TensorFlow, Jupyter Notebooks, Python, Seldon, etc.)
Desired Conceptual Architecture
6
ML/DL data pipeline and sources (databases, data lake, etc.)
Compute acceleration (GPU, FPGA, TPU)
Hybrid, multi cloud platform with self service capabilities
Set
goals
Gather and
prepare data
Develop ML
model
Deploy ML models
in app dev process
Implement
Apps & inference
ML models
monitoring &
management
Infrastructure
Virtual Private Public Hybrid EdgePhysical
AI/ML execution challenges
7
Lots of data is
collected, but finding
and preparing the right
data is difficult.
Readily usable data
lacking
Lack of key skills make it
difficult to find and
secure talent to
maintain operations.
Talent
shortage
No rapid availability of
infrastructure and
software tools slows data
scientists and developers
Unavailability of
infrastructure & software
Unable to implement
quickly due to slow,
manual and siloed
operations.
Lack of collaboration
across teams
Containers, Kubernetes, and DevOps can help!
What does a Data Scientist care about?
As a Data Scientist, I want a
“self-service cloud like” experience
for my Machine Learning projects,
where I can access a rich set of
modelling tools, data, and
computational resources, share and
collaborate with colleagues, and
deliver my work into production
with speed, agility and repeatability
to drive business value!
Self service portal to access ML
tools and access sources
ML model deployment
in app dev process
Data Scientists care less about infrastructure platform unless it integrates with their
ML tooling, and provides them the agility, flexibility, portability, & scalability.
8
ML Modelling / hardware
acceleration
Inferencing w/ hardware
acceleration
ML/DL and DevOps Tools (e.g. TensorFlow, Jupyter Notebooks, Python, Seldon, etc.)
Containers, Kubernetes, and DevOps as part of the Hybrid, Multi Cloud Platform
9
ML/DL data sources - databases (SQL, NoSQL, etc.), data lake, etc.
Compute acceleration (GPU, FPGA, TPU)
Hybrid, multi cloud platform with self service capabilities
Set
goals
Gather and
prepare data
Develop ML
model
Deploy ML models
in app dev process
Implement
Apps & inference
ML models
monitoring &
management
Infrastructure
Virtual Private Public Hybrid EdgePhysical
● Containers
● Kubernetes
● DevOps
10
Containers,
Kubernetes, and
DevOps help
accelerate your
AI/ML initiatives.
Why containers,
Kubernetes, and DevOps
for AI/ML?
Containers
Basic units that make AI/ML
programs shareable and portable
across hybrid cloud
Choice: Containers contain all your ML
frameworks and tools
Sharing: Container images can be shared and
iterated in flexible ways
Immutable & Portable: Contain once and run
them anywhere with integrity
Versioning: Incremental changes are tracked
Fast & Efficient: They are Linux processes!
Security: Process isolation and resource
control
Container Host Operating System
Container
App
Supporting Files &
Runtime
Container
App
...
Container
App
...
App
Kubernetes
▸ Centralizes compute resources and
provides a consistent experience across the
data center, cloud, and edge
▸ Resource management for compute
resources (including GPUs and FPGA)
▸ Workload scheduling and management
▸ Multi tenancy and quotas enforcement
▸ Networking and storage abstractions
Kubernetes is the de facto container
management platform for the hybrid, multi
cloudFoundation for the Hybrid, Multi Cloud
Platform w/ self service capabilities for
Data Scientists, Developers, etc.
Self-service,
Automation, CI/CD
Boosts speed, efficiency and
productivity
▸ Jupyter Notebooks running on Kubernetes form
the basis for self-service
▸ Source-2-image automatically converts a
notebook into a container image that is ready to
be deployed
▸ Kubernetes Operators provide automation and
lifecycle management for the containers
▸ CI/CD makes rapid, incremental and iterative
change possible; Open source technologies such
as Argo, Tekton, Jenkins and Spinnaker in
conjunction with Kubernetes make this happen
▸ ‘Serverless’ technologies such as Knative will
enable AI/ML users to spend more time
developing their models
Image source: https://www.brainvire.com/devops/
Data
Engineering
Easy, self-service and repeatable
Data sources: Kubernetes Persistent Volumes and
S3 object store makes access to storage easy and
standardized
Data pipes: Kubernetes Networking and
ServiceMesh provides the data connectivity - high
bandwidth, low latency that is secure
Data streaming and manipulation: Tools such as
Apache Spark, Kafka, Presto etc. can run natively and
can be accessed as a service
Data governance: With open source technologies
like Open Policy Agent (OPA)
Deploying into
production
To deliver business value and
redeem the promise of AI in the
enterprise
Containerize models and expose the service
with an REST API using the microservices
pattern - ServiceMesh (such as ISTIO) makes
this easy !
Models are incorporated in a data pipeline
Jobs (batch or real-time) with tools such as
Spark, Kafka and Argo
Models are delivered into existing
application workflow as binaries: PMML,
ONNX, Pickle
Monitoring model performance and drift
with open source tools native to Kubernetes i.e.
Prometheus and Grafana
CI/CD to drive continuous change and
improvement in production
15
Stitching all of this together into an…...
“Enterprise Kubernetes and Container Platform for AI/ML”
DatacenterLaptop
ANY
INFRASTRUCTURE
Intelligent
Applications
ENTERPRISE CONTAINER HOST(S)
CONTAINER ORCHESTRATION AND MANAGEMENT
(KUBERNETES)
AI/ML/DL & DevOps tool chain
Machine Learning
Modeling
Data Pipeline
Open Source community ML
toolkit for Kubernetes
16
17
Examples of Enterprise
Kubernetes and
Container Platforms
Red Hat OpenShift Kubernetes Platform
EXISTING
AUTOMATION
TOOLSETS
SCM
(GIT)
CI/CD
DATA SCIENTIST
Deploy ML on
any cloud
18
Developer Productivity
Cluster Services
Automated Ops ⠇Over-The-Air Updates ⠇Monitoring ⠇Logging ⠇Registry ⠇Networking ⠇Router ⠇KubeVirt ⠇OLM ⠇Helm
Red Hat Enterprise Linux & RHEL CoreOS
Kubernetes
Developer CLI ⠇VS Code
extensions ⠇IDE Plugins
Code Ready Workspaces
CodeReady Containers
Service Mesh ⠇Serverless
Builds ⠇CI/CD Pipelines
Full Stack Logging
Chargeback
Databases ⠇Languages
Runtimes ⠇Integration
Business Automation
100+ ISV Services
Platform Services Application Services Developer Services
Physical
Virtual Private cloud Public cloud
Build Cloud-Native AppsManage Workloads
Multi-cluster Management
Discovery ⠇Policy ⠇Compliance ⠇Configuration ⠇Workloads
Managed cloud
(Azure, AWS, IBM, Red Hat)
Windows Server
Nodes
Expose ML as
services, load
balanced and
scalable
Compute
Resources
on-demand
Best of SDLC
ML in
Production
Open source community project
● Open Source AI/ML Tooling
● Open source Red Hat
technologies e.g. OpenShift
● Automated deployment of open
source AI/ML tooling with
Kubernetes Operators
● https://www.opendatahub.io
Open Data Hub - “Data and AI Platform for the Hybrid Cloud”
Relationship between Kubeflow and Open Data Hub project
ML-as-a-service platform based on OpenShift,
Ceph storage, Kafka, JupyterHub and Spark
Home for k8s community to share
operators for various apps/tools
20
21
Examples of containerized data science on Red Hat OpenShift
Connected Drive &
Autonomous Driving
Data driven diagnosis
Data driven diagnosis
Democratize data science for oil
and gas exploration
Containerized Apache
Spark
Healthcare and public sector Automotive Financial Oil and gas
Discover Financial
ServicesJupyter notebooks as a service
Ministry of Defence (Israel)
RBC Bank
ML/DL with Jupyter Notebook on Enterprise Kubernetes
Platform
22
DATA SCIENTIST
ML/DL
Model
test &
iteration
Model deployed
into production via
Inference server
Data Sources Access to
multiple data
sources
Kubernetes Platform
Kubernetes Platform
Integrated
GPU access
Integrated
GPU access
Example Data Science Delivery Model on OpenShift
23
Source: https://assets.openshift.com/hubfs/OpenShift-Commons-SF-Agile-Data-Science-ExxonMobil.pdf
● kubeflow.org
● openshift.com/ai-ml
● opendatahub.io
24
RESOURCES
Summary
25
Containers, Kubernetes, &
DevOps can help
Agility, self-service, hybrid
cloud portability, scalability,
flexibility, automation
AI/ML benefits businesses
AI-powered intelligent
applications help achieve
key business goals, but
execution challenges
exists
Enterprise Kubernetes &
Container Platforms make it real
Allows leveraging the
benefits of containers,
Kubernetes, DevOps, and
accelerate delivery of
AI-powered apps
Abhinav Joshi: abhjoshi@redhat.com
Tushar Katarki: tkatarki@redhat.com
Thank you
26

Contenu connexe

Tendances

Leveraging IoT as part of your digital transformation
Leveraging IoT as part of your digital transformationLeveraging IoT as part of your digital transformation
Leveraging IoT as part of your digital transformationJohn Archer
 
Big data + cloud computing glossary for community
Big data + cloud computing glossary for communityBig data + cloud computing glossary for community
Big data + cloud computing glossary for communityKumar Chinnakali
 
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platformNatalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platformmatteo mazzeri
 
Azure Digital Twins 2.0
Azure Digital Twins 2.0Azure Digital Twins 2.0
Azure Digital Twins 2.0Marco Parenzan
 
Cloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsCloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsVMware Tanzu
 
Continus sql with sql stream builder
Continus sql with sql stream builderContinus sql with sql stream builder
Continus sql with sql stream builderTimothy Spann
 
Introducción al Machine Learning Automático
Introducción al Machine Learning AutomáticoIntroducción al Machine Learning Automático
Introducción al Machine Learning AutomáticoSri Ambati
 
Quantum Computing: The next new technology in computing
Quantum Computing: The next new technology in computingQuantum Computing: The next new technology in computing
Quantum Computing: The next new technology in computingData Con LA
 
Very large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDLVery large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDLDESMOND YUEN
 
What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3DataWorks Summit
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes John Archer
 
Modernizing your Application Architecture with Microservices
Modernizing your Application Architecture with MicroservicesModernizing your Application Architecture with Microservices
Modernizing your Application Architecture with Microservicesconfluent
 
Big data on google cloud
Big data on google cloudBig data on google cloud
Big data on google cloudTu Pham
 
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersWebinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersMesosphere Inc.
 
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)DataWorks Summit
 
When HPC meet ML/DL: Manage HPC Data Center with Kubernetes
When HPC meet ML/DL: Manage HPC Data Center with KubernetesWhen HPC meet ML/DL: Manage HPC Data Center with Kubernetes
When HPC meet ML/DL: Manage HPC Data Center with KubernetesYong Feng
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooJason Dai
 
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - HighlightsKubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - HighlightsKrishna-Kumar
 

Tendances (20)

Leveraging IoT as part of your digital transformation
Leveraging IoT as part of your digital transformationLeveraging IoT as part of your digital transformation
Leveraging IoT as part of your digital transformation
 
Big data + cloud computing glossary for community
Big data + cloud computing glossary for communityBig data + cloud computing glossary for community
Big data + cloud computing glossary for community
 
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platformNatalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
 
Azure Digital Twins 2.0
Azure Digital Twins 2.0Azure Digital Twins 2.0
Azure Digital Twins 2.0
 
Cloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsCloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive Applications
 
Continus sql with sql stream builder
Continus sql with sql stream builderContinus sql with sql stream builder
Continus sql with sql stream builder
 
CNCF and Cloud Native Intro
CNCF and Cloud Native IntroCNCF and Cloud Native Intro
CNCF and Cloud Native Intro
 
Introducción al Machine Learning Automático
Introducción al Machine Learning AutomáticoIntroducción al Machine Learning Automático
Introducción al Machine Learning Automático
 
Quantum Computing: The next new technology in computing
Quantum Computing: The next new technology in computingQuantum Computing: The next new technology in computing
Quantum Computing: The next new technology in computing
 
Very large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDLVery large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDL
 
What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
 
Modernizing your Application Architecture with Microservices
Modernizing your Application Architecture with MicroservicesModernizing your Application Architecture with Microservices
Modernizing your Application Architecture with Microservices
 
Big data on google cloud
Big data on google cloudBig data on google cloud
Big data on google cloud
 
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersWebinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containers
 
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
 
When HPC meet ML/DL: Manage HPC Data Center with Kubernetes
When HPC meet ML/DL: Manage HPC Data Center with KubernetesWhen HPC meet ML/DL: Manage HPC Data Center with Kubernetes
When HPC meet ML/DL: Manage HPC Data Center with Kubernetes
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics Zoo
 
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - HighlightsKubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
 
Hardware in Space
Hardware in SpaceHardware in Space
Hardware in Space
 

Similaire à ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Data Science Tools

DDDP 2019 - Brown to Green
DDDP 2019  - Brown to GreenDDDP 2019  - Brown to Green
DDDP 2019 - Brown to GreenJohn Archer
 
Red hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyOrgad Kimchi
 
Infrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload DeploymentInfrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload DeploymentDatabricks
 
IBM RedHat OCP Vs xKS.pptx
IBM RedHat OCP Vs xKS.pptxIBM RedHat OCP Vs xKS.pptx
IBM RedHat OCP Vs xKS.pptxssuser666667
 
Building Cloud Native Applications with Oracle Autonomous Database.
Building Cloud Native Applications with Oracle Autonomous Database.Building Cloud Native Applications with Oracle Autonomous Database.
Building Cloud Native Applications with Oracle Autonomous Database.Oracle Developers
 
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Animesh Singh
 
From OpenStack.... towards an Open cloud architecture
From OpenStack.... towards an Open cloud architecture From OpenStack.... towards an Open cloud architecture
From OpenStack.... towards an Open cloud architecture Claude Riousset
 
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...Akash Tandon
 
Simplify DevOps with Microservices and Mobile Backends.pptx
Simplify DevOps with Microservices and Mobile Backends.pptxSimplify DevOps with Microservices and Mobile Backends.pptx
Simplify DevOps with Microservices and Mobile Backends.pptxssuser5faa791
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningProvectus
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 editionDavid Talby
 
The Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and StreamingThe Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and StreamingTimothy Spann
 
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaSOpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaSDaniel Krook
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big DataDataWorks Summit
 
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...ScyllaDB
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaData Science Milan
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018Krishna-Kumar
 

Similaire à ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Data Science Tools (20)

DDDP 2019 - Brown to Green
DDDP 2019  - Brown to GreenDDDP 2019  - Brown to Green
DDDP 2019 - Brown to Green
 
Red hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategy
 
Infrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload DeploymentInfrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload Deployment
 
NextGenML
NextGenML NextGenML
NextGenML
 
IBM RedHat OCP Vs xKS.pptx
IBM RedHat OCP Vs xKS.pptxIBM RedHat OCP Vs xKS.pptx
IBM RedHat OCP Vs xKS.pptx
 
Building Cloud Native Applications with Oracle Autonomous Database.
Building Cloud Native Applications with Oracle Autonomous Database.Building Cloud Native Applications with Oracle Autonomous Database.
Building Cloud Native Applications with Oracle Autonomous Database.
 
03_aiops-1.pptx
03_aiops-1.pptx03_aiops-1.pptx
03_aiops-1.pptx
 
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
 
From OpenStack.... towards an Open cloud architecture
From OpenStack.... towards an Open cloud architecture From OpenStack.... towards an Open cloud architecture
From OpenStack.... towards an Open cloud architecture
 
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
 
Simplify DevOps with Microservices and Mobile Backends.pptx
Simplify DevOps with Microservices and Mobile Backends.pptxSimplify DevOps with Microservices and Mobile Backends.pptx
Simplify DevOps with Microservices and Mobile Backends.pptx
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)
 
The Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and StreamingThe Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and Streaming
 
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaSOpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018
 

Dernier

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Dernier (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Data Science Tools

  • 1. Accelerate ML Lifecycle with Kubernetes and Containerized Data Science Tools April 16th, 2020 1 Abhinav Joshi & Tushar Katarki Red Hat
  • 2. 2 Abhinav Joshi Senior Manager, Red Hat OpenShift Product Marketing 19+ yrs IT experience, 2 yrs at Red Hat, ex VMware, NetApp, Cisco Email: abhjoshi@redhat.com LinkedIn: https://www.linkedin.com/in/abhinavjoshi/ Tushar Katarki Senior Manager, Red Hat OpenShift Product Management 20 yrs IT experience, 8 yrs at Red Hat, Ex Oracle/Sun, Polycom, etc Email: tkatarki@redhat.com LinkedIn: https://www.linkedin.com/in/katarki/
  • 3. What we’ll discuss today 3 ● Desired AI/ML architecture & execution challenges ● Why containers, Kubernetes, and DevOps for AI/ML ● Enterprise Kubernetes Platform examples ● Real world deployment use cases
  • 5. 5 AI/ML lifecycle and key personas App developer IT operations Set goals Gather and prepare data Develop ML model Deploy ML models in app dev process Implement Apps & Inference ML models Monitoring & Management Data engineer Business leadership Data scientists ML Engineer
  • 6. ML/DL and DevOps Tools (e.g. TensorFlow, Jupyter Notebooks, Python, Seldon, etc.) Desired Conceptual Architecture 6 ML/DL data pipeline and sources (databases, data lake, etc.) Compute acceleration (GPU, FPGA, TPU) Hybrid, multi cloud platform with self service capabilities Set goals Gather and prepare data Develop ML model Deploy ML models in app dev process Implement Apps & inference ML models monitoring & management Infrastructure Virtual Private Public Hybrid EdgePhysical
  • 7. AI/ML execution challenges 7 Lots of data is collected, but finding and preparing the right data is difficult. Readily usable data lacking Lack of key skills make it difficult to find and secure talent to maintain operations. Talent shortage No rapid availability of infrastructure and software tools slows data scientists and developers Unavailability of infrastructure & software Unable to implement quickly due to slow, manual and siloed operations. Lack of collaboration across teams Containers, Kubernetes, and DevOps can help!
  • 8. What does a Data Scientist care about? As a Data Scientist, I want a “self-service cloud like” experience for my Machine Learning projects, where I can access a rich set of modelling tools, data, and computational resources, share and collaborate with colleagues, and deliver my work into production with speed, agility and repeatability to drive business value! Self service portal to access ML tools and access sources ML model deployment in app dev process Data Scientists care less about infrastructure platform unless it integrates with their ML tooling, and provides them the agility, flexibility, portability, & scalability. 8 ML Modelling / hardware acceleration Inferencing w/ hardware acceleration
  • 9. ML/DL and DevOps Tools (e.g. TensorFlow, Jupyter Notebooks, Python, Seldon, etc.) Containers, Kubernetes, and DevOps as part of the Hybrid, Multi Cloud Platform 9 ML/DL data sources - databases (SQL, NoSQL, etc.), data lake, etc. Compute acceleration (GPU, FPGA, TPU) Hybrid, multi cloud platform with self service capabilities Set goals Gather and prepare data Develop ML model Deploy ML models in app dev process Implement Apps & inference ML models monitoring & management Infrastructure Virtual Private Public Hybrid EdgePhysical ● Containers ● Kubernetes ● DevOps
  • 10. 10 Containers, Kubernetes, and DevOps help accelerate your AI/ML initiatives. Why containers, Kubernetes, and DevOps for AI/ML?
  • 11. Containers Basic units that make AI/ML programs shareable and portable across hybrid cloud Choice: Containers contain all your ML frameworks and tools Sharing: Container images can be shared and iterated in flexible ways Immutable & Portable: Contain once and run them anywhere with integrity Versioning: Incremental changes are tracked Fast & Efficient: They are Linux processes! Security: Process isolation and resource control Container Host Operating System Container App Supporting Files & Runtime Container App ... Container App ... App
  • 12. Kubernetes ▸ Centralizes compute resources and provides a consistent experience across the data center, cloud, and edge ▸ Resource management for compute resources (including GPUs and FPGA) ▸ Workload scheduling and management ▸ Multi tenancy and quotas enforcement ▸ Networking and storage abstractions Kubernetes is the de facto container management platform for the hybrid, multi cloudFoundation for the Hybrid, Multi Cloud Platform w/ self service capabilities for Data Scientists, Developers, etc.
  • 13. Self-service, Automation, CI/CD Boosts speed, efficiency and productivity ▸ Jupyter Notebooks running on Kubernetes form the basis for self-service ▸ Source-2-image automatically converts a notebook into a container image that is ready to be deployed ▸ Kubernetes Operators provide automation and lifecycle management for the containers ▸ CI/CD makes rapid, incremental and iterative change possible; Open source technologies such as Argo, Tekton, Jenkins and Spinnaker in conjunction with Kubernetes make this happen ▸ ‘Serverless’ technologies such as Knative will enable AI/ML users to spend more time developing their models Image source: https://www.brainvire.com/devops/
  • 14. Data Engineering Easy, self-service and repeatable Data sources: Kubernetes Persistent Volumes and S3 object store makes access to storage easy and standardized Data pipes: Kubernetes Networking and ServiceMesh provides the data connectivity - high bandwidth, low latency that is secure Data streaming and manipulation: Tools such as Apache Spark, Kafka, Presto etc. can run natively and can be accessed as a service Data governance: With open source technologies like Open Policy Agent (OPA)
  • 15. Deploying into production To deliver business value and redeem the promise of AI in the enterprise Containerize models and expose the service with an REST API using the microservices pattern - ServiceMesh (such as ISTIO) makes this easy ! Models are incorporated in a data pipeline Jobs (batch or real-time) with tools such as Spark, Kafka and Argo Models are delivered into existing application workflow as binaries: PMML, ONNX, Pickle Monitoring model performance and drift with open source tools native to Kubernetes i.e. Prometheus and Grafana CI/CD to drive continuous change and improvement in production 15
  • 16. Stitching all of this together into an…... “Enterprise Kubernetes and Container Platform for AI/ML” DatacenterLaptop ANY INFRASTRUCTURE Intelligent Applications ENTERPRISE CONTAINER HOST(S) CONTAINER ORCHESTRATION AND MANAGEMENT (KUBERNETES) AI/ML/DL & DevOps tool chain Machine Learning Modeling Data Pipeline Open Source community ML toolkit for Kubernetes 16
  • 17. 17 Examples of Enterprise Kubernetes and Container Platforms
  • 18. Red Hat OpenShift Kubernetes Platform EXISTING AUTOMATION TOOLSETS SCM (GIT) CI/CD DATA SCIENTIST Deploy ML on any cloud 18 Developer Productivity Cluster Services Automated Ops ⠇Over-The-Air Updates ⠇Monitoring ⠇Logging ⠇Registry ⠇Networking ⠇Router ⠇KubeVirt ⠇OLM ⠇Helm Red Hat Enterprise Linux & RHEL CoreOS Kubernetes Developer CLI ⠇VS Code extensions ⠇IDE Plugins Code Ready Workspaces CodeReady Containers Service Mesh ⠇Serverless Builds ⠇CI/CD Pipelines Full Stack Logging Chargeback Databases ⠇Languages Runtimes ⠇Integration Business Automation 100+ ISV Services Platform Services Application Services Developer Services Physical Virtual Private cloud Public cloud Build Cloud-Native AppsManage Workloads Multi-cluster Management Discovery ⠇Policy ⠇Compliance ⠇Configuration ⠇Workloads Managed cloud (Azure, AWS, IBM, Red Hat) Windows Server Nodes Expose ML as services, load balanced and scalable Compute Resources on-demand Best of SDLC ML in Production
  • 19. Open source community project ● Open Source AI/ML Tooling ● Open source Red Hat technologies e.g. OpenShift ● Automated deployment of open source AI/ML tooling with Kubernetes Operators ● https://www.opendatahub.io Open Data Hub - “Data and AI Platform for the Hybrid Cloud”
  • 20. Relationship between Kubeflow and Open Data Hub project ML-as-a-service platform based on OpenShift, Ceph storage, Kafka, JupyterHub and Spark Home for k8s community to share operators for various apps/tools 20
  • 21. 21 Examples of containerized data science on Red Hat OpenShift Connected Drive & Autonomous Driving Data driven diagnosis Data driven diagnosis Democratize data science for oil and gas exploration Containerized Apache Spark Healthcare and public sector Automotive Financial Oil and gas Discover Financial ServicesJupyter notebooks as a service Ministry of Defence (Israel) RBC Bank
  • 22. ML/DL with Jupyter Notebook on Enterprise Kubernetes Platform 22 DATA SCIENTIST ML/DL Model test & iteration Model deployed into production via Inference server Data Sources Access to multiple data sources Kubernetes Platform Kubernetes Platform Integrated GPU access Integrated GPU access
  • 23. Example Data Science Delivery Model on OpenShift 23 Source: https://assets.openshift.com/hubfs/OpenShift-Commons-SF-Agile-Data-Science-ExxonMobil.pdf
  • 24. ● kubeflow.org ● openshift.com/ai-ml ● opendatahub.io 24 RESOURCES
  • 25. Summary 25 Containers, Kubernetes, & DevOps can help Agility, self-service, hybrid cloud portability, scalability, flexibility, automation AI/ML benefits businesses AI-powered intelligent applications help achieve key business goals, but execution challenges exists Enterprise Kubernetes & Container Platforms make it real Allows leveraging the benefits of containers, Kubernetes, DevOps, and accelerate delivery of AI-powered apps
  • 26. Abhinav Joshi: abhjoshi@redhat.com Tushar Katarki: tkatarki@redhat.com Thank you 26