SlideShare une entreprise Scribd logo
1  sur  12
AI Pipeline Optimization
… using Kubeflow
© 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Muneer Ahmad (muneer.ahmad@netapp.com) Steve Guhr (steve.guhr@netapp.com)
AI Solutions Architect Solutions Engineer
Agenda
1) AI Pipeline Optimization & Architecture
2) Demo
3) Q&A
What’s it all about?
2 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
What it is and why did we do that?
AI Pipeline Optimization
3 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
AI Pipeline in general
… workflow across different sites
4 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Data
Ingestion
Data
Analysis
Data
Transformation
Data
Validation
Training
Data
Splitting
Monitoring
Training
Model
Validation
Training At
Scale
ServingRoll-Out Monitoring Logging
Site 1
Site 2Site 3Site 4
Site 5 Site 6
Model
AI Pipeline Portability
... working with multiple premises
5 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Model
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Model
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Laptop Training Rig Cloud
Model
AI Pipeline Optimization
… using Kubernetes & Kubeflow
6 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Model
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Model
UX
Tooling
Framework
Storage
Runtime
Drivers
OS
Accelerator
HW
Laptop Training Rig Cloud
Architectural Overview
What did we do (in a nutshell)?
7 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Kubeflow
Kubernetes
JupyterHub
…
Trident
Pipeline Katib
How did we do it?
Everyone loves demos, right?!
8 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Explaining the pipeline
 Pre-processing
 Training (classification, CIFAR10 dataset)
 Deploying and serving trained models
 TensorRT Inference engine
 Web-application
… using actual data and training
9 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
 What about „Machine Learning Version Control“?
 How do you „Lift and Shift“ the whole AI application stack across hybrid clouds?
 How to manage (c)old trained models and data?
 […]
10 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Closing Thoughts – What‘s next?
11 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
Resources
 “Trident” for persistent volumes inside of containers:
 https://github.com/NetApp/trident
 https://netapp-trident.readthedocs.io/en/latest/
 Kubernetes for container orchestration:
 https://kubernetes.io/de/
 Kubeflow as a „Data Science Toolchest“
 https://www.kubeflow.org/
 Articles about „DataScience as a Service“ and „Machine Learning Version Control“:
 https://www.linkedin.com/pulse/simplify-machine-learning-version-control-muneer-ahmad-dedmari/
 https://www.linkedin.com/pulse/part-2-simplifying-dataops-datascience-service-jupyter-steve-guhr/
12
May the Data be with you!
© 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —

Contenu connexe

Tendances

Tendances (20)

Scaling TensorFlow Models for Training using multi-GPUs & Google Cloud ML
Scaling TensorFlow Models for Training using multi-GPUs & Google Cloud MLScaling TensorFlow Models for Training using multi-GPUs & Google Cloud ML
Scaling TensorFlow Models for Training using multi-GPUs & Google Cloud ML
 
Hadoop + GPU
Hadoop + GPUHadoop + GPU
Hadoop + GPU
 
Metaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at NetflixMetaflow: The ML Infrastructure at Netflix
Metaflow: The ML Infrastructure at Netflix
 
IBM BOA for POWER
IBM BOA for POWER IBM BOA for POWER
IBM BOA for POWER
 
JMI Techtalk: 한재근 - How to use GPU for developing AI
JMI Techtalk: 한재근 - How to use GPU for developing AIJMI Techtalk: 한재근 - How to use GPU for developing AI
JMI Techtalk: 한재근 - How to use GPU for developing AI
 
Kubeflow at Spotify (For the Kubeflow Summit)
Kubeflow at Spotify (For the Kubeflow Summit)Kubeflow at Spotify (For the Kubeflow Summit)
Kubeflow at Spotify (For the Kubeflow Summit)
 
SigOpt at GTC - Tuning the Untunable
SigOpt at GTC - Tuning the UntunableSigOpt at GTC - Tuning the Untunable
SigOpt at GTC - Tuning the Untunable
 
Large-Scale Training with GPUs at Facebook
Large-Scale Training with GPUs at FacebookLarge-Scale Training with GPUs at Facebook
Large-Scale Training with GPUs at Facebook
 
Exploiting GPUs in Spark
Exploiting GPUs in SparkExploiting GPUs in Spark
Exploiting GPUs in Spark
 
TinyML as-a-Service
TinyML as-a-ServiceTinyML as-a-Service
TinyML as-a-Service
 
Distributed Deep Learning on Spark
Distributed Deep Learning on SparkDistributed Deep Learning on Spark
Distributed Deep Learning on Spark
 
TFX: A tensor flow-based production-scale machine learning platform
TFX: A tensor flow-based production-scale machine learning platformTFX: A tensor flow-based production-scale machine learning platform
TFX: A tensor flow-based production-scale machine learning platform
 
Deep Dive on Amazon EC2 Accelerated Computing
Deep Dive on Amazon EC2 Accelerated ComputingDeep Dive on Amazon EC2 Accelerated Computing
Deep Dive on Amazon EC2 Accelerated Computing
 
AI Bridging Cloud Infrastructure (ABCI) and its communication performance
AI Bridging Cloud Infrastructure (ABCI) and its communication performanceAI Bridging Cloud Infrastructure (ABCI) and its communication performance
AI Bridging Cloud Infrastructure (ABCI) and its communication performance
 
Deploying deep learning models with Docker and Kubernetes
Deploying deep learning models with Docker and KubernetesDeploying deep learning models with Docker and Kubernetes
Deploying deep learning models with Docker and Kubernetes
 
GTC Taiwan 2017 企業端深度學習與人工智慧應用
GTC Taiwan 2017 企業端深度學習與人工智慧應用GTC Taiwan 2017 企業端深度學習與人工智慧應用
GTC Taiwan 2017 企業端深度學習與人工智慧應用
 
AWS Compute Evolved Week: Deep Dive on Amazon EC2 Accelerated Computing
AWS Compute Evolved Week: Deep Dive on Amazon EC2 Accelerated ComputingAWS Compute Evolved Week: Deep Dive on Amazon EC2 Accelerated Computing
AWS Compute Evolved Week: Deep Dive on Amazon EC2 Accelerated Computing
 
BigDL: Bringing Ease of Use of Deep Learning for Apache Spark with Jason Dai ...
BigDL: Bringing Ease of Use of Deep Learning for Apache Spark with Jason Dai ...BigDL: Bringing Ease of Use of Deep Learning for Apache Spark with Jason Dai ...
BigDL: Bringing Ease of Use of Deep Learning for Apache Spark with Jason Dai ...
 
Very large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDLVery large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDL
 
Kaz Sato, Evangelist, Google at MLconf ATL 2016
Kaz Sato, Evangelist, Google at MLconf ATL 2016Kaz Sato, Evangelist, Google at MLconf ATL 2016
Kaz Sato, Evangelist, Google at MLconf ATL 2016
 

Similaire à AI Pipeline Optimization using Kubeflow

Similaire à AI Pipeline Optimization using Kubeflow (20)

DevOps@Scale- IBM Cloud and NetAp-Insight-Berlin
DevOps@Scale- IBM Cloud and NetAp-Insight-BerlinDevOps@Scale- IBM Cloud and NetAp-Insight-Berlin
DevOps@Scale- IBM Cloud and NetAp-Insight-Berlin
 
NetApp Hybrid Cloud with OpenNebula
NetApp Hybrid Cloud with OpenNebulaNetApp Hybrid Cloud with OpenNebula
NetApp Hybrid Cloud with OpenNebula
 
Leveraging HybridMultiCloud for Devops and Automation Platform
Leveraging HybridMultiCloud for Devops and Automation PlatformLeveraging HybridMultiCloud for Devops and Automation Platform
Leveraging HybridMultiCloud for Devops and Automation Platform
 
How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...
 
NetApp IT Data Center Strategies to Enable Digital Transformation
NetApp IT Data Center Strategies to Enable Digital TransformationNetApp IT Data Center Strategies to Enable Digital Transformation
NetApp IT Data Center Strategies to Enable Digital Transformation
 
Recipe for Success: The Right Ingredients for Enterprise-Class Cloud Data Man...
Recipe for Success: The Right Ingredients for Enterprise-Class Cloud Data Man...Recipe for Success: The Right Ingredients for Enterprise-Class Cloud Data Man...
Recipe for Success: The Right Ingredients for Enterprise-Class Cloud Data Man...
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and Kubernetes
 
IoT Edge Data Processing with NVidia Jetson Nano oct 3 2019
IoT  Edge Data Processing with NVidia Jetson Nano oct 3 2019IoT  Edge Data Processing with NVidia Jetson Nano oct 3 2019
IoT Edge Data Processing with NVidia Jetson Nano oct 3 2019
 
The rise of microservices
The rise of microservicesThe rise of microservices
The rise of microservices
 
End User Computing with NetApp
End User Computing with NetAppEnd User Computing with NetApp
End User Computing with NetApp
 
TensorFlow 16: Building a Data Science Platform
TensorFlow 16: Building a Data Science Platform TensorFlow 16: Building a Data Science Platform
TensorFlow 16: Building a Data Science Platform
 
Facilitez votre transition DevOps grâce à l'automatisation de votre infras...
 Facilitez votre transition DevOps grâce à l'automatisation de votre infras... Facilitez votre transition DevOps grâce à l'automatisation de votre infras...
Facilitez votre transition DevOps grâce à l'automatisation de votre infras...
 
CWIN16 UK Event - The Future of Infrastructure
CWIN16 UK Event - The Future of Infrastructure CWIN16 UK Event - The Future of Infrastructure
CWIN16 UK Event - The Future of Infrastructure
 
Instantaneous Replication of Build Artifacts with NetApp
Instantaneous Replication of Build Artifacts with NetAppInstantaneous Replication of Build Artifacts with NetApp
Instantaneous Replication of Build Artifacts with NetApp
 
Transform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
Transform Your Enterprise Faster with Seamless Hybrid Cloud from NetappTransform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
Transform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
 
Sydney mule soft meetup #8 1 August 2019 - all slides
Sydney mule soft meetup #8   1 August 2019 - all slidesSydney mule soft meetup #8   1 August 2019 - all slides
Sydney mule soft meetup #8 1 August 2019 - all slides
 
apidays LIVE Australia 2020 - Data with a Mission by Matt McLarty
apidays LIVE Australia 2020 -  Data with a Mission by Matt McLarty apidays LIVE Australia 2020 -  Data with a Mission by Matt McLarty
apidays LIVE Australia 2020 - Data with a Mission by Matt McLarty
 
apidays LIVE Paris - Data with a mission: a COVID-19 API case study by Matt M...
apidays LIVE Paris - Data with a mission: a COVID-19 API case study by Matt M...apidays LIVE Paris - Data with a mission: a COVID-19 API case study by Matt M...
apidays LIVE Paris - Data with a mission: a COVID-19 API case study by Matt M...
 
How to deploy machine learning models into production
How to deploy machine learning models into productionHow to deploy machine learning models into production
How to deploy machine learning models into production
 
End To End Machine Learning With Google Cloud
End To End Machine Learning With Google Cloud End To End Machine Learning With Google Cloud
End To End Machine Learning With Google Cloud
 

Dernier

AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 

Dernier (20)

10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 

AI Pipeline Optimization using Kubeflow

  • 1. AI Pipeline Optimization … using Kubeflow © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Muneer Ahmad (muneer.ahmad@netapp.com) Steve Guhr (steve.guhr@netapp.com) AI Solutions Architect Solutions Engineer
  • 2. Agenda 1) AI Pipeline Optimization & Architecture 2) Demo 3) Q&A What’s it all about? 2 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
  • 3. What it is and why did we do that? AI Pipeline Optimization 3 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
  • 4. AI Pipeline in general … workflow across different sites 4 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Data Ingestion Data Analysis Data Transformation Data Validation Training Data Splitting Monitoring Training Model Validation Training At Scale ServingRoll-Out Monitoring Logging Site 1 Site 2Site 3Site 4 Site 5 Site 6
  • 5. Model AI Pipeline Portability ... working with multiple premises 5 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Model UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Model UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Laptop Training Rig Cloud
  • 6. Model AI Pipeline Optimization … using Kubernetes & Kubeflow 6 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Model UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Model UX Tooling Framework Storage Runtime Drivers OS Accelerator HW Laptop Training Rig Cloud
  • 7. Architectural Overview What did we do (in a nutshell)? 7 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Kubeflow Kubernetes JupyterHub … Trident Pipeline Katib
  • 8. How did we do it? Everyone loves demos, right?! 8 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
  • 9. Explaining the pipeline  Pre-processing  Training (classification, CIFAR10 dataset)  Deploying and serving trained models  TensorRT Inference engine  Web-application … using actual data and training 9 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —
  • 10.  What about „Machine Learning Version Control“?  How do you „Lift and Shift“ the whole AI application stack across hybrid clouds?  How to manage (c)old trained models and data?  […] 10 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Closing Thoughts – What‘s next?
  • 11. 11 © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL — Resources  “Trident” for persistent volumes inside of containers:  https://github.com/NetApp/trident  https://netapp-trident.readthedocs.io/en/latest/  Kubernetes for container orchestration:  https://kubernetes.io/de/  Kubeflow as a „Data Science Toolchest“  https://www.kubeflow.org/  Articles about „DataScience as a Service“ and „Machine Learning Version Control“:  https://www.linkedin.com/pulse/simplify-machine-learning-version-control-muneer-ahmad-dedmari/  https://www.linkedin.com/pulse/part-2-simplifying-dataops-datascience-service-jupyter-steve-guhr/
  • 12. 12 May the Data be with you! © 2019 NetApp, Inc. All rights reserved. — NETAPP CONFIDENTIAL —