SlideShare une entreprise Scribd logo
1  sur  37
Télécharger pour lire hors ligne
Prof. Dr. Jan Kirenz
Machine Learning Operations (MLOps)
Usage of Pipelines in the ML Lifecycle with
Tensor Flow Extended (TFX) and Kubeflow
Prof. Dr. Jan Kirenz
HdM Stuttgart
Prof. Dr. Jan Kirenz
80-85% PoC Factory
The Proof of Concept Factory
Most companies...
● … conduct AI experiments and pilots
but achieve a low scaling success
rate
● … have significant under investment,
yielding low returns
Source: Accenture (2019) https://www.accenture.com/us-en/insights/artificial-intelligence/ai-investments
Prof. Dr. Jan Kirenz
https://www.gartner.com/smarterwithgartner/gartner-top-10-data-and-analytics-trends-for-2021/
Scalable AI
Prof. Dr. Jan Kirenz
ML Project Code
The problem with scaling AI: ML code is only a
fraction of a production-ready ML project code
ML
Code 5-10%
Prof. Dr. Jan Kirenz
Monitoring
Hidden technical debt in machine learning systems
Sculley, D. et al. (2015). Hidden technical debt in machine learning systems. Advances in neural information processing systems, 28, pp. 2503-2511
Data Collection
Configuration
Feature Engineering
Data
Verification
Metadata Management
Model Analysis
Serving
Infra-
structure
Automation
Process Management
Machine Resource
Management
Testing and Debugging
ML
Code
Prof. Dr. Jan Kirenz
Machine learning operations (MLOps)
● ML Engineering culture and practice that
aims at unifying ML System development
(Dev) and ML system operations (Ops)
● Tools and principles to support workflow
standardization and automation through
the ML system lifecycle (e.g. with pipelines)
Prof. Dr. Jan Kirenz
Prof. Dr. Jan Kirenz
Machine learning
lifecycle
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Lifecycle
of an ML System
Plan | Data | Model | Deployment
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Lifecycle
of an ML System
Plan | Data | Model | Deployment
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Data splitting
Anomaly
detection
Data
preprocessing
Lifecycle
of an ML System
Plan | Data | Model | Deployment
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Evaluate
model
Model Training
& tuning
Select
algorithm
Data splitting
Anomaly
detection
Data
preprocessing
Lifecycle
of an ML System
Plan | Data | Model | Deployment
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Retrain
triggers
Evaluate
model
Model Training
& tuning
Monitor model
Select
algorithm
Data splitting
Anomaly
detection
Data
preprocessing
Lifecycle
of an ML System
Plan | Data | Model | Deployment
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Retrain
triggers
Evaluate
model
Model Training
& tuning
Monitor model
Select
algorithm
Data splitting
Anomaly
detection
Data
preprocessing
Lifecycle
of an ML System
Plan | Data | Model | Deployment
Common issues which lead to a PoC to production gap
● Lack of reuse and duplication
● Inconsistency (data, code, models)
● Manual and slow transition from PoC to production
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Retrain
triggers
Evaluate
model
Model Training
& tuning
Monitor model
Select
algorithm
Data splitting
Anomaly
detection
Data
preprocessing
Model
management
Model registry
Data and feature
management
Feature store
Pipeline
management
Pipeline orchestration
Metadata
management
Metadata store
Lifecycle
of an ML System
Plan | Data | Model | Deployment
MLOps components
Prof. Dr. Jan Kirenz
What is a pipeline?
● Description of an ML workflow
● A pipeline component is a self-contained
set of user code that performs one step in
the pipeline
● Includes the definition of the configuration
and inputs required to run the pipeline (e.g.
model hyperparameters)
… do this
… than that
Start
...
… the end
The workflow is
also called directed
acyclic graph (DAG)
This is a component
Complete workflow of the ML
system lifecycle
Prof. Dr. Jan Kirenz
Source: Baer & Ngahane (2019)
… do this
… than that
Start
… the end
Prof. Dr. Jan Kirenz
TensorFlow Extended (TFX)
● Google-production-scale machine learning
(ML) platform based on TensorFlow
● Portable to multiple environments (Azure,
AWS, Google Cloud, IBM, ...)
● Python based toolkit; can be used with
notebooks
● Helps you orchestrate your ML process:
Apache Airflow, Apache Beam or Kubeflow
pipelines
Source: TensorFlow (2021)
Prof. Dr. Jan Kirenz
TFX 1.0 (19.05.21)
● Enterprise-grade support
● Security patches and select bug fixes for
up to three years
● Guaranteed API & Artifact backward
compatibility
Source: Google (2021)
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Evaluate
model
Model Training
& tuning
Monitor model
ExampleGen
Select
algorithm
StatisticsGen
SchemaGen
Example
Validator
Transform
Data splitting
Trainer
Tuner
Evaluator
InfraValidator
Anomaly
detection
Pusher
HUB / JS / LITE / SERVING
Model Server
TF
Data
Validation
(TFDV)
TFT
TF
KerasTuner
TensorFlow
Model Analysis
(TFMA)
BulkInferrer
Data
preprocessing
Metadata Store: ML Metadata (MLMD)
TFX Options for Pipeline
Orchestration
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Evaluate
model
Model Training
& tuning
Monitor model
ExampleGen
Select
algorithm
StatisticsGen
SchemaGen
Example
Validator
Transform
Data splitting
Trainer
Tuner
Evaluator
InfraValidator
Anomaly
detection
Pusher
HUB / JS / LITE / SERVING
Model Server
TF
Data
Validation
(TFDV)
TFT
TF
KerasTuner
TensorFlow
Model Analysis
(TFMA)
BulkInferrer
Metadata Store (ML Metadata)
TFX Options for Pipeline
Orchestration
Data
preprocessing
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Evaluate
model
Model Training
& tuning
Monitor model
ExampleGen
Select
algorithm
StatisticsGen
SchemaGen
Example
Validator
Transform
Data splitting
Trainer
Tuner
Evaluator
InfraValidator
Anomaly
detection
Pusher
HUB / JS / LITE / SERVING
Model Server
TF
Data
Validation
(TFDV)
TFT
TF
KerasTuner
TensorFlow
Model Analysis
(TFMA)
BulkInferrer
Data
preprocessing
Metadata Store (ML Metadata)
TFX Options for Pipeline
Orchestration
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Evaluate
model
Model Training
& tuning
Monitor model
ExampleGen
Select
algorithm
StatisticsGen
SchemaGen
Example
Validator
Transform
Data splitting
Trainer
Tuner
Evaluator
InfraValidator
Anomaly
detection
Pusher
HUB / JS / LITE / SERVING
Model Server
TF
Data
Validation
(TFDV)
TFT
TF
TensorFlow
Model Analysis
(TFMA)
BulkInferrer
Data
preprocessing
Metadata Store (ML Metadata)
TFX Options for Pipeline
Orchestration
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Evaluate
model
Model Training
& tuning
Monitor model
ExampleGen
Select
algorithm
StatisticsGen
SchemaGen
Example
Validator
Transform
Data splitting
Trainer
Tuner
Evaluator
InfraValidator
Anomaly
detection
Pusher
HUB / JS / LITE / SERVING
Model Server
TF
Data
Validation
(TFDV)
TFT
TF
TensorFlow
Model Analysis
(TFMA)
BulkInferrer
Data
preprocessing
Metadata Store (ML Metadata)
TFX Options for Pipeline
Orchestration
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Evaluate
model
Model Training
& tuning
Monitor model
ExampleGen
Select
algorithm
StatisticsGen
SchemaGen
Example
Validator
Transform
Data splitting
Trainer
Tuner
Evaluator
InfraValidator
Anomaly
detection
Pusher
HUB / JS / LITE / SERVING
Model Server
TF
Data
Validation
(TFDV)
TFT
TF
KerasTuner
TensorFlow
Model Analysis
(TFMA)
BulkInferrer
Data
preprocessing
Metadata Store (ML Metadata)
TFX Options for Pipeline
Orchestration
Prof. Dr. Jan Kirenz
Tuner
Evaluator
InfraValidator
ExampleGen
StatisticsGen
SchemaGen
Example
Validator
Transform
Trainer
Pusher
HUB / JS / LITE / SERVING
Model Server
KerasTuner
BulkInferrer
Metadata Store (ML Metadata)
TF
Data
Validation
(TFDV)
TFT
TF
TFX Options for Pipeline
Orchestration
TensorFlow
Model Analysis
(TFMA)
TensorFlow Lite is a set of tools that enables on-device
machine learning by helping developers run their models
on mobile, embedded, and IoT devices.
Prof. Dr. Jan Kirenz
Plan
Model
Deployment Data
Identify use
case
Frame
problem
Identify
variables
Define metrics
Business Analyst
Data
Engineer
Software
Developer
Data Scientist
Data ingestion
Analyze &
clean data
Define schema
Feature
engineering
Validate model
Deploy model
Serve model
Retrain model
Evaluate
model
Model Training
& tuning
Monitor model
ExampleGen
Select
algorithm
StatisticsGen
SchemaGen
Example
Validator
Transform
Data splitting
Trainer
Tuner
Evaluator
InfraValidator
Anomaly
detection
Pusher
HUB / JS / LITE / SERVING
Model Server
TF
Data
Validation
(TFDV)
TFT
TF
KerasTuner
TensorFlow
Model Analysis
(TFMA)
BulkInferrer
Data
preprocessing
Metadata Store (ML Metadata)
TFX Options for Pipeline
Orchestration
Production phase:
automate the execution
of the ML pipeline based
on a schedule or certain
triggering conditions.
Development phase: run the ML experiment, instead of
manually executing each step.
Data preparation
phase:
automatically
ingest, validate
and transform
data and provide
features to models
Prof. Dr. Jan Kirenz
Prof. Dr. Jan Kirenz
Pipeline orchestration
Prof. Dr. Jan Kirenz
TFX & Apache Airflow
● Programmatically author, schedule and
monitor workflows with Python code.
● User interface to visualize pipelines
running in production, monitor progress,
and troubleshoot issues.
Prof. Dr. Jan Kirenz
TFX & Apache Beam
● Provides a framework for running batch
and streaming data processing jobs that
run on a variety of runners (Spark, Flink, ...).
● Beam provides an abstraction layer which
enables TFX to run on any supported
runner without code modifications
● TFX only uses the Beam Python API
Prof. Dr. Jan Kirenz
TFX & Kubeflow pipelines
The Kubeflow Pipelines platform consists of:
● An engine for scheduling multi-step ML
workflows (using Kubernetes).
● User interface (UI) for managing and
tracking experiments, jobs, and runs.
● Python SDK for defining and manipulating
pipelines and components.
● Notebooks for interacting with the system
using the SDK
Kubeflow Pipelines is available as a core component of Kubeflow or as
a standalone installation.
Prof. Dr. Jan Kirenz
Prof. Dr. Jan Kirenz
Prof. Dr. Jan Kirenz
Prof. Dr. Jan Kirenz
KubeFlow
Prof. Dr. Jan Kirenz
ML toolkit for Kubernetes
Prof. Dr. Jan Kirenz
Google’s Vertex AI
Launched in May 2021
Prof. Dr. Jan Kirenz
ML Pipelines | wrap-up
Source: TensorFlow (2021)
By using a ML pipeline, you can:
● Automate your ML process, which lets you
regularly retrain, evaluate, and deploy your
model.
● Utilize distributed compute resources for
processing large datasets and workloads.
● Increase the velocity of experimentation by
running a pipeline with different sets of
hyperparameters.
To learn more visit the following tutorials @:
https://kirenz.github.io/
MLOps tutorials on how to:
● Install TF and TFX
● Build your first TFX pipeline
● Install Kubeflow
● Build your first Kubeflow pipeline
Jan Kirenz
www.kirenz.com

Contenu connexe

Tendances

Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep... Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...Databricks
 
Managing and Versioning Machine Learning Models in Python
Managing and Versioning Machine Learning Models in PythonManaging and Versioning Machine Learning Models in Python
Managing and Versioning Machine Learning Models in PythonSimon Frid
 
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
 ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens... ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...Databricks
 
Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...
Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...
Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...Databricks
 
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Databricks
 
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Luciano Resende
 
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...Databricks
 
Whats new in_mlflow
Whats new in_mlflowWhats new in_mlflow
Whats new in_mlflowDatabricks
 
Next18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to youNext18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to youMárton Kodok
 
Model versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 WalkthroughModel versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 WalkthroughManasi Vartak
 
Kubeflow at Spotify (For the Kubeflow Summit)
Kubeflow at Spotify (For the Kubeflow Summit)Kubeflow at Spotify (For the Kubeflow Summit)
Kubeflow at Spotify (For the Kubeflow Summit)Josh Baer
 
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Animesh Singh
 
Deploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDeploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDatabricks
 
Apply MLOps at Scale by H&M
Apply MLOps at Scale by H&MApply MLOps at Scale by H&M
Apply MLOps at Scale by H&MDatabricks
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleMLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleDatabricks
 
Reproducible AI using MLflow and PyTorch
Reproducible AI using MLflow and PyTorchReproducible AI using MLflow and PyTorch
Reproducible AI using MLflow and PyTorchDatabricks
 
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...Databricks
 
Accelerate Your AI Today
Accelerate Your AI TodayAccelerate Your AI Today
Accelerate Your AI TodayDESMOND YUEN
 
Automating machine learning lifecycle with kubeflow
Automating machine learning lifecycle with kubeflowAutomating machine learning lifecycle with kubeflow
Automating machine learning lifecycle with kubeflowStepan Pushkarev
 

Tendances (20)

Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep... Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...
 
Managing and Versioning Machine Learning Models in Python
Managing and Versioning Machine Learning Models in PythonManaging and Versioning Machine Learning Models in Python
Managing and Versioning Machine Learning Models in Python
 
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
 ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens... ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
ML at the Edge: Building Your Production Pipeline with Apache Spark and Tens...
 
Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...
Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...
Deep Learning for Natural Language Processing Using Apache Spark and TensorFl...
 
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
Deploying and Monitoring Heterogeneous Machine Learning Applications with Cli...
 
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
 
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
 
MLOps in action
MLOps in actionMLOps in action
MLOps in action
 
Whats new in_mlflow
Whats new in_mlflowWhats new in_mlflow
Whats new in_mlflow
 
Next18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to youNext18 Extended Targu Mures - Bringing the Cloud to you
Next18 Extended Targu Mures - Bringing the Cloud to you
 
Model versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 WalkthroughModel versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 Walkthrough
 
Kubeflow at Spotify (For the Kubeflow Summit)
Kubeflow at Spotify (For the Kubeflow Summit)Kubeflow at Spotify (For the Kubeflow Summit)
Kubeflow at Spotify (For the Kubeflow Summit)
 
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
 
Deploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNXDeploying End-to-End Deep Learning Pipelines with ONNX
Deploying End-to-End Deep Learning Pipelines with ONNX
 
Apply MLOps at Scale by H&M
Apply MLOps at Scale by H&MApply MLOps at Scale by H&M
Apply MLOps at Scale by H&M
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleMLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
 
Reproducible AI using MLflow and PyTorch
Reproducible AI using MLflow and PyTorchReproducible AI using MLflow and PyTorch
Reproducible AI using MLflow and PyTorch
 
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
Continuous Delivery of Deep Transformer-Based NLP Models Using MLflow and AWS...
 
Accelerate Your AI Today
Accelerate Your AI TodayAccelerate Your AI Today
Accelerate Your AI Today
 
Automating machine learning lifecycle with kubeflow
Automating machine learning lifecycle with kubeflowAutomating machine learning lifecycle with kubeflow
Automating machine learning lifecycle with kubeflow
 

Similaire à MLOps - Build pipelines with Tensor Flow Extended & Kubeflow

Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software EngineeringMiroslaw Staron
 
Towards the Industrialization of AI
Towards the Industrialization of AITowards the Industrialization of AI
Towards the Industrialization of AIHui Lei
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedRobert Grossman
 
The adoption of machine learning techniques for software defect prediction: A...
The adoption of machine learning techniques for software defect prediction: A...The adoption of machine learning techniques for software defect prediction: A...
The adoption of machine learning techniques for software defect prediction: A...RAKESH RANA
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLJordan Birdsell
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaData Science Milan
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...Robert Grossman
 
How to re-use existing system models to generate test cases
How to re-use existing system models to generate test casesHow to re-use existing system models to generate test cases
How to re-use existing system models to generate test casesTransWare AG
 
Certification Study Group - NLP & Recommendation Systems on GCP Session 5
Certification Study Group - NLP & Recommendation Systems on GCP Session 5Certification Study Group - NLP & Recommendation Systems on GCP Session 5
Certification Study Group - NLP & Recommendation Systems on GCP Session 5gdgsurrey
 
MLOps Using MLflow
MLOps Using MLflowMLOps Using MLflow
MLOps Using MLflowDatabricks
 
Denis Jannot - Towards Data Science Engineering Principles - Codemotion Milan...
Denis Jannot - Towards Data Science Engineering Principles - Codemotion Milan...Denis Jannot - Towards Data Science Engineering Principles - Codemotion Milan...
Denis Jannot - Towards Data Science Engineering Principles - Codemotion Milan...Codemotion
 
Software Analytics = Sharing Information
Software Analytics = Sharing InformationSoftware Analytics = Sharing Information
Software Analytics = Sharing InformationThomas Zimmermann
 
The Magic Of Application Lifecycle Management In Vs Public
The Magic Of Application Lifecycle Management In Vs PublicThe Magic Of Application Lifecycle Management In Vs Public
The Magic Of Application Lifecycle Management In Vs PublicDavid Solivan
 
End-to-End Machine learning pipelines for Python driven organizations - Nick ...
End-to-End Machine learning pipelines for Python driven organizations - Nick ...End-to-End Machine learning pipelines for Python driven organizations - Nick ...
End-to-End Machine learning pipelines for Python driven organizations - Nick ...PyData
 
Demystifying Cognitive Approaches to Predictive Maintenance Part 1
Demystifying Cognitive Approaches to Predictive Maintenance Part 1Demystifying Cognitive Approaches to Predictive Maintenance Part 1
Demystifying Cognitive Approaches to Predictive Maintenance Part 1Anita Raj
 

Similaire à MLOps - Build pipelines with Tensor Flow Extended & Kubeflow (20)

Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
AI for Software Engineering
AI for Software EngineeringAI for Software Engineering
AI for Software Engineering
 
Towards the Industrialization of AI
Towards the Industrialization of AITowards the Industrialization of AI
Towards the Industrialization of AI
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
 
The adoption of machine learning techniques for software defect prediction: A...
The adoption of machine learning techniques for software defect prediction: A...The adoption of machine learning techniques for software defect prediction: A...
The adoption of machine learning techniques for software defect prediction: A...
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
 
How to re-use existing system models to generate test cases
How to re-use existing system models to generate test casesHow to re-use existing system models to generate test cases
How to re-use existing system models to generate test cases
 
AI at Scale in Enterprises
AI at Scale in Enterprises AI at Scale in Enterprises
AI at Scale in Enterprises
 
Certification Study Group - NLP & Recommendation Systems on GCP Session 5
Certification Study Group - NLP & Recommendation Systems on GCP Session 5Certification Study Group - NLP & Recommendation Systems on GCP Session 5
Certification Study Group - NLP & Recommendation Systems on GCP Session 5
 
MLOps Using MLflow
MLOps Using MLflowMLOps Using MLflow
MLOps Using MLflow
 
Denis Jannot - Towards Data Science Engineering Principles - Codemotion Milan...
Denis Jannot - Towards Data Science Engineering Principles - Codemotion Milan...Denis Jannot - Towards Data Science Engineering Principles - Codemotion Milan...
Denis Jannot - Towards Data Science Engineering Principles - Codemotion Milan...
 
Software Analytics = Sharing Information
Software Analytics = Sharing InformationSoftware Analytics = Sharing Information
Software Analytics = Sharing Information
 
Deep learning in manufacturing predicting and preventing manufacturing defect...
Deep learning in manufacturing predicting and preventing manufacturing defect...Deep learning in manufacturing predicting and preventing manufacturing defect...
Deep learning in manufacturing predicting and preventing manufacturing defect...
 
The Magic Of Application Lifecycle Management In Vs Public
The Magic Of Application Lifecycle Management In Vs PublicThe Magic Of Application Lifecycle Management In Vs Public
The Magic Of Application Lifecycle Management In Vs Public
 
End-to-End Machine learning pipelines for Python driven organizations - Nick ...
End-to-End Machine learning pipelines for Python driven organizations - Nick ...End-to-End Machine learning pipelines for Python driven organizations - Nick ...
End-to-End Machine learning pipelines for Python driven organizations - Nick ...
 
Demystifying Cognitive Approaches to Predictive Maintenance Part 1
Demystifying Cognitive Approaches to Predictive Maintenance Part 1Demystifying Cognitive Approaches to Predictive Maintenance Part 1
Demystifying Cognitive Approaches to Predictive Maintenance Part 1
 

Dernier

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numberssuginr1
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...gajnagarg
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfSayantanBiswas37
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...kumargunjan9515
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...SOFTTECHHUB
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls
 

Dernier (20)

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 

MLOps - Build pipelines with Tensor Flow Extended & Kubeflow

  • 1. Prof. Dr. Jan Kirenz Machine Learning Operations (MLOps) Usage of Pipelines in the ML Lifecycle with Tensor Flow Extended (TFX) and Kubeflow Prof. Dr. Jan Kirenz HdM Stuttgart
  • 2. Prof. Dr. Jan Kirenz 80-85% PoC Factory The Proof of Concept Factory Most companies... ● … conduct AI experiments and pilots but achieve a low scaling success rate ● … have significant under investment, yielding low returns Source: Accenture (2019) https://www.accenture.com/us-en/insights/artificial-intelligence/ai-investments
  • 3. Prof. Dr. Jan Kirenz https://www.gartner.com/smarterwithgartner/gartner-top-10-data-and-analytics-trends-for-2021/ Scalable AI
  • 4. Prof. Dr. Jan Kirenz ML Project Code The problem with scaling AI: ML code is only a fraction of a production-ready ML project code ML Code 5-10%
  • 5. Prof. Dr. Jan Kirenz Monitoring Hidden technical debt in machine learning systems Sculley, D. et al. (2015). Hidden technical debt in machine learning systems. Advances in neural information processing systems, 28, pp. 2503-2511 Data Collection Configuration Feature Engineering Data Verification Metadata Management Model Analysis Serving Infra- structure Automation Process Management Machine Resource Management Testing and Debugging ML Code
  • 6. Prof. Dr. Jan Kirenz Machine learning operations (MLOps) ● ML Engineering culture and practice that aims at unifying ML System development (Dev) and ML system operations (Ops) ● Tools and principles to support workflow standardization and automation through the ML system lifecycle (e.g. with pipelines)
  • 7. Prof. Dr. Jan Kirenz Prof. Dr. Jan Kirenz Machine learning lifecycle
  • 8. Prof. Dr. Jan Kirenz Plan Model Deployment Data Business Analyst Data Engineer Software Developer Data Scientist Lifecycle of an ML System Plan | Data | Model | Deployment
  • 9. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Lifecycle of an ML System Plan | Data | Model | Deployment
  • 10. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Data splitting Anomaly detection Data preprocessing Lifecycle of an ML System Plan | Data | Model | Deployment
  • 11. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Evaluate model Model Training & tuning Select algorithm Data splitting Anomaly detection Data preprocessing Lifecycle of an ML System Plan | Data | Model | Deployment
  • 12. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Retrain triggers Evaluate model Model Training & tuning Monitor model Select algorithm Data splitting Anomaly detection Data preprocessing Lifecycle of an ML System Plan | Data | Model | Deployment
  • 13. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Retrain triggers Evaluate model Model Training & tuning Monitor model Select algorithm Data splitting Anomaly detection Data preprocessing Lifecycle of an ML System Plan | Data | Model | Deployment Common issues which lead to a PoC to production gap ● Lack of reuse and duplication ● Inconsistency (data, code, models) ● Manual and slow transition from PoC to production
  • 14. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Retrain triggers Evaluate model Model Training & tuning Monitor model Select algorithm Data splitting Anomaly detection Data preprocessing Model management Model registry Data and feature management Feature store Pipeline management Pipeline orchestration Metadata management Metadata store Lifecycle of an ML System Plan | Data | Model | Deployment MLOps components
  • 15. Prof. Dr. Jan Kirenz What is a pipeline? ● Description of an ML workflow ● A pipeline component is a self-contained set of user code that performs one step in the pipeline ● Includes the definition of the configuration and inputs required to run the pipeline (e.g. model hyperparameters) … do this … than that Start ... … the end The workflow is also called directed acyclic graph (DAG) This is a component Complete workflow of the ML system lifecycle
  • 16. Prof. Dr. Jan Kirenz Source: Baer & Ngahane (2019) … do this … than that Start … the end
  • 17. Prof. Dr. Jan Kirenz TensorFlow Extended (TFX) ● Google-production-scale machine learning (ML) platform based on TensorFlow ● Portable to multiple environments (Azure, AWS, Google Cloud, IBM, ...) ● Python based toolkit; can be used with notebooks ● Helps you orchestrate your ML process: Apache Airflow, Apache Beam or Kubeflow pipelines Source: TensorFlow (2021)
  • 18. Prof. Dr. Jan Kirenz TFX 1.0 (19.05.21) ● Enterprise-grade support ● Security patches and select bug fixes for up to three years ● Guaranteed API & Artifact backward compatibility Source: Google (2021)
  • 19. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Evaluate model Model Training & tuning Monitor model ExampleGen Select algorithm StatisticsGen SchemaGen Example Validator Transform Data splitting Trainer Tuner Evaluator InfraValidator Anomaly detection Pusher HUB / JS / LITE / SERVING Model Server TF Data Validation (TFDV) TFT TF KerasTuner TensorFlow Model Analysis (TFMA) BulkInferrer Data preprocessing Metadata Store: ML Metadata (MLMD) TFX Options for Pipeline Orchestration
  • 20. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Evaluate model Model Training & tuning Monitor model ExampleGen Select algorithm StatisticsGen SchemaGen Example Validator Transform Data splitting Trainer Tuner Evaluator InfraValidator Anomaly detection Pusher HUB / JS / LITE / SERVING Model Server TF Data Validation (TFDV) TFT TF KerasTuner TensorFlow Model Analysis (TFMA) BulkInferrer Metadata Store (ML Metadata) TFX Options for Pipeline Orchestration Data preprocessing
  • 21. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Evaluate model Model Training & tuning Monitor model ExampleGen Select algorithm StatisticsGen SchemaGen Example Validator Transform Data splitting Trainer Tuner Evaluator InfraValidator Anomaly detection Pusher HUB / JS / LITE / SERVING Model Server TF Data Validation (TFDV) TFT TF KerasTuner TensorFlow Model Analysis (TFMA) BulkInferrer Data preprocessing Metadata Store (ML Metadata) TFX Options for Pipeline Orchestration
  • 22. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Evaluate model Model Training & tuning Monitor model ExampleGen Select algorithm StatisticsGen SchemaGen Example Validator Transform Data splitting Trainer Tuner Evaluator InfraValidator Anomaly detection Pusher HUB / JS / LITE / SERVING Model Server TF Data Validation (TFDV) TFT TF TensorFlow Model Analysis (TFMA) BulkInferrer Data preprocessing Metadata Store (ML Metadata) TFX Options for Pipeline Orchestration
  • 23. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Evaluate model Model Training & tuning Monitor model ExampleGen Select algorithm StatisticsGen SchemaGen Example Validator Transform Data splitting Trainer Tuner Evaluator InfraValidator Anomaly detection Pusher HUB / JS / LITE / SERVING Model Server TF Data Validation (TFDV) TFT TF TensorFlow Model Analysis (TFMA) BulkInferrer Data preprocessing Metadata Store (ML Metadata) TFX Options for Pipeline Orchestration
  • 24. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Evaluate model Model Training & tuning Monitor model ExampleGen Select algorithm StatisticsGen SchemaGen Example Validator Transform Data splitting Trainer Tuner Evaluator InfraValidator Anomaly detection Pusher HUB / JS / LITE / SERVING Model Server TF Data Validation (TFDV) TFT TF KerasTuner TensorFlow Model Analysis (TFMA) BulkInferrer Data preprocessing Metadata Store (ML Metadata) TFX Options for Pipeline Orchestration
  • 25. Prof. Dr. Jan Kirenz Tuner Evaluator InfraValidator ExampleGen StatisticsGen SchemaGen Example Validator Transform Trainer Pusher HUB / JS / LITE / SERVING Model Server KerasTuner BulkInferrer Metadata Store (ML Metadata) TF Data Validation (TFDV) TFT TF TFX Options for Pipeline Orchestration TensorFlow Model Analysis (TFMA) TensorFlow Lite is a set of tools that enables on-device machine learning by helping developers run their models on mobile, embedded, and IoT devices.
  • 26. Prof. Dr. Jan Kirenz Plan Model Deployment Data Identify use case Frame problem Identify variables Define metrics Business Analyst Data Engineer Software Developer Data Scientist Data ingestion Analyze & clean data Define schema Feature engineering Validate model Deploy model Serve model Retrain model Evaluate model Model Training & tuning Monitor model ExampleGen Select algorithm StatisticsGen SchemaGen Example Validator Transform Data splitting Trainer Tuner Evaluator InfraValidator Anomaly detection Pusher HUB / JS / LITE / SERVING Model Server TF Data Validation (TFDV) TFT TF KerasTuner TensorFlow Model Analysis (TFMA) BulkInferrer Data preprocessing Metadata Store (ML Metadata) TFX Options for Pipeline Orchestration Production phase: automate the execution of the ML pipeline based on a schedule or certain triggering conditions. Development phase: run the ML experiment, instead of manually executing each step. Data preparation phase: automatically ingest, validate and transform data and provide features to models
  • 27. Prof. Dr. Jan Kirenz Prof. Dr. Jan Kirenz Pipeline orchestration
  • 28. Prof. Dr. Jan Kirenz TFX & Apache Airflow ● Programmatically author, schedule and monitor workflows with Python code. ● User interface to visualize pipelines running in production, monitor progress, and troubleshoot issues.
  • 29. Prof. Dr. Jan Kirenz TFX & Apache Beam ● Provides a framework for running batch and streaming data processing jobs that run on a variety of runners (Spark, Flink, ...). ● Beam provides an abstraction layer which enables TFX to run on any supported runner without code modifications ● TFX only uses the Beam Python API
  • 30. Prof. Dr. Jan Kirenz TFX & Kubeflow pipelines The Kubeflow Pipelines platform consists of: ● An engine for scheduling multi-step ML workflows (using Kubernetes). ● User interface (UI) for managing and tracking experiments, jobs, and runs. ● Python SDK for defining and manipulating pipelines and components. ● Notebooks for interacting with the system using the SDK Kubeflow Pipelines is available as a core component of Kubeflow or as a standalone installation.
  • 31. Prof. Dr. Jan Kirenz
  • 32. Prof. Dr. Jan Kirenz
  • 33. Prof. Dr. Jan Kirenz
  • 34. Prof. Dr. Jan Kirenz KubeFlow
  • 35. Prof. Dr. Jan Kirenz ML toolkit for Kubernetes
  • 36. Prof. Dr. Jan Kirenz Google’s Vertex AI Launched in May 2021
  • 37. Prof. Dr. Jan Kirenz ML Pipelines | wrap-up Source: TensorFlow (2021) By using a ML pipeline, you can: ● Automate your ML process, which lets you regularly retrain, evaluate, and deploy your model. ● Utilize distributed compute resources for processing large datasets and workloads. ● Increase the velocity of experimentation by running a pipeline with different sets of hyperparameters. To learn more visit the following tutorials @: https://kirenz.github.io/ MLOps tutorials on how to: ● Install TF and TFX ● Build your first TFX pipeline ● Install Kubeflow ● Build your first Kubeflow pipeline Jan Kirenz www.kirenz.com