Soumettre la recherche
Mettre en ligne
仕事ではじめる機械学習
•
41 j'aime
•
8,251 vues
Aki Ariga
Suivre
データドリブンなプロダクトを作るためのプロジェクトの進め方や、機械学習システムを作る上で特有の難しさについて
Lire moins
Lire la suite
Ingénierie
Signaler
Partager
Signaler
Partager
1 sur 35
Télécharger maintenant
Télécharger pour lire hors ligne
Recommandé
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
Bill Liu
AISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the Edge
Bill Liu
Train, predict, serve: How to go into production your machine learning model
Train, predict, serve: How to go into production your machine learning model
Cloudera Japan
File Repository on GAE
File Repository on GAE
lynneblue
Powerful Google developer tools for immediate impact! (2023-24 A)
Powerful Google developer tools for immediate impact! (2023-24 A)
wesley chun
[AI] ML Operationalization with Microsoft Azure
[AI] ML Operationalization with Microsoft Azure
Korkrid Akepanidtaworn
Tutorial Expert How-To - Docker-based automation
Tutorial Expert How-To - Docker-based automation
PascalDesmarets1
Manish_Gupta
Manish_Gupta
manish gupta
Recommandé
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
Bill Liu
AISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the Edge
Bill Liu
Train, predict, serve: How to go into production your machine learning model
Train, predict, serve: How to go into production your machine learning model
Cloudera Japan
File Repository on GAE
File Repository on GAE
lynneblue
Powerful Google developer tools for immediate impact! (2023-24 A)
Powerful Google developer tools for immediate impact! (2023-24 A)
wesley chun
[AI] ML Operationalization with Microsoft Azure
[AI] ML Operationalization with Microsoft Azure
Korkrid Akepanidtaworn
Tutorial Expert How-To - Docker-based automation
Tutorial Expert How-To - Docker-based automation
PascalDesmarets1
Manish_Gupta
Manish_Gupta
manish gupta
How to deploy machine learning models into production
How to deploy machine learning models into production
DataWorks Summit
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Luciano Resende
Exploring Google APIs with Python
Exploring Google APIs with Python
wesley chun
Deployment Automation for Hybrid Cloud and Multi-Platform Environments
Deployment Automation for Hybrid Cloud and Multi-Platform Environments
IBM UrbanCode Products
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Christophe Lucas
SamSegalResume
SamSegalResume
samuel segal
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
Daniel Zivkovic
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Sotrender
Peek into Neo4j Product Strategy and Roadmap
Peek into Neo4j Product Strategy and Roadmap
Neo4j
Solving Enterprise Data Challenges with Apache Arrow
Solving Enterprise Data Challenges with Apache Arrow
Wes McKinney
Cloud Roundtable | Pivoltal: Agile platform
Cloud Roundtable | Pivoltal: Agile platform
Codemotion
Enabling .NET Apps with Monitoring and Management Using Steeltoe
Enabling .NET Apps with Monitoring and Management Using Steeltoe
VMware Tanzu
Breaking the Monolith
Breaking the Monolith
VMware Tanzu
Machine Learning Models: From Research to Production 6.13.18
Machine Learning Models: From Research to Production 6.13.18
Cloudera, Inc.
Custom Runtimes for the Cloud
Custom Runtimes for the Cloud
CloudBees
CSC AWS re:Invent Enterprise DevOps session
CSC AWS re:Invent Enterprise DevOps session
Tom Laszewski
Part 2: A Visual Dive into Machine Learning and Deep Learning
Part 2: A Visual Dive into Machine Learning and Deep Learning
Cloudera, Inc.
DevOps on Oracle Cloud
DevOps on Oracle Cloud
Mee Nam Lee
Accessing Google Cloud APIs
Accessing Google Cloud APIs
wesley chun
Sam segal resume
Sam segal resume
samuel segal
Challenges for machine learning systems toward continuous improvement
Challenges for machine learning systems toward continuous improvement
Aki Ariga
Managing Machine Learning workflows on Treasure Data
Managing Machine Learning workflows on Treasure Data
Aki Ariga
Contenu connexe
Similaire à 仕事ではじめる機械学習
How to deploy machine learning models into production
How to deploy machine learning models into production
DataWorks Summit
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Luciano Resende
Exploring Google APIs with Python
Exploring Google APIs with Python
wesley chun
Deployment Automation for Hybrid Cloud and Multi-Platform Environments
Deployment Automation for Hybrid Cloud and Multi-Platform Environments
IBM UrbanCode Products
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Christophe Lucas
SamSegalResume
SamSegalResume
samuel segal
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
Daniel Zivkovic
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Sotrender
Peek into Neo4j Product Strategy and Roadmap
Peek into Neo4j Product Strategy and Roadmap
Neo4j
Solving Enterprise Data Challenges with Apache Arrow
Solving Enterprise Data Challenges with Apache Arrow
Wes McKinney
Cloud Roundtable | Pivoltal: Agile platform
Cloud Roundtable | Pivoltal: Agile platform
Codemotion
Enabling .NET Apps with Monitoring and Management Using Steeltoe
Enabling .NET Apps with Monitoring and Management Using Steeltoe
VMware Tanzu
Breaking the Monolith
Breaking the Monolith
VMware Tanzu
Machine Learning Models: From Research to Production 6.13.18
Machine Learning Models: From Research to Production 6.13.18
Cloudera, Inc.
Custom Runtimes for the Cloud
Custom Runtimes for the Cloud
CloudBees
CSC AWS re:Invent Enterprise DevOps session
CSC AWS re:Invent Enterprise DevOps session
Tom Laszewski
Part 2: A Visual Dive into Machine Learning and Deep Learning
Part 2: A Visual Dive into Machine Learning and Deep Learning
Cloudera, Inc.
DevOps on Oracle Cloud
DevOps on Oracle Cloud
Mee Nam Lee
Accessing Google Cloud APIs
Accessing Google Cloud APIs
wesley chun
Sam segal resume
Sam segal resume
samuel segal
Similaire à 仕事ではじめる機械学習
(20)
How to deploy machine learning models into production
How to deploy machine learning models into production
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Exploring Google APIs with Python
Exploring Google APIs with Python
Deployment Automation for Hybrid Cloud and Multi-Platform Environments
Deployment Automation for Hybrid Cloud and Multi-Platform Environments
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
Perth DevOps Meetup - Introducing the IBM Innovation Lab - 12112015
SamSegalResume
SamSegalResume
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
How to build unified Batch & Streaming Pipelines with Apache Beam and Dataflow
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Peek into Neo4j Product Strategy and Roadmap
Peek into Neo4j Product Strategy and Roadmap
Solving Enterprise Data Challenges with Apache Arrow
Solving Enterprise Data Challenges with Apache Arrow
Cloud Roundtable | Pivoltal: Agile platform
Cloud Roundtable | Pivoltal: Agile platform
Enabling .NET Apps with Monitoring and Management Using Steeltoe
Enabling .NET Apps with Monitoring and Management Using Steeltoe
Breaking the Monolith
Breaking the Monolith
Machine Learning Models: From Research to Production 6.13.18
Machine Learning Models: From Research to Production 6.13.18
Custom Runtimes for the Cloud
Custom Runtimes for the Cloud
CSC AWS re:Invent Enterprise DevOps session
CSC AWS re:Invent Enterprise DevOps session
Part 2: A Visual Dive into Machine Learning and Deep Learning
Part 2: A Visual Dive into Machine Learning and Deep Learning
DevOps on Oracle Cloud
DevOps on Oracle Cloud
Accessing Google Cloud APIs
Accessing Google Cloud APIs
Sam segal resume
Sam segal resume
Plus de Aki Ariga
Challenges for machine learning systems toward continuous improvement
Challenges for machine learning systems toward continuous improvement
Aki Ariga
Managing Machine Learning workflows on Treasure Data
Managing Machine Learning workflows on Treasure Data
Aki Ariga
主人が外資系IT企業に転職して4ヶ月が過ぎました
主人が外資系IT企業に転職して4ヶ月が過ぎました
Aki Ariga
R&D at Foodtech company - #CookpadTechConf 2016
R&D at Foodtech company - #CookpadTechConf 2016
Aki Ariga
Why I started Machine Learning Casual Talks? #MLCT
Why I started Machine Learning Casual Talks? #MLCT
Aki Ariga
クックパッドサマーインターン2015 機械学習・自然言語処理 実習課題
クックパッドサマーインターン2015 機械学習・自然言語処理 実習課題
Aki Ariga
Rubyistがgemの前にPypiデビューするのは間違っているだろうか
Rubyistがgemの前にPypiデビューするのは間違っているだろうか
Aki Ariga
Machine Learning Casual Talks Intro #MLCT
Machine Learning Casual Talks Intro #MLCT
Aki Ariga
Make Julia more popular in Japan!!1 #JuliaTokyo
Make Julia more popular in Japan!!1 #JuliaTokyo
Aki Ariga
Refrection of kawasaki.rb
Refrection of kawasaki.rb
Aki Ariga
Introduction and benchmarking of MeCab.jl #JapanR
Introduction and benchmarking of MeCab.jl #JapanR
Aki Ariga
Recommendation for iruby #tqrk08
Recommendation for iruby #tqrk08
Aki Ariga
The book that changed me
The book that changed me
Aki Ariga
Introduction of Mecab.jl #JuliaTokyo
Introduction of Mecab.jl #JuliaTokyo
Aki Ariga
Introduction to Kanagawa Ruby Kaigi01 #kana01
Introduction to Kanagawa Ruby Kaigi01 #kana01
Aki Ariga
Julia 100 exercises #JuliaTokyo
Julia 100 exercises #JuliaTokyo
Aki Ariga
Machine Learning Casual Talks opening talk
Machine Learning Casual Talks opening talk
Aki Ariga
Gong anyware
Gong anyware
Aki Ariga
gsub with ActiveSupport::SafeBuffer
gsub with ActiveSupport::SafeBuffer
Aki Ariga
はじめて翻訳記事を書いたら300ブクマ超えた話
はじめて翻訳記事を書いたら300ブクマ超えた話
Aki Ariga
Plus de Aki Ariga
(20)
Challenges for machine learning systems toward continuous improvement
Challenges for machine learning systems toward continuous improvement
Managing Machine Learning workflows on Treasure Data
Managing Machine Learning workflows on Treasure Data
主人が外資系IT企業に転職して4ヶ月が過ぎました
主人が外資系IT企業に転職して4ヶ月が過ぎました
R&D at Foodtech company - #CookpadTechConf 2016
R&D at Foodtech company - #CookpadTechConf 2016
Why I started Machine Learning Casual Talks? #MLCT
Why I started Machine Learning Casual Talks? #MLCT
クックパッドサマーインターン2015 機械学習・自然言語処理 実習課題
クックパッドサマーインターン2015 機械学習・自然言語処理 実習課題
Rubyistがgemの前にPypiデビューするのは間違っているだろうか
Rubyistがgemの前にPypiデビューするのは間違っているだろうか
Machine Learning Casual Talks Intro #MLCT
Machine Learning Casual Talks Intro #MLCT
Make Julia more popular in Japan!!1 #JuliaTokyo
Make Julia more popular in Japan!!1 #JuliaTokyo
Refrection of kawasaki.rb
Refrection of kawasaki.rb
Introduction and benchmarking of MeCab.jl #JapanR
Introduction and benchmarking of MeCab.jl #JapanR
Recommendation for iruby #tqrk08
Recommendation for iruby #tqrk08
The book that changed me
The book that changed me
Introduction of Mecab.jl #JuliaTokyo
Introduction of Mecab.jl #JuliaTokyo
Introduction to Kanagawa Ruby Kaigi01 #kana01
Introduction to Kanagawa Ruby Kaigi01 #kana01
Julia 100 exercises #JuliaTokyo
Julia 100 exercises #JuliaTokyo
Machine Learning Casual Talks opening talk
Machine Learning Casual Talks opening talk
Gong anyware
Gong anyware
gsub with ActiveSupport::SafeBuffer
gsub with ActiveSupport::SafeBuffer
はじめて翻訳記事を書いたら300ブクマ超えた話
はじめて翻訳記事を書いたら300ブクマ超えた話
Dernier
Javier_Fernandez_CARS_workshop_presentation.pptx
Javier_Fernandez_CARS_workshop_presentation.pptx
Javier Fernández Muñoz
Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________
Romil Mishra
Theory of Machine Notes / Lecture Material .pdf
Theory of Machine Notes / Lecture Material .pdf
Shreyas Pandit
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptx
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptx
Romil Mishra
Python Programming for basic beginners.pptx
Python Programming for basic beginners.pptx
mohitesoham12
Forming section troubleshooting checklist for improving wire life (1).ppt
Forming section troubleshooting checklist for improving wire life (1).ppt
Noman khan
Artificial Intelligence in Power System overview
Artificial Intelligence in Power System overview
sandhya757531
Novel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending Actuators
Researcher Researcher
High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
sandhya757531
A brief look at visionOS - How to develop app on Apple's Vision Pro
A brief look at visionOS - How to develop app on Apple's Vision Pro
Ray Yuan Liu
Research Methodology for Engineering pdf
Research Methodology for Engineering pdf
CaalaaAbdulkerim
Curve setting (Basic Mine Surveying)_MI10412MI.pptx
Curve setting (Basic Mine Surveying)_MI10412MI.pptx
Romil Mishra
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
elesangwon
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of Action
Mebane Rash
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
gerogepatton
CS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdf
BalamuruganV28
Comprehensive energy systems.pdf Comprehensive energy systems.pdf
Comprehensive energy systems.pdf Comprehensive energy systems.pdf
alene1
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Course
bim.edu.pl
Secure Key Crypto - Tech Paper JET Tech Labs
Secure Key Crypto - Tech Paper JET Tech Labs
amber724300
ASME-B31.4-2019-estandar para diseño de ductos
ASME-B31.4-2019-estandar para diseño de ductos
JORGE RODRIGUEZ Jrodriguezj2006
Dernier
(20)
Javier_Fernandez_CARS_workshop_presentation.pptx
Javier_Fernandez_CARS_workshop_presentation.pptx
Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________
Theory of Machine Notes / Lecture Material .pdf
Theory of Machine Notes / Lecture Material .pdf
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptx
Triangulation survey (Basic Mine Surveying)_MI10412MI.pptx
Python Programming for basic beginners.pptx
Python Programming for basic beginners.pptx
Forming section troubleshooting checklist for improving wire life (1).ppt
Forming section troubleshooting checklist for improving wire life (1).ppt
Artificial Intelligence in Power System overview
Artificial Intelligence in Power System overview
Novel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending Actuators
High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
High Voltage Engineering- OVER VOLTAGES IN ELECTRICAL POWER SYSTEMS
A brief look at visionOS - How to develop app on Apple's Vision Pro
A brief look at visionOS - How to develop app on Apple's Vision Pro
Research Methodology for Engineering pdf
Research Methodology for Engineering pdf
Curve setting (Basic Mine Surveying)_MI10412MI.pptx
Curve setting (Basic Mine Surveying)_MI10412MI.pptx
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of Action
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
CS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdf
Comprehensive energy systems.pdf Comprehensive energy systems.pdf
Comprehensive energy systems.pdf Comprehensive energy systems.pdf
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Course
Secure Key Crypto - Tech Paper JET Tech Labs
Secure Key Crypto - Tech Paper JET Tech Labs
ASME-B31.4-2019-estandar para diseño de ductos
ASME-B31.4-2019-estandar para diseño de ductos
仕事ではじめる機械学習
1.
Aki Ariga |
Field Data Scientist 2018.05.17
2.
2 © Cloudera,
Inc. All rights reserved. ● Field Data Scientist at Cloudera ● Previously research engineer at Toshiba, Rails developer at Cookpad ● Co-author of “ ” ● Founder of kawasaki.rb & MLCT ● Twitter: @chezou ● GitHub: https://github.com/chezou/ :
3.
3 © Cloudera,
Inc. All rights reserved. Hidden technical debt in Machine learning systems [2] Project procedure Culture + +
4.
© Cloudera, Inc.
All rights reserved. Building a Data-driven product ≠ Research
5.
5 © Cloudera,
Inc. All rights reserved. A journey for Data-driven product 1. 2. 3. A/B 4. A/B 5. 6. 7. http://tjo.hatenablog.com/entry/2016/01/18/080000 ( ) Culture BI Statistics ML
6.
6 © Cloudera,
Inc. All rights reserved. 1. 2. 3. 4. 5. 6. 7. 8. Procedure in a Machine Learning project Step.4 7
7.
7 © Cloudera,
Inc. All rights reserved. • • • • / Web • Typical project member recommendation for ML project
8.
© Cloudera, Inc.
All rights reserved. What’s the difference between academia and industry for ML?
9.
9 © Cloudera,
Inc. All rights reserved. Production by Nick Youngson CC BY-SA 3.0 Alpha Stock Images
10.
10 © Cloudera,
Inc. All rights reserved. Sample data science/machine learning workflow From data to exploration to action Data Engineering Data Science (Exploratory) Production (Operational) Data Wrangling Data Exploration Model Training & Testing Production Data Pipelines Batch Scoring Online Scoring Serving Data GovernanceCuration Data Engineering Acquisition Reports, Dashboards Data Models Predictions Business value 1.
11.
12 © Cloudera,
Inc. All rights reserved. 1. 2. 3. Production MLOps
12.
13 © Cloudera,
Inc. All rights reserved. 1. 2. 3. Production MLOps
13.
14 © Cloudera,
Inc. All rights reserved. 1. Train by batch, predict on the fly, serve via REST API 2. Train by batch, predict by batch, serve through the shared DB 3. Train, predict, serve by streaming 4. Train by batch, predict on mobile app 1.
14.
15 © Cloudera,
Inc. All rights reserved. Web Application DB Trained Model Execute training Extract feature Prediction result Activity log/ Contents data Feature Training result Feature Batch SystemAPI Server REST API User ID/ Item ID ML System Pattern 1: Train by batch, predict on the fly, serve via REST API 1.
15.
16 © Cloudera,
Inc. All rights reserved. Extract feature & Train/update model Extract feature & Predict Trained Model Activity log Export model as PMML Model building layer Predicting & serving layer Updated model CDSW Prediction results HDFSRequest to predict Load model Example architecture: PMML + OpenScoring 1.
16.
17 © Cloudera,
Inc. All rights reserved. Extract feature & Train/update model Extract feature & Predict Trained Model Activity log Save model on object storage Model building layer Predicting & serving layer Updated model Prediction results HDFSRequest to predict Load model Object storage Pack the runtime env with Docker CDSW Example architecture: Docker based API Server 1.
17.
18 © Cloudera,
Inc. All rights reserved. Web Application DB Trained Model Batch System Execute training Extract feature Prediction result Activity log/ Contents data Feature Training result Feature Serve prediction Training BatchPrediction Batch Pattern 2: Train by batch, predict by batch, serve through the shared DB 1.
18.
19 © Cloudera,
Inc. All rights reserved. Kudu/HBase Extract feature & Train/update model Extract feature & Predict Activity log Prediction results Model building & predicting layerServing layer Updated model Activity log Load trained model Prediction results HDFS CDSW Historical data Historical data Example architecture: Serving by HBase/Kudu Trained Model 1.
19.
20 © Cloudera,
Inc. All rights reserved. Web Application Trained Model Stream-based ML System (e.g. Spark Streaming) Train & Predict Extract feature Prediction results Recent log data Feature Model updates Model - Querying for prediction - Showing or sending alerts - This component may work with message queue like Kafka Messagequeue (e.g.Kafka) Log data Prediction results Pattern 3: Train, predict, serve by streaming 1.
20.
21 © Cloudera,
Inc. All rights reserved. Mobile Application DB Trained Model Batch System Execute training Extract feature Extract feature Request for prediction Activity logs/ Contents data Prediction result Activity log/ Contents data Feature Training resultFeature DB Trained Model Convert model Pattern 4: Train by batch, predict on a mobile app 1.
21.
22 © Cloudera,
Inc. All rights reserved. Extract feature & Train/update model Extract feature & Predict Trained Model Activity log Convert model to TFLite/CoreML Model building layer Predicting & serving layer Updated model Prediction results HDFSRequest to predict Load model Storage in a smart phone CDSW Example architecture: Serving on a mobile app 1.
22.
23 © Cloudera,
Inc. All rights reserved. Pattern 4’: Federated learning https://research.googleblog.com/2017/04/federated-learning- collaborative.html 1.
23.
24 © Cloudera,
Inc. All rights reserved. 4 patterns Comparison 1. Pattern 1 (REST API) Pattern 2 (Shared DB) Pattern 3 (Streaming) Pattern 4 (Mobile app) Training by batch by batch NRT (by streaming) by batch Prediction NRT (on the fly) by batch NRT (by streaming) NRT (on the fly) Prediction result delivery NRT (via REST API) NRT (through the shared DB) NRT (by streaming via MQ ) NRT (via in-process API on mobile) Latency for prediction from getting new data So so So so ~ Long Very low Low Required time to predict Short Long Short Short Tight/loose coupling with app Loose Loose Loose Tight Dependency of languages Independent Independent Independent Depends on frameworks System management difficulty So so Easy Very Hard So so NRT: Near real time
24.
25 © Cloudera,
Inc. All rights reserved. CI, CD and Blue Green deployment https://www.slideshare.net/hiroakikudo77/ss-84593653/14 1.
25.
26 © Cloudera,
Inc. All rights reserved. 1. 2. 3. Production MLOps
26.
27 © Cloudera,
Inc. All rights reserved. • /Feedback loop • • 2.
27.
28 © Cloudera,
Inc. All rights reserved. • • ) MeCab • • ) • • • /Feedback loop https://twitter.com/hagino3000/status/986257856730034177 2.
28.
29 © Cloudera,
Inc. All rights reserved. • • “safe to serve” & “desired prediction quality” [4] • (offline) (online) • “Silent failures” [3] • ) Join • ) • • • • serving 2.
29.
30 © Cloudera,
Inc. All rights reserved. • • • [1] • ) DVC, Bitemporal Modeling • [4] • ) • • [2,4] • [4] 2.
30.
31 © Cloudera,
Inc. All rights reserved. 1. 2. 3. Production MLOps
31.
32 © Cloudera,
Inc. All rights reserved. • • [7] • Google, Facebook [4, 9] • / • / • • Researcher, Dev, Ops: https://www.slideshare.net/syou6162/ss-88255142 3.
32.
33 © Cloudera,
Inc. All rights reserved. • IoT [8] • • (GDPR) 3.
33.
34 © Cloudera,
Inc. All rights reserved. • Data-driven product • • • • ML systems Production • • • •
34.
35 © Cloudera,
Inc. All rights reserved. • [1] “My model has higher BLEU, can I ship it? The Joel Test for machine learning systems”, L. Park, 2017, ACML-AIMLP Workshop • [2] “Hidden Technical Debt in Machine Learning Systems”, D. Sculley et al., NIPS’ 15 • [3] “Rules of Machine Learning: Best Practices for ML Engineering”, M. Zinkevich • [4] “TFX: A TensorFlow-Based Production-Scale Machine Learning Platform”, A. Naresh et al., KDD 2017 • [5] “What’s your ML test score? A rubric for ML production systems”, E. Breck et al., Reliable Machine Learning in the Wild - NIPS 2016 Workshop (2016) • [6] , 2017, ML Ops Study #1 • [7] , , 2018, HACKER TACKLE 2018 • [8] “DevOps for models: How to manage millions of models in production—and at the edge”, T. Tung et al., Strata Data Singapore, 2017 • [9] “Applied Machine Learning at Facebook: A Datacenter Infrastructure Perspective”, K. Hazelwood et al., IEEE HPCA, 2018
35.
THANK YOU
Télécharger maintenant