#OSSPARIS19 - Which Severless prediction platform on Google Cloud Platform and with Tensorflow - GUILLAUME BLAQUIERE, Veolia

•Télécharger en tant que PPTX, PDF•

1 j'aime•85 vues

#IA Track - Technology and Tools Google Cloud Platform allows to build, to train and to serve ML models in serverless thanks to a dedicated service: AI-Platform. This service can do all, but, on the orediction serving part, there is others Google Cloud Platform services which can offer different characteristic and could be more interesting for some use cases - Big Query for batch prediction and based on Structured data - Cloud Run for online prediction, which brings Knative compatibility and thus portability through different operator and event on premise! During this talk, use cases, demos and implementation examples will be described based on Tensorflow already trained model. A comparaisons will be performed in term of deployment process, serving performances, team organisation and skills required, pricing and trade-off between portability and efficiency.

Technologie

Which Severless platform
for serving a Tensorflow
model on Google Cloud
Guillaume Blaquiere
@gblaquiere
Lead developer/Scrum master Veolia

What is Serving in AI pipeline ?
Explore
data
Build
model
Train
model
Serve
model
Prepare
data

What is Serverless ?
Operational
Model
Programming
Model
No Server
Management
Fully Managed
Security
Pay only for
usage
Service-based Event-driven Portable

What is AI-Platform on GCP ?
Explore
data
Build
model
Train
model
Serve
model
Prepare
data
AI Platform
AI Platform
Cloud
Datalab
Cloud
Dataproc
Cloud
Dataprep
Cloud
Dataflow
BigQuery
Cloud Data
Fusion Datastudio AI platform

AI-Platform serving features
G.A.
1 vCPU
2 Gb
No GPU
Beta
2 - 32 vCPU
2 - 208 Gb
1 - 8 GPU
$$

Portable solution
Any language
Any library
Any binary
Ecosystem of base images
Industry standard
.js .rb .go
.py .sh …
0 1 0
1 0 0
1 1 1
Containers

Containers
Flexibility
Serverless
Simplicity

Cloud Run
Container to
production in
seconds
Natively
Serverless
Run it anywhere
you want

Deploy in production in seconds
Container Registry
https://yourservice.run.app
Cloud Run

Cloud Run: Pay-per-use
CPU / Memory / Requests 100ms

Up to 80 concurrent requests
concurrency = 1
concurrency = 80

● Common API and runtime environment for
serving workloads
● Implements learnings from Google and over 50
companies contributing
● Portability of experiences, tooling, and
workloads between Knative environments - you
can even run serverless on-prem
https://knative.dev
Portability based on Knative

Cloud Run
Fully managed, deploy your
workloads and don’t see the
cluster.
Cloud Run on Anthos
Deploy into Anthos, run
serverless side-by-side with
your existing workloads.
Knative Everywhere
Use the same APIs and
tooling anywhere you run
Kubernetes with Knative.
One experience, where you want it

- 1 vCPU, 2Gb of memory
- Scale to 0
- Pay-as-you-use
- Portable serverless container
Cloud Run summary

What is batch prediction process ?
Extract data
to files
Run batch
model
Store results
Load results in database
$ $ $ $
BigQuery Storage AI Platform Storage

SELECT data as input FROM `dataset.my_data`
SELECT * FROM ML.PREDICT(
MODEL `model.my_model`, (
Use model in a query
INSERT INTO result.my_results
)
)

BigQuery Storage AI Platform Storage
Batch prediction with BQML
$ $ $ $
Extract data
to files
Run batch
model
Store results
Load results in database

Online prediction
Sparse request
Spiky traffic
Compute intensive - GPU
Sustainable traffic
Cloud Run
AI-Platform
Portability
No IT engineering

Batch prediction
Structured data
Compute intensive - GPU
AI-Platform
Data reachable with BQ
(un)structured data
No data engineering
BigQuery
Not only Tensorflow model

Thank you!
AI-Platform cloud.google.com/ai-platform
BigQuery ML cloud.google.com/bigquery-ml
Cloud Run cloud.google.com/run
Knative knative.dev
Find me on :
Twitter @gblaquiere
GitHub guillaumeblaquiere
Medium @guillaume.blaquiere

Contenu connexe

Plus de Paris Open Source Summit

#OSSPARIS19 - Fostering disruptive innovation in AI with JEDI - André Loesekr...Paris Open Source Summit

#OSSPARIS19 : Comment ONLYOFFICE aide à organiser les travaux de recherches ...Paris Open Source Summit

#OSSPARIS19 : MDPH : une solution collaborative open source pour l'instructio...Paris Open Source Summit

#OSSPARIS19 - Understanding Open Source Governance - Gilles Gravier, Wipro Li...Paris Open Source Summit

#OSSPARIS19 : Publier du code Open Source dans une banque : Mission impossibl...Paris Open Source Summit

#OSSPARIS19 : Libre à vous ! Raconter les libertés informatiques à la radio -...Paris Open Source Summit

#OSSPARIS19 - Le logiciel libre : un enjeu politique et social - Etienne Gonn...Paris Open Source Summit

#OSSPARIS19 - Conflits d’intérêt & concurrence : la place de l’éditeur dans l...Paris Open Source Summit

#OSSPARIS19 - Table ronde : souveraineté des données Paris Open Source Summit

#OSSPARIS19 - Comment financer un projet de logiciel libre - LUDOVIC DUBOST, ...Paris Open Source Summit

#OSSPARIS19 - BlueMind v4 : les dessous technologiques de 10 ans de travail p...Paris Open Source Summit

#OSSPARIS19 - Tuto de première installation de VITAM, un système d'archivage ...Paris Open Source Summit

#OSSPARIS19 - Cryptpad : la collaboration chiffrée - LUDOVIC DUBOST, CEO XWik...Paris Open Source Summit

OSSPARIS19 - Customer Content Management: GED et CRM combiné - MICHAËL GENA, ...Paris Open Source Summit

OSSPARIS19 - Utiliser les outils open source pour démarrer une nouvelle entre...Paris Open Source Summit

#OSSPARIS19 - Comment un Logiciel Libre a conduit ma voiture sur plus de 8000...Paris Open Source Summit

#OSSPARIS19 - Blockchain tokenization at SocieteGenerale - SEBASTIEN CHROUKRO...Paris Open Source Summit

#OSSPARIS19 - La sécurité applicative par le design - CHRISTOPHE VILLENEUVE, ...Paris Open Source Summit

#OSSPARIS19 - Learn AWK in 15 minutes - MAXIME BESSON, WorteksParis Open Source Summit

#OSSPARIS19 - TLS for dummies - MAXIME BESSON, WorteksParis Open Source Summit

Plus de Paris Open Source Summit (20)

#OSSPARIS19 - Fostering disruptive innovation in AI with JEDI - André Loesekr...

#OSSPARIS19 : Comment ONLYOFFICE aide à organiser les travaux de recherches ...

#OSSPARIS19 : MDPH : une solution collaborative open source pour l'instructio...

#OSSPARIS19 - Understanding Open Source Governance - Gilles Gravier, Wipro Li...

#OSSPARIS19 : Publier du code Open Source dans une banque : Mission impossibl...

#OSSPARIS19 : Libre à vous ! Raconter les libertés informatiques à la radio -...

#OSSPARIS19 - Le logiciel libre : un enjeu politique et social - Etienne Gonn...

#OSSPARIS19 - Conflits d’intérêt & concurrence : la place de l’éditeur dans l...

#OSSPARIS19 - Table ronde : souveraineté des données

#OSSPARIS19 - Comment financer un projet de logiciel libre - LUDOVIC DUBOST, ...

#OSSPARIS19 - BlueMind v4 : les dessous technologiques de 10 ans de travail p...

#OSSPARIS19 - Tuto de première installation de VITAM, un système d'archivage ...

#OSSPARIS19 - Cryptpad : la collaboration chiffrée - LUDOVIC DUBOST, CEO XWik...

OSSPARIS19 - Customer Content Management: GED et CRM combiné - MICHAËL GENA, ...

OSSPARIS19 - Utiliser les outils open source pour démarrer une nouvelle entre...

#OSSPARIS19 - Comment un Logiciel Libre a conduit ma voiture sur plus de 8000...

#OSSPARIS19 - Blockchain tokenization at SocieteGenerale - SEBASTIEN CHROUKRO...

#OSSPARIS19 - La sécurité applicative par le design - CHRISTOPHE VILLENEUVE, ...

#OSSPARIS19 - Learn AWK in 15 minutes - MAXIME BESSON, Worteks

#OSSPARIS19 - TLS for dummies - MAXIME BESSON, Worteks

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

MINDCTI Revenue Release Quarter One 2024MIND CTI

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays

Manulife - Insurer Transformation Award 2024The Digital Insurer

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Dernier (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Spring Boot vs Quarkus the ultimate battle - DevoxxUK

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

presentation ICT roal in 21st century education

MINDCTI Revenue Release Quarter One 2024

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

MS Copilot expands with MS Graph connectors

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - The value of a flexible API Management solution for O...

Exploring the Future Potential of AI-Enabled Smartphone Processors

AXA XL - Insurer Innovation Award Americas 2024

AWS Community Day CPH - Three problems of Terraform

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

Manulife - Insurer Transformation Award 2024

Axa Assurance Maroc - Insurer Innovation Award 2024

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

#OSSPARIS19 - Which Severless prediction platform on Google Cloud Platform and with Tensorflow - GUILLAUME BLAQUIERE, Veolia

1. Which Severless platform for serving a Tensorflow model on Google Cloud Guillaume Blaquiere @gblaquiere Lead developer/Scrum master Veolia

2. Which Severless platform for serving a Tensorflow model on Google Cloud Guillaume Blaquiere @gblaquiere Lead developer/Scrum master Veolia

3. What is Serving in AI pipeline ? Explore data Build model Train model Serve model Prepare data

4. What is Tensorflow ? Tensor Flow

5. What is Serverless ? Operational Model Programming Model No Server Management Fully Managed Security Pay only for usage Service-based Event-driven Portable

6. AI platform on GCP

7. What is AI-Platform on GCP ? Explore data Build model Train model Serve model Prepare data AI Platform AI Platform Cloud Datalab Cloud Dataproc Cloud Dataprep Cloud Dataflow BigQuery Cloud Data Fusion Datastudio AI platform

8. Serverless online prediction

9. AI-Platform serving features G.A. 1 vCPU 2 Gb No GPU Beta 2 - 32 vCPU 2 - 208 Gb 1 - 8 GPU $$

10. Portable solution Any language Any library Any binary Ecosystem of base images Industry standard .js .rb .go .py .sh … 0 1 0 1 0 0 1 1 1 Containers

11. Containers Flexibility Serverless Simplicity

12. Cloud Run Container to production in seconds Natively Serverless Run it anywhere you want

13. Deploy in production in seconds Container Registry https://yourservice.run.app Cloud Run

14. Cloud Run: Pay-per-use CPU / Memory / Requests 100ms

15. Up to 80 concurrent requests concurrency = 1 concurrency = 80

16. Cloud Run: Leverage concurrency

17. ● Common API and runtime environment for serving workloads ● Implements learnings from Google and over 50 companies contributing ● Portability of experiences, tooling, and workloads between Knative environments - you can even run serverless on-prem https://knative.dev Portability based on Knative

18. Cloud Run Fully managed, deploy your workloads and don’t see the cluster. Cloud Run on Anthos Deploy into Anthos, run serverless side-by-side with your existing workloads. Knative Everywhere Use the same APIs and tooling anywhere you run Kubernetes with Knative. One experience, where you want it

19. - 1 vCPU, 2Gb of memory - Scale to 0 - Pay-as-you-use - Portable serverless container Cloud Run summary

20. Serverless batch prediction

21. What is batch prediction process ? Extract data to files Run batch model Store results Load results in database $ $ $ $ BigQuery Storage AI Platform Storage

22. BigQuery ML

23. SELECT data as input FROM `dataset.my_data` SELECT * FROM ML.PREDICT( MODEL `model.my_model`, ( Use model in a query INSERT INTO result.my_results ) )

24. BigQuery Storage AI Platform Storage Batch prediction with BQML $ $ $ $ Extract data to files Run batch model Store results Load results in database

25. What to choose?

26. Online prediction Sparse request Spiky traffic Compute intensive - GPU Sustainable traffic Cloud Run AI-Platform Portability No IT engineering

27. Batch prediction Structured data Compute intensive - GPU AI-Platform Data reachable with BQ (un)structured data No data engineering BigQuery Not only Tensorflow model

28. Thank you! AI-Platform cloud.google.com/ai-platform BigQuery ML cloud.google.com/bigquery-ml Cloud Run cloud.google.com/run Knative knative.dev Find me on : Twitter @gblaquiere GitHub guillaumeblaquiere Medium @guillaume.blaquiere

Notes de l'éditeur

Focus on the last part, for serving a trainded model Prediction on data Inference on image/video/NLP
Make and use by Google. V2.0 in September Tensorflow Lite and JS to run on mobile and browser
Focus on the last part, for serving a trainded model Prediction on data Inference on image/video/NLP
Small : cheaper and scale to 0. Quick deployment (about 60s) Beta: powerfull, customizable, expensive: bigger and don’t scale to 0 Sticky to GCP + load only your model, no binary allowed
TF2 example with function

#OSSPARIS19 - Which Severless prediction platform on Google Cloud Platform and with Tensorflow - GUILLAUME BLAQUIERE, Veolia

Recommandé

Recommandé

Contenu connexe

Plus de Paris Open Source Summit

Plus de Paris Open Source Summit (20)

Dernier

Dernier (20)

#OSSPARIS19 - Which Severless prediction platform on Google Cloud Platform and with Tensorflow - GUILLAUME BLAQUIERE, Veolia

Notes de l'éditeur