Soumettre la recherche
Mettre en ligne
Container and Kubernetes without limits
•
0 j'aime
•
286 vues
Antje Barth
Suivre
Talk at Advanced Spark and Tensorflow Meetup @ O'Reilly AI London, October 2018
Lire moins
Lire la suite
Logiciels
Signaler
Partager
Signaler
Partager
1 sur 65
Recommandé
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Mathieu Dumoulin
Streaming Architecture to Connect Everything (Including Hybrid Cloud) - Strat...
Streaming Architecture to Connect Everything (Including Hybrid Cloud) - Strat...
Mathieu Dumoulin
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
MapR Technologies
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
MapR Technologies
MapR and Machine Learning Primer
MapR and Machine Learning Primer
Mathieu Dumoulin
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
MapR Technologies
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
Matt Stubbs
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
Mathieu Dumoulin
Recommandé
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Mathieu Dumoulin
Streaming Architecture to Connect Everything (Including Hybrid Cloud) - Strat...
Streaming Architecture to Connect Everything (Including Hybrid Cloud) - Strat...
Mathieu Dumoulin
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
MapR Technologies
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
MapR Technologies
MapR and Machine Learning Primer
MapR and Machine Learning Primer
Mathieu Dumoulin
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
MapR Technologies
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
Big Data LDN 2018: PROGRESS FOR BIG DATA IN KUBERNETES
Matt Stubbs
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
Mathieu Dumoulin
How to Get Going with Kubernetes
How to Get Going with Kubernetes
Ted Dunning
Progress for big data in Kubernetes
Progress for big data in Kubernetes
Ted Dunning
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
MapR Technologies
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
MapR Technologies
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
MapR Technologies
Meruvian - Introduction to MapR
Meruvian - Introduction to MapR
The World Bank
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
MapR Technologies
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
LEGATO project
2018 03 25 system ml ai and openpower meetup
2018 03 25 system ml ai and openpower meetup
Ganesan Narayanasamy
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
MapR Technologies
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
MapR Technologies
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
MapR Technologies
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
MapR Technologies
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
VMware Tanzu
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
MapR Technologies
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming Data
Carol McDonald
Applying Machine Learning to Live Patient Data
Applying Machine Learning to Live Patient Data
Carol McDonald
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Carol McDonald
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
MapR Technologies
Containers and Kubernetes without limits
Containers and Kubernetes without limits
Antje Barth
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Matt Stubbs
Contenu connexe
Tendances
How to Get Going with Kubernetes
How to Get Going with Kubernetes
Ted Dunning
Progress for big data in Kubernetes
Progress for big data in Kubernetes
Ted Dunning
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
MapR Technologies
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
MapR Technologies
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
MapR Technologies
Meruvian - Introduction to MapR
Meruvian - Introduction to MapR
The World Bank
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
MapR Technologies
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
LEGATO project
2018 03 25 system ml ai and openpower meetup
2018 03 25 system ml ai and openpower meetup
Ganesan Narayanasamy
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
MapR Technologies
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
MapR Technologies
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
MapR Technologies
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
MapR Technologies
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
VMware Tanzu
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
MapR Technologies
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming Data
Carol McDonald
Applying Machine Learning to Live Patient Data
Applying Machine Learning to Live Patient Data
Carol McDonald
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Carol McDonald
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
MapR Technologies
Tendances
(20)
How to Get Going with Kubernetes
How to Get Going with Kubernetes
Progress for big data in Kubernetes
Progress for big data in Kubernetes
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
Meruvian - Introduction to MapR
Meruvian - Introduction to MapR
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
2018 03 25 system ml ai and openpower meetup
2018 03 25 system ml ai and openpower meetup
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming Data
Applying Machine Learning to Live Patient Data
Applying Machine Learning to Live Patient Data
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
Similaire à Container and Kubernetes without limits
Containers and Kubernetes without limits
Containers and Kubernetes without limits
Antje Barth
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Matt Stubbs
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
Carol McDonald
Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018
Ellen Friedman
DataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven Organizations
Ellen Friedman
The rise of microservices
The rise of microservices
Cloud Technology Experts
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
Ian Downard
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Chris Fregly
MapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR Technologies
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
John Archer
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine Learning
Ted Dunning
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
Ellen Friedman
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural Networks
Justin Brandenburg
MapR and Cisco Make IT Better
MapR and Cisco Make IT Better
MapR Technologies
Accelerating Edge Computing Adoption
Accelerating Edge Computing Adoption
Michelle Holley
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR Technologies
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
DataWorks Summit
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Matt Stubbs
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Mathieu Dumoulin
Similaire à Container and Kubernetes without limits
(20)
Containers and Kubernetes without limits
Containers and Kubernetes without limits
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018
DataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven Organizations
The rise of microservices
The rise of microservices
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
MapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR 5.2: Getting More Value from the MapR Converged Community Edition
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine Learning
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural Networks
MapR and Cisco Make IT Better
MapR and Cisco Make IT Better
Accelerating Edge Computing Adoption
Accelerating Edge Computing Adoption
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in Production
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data Platform
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Dernier
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
Willy Marroquin (WillyDevNET)
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
MyIntelliSource, Inc.
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
aagamshah0812
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
kellynguyen01
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
ComplianceQuest1
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
Wave PLM
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
Fatema Valibhai
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
panagenda
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
MyIntelliSource, Inc.
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
ABDERRAOUF MEHENNI
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Alberto González Trastoy
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
shikhaohhpro
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
kalichargn70th171
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
Jhone kinadey
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Steffen Staab
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
OnePlan Solutions
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
harshavardhanraghave
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
ThousandEyes
Dernier
(20)
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
Container and Kubernetes without limits
1.
CONTAINER AND KUBERNETES WITHOUT
LIMITS ANTJE BARTH Advanced Spark and TensorFlow Meetup O‘Reilly AI Conference, London October 9th, 2018
2.
2 © 2018
MapR Technologies, Inc. // MapR Confidential Learn how a MODERN DATA PLATFORM can help to support stateful applications in large containerized environments, and how to handle persistent data across multiple data centers or geographic locations. #Kubernetes4Data Today‘s Session
3.
3 © 2018
MapR Technologies, Inc. // MapR Confidential Who ANTJE BARTH Partner Engineer, MapR abarth@mapr.com antje-barth-413258bb @anbarth Chapter Lead Duesseldorf (Germany) https://www.meetup.com/Women-in-Big-Data-Dusseldorf/ BIG DATA ML/AI CONTAINER K8S ADV. ANALYTICS
4.
4 © 2018
MapR Technologies, Inc. // MapR Confidential Agenda QUICK INTRO / RECAP MODERN DATA PLATFORM • Data Persistence across data centers / geographic regions • #Kubernetes4Data AI BONUS TRACK - Kubernetes plays Cupid for Data Scientists and IT CONTAINER ORCHESTRATION • Kubernetes • Challenges for stateful applications CONTAINERS • Architectural concepts • Container challenges
5.
5 © 2018
MapR Technologies, Inc. // MapR Confidential Virtual Machines are Computers in a Box Containers are Applications in a Box
6.
6 © 2018
MapR Technologies, Inc. // MapR Confidential hardware os hypervisor vm os libs app vm os libs app hardware os container libs app container libs app container libs app VM vs Container
7.
7 © 2018
MapR Technologies, Inc. // MapR Confidential Pets vs Cattle - long lived - name them - care for them - ephemeral - brand them with #’s - well.. vets are expensive
8.
8 © 2018
MapR Technologies, Inc. // MapR Confidential Containers • Are lightweight • Are stateless • Are portable • Targeted for developing applications • Surely moving towards production • Docker made it popular … and added a whole lot of jargon for us to learn! J
9.
9 © 2018
MapR Technologies, Inc. // MapR Confidential But… Containers have a problem
10.
10 © 2018
MapR Technologies, Inc. // MapR Confidential Challenges in using / deploying containers Source: CNCF Survey, 2018. https://www.cncf.io/blog/2018/08/29/cncf-survey-use-of-cloud-native-technologies-in-production-has-grown-over-200-percent/ • Cultural Changes with Development Team • Complexity • Lack of Training • Security • Monitoring • Storage • Networking
11.
11 © 2018
MapR Technologies, Inc. // MapR Confidential Some of the things Docker can’t do • Monitor running containers • Handle dead containers • Move containers so utilization improves • Auto-scale container instances to handle load • Solve port mapping hell • …
12.
12 © 2018
MapR Technologies, Inc. // MapR Confidential You can never get away from pets unless: • You have an environment to support cattle • You handle the problem of container state
13.
Kubernetes kubernetes (n.) -
greek word for pilot or helm
14.
14 © 2018
MapR Technologies, Inc. // MapR Confidential Now home at the CNCF! Large-scale cluster management at Google with Borg, 2015. https://ai.google/research/pubs/pub43438 Kubernetes started life as a successor to Google’s Borg project... https://www.cncf.io/ https://kubernetes.io/
15.
15 © 2018
MapR Technologies, Inc. // MapR Confidential Kubernetes is an API and agents The Kubernetes API provides containers with a scheduling, configuration, network, and storage The Kubernetes runtime manages the containers
16.
16 © 2018
MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes Kubernetes
17.
17 © 2018
MapR Technologies, Inc. // MapR Confidential App 1 Kubernetes Magical View of Kubernetes Kubernetes starts application containers “somewhere”
18.
18 © 2018
MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes App 1 App 3 Kubernetes Later containers may be started elsewhere due to “affinities”
19.
19 © 2018
MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes App 1 App 2 App 3 Kubernetes Kubernetes provides super fast naming via DNS so containers can find each other
20.
20 © 2018
MapR Technologies, Inc. // MapR Confidential Note that you don’t think about which machine at all No more names from The Hobbit Just cattle!
21.
21 © 2018
MapR Technologies, Inc. // MapR Confidential Kubernetes – Why is it so popular? • There are many management software solutions to create, manage & delete containers with newer vendors emerging everyday • Kubernetes remains the leader with 83% (up from 77%) • The ecosystem and developer community augmented by Google’s support gave Kubernetes the edge over others Source: CNCF Survey, 2018. https://www.cncf.io/blog/2018/08/29/cncf-survey-use-of-cloud-native-technologies-in-production-has-grown-over-200-percent/
22.
22 © 2018
MapR Technologies, Inc. // MapR Confidential Kubernetes – an open, pluggable framework
23.
23 © 2018
MapR Technologies, Inc. // MapR Confidential Kubernetes – an open, pluggable framework Source: CNCF landscape, see https://github.com/cncf/landscape and http://l.cncf.io
24.
24 © 2018
MapR Technologies, Inc. // MapR Confidential We still have a problem
25.
25 © 2018
MapR Technologies, Inc. // MapR Confidential State!
26.
26 © 2018
MapR Technologies, Inc. // MapR Confidential Problem with Containers and State • State in containers messes things up • Restarts lose the state • Replicating state makes services complex • Application developers just aren’t systems developers • State life-cycle doesn’t match app life-cycle • …
27.
27 © 2018
MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes
28.
28 © 2018
MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes rpc stream LogFile
29.
29 © 2018
MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes rpc stream LogFile We need multiple forms of persistence!
30.
30 © 2018
MapR Technologies, Inc. // MapR Confidential Data platform App 1 App 2 App 3 Kubernetes rpc
31.
31 © 2018
MapR Technologies, Inc. // MapR Confidential What Does This Data Platform Need to Have? Global namespace across entire Kubernetes cluster • Between clusters as well if possible All three forms of primitive persistence • Files, streams, tables Inherently scalable • Performance, cardinality, locality Uniform access and control • Path names for all objects, identical permission scheme
32.
32 © 2018
MapR Technologies, Inc. // MapR Confidential The Data Platform needs to be like Kubernetes. For Data.
33.
MapR Data Platform
34.
34 © 2018
MapR Technologies, Inc. // MapR Confidential MapR Data Platform MAPR DATA PLATFORM FILES / OBJECTS / TABLES / STREAMS APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA DATA CENTER CLOUD MULTI-CLOUD EDGE KUBERNETES COMMODITY SERVER VIRTUAL MACHINE IoT & Edge AI / ML ADV. ANALYTICS ENTERPRISE APPLICATIONS Pod Pod Pod Pod
35.
35 © 2018
MapR Technologies, Inc. // MapR Confidential Scale. It distributes data across the cluster and offers a global namespace for a unified view of data regardless of its physical location High Availability. Offers configurable levels of replication to ensure data durability. In event of a failure, all nodes participate to self-heal and reconstruct data automatically Data Protection. End-to-end security, per volume Access control expressions, space efficient snapshots, volume mirroring, offers several choices to build a data protection strategy Intelligent Data Placement. Offers three different storage tiers with automated storage policies to place data based on their SLAs Edge, on-premises, Cloud: Can be deployed in on-premises datacenters, edge and on the cloud MapR Data Platform
36.
36 © 2018
MapR Technologies, Inc. // MapR Confidential MapR Volumes Volumes are logical units of management, holding files, directories, tables, messages. WHAT CAN YOU DO WITH VOLUMES? • Schedule snapshots • Schedule mirrors • Control data placement • Access permissions • Enforce volume quotas • Manage performance • Specify replication factor Volumes: Shared MapR Cluster r : user:sally | (group:research & group:managers) MAPR ACCESS CONTROL EXPRESSIONS /mktg /finance /projectx
37.
37 © 2018
MapR Technologies, Inc. // MapR Confidential ü Global data view in a single namespace ü Distributed data processing ü Unified Security ü Global Replication For Data Distribution & DR ü Bandwidth-aware to manage global data flows ü Simplify cross cloud application development & deployment Global Namespace – common path to connect to any data Globally Protected Globally Accessible Globally Managed Globally Replicated Across Locations Across Clouds /mapr /us.mapr.com /eu_cloud.mapr.com /asia.mapr.com /us_cloud.mapr.com
38.
38 © 2018
MapR Technologies, Inc. // MapR Confidential Automatically Synchronized Globally Distributed Data Topic Topic Topic On-Premises S3 EDGE DATA PLATFORM DATA PLATFORM DATA PLATFORM DATA PLATFORM Multi-Cloud Data Movement & Application Portability Enabling Application and Data Portability #Kubernetes4Data
39.
39 © 2018
MapR Technologies, Inc. // MapR Confidential MapR Persistent Application Client Container (PACC) • Pre-built, certified container image for connecting to MapR services • Secure authentication at container level, secure connection • Extensible support for application layers • Available in Docker Hub, Dockerfile for customizability MapR POSIX Client for Containers MapR Converged Client for Containers Space for Customer Application MapR PACC MAPR DATA PLATFORM EVENT DATA STREAMS ANALYTICS & ML ENGINES OPERATIONAL DATABASE CLOUD-SCALE FILE AND OBJECT STORE
40.
40 © 2018
MapR Technologies, Inc. // MapR Confidential Containerized Microservices have real-time access to files/tables/streams Microservices Databases/files Microservices Databases/files Microservices Databases/files Microservices Microservices Microservices Microservices Microservices Microservices MAPR DATA PLATFORM Stream Stream
41.
MapR Data Fabric
for Kubernetes
42.
42 © 2018
MapR Technologies, Inc. // MapR Confidential MapR Data Fabric for Kubernetes node Pod Pod node Pod Pod node Pod Pod MAPR MAPR MAPR MAPR DATA PLATFORM • Integration with Kubernetes APIs, packaged and run as a POSIX client on each Kubernetes host • MapR Volumes are mounted for containers • Persist data for containerized applications • Scale data and performance as containers grow • Highly available by leveraging replicas, snapshots, mirroring of data • Benefit from MapR tickets, for end-to- end security • Multi-tenant deployment and access
43.
43 © 2018
MapR Technologies, Inc. // MapR Confidential There are two ways to provision a volume: 1. Kubernetes Volume with Static Provisioner This is used to mount an existing MapR Volume to Containers managed by Kubernetes. 2. Kubernetes Persistent Volumes with Dynamic Provisioner This is used to create and mount a new MapR Volume to Containers managed by Kubernetes. Kubernetes Integration via Volume Driver Plugin
44.
44 © 2018
MapR Technologies, Inc. // MapR Confidential pod kubelet docker plugin mapr fuse Example 1: You have a Postgres container that needs persistent storage. Plugin mounts MapR path via fuse Static Provisioning • Kdf volume plugin • Admin provisions • Fast, uses Posix drivers • Secured with MapR tickets • MapR cluster can be external to K8s
45.
45 © 2018
MapR Technologies, Inc. // MapR Confidential KUBERNETES (CLIENT HOST) YOUR CONTAINER MAPR VOLUME PLUGIN - POSIX CLIENT K8S PERSISTANT VOLUME MAPR DATA PLATFORM 1. Request Volume 2. Mount POSIX Volume Mounting An Existing MapR Volume (Static Provisioner)
46.
46 © 2018
MapR Technologies, Inc. // MapR Confidential Static provisioning
47.
47 © 2018
MapR Technologies, Inc. // MapR Confidential pod kubelet docker plugin mapr fuse provisioner rest Example 2: You are testing a new container. You want the storage in MapR automatically allocated for the container. Dynamic provisioning • Kdf provisioner • Uses MapR REST API’s to allocate/delete MapR volumes • Mounting is the same as static provisioning
48.
48 © 2018
MapR Technologies, Inc. // MapR Confidential KUBERNETES (CLIENT HOST) YOUR CONTAINER MAPR VOLUME PLUGIN - POSIX CLIENT K8S PERSISTANT VOLUME MAPR DATA PLATFORM 1. Request Volume 5. Mount POSIX Volume PERSISTENT VOLUME CLAIM STORAGE CLASS DYNAMIC PROVISIONER 2. Request Volume 3. Request Volume Creating A New MapR Volume (Dynamic Provisioner) Volume Claim binds the volume created to the container(s) Storage Classes used by Administrators express the type, size and other characteristics that the volume should contain
49.
49 © 2018
MapR Technologies, Inc. // MapR Confidential Dynamic provisioning
50.
50 © 2018
MapR Technologies, Inc. // MapR Confidential Consequences Installation of plugin is K8S level operation • No per-node attention required Use of plugin is overlay operation • No change needed for an container • Any Helm chart can use the plugin for conventional file access Can share storage/compute or isolate or scale independently State is no longer a dirty word for Kubernetes!J
51.
51 © 2018
MapR Technologies, Inc. // MapR Confidential Application GLOBAL DATA MANAGEMENT Edge Private Cloud On Premise Public Cloud Public Cloud Public Cloud PodPod Pod ML/AI workloadsClassic ETL Scheduling & Scaling MapR Kubernetes Volume Driver Containers and Kubernetes without limits Creating an “Ubernetes” Platform with MapR
52.
AI BONUS TRACK Kubernetes
plays Cupid for Data Scientists and IT
53.
53 © 2018
MapR Technologies, Inc. // MapR Confidential Data Science Phases Exploration Training Deployment Production In this phase, the executable code that is used to train models is developed and some prototyping is done. • Typically uses data science notebooks • Output is code The executable training code is run on very large datasets. • Phase where compute powers matters • Output is a model Models are deployed into a framework that allows for the scoring of data. • Can be done in batch or real time • Output is a microservices framework Models are monitored and updated in production. • Requires CI/CD pipeline capability • Output is “insights”
54.
54 © 2018
MapR Technologies, Inc. // MapR Confidential Data science workflows benefit from containerization in every phase of the pipeline from exploration, training, and deploying models to production. • For Exploration: containerization enables isolated personalized development environments • For Training: containerization provides compute agility and the ability to iterate with varying parameters • For Deployment: containerization provides the ability to create a robust microservices architecture Containerization is good for Machine Learning
55.
55 © 2018
MapR Technologies, Inc. // MapR Confidential Everything on One Cluster ON-PREMISES, MULTI-CLOUD, IoT EDGE COMMODITY SERVER VIRTUAL MACHINE IoT & Edge MAPR DATA PLATFORM APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA Accessing Data In-Place /f1 MAPR DATA PLATFORM HDFS API MAPR POSIX CLIENT MAPR CLIENTS FOR CONTAINERS MapR Makes Doing Data Science Easier
56.
56 © 2018
MapR Technologies, Inc. // MapR Confidential An Open Approach to Tooling • Pre-built, certified container images connect to MapR platform services • Customizable using Volume Plugin and Dockerfile to support any POSIX-compliant library or tool • Provides a unified security model, enabling secure connection between container and cluster • High I/O throughput data connection to storage layer with POSIX client • Enables seamless multi-tenancy and job isolation MAPR DATA PLATFORM Model A Model B ….. For Tools MAPR KUBERNETES VOLUME DRIVER MAPR CLIENT FOR CONTAINERS MAPR CLIENT FOR CONTAINERS For Algorithms For Architectures
57.
57 © 2018
MapR Technologies, Inc. // MapR Confidential Kubernetes Namespace • Used to manage and isolate cluster resources • Provides a multi-tenant architecture for jobs, pods and deployments Storage Namespace • Can join data across architectural or geographical divides • Read/Write access to any dataset the user has access to as if it were a local resource • Data security and isolation at the user, team, and tenant level Kubernetes and Storage Namespaces
58.
58 © 2018
MapR Technologies, Inc. // MapR Confidential End to End Machine Learning on ALL of your Data MAPR DATA PLATFORM APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA Exploration Training Deployment A B Leverage MapR to deploy and run data science workflow end-to-end using your favorite tools
59.
59 © 2018
MapR Technologies, Inc. // MapR Confidential End to End Machine Learning on ALL of your Data Leverage MapR to deploy and run data science workflow end-to-end using your favorite tools MAPR DATA PLATFORM APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA Exploration Training Deployment A B ?
60.
Summary
61.
61 © 2018
MapR Technologies, Inc. // MapR Confidential COMPUTE AGILITY MAPR DATA PLATFORM Containers & Kubernetes without limits! MAPR KUBERNETES VOLUME PLUGIN TENANT N Application….. TENANT 1 Application APP AGILITY DATA AGILITY DATA CENTER CLOUD MULTI-CLOUD KUBERNETES EDGE ACROSS INFRASTRUCTURES ENTERPRISE APPLICATIONS AI AND ML ADV. ANALYTICS Stateful app container MAPR POSIX CLIENT FOR CONTAINERS Application
62.
More information
63.
63 © 2018
MapR Technologies, Inc. // MapR Confidential MapR The Leading Data Platform for AI and Analytics https://mapr.com/solutions/ai-analytics/ Blog: Containers, Kubernetes, and MapR: The Time is Now https://mapr.com/blog/containers-kubernetes-and-mapr-the-time-is-now/ MapR Data Fabric for Kubernetes https://mapr.com/solutions/data-fabric/kubernetes/ MapR Data Fabric for Kubernetes - Documentation https://mapr.com/docs/60/PersistentStorage/kdf_overview.html MapR Data Platform
64.
64 © 2018
MapR Technologies, Inc. // MapR Confidential O’Reilly (e)books! Download the e-book here: https://mapr.com/ebook/ machine-learning-logistics/ by Ted Dunning and Ellen Friedman Just released at Strata New York, Sept 2018 Download the e-book here: https://mapr.com/ebook/ ai-and-analytics-in- production/
65.
THANK YOU! #MapR #Kubernetes4Data