SlideShare une entreprise Scribd logo
1  sur  65
CONTAINER AND KUBERNETES
WITHOUT LIMITS
ANTJE BARTH
Advanced Spark and TensorFlow Meetup
O‘Reilly AI Conference, London
October 9th, 2018
2 © 2018 MapR Technologies, Inc. // MapR Confidential
Learn how a MODERN DATA PLATFORM can help to support
stateful applications in large containerized environments,
and how to handle persistent data
across multiple data centers or geographic locations.
#Kubernetes4Data
Today‘s Session
3 © 2018 MapR Technologies, Inc. // MapR Confidential
Who
ANTJE BARTH
Partner Engineer, MapR
abarth@mapr.com
antje-barth-413258bb
@anbarth
Chapter Lead Duesseldorf (Germany)
https://www.meetup.com/Women-in-Big-Data-Dusseldorf/
BIG DATA
ML/AI
CONTAINER
K8S
ADV. ANALYTICS
4 © 2018 MapR Technologies, Inc. // MapR Confidential
Agenda
QUICK INTRO / RECAP
MODERN DATA PLATFORM
• Data Persistence across data centers / geographic regions
• #Kubernetes4Data
AI BONUS TRACK - Kubernetes plays Cupid for Data Scientists and IT
CONTAINER ORCHESTRATION
• Kubernetes
• Challenges for stateful applications
CONTAINERS
• Architectural concepts
• Container challenges
5 © 2018 MapR Technologies, Inc. // MapR Confidential
Virtual Machines are Computers
in a Box
Containers are Applications
in a Box
6 © 2018 MapR Technologies, Inc. // MapR Confidential
hardware
os
hypervisor
vm
os
libs
app
vm
os
libs
app
hardware
os
container
libs
app
container
libs
app
container
libs
app
VM vs Container
7 © 2018 MapR Technologies, Inc. // MapR Confidential
Pets vs Cattle
- long lived
- name them
- care for them
- ephemeral
- brand them with #’s
- well.. vets are expensive
8 © 2018 MapR Technologies, Inc. // MapR Confidential
Containers
• Are lightweight
• Are stateless
• Are portable
• Targeted for developing applications
• Surely moving towards production
• Docker made it popular
… and added a whole lot of jargon for us to learn! J
9 © 2018 MapR Technologies, Inc. // MapR Confidential
But…
Containers have a problem
10 © 2018 MapR Technologies, Inc. // MapR Confidential
Challenges in using / deploying containers
Source: CNCF Survey, 2018.
https://www.cncf.io/blog/2018/08/29/cncf-survey-use-of-cloud-native-technologies-in-production-has-grown-over-200-percent/
• Cultural Changes with
Development Team
• Complexity
• Lack of Training
• Security
• Monitoring
• Storage
• Networking
11 © 2018 MapR Technologies, Inc. // MapR Confidential
Some of the things Docker can’t do
• Monitor running containers
• Handle dead containers
• Move containers so utilization improves
• Auto-scale container instances to handle load
• Solve port mapping hell
• …
12 © 2018 MapR Technologies, Inc. // MapR Confidential
You can never get away from pets
unless:
• You have an environment to
support cattle
• You handle the problem of
container state
Kubernetes
kubernetes (n.) - greek word for pilot or helm
14 © 2018 MapR Technologies, Inc. // MapR Confidential
Now home
at the CNCF!
Large-scale cluster management at Google with Borg, 2015.
https://ai.google/research/pubs/pub43438
Kubernetes started life as
a successor to Google’s
Borg project...
https://www.cncf.io/ https://kubernetes.io/
15 © 2018 MapR Technologies, Inc. // MapR Confidential
Kubernetes is an API and agents
The Kubernetes API provides containers with a
scheduling, configuration, network, and
storage
The Kubernetes runtime manages the containers
16 © 2018 MapR Technologies, Inc. // MapR Confidential
Magical View of Kubernetes
Kubernetes
17 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1
Kubernetes
Magical View of Kubernetes
Kubernetes starts application
containers “somewhere”
18 © 2018 MapR Technologies, Inc. // MapR Confidential
Magical View of Kubernetes
App 1 App 3
Kubernetes
Later containers may be started
elsewhere due to “affinities”
19 © 2018 MapR Technologies, Inc. // MapR Confidential
Magical View of Kubernetes
App 1 App 2 App 3
Kubernetes
Kubernetes provides super fast
naming via DNS so containers
can find each other
20 © 2018 MapR Technologies, Inc. // MapR Confidential
Note that you don’t think about
which machine at all
No more names from The Hobbit
Just cattle!
21 © 2018 MapR Technologies, Inc. // MapR Confidential
Kubernetes – Why is it so popular?
• There are many management software
solutions to create, manage & delete
containers with newer vendors
emerging everyday
• Kubernetes remains the leader with
83% (up from 77%)
• The ecosystem and developer
community augmented by Google’s
support gave Kubernetes the edge over
others
Source: CNCF Survey, 2018.
https://www.cncf.io/blog/2018/08/29/cncf-survey-use-of-cloud-native-technologies-in-production-has-grown-over-200-percent/
22 © 2018 MapR Technologies, Inc. // MapR Confidential
Kubernetes – an open, pluggable framework
23 © 2018 MapR Technologies, Inc. // MapR Confidential
Kubernetes – an open, pluggable framework
Source: CNCF landscape, see https://github.com/cncf/landscape and http://l.cncf.io
24 © 2018 MapR Technologies, Inc. // MapR Confidential
We still have a problem
25 © 2018 MapR Technologies, Inc. // MapR Confidential
State!
26 © 2018 MapR Technologies, Inc. // MapR Confidential
Problem with Containers and State
• State in containers messes things up
• Restarts lose the state
• Replicating state makes services complex
• Application developers just aren’t systems developers
• State life-cycle doesn’t match app life-cycle
• …
27 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
Kubernetes
28 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
Kubernetes
rpc
stream
LogFile
29 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
Kubernetes
rpc
stream
LogFile
We need
multiple
forms of
persistence!
30 © 2018 MapR Technologies, Inc. // MapR Confidential
Data platform
App 1 App 2 App 3
Kubernetes
rpc
31 © 2018 MapR Technologies, Inc. // MapR Confidential
What Does This Data Platform Need to Have?
Global namespace across entire Kubernetes cluster
• Between clusters as well if possible
All three forms of primitive persistence
• Files, streams, tables
Inherently scalable
• Performance, cardinality, locality
Uniform access and control
• Path names for all objects, identical permission scheme
32 © 2018 MapR Technologies, Inc. // MapR Confidential
The Data Platform needs
to be like Kubernetes.
For Data.
MapR Data Platform
34 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR Data Platform
MAPR DATA PLATFORM
FILES / OBJECTS / TABLES / STREAMS APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA
DATA CENTER CLOUD MULTI-CLOUD EDGE KUBERNETES
COMMODITY
SERVER
VIRTUAL
MACHINE
IoT & Edge
AI / ML
ADV. ANALYTICS
ENTERPRISE
APPLICATIONS
Pod Pod Pod Pod
35 © 2018 MapR Technologies, Inc. // MapR Confidential
Scale. It distributes data across the cluster and offers a global namespace for a unified view of data
regardless of its physical location
High Availability. Offers configurable levels of replication to ensure data durability. In event of a failure,
all nodes participate to self-heal and reconstruct data automatically
Data Protection. End-to-end security, per volume Access control expressions, space efficient
snapshots, volume mirroring, offers several choices to build a data protection strategy
Intelligent Data Placement. Offers three different storage tiers with automated storage policies to
place data based on their SLAs
Edge, on-premises, Cloud: Can be deployed in on-premises datacenters, edge and on the cloud
MapR Data Platform
36 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR Volumes
Volumes are logical units of management, holding files, directories, tables, messages.
WHAT CAN YOU DO WITH VOLUMES?
• Schedule snapshots
• Schedule mirrors
• Control data placement
• Access permissions
• Enforce volume quotas
• Manage performance
• Specify replication factor
Volumes:
Shared MapR Cluster
r : user:sally |
(group:research & group:managers)
MAPR ACCESS CONTROL EXPRESSIONS
/mktg /finance /projectx
37 © 2018 MapR Technologies, Inc. // MapR Confidential
ü Global data view in a single
namespace
ü Distributed data processing
ü Unified Security
ü Global Replication For Data
Distribution & DR
ü Bandwidth-aware to manage
global data flows
ü Simplify cross cloud application
development & deployment
Global Namespace – common path to connect to any data
Globally Protected
Globally Accessible
Globally Managed
Globally Replicated
Across Locations Across Clouds
/mapr
/us.mapr.com
/eu_cloud.mapr.com
/asia.mapr.com
/us_cloud.mapr.com
38 © 2018 MapR Technologies, Inc. // MapR Confidential
Automatically Synchronized Globally Distributed Data
Topic
Topic
Topic
On-Premises
S3
EDGE
DATA
PLATFORM
DATA
PLATFORM
DATA
PLATFORM
DATA
PLATFORM
Multi-Cloud Data Movement & Application Portability
Enabling Application and Data Portability
#Kubernetes4Data
39 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR Persistent Application Client Container (PACC)
• Pre-built, certified container image
for connecting to MapR services
• Secure authentication at
container level, secure connection
• Extensible support for application
layers
• Available in Docker Hub, Dockerfile
for customizability
MapR POSIX Client
for Containers
MapR Converged
Client for
Containers
Space for Customer Application
MapR PACC
MAPR DATA PLATFORM
EVENT DATA
STREAMS
ANALYTICS & ML
ENGINES
OPERATIONAL
DATABASE
CLOUD-SCALE
FILE AND OBJECT
STORE
40 © 2018 MapR Technologies, Inc. // MapR Confidential
Containerized Microservices have real-time access to
files/tables/streams
Microservices
Databases/files
Microservices
Databases/files
Microservices
Databases/files
Microservices Microservices Microservices
Microservices Microservices Microservices
MAPR DATA PLATFORM
Stream Stream
MapR Data Fabric for Kubernetes
42 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR Data Fabric for Kubernetes
node
Pod
Pod
node
Pod
Pod
node
Pod
Pod
MAPR MAPR MAPR
MAPR DATA PLATFORM
• Integration with Kubernetes APIs,
packaged and run as a POSIX client on
each Kubernetes host
• MapR Volumes are mounted for
containers
• Persist data for containerized
applications
• Scale data and performance as
containers grow
• Highly available by leveraging replicas,
snapshots, mirroring of data
• Benefit from MapR tickets, for end-to-
end security
• Multi-tenant deployment and access
43 © 2018 MapR Technologies, Inc. // MapR Confidential
There are two ways to provision a volume:
1. Kubernetes Volume with Static Provisioner
This is used to mount an existing MapR Volume to Containers
managed by Kubernetes.
2. Kubernetes Persistent Volumes with Dynamic Provisioner
This is used to create and mount a new MapR Volume to Containers
managed by Kubernetes.
Kubernetes Integration via Volume Driver Plugin
44 © 2018 MapR Technologies, Inc. // MapR Confidential
pod
kubelet
docker
plugin
mapr
fuse
Example 1: You have a Postgres container that needs persistent
storage. Plugin mounts MapR path via fuse
Static Provisioning
• Kdf volume plugin
• Admin provisions
• Fast, uses Posix drivers
• Secured with MapR tickets
• MapR cluster can be external
to K8s
45 © 2018 MapR Technologies, Inc. // MapR Confidential
KUBERNETES (CLIENT HOST)
YOUR CONTAINER
MAPR VOLUME PLUGIN - POSIX CLIENT
K8S PERSISTANT VOLUME
MAPR DATA PLATFORM
1. Request Volume
2. Mount POSIX Volume
Mounting An Existing MapR Volume (Static Provisioner)
46 © 2018 MapR Technologies, Inc. // MapR Confidential
Static provisioning
47 © 2018 MapR Technologies, Inc. // MapR Confidential
pod
kubelet
docker
plugin mapr
fuse
provisioner rest
Example 2: You are testing a new container. You want the
storage in MapR automatically allocated for the container.
Dynamic provisioning
• Kdf provisioner
• Uses MapR REST API’s to
allocate/delete MapR
volumes
• Mounting is the same as
static provisioning
48 © 2018 MapR Technologies, Inc. // MapR Confidential
KUBERNETES (CLIENT HOST)
YOUR CONTAINER
MAPR VOLUME PLUGIN - POSIX CLIENT
K8S PERSISTANT VOLUME
MAPR DATA PLATFORM
1. Request Volume
5. Mount POSIX Volume
PERSISTENT VOLUME CLAIM
STORAGE CLASS
DYNAMIC PROVISIONER
2. Request Volume
3. Request Volume
Creating A New MapR Volume (Dynamic Provisioner)
Volume Claim binds the
volume created to the
container(s)
Storage Classes used by
Administrators express
the type, size and other
characteristics that the
volume should contain
49 © 2018 MapR Technologies, Inc. // MapR Confidential
Dynamic
provisioning
50 © 2018 MapR Technologies, Inc. // MapR Confidential
Consequences
Installation of plugin is K8S level operation
• No per-node attention required
Use of plugin is overlay operation
• No change needed for an container
• Any Helm chart can use the plugin for conventional file access
Can share storage/compute or isolate or scale independently
State is no longer a dirty word for
Kubernetes!J
51 © 2018 MapR Technologies, Inc. // MapR Confidential
Application
GLOBAL DATA MANAGEMENT
Edge Private Cloud
On Premise
Public Cloud Public Cloud Public Cloud
PodPod Pod ML/AI workloadsClassic ETL
Scheduling & Scaling
MapR Kubernetes Volume Driver
Containers and Kubernetes without limits
Creating an “Ubernetes” Platform with MapR
AI BONUS TRACK
Kubernetes plays Cupid
for Data Scientists and IT
53 © 2018 MapR Technologies, Inc. // MapR Confidential
Data Science Phases
Exploration Training Deployment Production
In this phase, the
executable code that is
used to train models is
developed and some
prototyping is done.
• Typically uses data
science notebooks
• Output is code
The executable training
code is run on very large
datasets.
• Phase where
compute powers
matters
• Output is a model
Models are deployed
into a framework that
allows for the scoring
of data.
• Can be done in
batch or real time
• Output is a
microservices
framework
Models are monitored
and updated in
production.
• Requires CI/CD
pipeline capability
• Output is “insights”
54 © 2018 MapR Technologies, Inc. // MapR Confidential
Data science workflows benefit from containerization in every phase of the
pipeline from exploration, training, and deploying models to production.
• For Exploration: containerization enables isolated personalized development
environments
• For Training: containerization provides compute agility and the ability to
iterate with varying parameters
• For Deployment: containerization provides the ability to create a robust
microservices architecture
Containerization is good for Machine Learning
55 © 2018 MapR Technologies, Inc. // MapR Confidential
Everything on One Cluster
ON-PREMISES, MULTI-CLOUD, IoT EDGE
COMMODITY
SERVER
VIRTUAL
MACHINE
IoT & Edge
MAPR DATA PLATFORM
APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA
Accessing Data In-Place
/f1
MAPR DATA PLATFORM
HDFS API
MAPR POSIX
CLIENT
MAPR CLIENTS
FOR CONTAINERS
MapR Makes Doing Data Science Easier
56 © 2018 MapR Technologies, Inc. // MapR Confidential
An Open Approach to Tooling
• Pre-built, certified container images connect
to MapR platform services
• Customizable using Volume Plugin and Dockerfile
to support any POSIX-compliant library or tool
• Provides a unified security model, enabling
secure connection between container and cluster
• High I/O throughput data connection to storage
layer with POSIX client
• Enables seamless multi-tenancy and job isolation MAPR DATA PLATFORM
Model A Model B
…..
For Tools
MAPR KUBERNETES VOLUME DRIVER
MAPR CLIENT
FOR
CONTAINERS
MAPR CLIENT
FOR
CONTAINERS
For Algorithms For Architectures
57 © 2018 MapR Technologies, Inc. // MapR Confidential
Kubernetes Namespace
• Used to manage and isolate cluster
resources
• Provides a multi-tenant architecture for
jobs, pods and deployments
Storage Namespace
• Can join data across architectural or
geographical divides
• Read/Write access to any dataset the user
has access to as if it were a local resource
• Data security and isolation at the user,
team, and tenant level
Kubernetes and Storage Namespaces
58 © 2018 MapR Technologies, Inc. // MapR Confidential
End to End Machine Learning on ALL of your Data
MAPR DATA PLATFORM
APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA
Exploration Training Deployment
A
B
Leverage MapR to deploy and run data science workflow end-to-end using your favorite tools
59 © 2018 MapR Technologies, Inc. // MapR Confidential
End to End Machine Learning on ALL of your Data
Leverage MapR to deploy and run data science workflow end-to-end using your favorite tools
MAPR DATA PLATFORM
APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA
Exploration Training Deployment
A
B
?
Summary
61 © 2018 MapR Technologies, Inc. // MapR Confidential
COMPUTE AGILITY
MAPR DATA PLATFORM
Containers & Kubernetes without limits!
MAPR KUBERNETES VOLUME PLUGIN
TENANT N
Application…..
TENANT 1
Application APP AGILITY
DATA AGILITY
DATA CENTER CLOUD MULTI-CLOUD KUBERNETES EDGE ACROSS
INFRASTRUCTURES
ENTERPRISE
APPLICATIONS
AI AND ML
ADV. ANALYTICS
Stateful app
container
MAPR POSIX
CLIENT FOR
CONTAINERS
Application
More information
63 © 2018 MapR Technologies, Inc. // MapR Confidential
MapR The Leading Data Platform for AI and Analytics
https://mapr.com/solutions/ai-analytics/
Blog: Containers, Kubernetes, and MapR: The Time is Now
https://mapr.com/blog/containers-kubernetes-and-mapr-the-time-is-now/
MapR Data Fabric for Kubernetes
https://mapr.com/solutions/data-fabric/kubernetes/
MapR Data Fabric for Kubernetes - Documentation
https://mapr.com/docs/60/PersistentStorage/kdf_overview.html
MapR Data Platform
64 © 2018 MapR Technologies, Inc. // MapR Confidential
O’Reilly (e)books!
Download the e-book
here:
https://mapr.com/ebook/
machine-learning-logistics/
by Ted Dunning and
Ellen Friedman
Just released at Strata
New York, Sept 2018
Download the e-book
here:
https://mapr.com/ebook/
ai-and-analytics-in-
production/
THANK YOU!
#MapR
#Kubernetes4Data

Contenu connexe

Tendances

How to Get Going with Kubernetes
How to Get Going with KubernetesHow to Get Going with Kubernetes
How to Get Going with KubernetesTed Dunning
 
Progress for big data in Kubernetes
Progress for big data in KubernetesProgress for big data in Kubernetes
Progress for big data in KubernetesTed Dunning
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action MapR Technologies
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016Mathieu Dumoulin
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformMapR Technologies
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data AnalyticsMapR Technologies
 
Meruvian - Introduction to MapR
Meruvian - Introduction to MapRMeruvian - Introduction to MapR
Meruvian - Introduction to MapRThe World Bank
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMapR Technologies
 
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...LEGATO project
 
2018 03 25 system ml ai and openpower meetup
2018 03 25 system ml ai and openpower meetup2018 03 25 system ml ai and openpower meetup
2018 03 25 system ml ai and openpower meetupGanesan Narayanasamy
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareMapR Technologies
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureMapR Technologies
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...MapR Technologies
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsMapR Technologies
 
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019VMware Tanzu
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionMapR Technologies
 
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming DataAdvanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming DataCarol McDonald
 
Applying Machine Learning to Live Patient Data
Applying Machine Learning to  Live Patient DataApplying Machine Learning to  Live Patient Data
Applying Machine Learning to Live Patient DataCarol McDonald
 
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DBStructured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DBCarol McDonald
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsMapR Technologies
 

Tendances (20)

How to Get Going with Kubernetes
How to Get Going with KubernetesHow to Get Going with Kubernetes
How to Get Going with Kubernetes
 
Progress for big data in Kubernetes
Progress for big data in KubernetesProgress for big data in Kubernetes
Progress for big data in Kubernetes
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
 
Meruvian - Introduction to MapR
Meruvian - Introduction to MapRMeruvian - Introduction to MapR
Meruvian - Introduction to MapR
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
SAMOS 2018: LEGaTO: first steps towards energy-efficient toolset for heteroge...
 
2018 03 25 system ml ai and openpower meetup
2018 03 25 system ml ai and openpower meetup2018 03 25 system ml ai and openpower meetup
2018 03 25 system ml ai and openpower meetup
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
 
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
Distributing big astronomical catalogues with Greenplum - Greenplum Summit 2019
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming DataAdvanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming Data
 
Applying Machine Learning to Live Patient Data
Applying Machine Learning to  Live Patient DataApplying Machine Learning to  Live Patient Data
Applying Machine Learning to Live Patient Data
 
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DBStructured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 

Similaire à Container and Kubernetes without limits

Containers and Kubernetes without limits
Containers and Kubernetes without limitsContainers and Kubernetes without limits
Containers and Kubernetes without limitsAntje Barth
 
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...Matt Stubbs
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Carol McDonald
 
Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Ellen Friedman
 
DataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven OrganizationsDataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven OrganizationsEllen Friedman
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleIan Downard
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Chris Fregly
 
MapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR 5.2: Getting More Value from the MapR Converged Community EditionMapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR 5.2: Getting More Value from the MapR Converged Community EditionMapR Technologies
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes John Archer
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningTed Dunning
 
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 20187 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 2018Ellen Friedman
 
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksPredictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksJustin Brandenburg
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR Technologies
 
Accelerating Edge Computing Adoption
Accelerating Edge Computing Adoption Accelerating Edge Computing Adoption
Accelerating Edge Computing Adoption Michelle Holley
 
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionReal World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionCodemotion
 
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR Technologies
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...DataWorks Summit
 
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Matt Stubbs
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Mathieu Dumoulin
 

Similaire à Container and Kubernetes without limits (20)

Containers and Kubernetes without limits
Containers and Kubernetes without limitsContainers and Kubernetes without limits
Containers and Kubernetes without limits
 
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
 
Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018
 
DataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven OrganizationsDataOps: An Agile Method for Data-Driven Organizations
DataOps: An Agile Method for Data-Driven Organizations
 
The rise of microservices
The rise of microservicesThe rise of microservices
The rise of microservices
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
 
MapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR 5.2: Getting More Value from the MapR Converged Community EditionMapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR 5.2: Getting More Value from the MapR Converged Community Edition
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine Learning
 
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 20187 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
 
Predictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural NetworksPredictive Maintenance Using Recurrent Neural Networks
Predictive Maintenance Using Recurrent Neural Networks
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT Better
 
Accelerating Edge Computing Adoption
Accelerating Edge Computing Adoption Accelerating Edge Computing Adoption
Accelerating Edge Computing Adoption
 
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionReal World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in Production
 
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data Platform
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
 
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
 

Dernier

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 

Dernier (20)

Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 

Container and Kubernetes without limits

  • 1. CONTAINER AND KUBERNETES WITHOUT LIMITS ANTJE BARTH Advanced Spark and TensorFlow Meetup O‘Reilly AI Conference, London October 9th, 2018
  • 2. 2 © 2018 MapR Technologies, Inc. // MapR Confidential Learn how a MODERN DATA PLATFORM can help to support stateful applications in large containerized environments, and how to handle persistent data across multiple data centers or geographic locations. #Kubernetes4Data Today‘s Session
  • 3. 3 © 2018 MapR Technologies, Inc. // MapR Confidential Who ANTJE BARTH Partner Engineer, MapR abarth@mapr.com antje-barth-413258bb @anbarth Chapter Lead Duesseldorf (Germany) https://www.meetup.com/Women-in-Big-Data-Dusseldorf/ BIG DATA ML/AI CONTAINER K8S ADV. ANALYTICS
  • 4. 4 © 2018 MapR Technologies, Inc. // MapR Confidential Agenda QUICK INTRO / RECAP MODERN DATA PLATFORM • Data Persistence across data centers / geographic regions • #Kubernetes4Data AI BONUS TRACK - Kubernetes plays Cupid for Data Scientists and IT CONTAINER ORCHESTRATION • Kubernetes • Challenges for stateful applications CONTAINERS • Architectural concepts • Container challenges
  • 5. 5 © 2018 MapR Technologies, Inc. // MapR Confidential Virtual Machines are Computers in a Box Containers are Applications in a Box
  • 6. 6 © 2018 MapR Technologies, Inc. // MapR Confidential hardware os hypervisor vm os libs app vm os libs app hardware os container libs app container libs app container libs app VM vs Container
  • 7. 7 © 2018 MapR Technologies, Inc. // MapR Confidential Pets vs Cattle - long lived - name them - care for them - ephemeral - brand them with #’s - well.. vets are expensive
  • 8. 8 © 2018 MapR Technologies, Inc. // MapR Confidential Containers • Are lightweight • Are stateless • Are portable • Targeted for developing applications • Surely moving towards production • Docker made it popular … and added a whole lot of jargon for us to learn! J
  • 9. 9 © 2018 MapR Technologies, Inc. // MapR Confidential But… Containers have a problem
  • 10. 10 © 2018 MapR Technologies, Inc. // MapR Confidential Challenges in using / deploying containers Source: CNCF Survey, 2018. https://www.cncf.io/blog/2018/08/29/cncf-survey-use-of-cloud-native-technologies-in-production-has-grown-over-200-percent/ • Cultural Changes with Development Team • Complexity • Lack of Training • Security • Monitoring • Storage • Networking
  • 11. 11 © 2018 MapR Technologies, Inc. // MapR Confidential Some of the things Docker can’t do • Monitor running containers • Handle dead containers • Move containers so utilization improves • Auto-scale container instances to handle load • Solve port mapping hell • …
  • 12. 12 © 2018 MapR Technologies, Inc. // MapR Confidential You can never get away from pets unless: • You have an environment to support cattle • You handle the problem of container state
  • 13. Kubernetes kubernetes (n.) - greek word for pilot or helm
  • 14. 14 © 2018 MapR Technologies, Inc. // MapR Confidential Now home at the CNCF! Large-scale cluster management at Google with Borg, 2015. https://ai.google/research/pubs/pub43438 Kubernetes started life as a successor to Google’s Borg project... https://www.cncf.io/ https://kubernetes.io/
  • 15. 15 © 2018 MapR Technologies, Inc. // MapR Confidential Kubernetes is an API and agents The Kubernetes API provides containers with a scheduling, configuration, network, and storage The Kubernetes runtime manages the containers
  • 16. 16 © 2018 MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes Kubernetes
  • 17. 17 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 Kubernetes Magical View of Kubernetes Kubernetes starts application containers “somewhere”
  • 18. 18 © 2018 MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes App 1 App 3 Kubernetes Later containers may be started elsewhere due to “affinities”
  • 19. 19 © 2018 MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes App 1 App 2 App 3 Kubernetes Kubernetes provides super fast naming via DNS so containers can find each other
  • 20. 20 © 2018 MapR Technologies, Inc. // MapR Confidential Note that you don’t think about which machine at all No more names from The Hobbit Just cattle!
  • 21. 21 © 2018 MapR Technologies, Inc. // MapR Confidential Kubernetes – Why is it so popular? • There are many management software solutions to create, manage & delete containers with newer vendors emerging everyday • Kubernetes remains the leader with 83% (up from 77%) • The ecosystem and developer community augmented by Google’s support gave Kubernetes the edge over others Source: CNCF Survey, 2018. https://www.cncf.io/blog/2018/08/29/cncf-survey-use-of-cloud-native-technologies-in-production-has-grown-over-200-percent/
  • 22. 22 © 2018 MapR Technologies, Inc. // MapR Confidential Kubernetes – an open, pluggable framework
  • 23. 23 © 2018 MapR Technologies, Inc. // MapR Confidential Kubernetes – an open, pluggable framework Source: CNCF landscape, see https://github.com/cncf/landscape and http://l.cncf.io
  • 24. 24 © 2018 MapR Technologies, Inc. // MapR Confidential We still have a problem
  • 25. 25 © 2018 MapR Technologies, Inc. // MapR Confidential State!
  • 26. 26 © 2018 MapR Technologies, Inc. // MapR Confidential Problem with Containers and State • State in containers messes things up • Restarts lose the state • Replicating state makes services complex • Application developers just aren’t systems developers • State life-cycle doesn’t match app life-cycle • …
  • 27. 27 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes
  • 28. 28 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes rpc stream LogFile
  • 29. 29 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 Kubernetes rpc stream LogFile We need multiple forms of persistence!
  • 30. 30 © 2018 MapR Technologies, Inc. // MapR Confidential Data platform App 1 App 2 App 3 Kubernetes rpc
  • 31. 31 © 2018 MapR Technologies, Inc. // MapR Confidential What Does This Data Platform Need to Have? Global namespace across entire Kubernetes cluster • Between clusters as well if possible All three forms of primitive persistence • Files, streams, tables Inherently scalable • Performance, cardinality, locality Uniform access and control • Path names for all objects, identical permission scheme
  • 32. 32 © 2018 MapR Technologies, Inc. // MapR Confidential The Data Platform needs to be like Kubernetes. For Data.
  • 34. 34 © 2018 MapR Technologies, Inc. // MapR Confidential MapR Data Platform MAPR DATA PLATFORM FILES / OBJECTS / TABLES / STREAMS APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA DATA CENTER CLOUD MULTI-CLOUD EDGE KUBERNETES COMMODITY SERVER VIRTUAL MACHINE IoT & Edge AI / ML ADV. ANALYTICS ENTERPRISE APPLICATIONS Pod Pod Pod Pod
  • 35. 35 © 2018 MapR Technologies, Inc. // MapR Confidential Scale. It distributes data across the cluster and offers a global namespace for a unified view of data regardless of its physical location High Availability. Offers configurable levels of replication to ensure data durability. In event of a failure, all nodes participate to self-heal and reconstruct data automatically Data Protection. End-to-end security, per volume Access control expressions, space efficient snapshots, volume mirroring, offers several choices to build a data protection strategy Intelligent Data Placement. Offers three different storage tiers with automated storage policies to place data based on their SLAs Edge, on-premises, Cloud: Can be deployed in on-premises datacenters, edge and on the cloud MapR Data Platform
  • 36. 36 © 2018 MapR Technologies, Inc. // MapR Confidential MapR Volumes Volumes are logical units of management, holding files, directories, tables, messages. WHAT CAN YOU DO WITH VOLUMES? • Schedule snapshots • Schedule mirrors • Control data placement • Access permissions • Enforce volume quotas • Manage performance • Specify replication factor Volumes: Shared MapR Cluster r : user:sally | (group:research & group:managers) MAPR ACCESS CONTROL EXPRESSIONS /mktg /finance /projectx
  • 37. 37 © 2018 MapR Technologies, Inc. // MapR Confidential ü Global data view in a single namespace ü Distributed data processing ü Unified Security ü Global Replication For Data Distribution & DR ü Bandwidth-aware to manage global data flows ü Simplify cross cloud application development & deployment Global Namespace – common path to connect to any data Globally Protected Globally Accessible Globally Managed Globally Replicated Across Locations Across Clouds /mapr /us.mapr.com /eu_cloud.mapr.com /asia.mapr.com /us_cloud.mapr.com
  • 38. 38 © 2018 MapR Technologies, Inc. // MapR Confidential Automatically Synchronized Globally Distributed Data Topic Topic Topic On-Premises S3 EDGE DATA PLATFORM DATA PLATFORM DATA PLATFORM DATA PLATFORM Multi-Cloud Data Movement & Application Portability Enabling Application and Data Portability #Kubernetes4Data
  • 39. 39 © 2018 MapR Technologies, Inc. // MapR Confidential MapR Persistent Application Client Container (PACC) • Pre-built, certified container image for connecting to MapR services • Secure authentication at container level, secure connection • Extensible support for application layers • Available in Docker Hub, Dockerfile for customizability MapR POSIX Client for Containers MapR Converged Client for Containers Space for Customer Application MapR PACC MAPR DATA PLATFORM EVENT DATA STREAMS ANALYTICS & ML ENGINES OPERATIONAL DATABASE CLOUD-SCALE FILE AND OBJECT STORE
  • 40. 40 © 2018 MapR Technologies, Inc. // MapR Confidential Containerized Microservices have real-time access to files/tables/streams Microservices Databases/files Microservices Databases/files Microservices Databases/files Microservices Microservices Microservices Microservices Microservices Microservices MAPR DATA PLATFORM Stream Stream
  • 41. MapR Data Fabric for Kubernetes
  • 42. 42 © 2018 MapR Technologies, Inc. // MapR Confidential MapR Data Fabric for Kubernetes node Pod Pod node Pod Pod node Pod Pod MAPR MAPR MAPR MAPR DATA PLATFORM • Integration with Kubernetes APIs, packaged and run as a POSIX client on each Kubernetes host • MapR Volumes are mounted for containers • Persist data for containerized applications • Scale data and performance as containers grow • Highly available by leveraging replicas, snapshots, mirroring of data • Benefit from MapR tickets, for end-to- end security • Multi-tenant deployment and access
  • 43. 43 © 2018 MapR Technologies, Inc. // MapR Confidential There are two ways to provision a volume: 1. Kubernetes Volume with Static Provisioner This is used to mount an existing MapR Volume to Containers managed by Kubernetes. 2. Kubernetes Persistent Volumes with Dynamic Provisioner This is used to create and mount a new MapR Volume to Containers managed by Kubernetes. Kubernetes Integration via Volume Driver Plugin
  • 44. 44 © 2018 MapR Technologies, Inc. // MapR Confidential pod kubelet docker plugin mapr fuse Example 1: You have a Postgres container that needs persistent storage. Plugin mounts MapR path via fuse Static Provisioning • Kdf volume plugin • Admin provisions • Fast, uses Posix drivers • Secured with MapR tickets • MapR cluster can be external to K8s
  • 45. 45 © 2018 MapR Technologies, Inc. // MapR Confidential KUBERNETES (CLIENT HOST) YOUR CONTAINER MAPR VOLUME PLUGIN - POSIX CLIENT K8S PERSISTANT VOLUME MAPR DATA PLATFORM 1. Request Volume 2. Mount POSIX Volume Mounting An Existing MapR Volume (Static Provisioner)
  • 46. 46 © 2018 MapR Technologies, Inc. // MapR Confidential Static provisioning
  • 47. 47 © 2018 MapR Technologies, Inc. // MapR Confidential pod kubelet docker plugin mapr fuse provisioner rest Example 2: You are testing a new container. You want the storage in MapR automatically allocated for the container. Dynamic provisioning • Kdf provisioner • Uses MapR REST API’s to allocate/delete MapR volumes • Mounting is the same as static provisioning
  • 48. 48 © 2018 MapR Technologies, Inc. // MapR Confidential KUBERNETES (CLIENT HOST) YOUR CONTAINER MAPR VOLUME PLUGIN - POSIX CLIENT K8S PERSISTANT VOLUME MAPR DATA PLATFORM 1. Request Volume 5. Mount POSIX Volume PERSISTENT VOLUME CLAIM STORAGE CLASS DYNAMIC PROVISIONER 2. Request Volume 3. Request Volume Creating A New MapR Volume (Dynamic Provisioner) Volume Claim binds the volume created to the container(s) Storage Classes used by Administrators express the type, size and other characteristics that the volume should contain
  • 49. 49 © 2018 MapR Technologies, Inc. // MapR Confidential Dynamic provisioning
  • 50. 50 © 2018 MapR Technologies, Inc. // MapR Confidential Consequences Installation of plugin is K8S level operation • No per-node attention required Use of plugin is overlay operation • No change needed for an container • Any Helm chart can use the plugin for conventional file access Can share storage/compute or isolate or scale independently State is no longer a dirty word for Kubernetes!J
  • 51. 51 © 2018 MapR Technologies, Inc. // MapR Confidential Application GLOBAL DATA MANAGEMENT Edge Private Cloud On Premise Public Cloud Public Cloud Public Cloud PodPod Pod ML/AI workloadsClassic ETL Scheduling & Scaling MapR Kubernetes Volume Driver Containers and Kubernetes without limits Creating an “Ubernetes” Platform with MapR
  • 52. AI BONUS TRACK Kubernetes plays Cupid for Data Scientists and IT
  • 53. 53 © 2018 MapR Technologies, Inc. // MapR Confidential Data Science Phases Exploration Training Deployment Production In this phase, the executable code that is used to train models is developed and some prototyping is done. • Typically uses data science notebooks • Output is code The executable training code is run on very large datasets. • Phase where compute powers matters • Output is a model Models are deployed into a framework that allows for the scoring of data. • Can be done in batch or real time • Output is a microservices framework Models are monitored and updated in production. • Requires CI/CD pipeline capability • Output is “insights”
  • 54. 54 © 2018 MapR Technologies, Inc. // MapR Confidential Data science workflows benefit from containerization in every phase of the pipeline from exploration, training, and deploying models to production. • For Exploration: containerization enables isolated personalized development environments • For Training: containerization provides compute agility and the ability to iterate with varying parameters • For Deployment: containerization provides the ability to create a robust microservices architecture Containerization is good for Machine Learning
  • 55. 55 © 2018 MapR Technologies, Inc. // MapR Confidential Everything on One Cluster ON-PREMISES, MULTI-CLOUD, IoT EDGE COMMODITY SERVER VIRTUAL MACHINE IoT & Edge MAPR DATA PLATFORM APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA Accessing Data In-Place /f1 MAPR DATA PLATFORM HDFS API MAPR POSIX CLIENT MAPR CLIENTS FOR CONTAINERS MapR Makes Doing Data Science Easier
  • 56. 56 © 2018 MapR Technologies, Inc. // MapR Confidential An Open Approach to Tooling • Pre-built, certified container images connect to MapR platform services • Customizable using Volume Plugin and Dockerfile to support any POSIX-compliant library or tool • Provides a unified security model, enabling secure connection between container and cluster • High I/O throughput data connection to storage layer with POSIX client • Enables seamless multi-tenancy and job isolation MAPR DATA PLATFORM Model A Model B ….. For Tools MAPR KUBERNETES VOLUME DRIVER MAPR CLIENT FOR CONTAINERS MAPR CLIENT FOR CONTAINERS For Algorithms For Architectures
  • 57. 57 © 2018 MapR Technologies, Inc. // MapR Confidential Kubernetes Namespace • Used to manage and isolate cluster resources • Provides a multi-tenant architecture for jobs, pods and deployments Storage Namespace • Can join data across architectural or geographical divides • Read/Write access to any dataset the user has access to as if it were a local resource • Data security and isolation at the user, team, and tenant level Kubernetes and Storage Namespaces
  • 58. 58 © 2018 MapR Technologies, Inc. // MapR Confidential End to End Machine Learning on ALL of your Data MAPR DATA PLATFORM APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA Exploration Training Deployment A B Leverage MapR to deploy and run data science workflow end-to-end using your favorite tools
  • 59. 59 © 2018 MapR Technologies, Inc. // MapR Confidential End to End Machine Learning on ALL of your Data Leverage MapR to deploy and run data science workflow end-to-end using your favorite tools MAPR DATA PLATFORM APIs: NFS, POSIX, REST, S3, HDFS, HBASE, JSON, KAFKA Exploration Training Deployment A B ?
  • 61. 61 © 2018 MapR Technologies, Inc. // MapR Confidential COMPUTE AGILITY MAPR DATA PLATFORM Containers & Kubernetes without limits! MAPR KUBERNETES VOLUME PLUGIN TENANT N Application….. TENANT 1 Application APP AGILITY DATA AGILITY DATA CENTER CLOUD MULTI-CLOUD KUBERNETES EDGE ACROSS INFRASTRUCTURES ENTERPRISE APPLICATIONS AI AND ML ADV. ANALYTICS Stateful app container MAPR POSIX CLIENT FOR CONTAINERS Application
  • 63. 63 © 2018 MapR Technologies, Inc. // MapR Confidential MapR The Leading Data Platform for AI and Analytics https://mapr.com/solutions/ai-analytics/ Blog: Containers, Kubernetes, and MapR: The Time is Now https://mapr.com/blog/containers-kubernetes-and-mapr-the-time-is-now/ MapR Data Fabric for Kubernetes https://mapr.com/solutions/data-fabric/kubernetes/ MapR Data Fabric for Kubernetes - Documentation https://mapr.com/docs/60/PersistentStorage/kdf_overview.html MapR Data Platform
  • 64. 64 © 2018 MapR Technologies, Inc. // MapR Confidential O’Reilly (e)books! Download the e-book here: https://mapr.com/ebook/ machine-learning-logistics/ by Ted Dunning and Ellen Friedman Just released at Strata New York, Sept 2018 Download the e-book here: https://mapr.com/ebook/ ai-and-analytics-in- production/