SlideShare une entreprise Scribd logo
1  sur  61
Télécharger pour lire hors ligne
Datacenter Computing 

with Apache Mesos	



シリコンバレー日本人駐在員Meetup

2014-03-14	



Paco Nathan 

http://liber118.com/pxn/

@pacoid
A Big Idea
!
Have you heard about 

“data democratization” ? ? ?	

	

	

 making data available

	

	

 	

 throughout more of the organization
!
Have you heard about 

“data democratization” ? ? ?	

	

	

 making data available

	

	

 	

 throughout more of the organization	

!
Then how would you handle 

“cluster democratization” ? ? ?	

	

	

 making data+resources available

	

	

 	

 throughout	

more of the organization
!
Have you heard about 

“data democratization” ? ? ?	

	

	

 making data available

	

	

 	

 throughout more of the organization	

!
Then how would you handle 

“cluster democratization” ? ? ?	

	

	

 making data+resources available

	

	

 	

 throughout	

more of the organization
In other words, 

how to remove silos…
Lessons

from Google
Datacenter Computing	

Google has been doing datacenter computing for years, 

to address the complexities of large-scale data workflows:	

• leveraging the modern kernel: isolation in lieu of VMs	

• “most (>80%) jobs are batch jobs, but the majority 

of resources (55–80%) are allocated to service jobs”	

• mixed workloads, multi-tenancy	

• relatively high utilization rates	

• JVM? not so much…	

• reality: scheduling batch is simple; 

scheduling services is hard/expensive
The Modern Kernel: Top Linux Contributors…	

arstechnica.com/information-technology/2013/09/...
“Return of the Borg”	

Return of the Borg: HowTwitter Rebuilt Google’s SecretWeapon

Cade Metz

wired.com/wiredenterprise/2013/03/google-
borg-twitter-mesos	

!
The Datacenter as a Computer: An Introduction 

to the Design ofWarehouse-Scale Machines	

Luiz André Barroso, Urs Hölzle	

research.google.com/pubs/pub35290.html	

!
!
2011 GAFS Omega

John Wilkes, et al.

youtu.be/0ZFMlO98Jkc
Google describes the technology…	

Omega: flexible, scalable schedulers for large compute clusters	

Malte Schwarzkopf,Andy Konwinski, Michael Abd-El-Malek, John Wilkes	

eurosys2013.tudos.org/wp-content/uploads/2013/paper/
Schwarzkopf.pdf
Google describes the business case…	

Taming LatencyVariability

Jeff Dean

plus.google.com/u/0/+ResearchatGoogle/posts/C1dPhQhcDRv
Commercial OS Cluster Schedulers	

!
• IBM Platform Symphony

• Microsoft Autopilot	

!


Arguably, some grid controllers 

are quite notable in-category:	

• Univa Grid Engine (formerly SGE)

• Condor	

• etc.
Emerging

at Berkeley
Beyond Hadoop	

Hadoop – an open source solution for fault-tolerant
parallel processing of batch jobs at scale, based on
commodity hardware… however, other priorities have
emerged for the analytics lifecycle:	

• apps require integration beyond Hadoop	

• multiple topologies, mixed workloads, multi-tenancy	

• significant disruptions in h/w cost/performance
curves	

• higher utilization	

• lower latency	

• highly-available, long running services	

• more than “Just JVM” – e.g., Python growth
Beyond Hadoop	

Hadoop – an open source solution for fault-tolerant
parallel processing of batch jobs at scale, based on
commodity hardware… however, other priorities have
emerged for the
• apps require integration beyond Hadoop	

• multiple topologies, mixed workloads, multi-tenancy	

• significant disruptions in h/w cost/performance
curves	

• higher utilization	

• lower latency	

• highly-available, long running services	

• more than “Just JVM” – e.g., Python growth
keep in mind priorities for
interdisciplinary efforts, to
break down silos – extending
beyond a de facto “priesthood”
of data engineering
Mesos – open source datacenter computing	

a common substrate for cluster computing	

mesos.apache.org	

heterogenous assets in your datacenter or cloud 

made available as a homogenous set of resources	

• top-level Apache project	

• scalability to 10,000s of nodes	

• obviates the need for virtual machines	

• isolation (pluggable) for CPU, RAM, I/O, FS, etc.	

• fault-tolerant leader election based on Zookeeper	

• APIs in C++, Java, Python, Go	

• web UI for inspecting cluster state	

• available for Linux, OpenSolaris, Mac OSX
What are the costs of Virtualization?
benchmark	

type
OpenVZ	

improvement
mixed workloads 210%-300%
LAMP (related) 38%-200%
I/O throughput 200%-500%
response time order magnitude
more pronounced 

at higher loads
What are the costs of Single Tenancy?
0%
25%
50%
75%
100%
RAILS CPU
LOAD
MEMCACHED
CPU LOAD
0%
25%
50%
75%
100%
HADOOP CPU
LOAD
0%
25%
50%
75%
100%
t t
0%
25%
50%
75%
100%
Rails
Memcached
Hadoop
COMBINED CPU LOAD (RAILS,
MEMCACHED, HADOOP)
Arguments for Datacenter Computing	

rather than running several specialized clusters, each 

at relatively low utilization rates, instead run many 

mixed workloads 	

obvious benefits are realized in terms of:	

• scalability, elasticity, fault tolerance, performance, utilization	

• reduced equipment capex, Ops overhead, etc.	

• reduced licensing, eliminating need forVMs or potential 

vendor lock-in	

subtle benefits – arguably, more important for Enterprise IT:	

• reduced time for engineers to ramp up new services at scale	

• reduced latency between batch and services, enabling new 

high ROI use cases	

• enables Dev/Test apps to run safely on a Production cluster
Analogies and
Architecture
Prior Practice: Dedicated Servers	

• low utilization rates	

• longer time to ramp up new services
DATACENTER
Prior Practice: Virtualization	

DATACENTER PROVISIONED VMS
• even more machines to manage	

• substantial performance decrease 

due to virtualization	

• VM licensing costs
Prior Practice: Static Partitioning
STATIC PARTITIONING
• even more machines to manage	

• substantial performance decrease 

due to virtualization	

• VM licensing costs	

• static partitioning limits elasticity
DATACENTER
MESOS
Mesos: One Large Pool of Resources	

“We wanted people to be able to program 

for the datacenter just like they program 

for their laptop."	

!
Ben Hindman
DATACENTER
Frameworks Integrated with Mesos	

Continuous Integration:

Jenkins, GitLab
Big Data:

Hadoop, Spark, Storm, Kafka, Cassandra,

Hypertable, MPI
Python workloads:

DPark, Exelixi
Meta-Frameworks / HA Services:

Aurora, Marathon
Distributed Cron:

Chronos
Containers:

Docker
!
Fault-tolerant distributed systems…	

…written in 100-300 lines of 

C++, Java/Scala, Python, Go, etc.	

…building blocks, if you will	

!
Q: required lines of network code?	

A: probably none
Kernel
Apps
servicesbatch
Frameworks
Python
JVM
C
++
Workloads
distributed file system
Chronos
DFS
distributed resources: CPU, RAM, I/O, FS, rack locality, etc. Cluster
Storm
Kafka JBoss Django RailsSharkImpalaScalding
Marathon
SparkHadoopMPI
MySQL
Mesos – architecture
Mesos – architecture	

HDFS, distrib file system
Mesos, distrib kernel
meta-frameworks: Aurora, Marathon
frameworks: Spark, Storm,
MPI, Jenkins, etc.
task schedulers: Chronos, etc.
APIs: C++, JVM, Py, Go
apps: HA services, web apps, batch
jobs, scripts, etc.
Linux: libcgroup, libprocess, libev, etc.
Mesos – dynamics	

Mesos
distrib kernel
Marathon
distrib init.d
Chronos
distrib cron
distrib
frameworks
HA
services
scheduled
apps
Mesos – dynamics	

resource
offers
distributed
framework
Scheduler Executor Executor Executor
Mesos
slave
Mesos
slave
Mesos
slave
distributed
kernel
available resources
Mesos
slave
Mesos
slave
Mesos
slave
Mesos
masterMesos
master
Example: Resource Offer in a Two-Level Scheduler
mesos.apache.org/documentation/latest/mesos-architecture/
M
Master
Docker
Registry
index.docker.io
Local
Docker
Registry
( optional )
M
M
S
S
S
S
S
S
marathon
docker
docker
docker
Mesos
master servers
Mesos
slave servers
Marathon can launch and monitor
service containers from one or
more Docker registries, using
the Docker executor for Mesos
S
S
S S
S
S
…
…
…
……
…
…
Example: Docker on Mesos
mesosphere.io/2013/09/26/docker-on-mesos/
Mesos Master Server
init
|
+ mesos-master
|
+ marathon
|
Mesos Slave Server
init
|
+ docker
| |
| + lxc
| |
| + (user task, under container init system)
| |
|
+ mesos-slave
| |
| + /var/lib/mesos/executors/docker
| | |
| | + docker run …
| | |
The executor, monitored by the
Mesos slave, delegates to the
local Docker daemon for image
discovery and management. The
executor communicates with
Marathon via the Mesos master
and ensures that Docker enforces
the specified resource limitations.
Example: Docker on Mesos
mesosphere.io/2013/09/26/docker-on-mesos/
Mesos Master Server
init
|
+ mesos-master
|
+ marathon
|
Mesos Slave Server
init
|
+ docker
| |
| + lxc
| |
| + (user task, under container init system)
| |
|
+ mesos-slave
| |
| + /var/lib/mesos/executors/docker
| | |
| | + docker run …
| | |
Docker
Registry
When a user requests
a container…
Mesos, LXC, and
Docker are tied
together for launch
2
1
3
4
5
6
7
8
Example: Docker on Mesos
mesosphere.io/2013/09/26/docker-on-mesos/
Because…

Use Cases
Production Deployments (public)
Built-in /

bare metal
Hypervisors
Solaris Zones
Linux CGroups
Opposite Ends of the Spectrum, One Common Substrate
Opposite Ends of the Spectrum, One Common Substrate	

Request /

Response
Batch
Case Study: Twitter (bare metal / on premise)	

“Mesos is the cornerstone of our elastic compute infrastructure – 

it’s how we build all our new services and is critical forTwitter’s

continued success at scale. It's one of the primary keys to our

data center efficiency."	

Chris Fry, SVP Engineering	

blog.twitter.com/2013/mesos-graduates-from-apache-incubation	

wired.com/gadgetlab/2013/11/qa-with-chris-fry/	

!
• key services run in production: analytics, typeahead, ads	

• Twitter engineers rely on Mesos to build all new services	

• instead of thinking about static machines, engineers think 

about resources like CPU, memory and disk	

• allows services to scale and leverage a shared pool of 

servers across datacenters efficiently	

• reduces the time between prototyping and launching
Case Study: Airbnb (fungible cloud infrastructure)	

“We think we might be pushing data science in the field of travel 

more so than anyone has ever done before… a smaller number 

of engineers can have higher impact through automation on 

Mesos."	

Mike Curtis,VP Engineering

gigaom.com/2013/07/29/airbnb-is-engineering-itself-into-a-data...	

• improves resource management and efficiency	

• helps advance engineering strategy of building small teams 

that can move fast	

• key to letting engineers make the most of AWS-based 

infrastructure beyond just Hadoop	

• allowed company to migrate off Elastic MapReduce	

• enables use of Hadoop along with Chronos, Spark, Storm, etc.
DIY
!
!
http://elastic.mesosphere.io
!
http://mesosphere.io/learn	

!
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Master 2
NN
ZK
Master 1
NN
ZK
Master 3
NN
ZK
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Worker
DN
Elastic Mesos
Former Airbnb engineers simplify Mesos to manage data jobs in the cloud

Jordan Novet

VentureBeat (2013-11-12)

venturebeat.com/2013/11/12/former-airbnb-engineers-simplify...	

Mesosphere Adds Docker SupportTo Its Mesos-Based Operating System ForThe Data Center

Frederic Lardinois

TechCrunch (2013-09-26)

techcrunch.com/2013/09/26/mesosphere...	

Play Framework Grid Deployment with Mesos

James Ward, Flo Leibert, et al.

Typesafe blog (2013-09-19)

typesafe.com/blog/play-framework-grid...	

Mesosphere Launches Marathon Framework

Adrian Bridgwater

Dr. Dobbs (2013-09-18)

drdobbs.com/open-source/mesosphere...	

New open source tech Marathon wants to make your data center run like Google’s

Derrick Harris

GigaOM (2013-09-04)

gigaom.com/2013/09/04/new-open-source...	

Running batch and long-running, highly available service jobs on the same cluster

Ben Lorica

O’Reilly (2013-09-01)

strata.oreilly.com/2013/09/running-batch...

Resources	

Apache Mesos Project

mesos.apache.org	

Twitter

@ApacheMesos	

Mesosphere

mesosphere.io	

Tutorials

mesosphere.io/learn	

Documentation

mesos.apache.org/documentation	

2011 USENIX Research Paper

usenix.org/legacy/event/nsdi11/tech/full_papers/
Hindman_new.pdf	

Collected Notes/Archives

goo.gl/jPtTP
ありがとう

ございました
Enterprise DataWorkflows with Cascading	

O’Reilly, 2013	

shop.oreilly.com/product/
0636920028536.do
!
monthly newsletter for updates, 

events, conference summaries, etc.:	

liber118.com/pxn/
New Book:	

“Just Enough Math”, 

with Allen Day @MapR Asia	

!
advanced math for business people,

to leverage open source for Big Data	

!
galleys circulate: July 2014

Contenu connexe

Tendances

Introduction to Apache Mesos
Introduction to Apache MesosIntroduction to Apache Mesos
Introduction to Apache Mesostomasbart
 
Getting Started with Big Data and HPC in the Cloud - August 2015
Getting Started with Big Data and HPC in the Cloud - August 2015Getting Started with Big Data and HPC in the Cloud - August 2015
Getting Started with Big Data and HPC in the Cloud - August 2015Amazon Web Services
 
Cloud Foundry Summit 2015: Building a Robust Cloud Foundry (HA, Security and DR)
Cloud Foundry Summit 2015: Building a Robust Cloud Foundry (HA, Security and DR)Cloud Foundry Summit 2015: Building a Robust Cloud Foundry (HA, Security and DR)
Cloud Foundry Summit 2015: Building a Robust Cloud Foundry (HA, Security and DR)VMware Tanzu
 
Doing Big Data for Real with Docker
Doing Big Data for Real with Docker  Doing Big Data for Real with Docker
Doing Big Data for Real with Docker Mesosphere Inc.
 
How to build an HA container orchestrator infrastructure for production – Giu...
How to build an HA container orchestrator infrastructure for production – Giu...How to build an HA container orchestrator infrastructure for production – Giu...
How to build an HA container orchestrator infrastructure for production – Giu...Codemotion
 
Building hybrid cloud with cloudify (public)
Building hybrid cloud with cloudify (public)Building hybrid cloud with cloudify (public)
Building hybrid cloud with cloudify (public)Nati Shalom
 
High Performance Computing with AWS
High Performance Computing with AWSHigh Performance Computing with AWS
High Performance Computing with AWSAmazon Web Services
 
Lean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big DataLean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big DataStylight
 
Apache Superset at Airbnb
Apache Superset at AirbnbApache Superset at Airbnb
Apache Superset at AirbnbBill Liu
 
High Performance Computing (HPC) in cloud
High Performance Computing (HPC) in cloudHigh Performance Computing (HPC) in cloud
High Performance Computing (HPC) in cloudAccubits Technologies
 
What’s New in CloudStack 4.15 - CloudStack European User Group Virtual, May 2021
What’s New in CloudStack 4.15 - CloudStack European User Group Virtual, May 2021What’s New in CloudStack 4.15 - CloudStack European User Group Virtual, May 2021
What’s New in CloudStack 4.15 - CloudStack European User Group Virtual, May 2021ShapeBlue
 
OpenNebulaConf2015 1.03 Private, Public, Hybrid: The Real Economics of Open S...
OpenNebulaConf2015 1.03 Private, Public, Hybrid: The Real Economics of Open S...OpenNebulaConf2015 1.03 Private, Public, Hybrid: The Real Economics of Open S...
OpenNebulaConf2015 1.03 Private, Public, Hybrid: The Real Economics of Open S...OpenNebula Project
 
Trash Talk! How to Reduce Downtime by Tuning Garbage Collection
Trash Talk! How to Reduce Downtime by Tuning Garbage CollectionTrash Talk! How to Reduce Downtime by Tuning Garbage Collection
Trash Talk! How to Reduce Downtime by Tuning Garbage CollectionAtlassian
 
Choosing a dev ops paas platform svccd presentation v2 for slideshare
Choosing a dev ops paas platform svccd presentation v2 for slideshareChoosing a dev ops paas platform svccd presentation v2 for slideshare
Choosing a dev ops paas platform svccd presentation v2 for slideshareJohn Mathon
 
Building clouds with apache cloudstack apache roadshow 2018
Building clouds with apache cloudstack   apache roadshow 2018Building clouds with apache cloudstack   apache roadshow 2018
Building clouds with apache cloudstack apache roadshow 2018ShapeBlue
 
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CSBetter, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CSJohn Burwell
 
Boris Stoyanov - some new features in Apache cloudStack
Boris Stoyanov - some new features in Apache cloudStackBoris Stoyanov - some new features in Apache cloudStack
Boris Stoyanov - some new features in Apache cloudStackShapeBlue
 
Discover the all new Mesosphere DC/OS 1.10
Discover the all new Mesosphere DC/OS 1.10Discover the all new Mesosphere DC/OS 1.10
Discover the all new Mesosphere DC/OS 1.10Mesosphere Inc.
 

Tendances (20)

Introduction to Apache Mesos
Introduction to Apache MesosIntroduction to Apache Mesos
Introduction to Apache Mesos
 
Getting Started with Big Data and HPC in the Cloud - August 2015
Getting Started with Big Data and HPC in the Cloud - August 2015Getting Started with Big Data and HPC in the Cloud - August 2015
Getting Started with Big Data and HPC in the Cloud - August 2015
 
Cloud Foundry Summit 2015: Building a Robust Cloud Foundry (HA, Security and DR)
Cloud Foundry Summit 2015: Building a Robust Cloud Foundry (HA, Security and DR)Cloud Foundry Summit 2015: Building a Robust Cloud Foundry (HA, Security and DR)
Cloud Foundry Summit 2015: Building a Robust Cloud Foundry (HA, Security and DR)
 
Doing Big Data for Real with Docker
Doing Big Data for Real with Docker  Doing Big Data for Real with Docker
Doing Big Data for Real with Docker
 
How to build an HA container orchestrator infrastructure for production – Giu...
How to build an HA container orchestrator infrastructure for production – Giu...How to build an HA container orchestrator infrastructure for production – Giu...
How to build an HA container orchestrator infrastructure for production – Giu...
 
Building hybrid cloud with cloudify (public)
Building hybrid cloud with cloudify (public)Building hybrid cloud with cloudify (public)
Building hybrid cloud with cloudify (public)
 
High Performance Computing with AWS
High Performance Computing with AWSHigh Performance Computing with AWS
High Performance Computing with AWS
 
HPC in the Cloud
HPC in the CloudHPC in the Cloud
HPC in the Cloud
 
Lean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big DataLean Enterprise, Microservices and Big Data
Lean Enterprise, Microservices and Big Data
 
Apache Superset at Airbnb
Apache Superset at AirbnbApache Superset at Airbnb
Apache Superset at Airbnb
 
High Performance Computing (HPC) in cloud
High Performance Computing (HPC) in cloudHigh Performance Computing (HPC) in cloud
High Performance Computing (HPC) in cloud
 
What’s New in CloudStack 4.15 - CloudStack European User Group Virtual, May 2021
What’s New in CloudStack 4.15 - CloudStack European User Group Virtual, May 2021What’s New in CloudStack 4.15 - CloudStack European User Group Virtual, May 2021
What’s New in CloudStack 4.15 - CloudStack European User Group Virtual, May 2021
 
OpenNebulaConf2015 1.03 Private, Public, Hybrid: The Real Economics of Open S...
OpenNebulaConf2015 1.03 Private, Public, Hybrid: The Real Economics of Open S...OpenNebulaConf2015 1.03 Private, Public, Hybrid: The Real Economics of Open S...
OpenNebulaConf2015 1.03 Private, Public, Hybrid: The Real Economics of Open S...
 
Trash Talk! How to Reduce Downtime by Tuning Garbage Collection
Trash Talk! How to Reduce Downtime by Tuning Garbage CollectionTrash Talk! How to Reduce Downtime by Tuning Garbage Collection
Trash Talk! How to Reduce Downtime by Tuning Garbage Collection
 
Choosing a dev ops paas platform svccd presentation v2 for slideshare
Choosing a dev ops paas platform svccd presentation v2 for slideshareChoosing a dev ops paas platform svccd presentation v2 for slideshare
Choosing a dev ops paas platform svccd presentation v2 for slideshare
 
HPC on AWS
HPC on AWSHPC on AWS
HPC on AWS
 
Building clouds with apache cloudstack apache roadshow 2018
Building clouds with apache cloudstack   apache roadshow 2018Building clouds with apache cloudstack   apache roadshow 2018
Building clouds with apache cloudstack apache roadshow 2018
 
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CSBetter, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
Better, Faster, Cheaper Infrastructure: Apache CloudStack and Riak CS
 
Boris Stoyanov - some new features in Apache cloudStack
Boris Stoyanov - some new features in Apache cloudStackBoris Stoyanov - some new features in Apache cloudStack
Boris Stoyanov - some new features in Apache cloudStack
 
Discover the all new Mesosphere DC/OS 1.10
Discover the all new Mesosphere DC/OS 1.10Discover the all new Mesosphere DC/OS 1.10
Discover the all new Mesosphere DC/OS 1.10
 

En vedette

Finite State Machines - Why the fear?
Finite State Machines - Why the fear?Finite State Machines - Why the fear?
Finite State Machines - Why the fear?OSCON Byrum
 
clearScienceStrataRx2012
clearScienceStrataRx2012clearScienceStrataRx2012
clearScienceStrataRx2012OReillyStrata
 
Code curiosity rubyconfindia 2016 talk
Code curiosity rubyconfindia 2016 talkCode curiosity rubyconfindia 2016 talk
Code curiosity rubyconfindia 2016 talkSethupathi Asokan
 
US Patriot Act OSCON2012 David Mertz
US Patriot Act OSCON2012 David MertzUS Patriot Act OSCON2012 David Mertz
US Patriot Act OSCON2012 David MertzOSCON Byrum
 
Open Data: From the Information Age to the Action Age (Keynote File)
Open Data: From the Information Age to the Action Age (Keynote File)Open Data: From the Information Age to the Action Age (Keynote File)
Open Data: From the Information Age to the Action Age (Keynote File)Tim O'Reilly
 
SlideShare's Lean Startup Journey: Lessons Learnt
SlideShare's Lean Startup Journey: Lessons LearntSlideShare's Lean Startup Journey: Lessons Learnt
SlideShare's Lean Startup Journey: Lessons LearntKapil Mohan
 
When Ruby Meets Java - The Power of Torquebox
When Ruby Meets Java - The Power of TorqueboxWhen Ruby Meets Java - The Power of Torquebox
When Ruby Meets Java - The Power of Torqueboxrockyjaiswal
 
Some Lessons for Startups (pdf with notes)
Some Lessons for Startups (pdf with notes)Some Lessons for Startups (pdf with notes)
Some Lessons for Startups (pdf with notes)Tim O'Reilly
 
Seattle Data Geeks: Hadoop and Beyond
Seattle Data Geeks: Hadoop and BeyondSeattle Data Geeks: Hadoop and Beyond
Seattle Data Geeks: Hadoop and BeyondPaco Nathan
 
Mobilité partagée, un enjeu d'innovation dans un système global de transport
Mobilité partagée, un enjeu d'innovation dans un système global de transportMobilité partagée, un enjeu d'innovation dans un système global de transport
Mobilité partagée, un enjeu d'innovation dans un système global de transportPierre-Olivier Desmurs
 
Yusuf mapping the creative industries in jordan 15 11 2012
Yusuf mapping the creative industries in jordan 15 11 2012Yusuf mapping the creative industries in jordan 15 11 2012
Yusuf mapping the creative industries in jordan 15 11 2012Yusuf Mansur
 
OPEN Silcon Valley - Clean-tech is Main-tech: How do you fit in the Green Ec...
OPEN Silcon Valley - Clean-tech is Main-tech:  How do you fit in the Green Ec...OPEN Silcon Valley - Clean-tech is Main-tech:  How do you fit in the Green Ec...
OPEN Silcon Valley - Clean-tech is Main-tech: How do you fit in the Green Ec...Shuja Keen
 
Flow Engines - Hack The Way You Work, Not The Time You Have
Flow Engines - Hack The Way You Work, Not The Time You HaveFlow Engines - Hack The Way You Work, Not The Time You Have
Flow Engines - Hack The Way You Work, Not The Time You HaveJohn V Willshire
 
Creating actionable marketo reports july, 2013
Creating actionable marketo reports   july, 2013Creating actionable marketo reports   july, 2013
Creating actionable marketo reports july, 2013Inga Romanoff
 
Pinterest for Business 101
Pinterest for Business 101Pinterest for Business 101
Pinterest for Business 101Nick Armstrong
 
Elastic Apache Mesos on Amazon EC2
Elastic Apache Mesos on Amazon EC2Elastic Apache Mesos on Amazon EC2
Elastic Apache Mesos on Amazon EC2Paco Nathan
 
A GeoSocial Intelligence Framework for Studying & Promoting Resilience to Sea...
A GeoSocial Intelligence Framework for Studying & Promoting Resilience to Sea...A GeoSocial Intelligence Framework for Studying & Promoting Resilience to Sea...
A GeoSocial Intelligence Framework for Studying & Promoting Resilience to Sea...SMART Infrastructure Facility
 

En vedette (20)

Finite State Machines - Why the fear?
Finite State Machines - Why the fear?Finite State Machines - Why the fear?
Finite State Machines - Why the fear?
 
clearScienceStrataRx2012
clearScienceStrataRx2012clearScienceStrataRx2012
clearScienceStrataRx2012
 
Code curiosity rubyconfindia 2016 talk
Code curiosity rubyconfindia 2016 talkCode curiosity rubyconfindia 2016 talk
Code curiosity rubyconfindia 2016 talk
 
US Patriot Act OSCON2012 David Mertz
US Patriot Act OSCON2012 David MertzUS Patriot Act OSCON2012 David Mertz
US Patriot Act OSCON2012 David Mertz
 
Open Data: From the Information Age to the Action Age (Keynote File)
Open Data: From the Information Age to the Action Age (Keynote File)Open Data: From the Information Age to the Action Age (Keynote File)
Open Data: From the Information Age to the Action Age (Keynote File)
 
SlideShare's Lean Startup Journey: Lessons Learnt
SlideShare's Lean Startup Journey: Lessons LearntSlideShare's Lean Startup Journey: Lessons Learnt
SlideShare's Lean Startup Journey: Lessons Learnt
 
When Ruby Meets Java - The Power of Torquebox
When Ruby Meets Java - The Power of TorqueboxWhen Ruby Meets Java - The Power of Torquebox
When Ruby Meets Java - The Power of Torquebox
 
Creative, Digital & Design Business Briefing July 2015
Creative, Digital & Design Business Briefing July 2015Creative, Digital & Design Business Briefing July 2015
Creative, Digital & Design Business Briefing July 2015
 
Some Lessons for Startups (pdf with notes)
Some Lessons for Startups (pdf with notes)Some Lessons for Startups (pdf with notes)
Some Lessons for Startups (pdf with notes)
 
Seattle Data Geeks: Hadoop and Beyond
Seattle Data Geeks: Hadoop and BeyondSeattle Data Geeks: Hadoop and Beyond
Seattle Data Geeks: Hadoop and Beyond
 
Mobilité partagée, un enjeu d'innovation dans un système global de transport
Mobilité partagée, un enjeu d'innovation dans un système global de transportMobilité partagée, un enjeu d'innovation dans un système global de transport
Mobilité partagée, un enjeu d'innovation dans un système global de transport
 
Yusuf mapping the creative industries in jordan 15 11 2012
Yusuf mapping the creative industries in jordan 15 11 2012Yusuf mapping the creative industries in jordan 15 11 2012
Yusuf mapping the creative industries in jordan 15 11 2012
 
OPEN Silcon Valley - Clean-tech is Main-tech: How do you fit in the Green Ec...
OPEN Silcon Valley - Clean-tech is Main-tech:  How do you fit in the Green Ec...OPEN Silcon Valley - Clean-tech is Main-tech:  How do you fit in the Green Ec...
OPEN Silcon Valley - Clean-tech is Main-tech: How do you fit in the Green Ec...
 
Flow Engines - Hack The Way You Work, Not The Time You Have
Flow Engines - Hack The Way You Work, Not The Time You HaveFlow Engines - Hack The Way You Work, Not The Time You Have
Flow Engines - Hack The Way You Work, Not The Time You Have
 
Bilan de mobilité
Bilan de mobilitéBilan de mobilité
Bilan de mobilité
 
Creating actionable marketo reports july, 2013
Creating actionable marketo reports   july, 2013Creating actionable marketo reports   july, 2013
Creating actionable marketo reports july, 2013
 
Stanford Ee380
Stanford Ee380Stanford Ee380
Stanford Ee380
 
Pinterest for Business 101
Pinterest for Business 101Pinterest for Business 101
Pinterest for Business 101
 
Elastic Apache Mesos on Amazon EC2
Elastic Apache Mesos on Amazon EC2Elastic Apache Mesos on Amazon EC2
Elastic Apache Mesos on Amazon EC2
 
A GeoSocial Intelligence Framework for Studying & Promoting Resilience to Sea...
A GeoSocial Intelligence Framework for Studying & Promoting Resilience to Sea...A GeoSocial Intelligence Framework for Studying & Promoting Resilience to Sea...
A GeoSocial Intelligence Framework for Studying & Promoting Resilience to Sea...
 

Similaire à Datacenter Computing with Apache Mesos - シリコンバレー日本人駐在員Meetup

Datacenter Computing with Apache Mesos - BigData DC
Datacenter Computing with Apache Mesos - BigData DCDatacenter Computing with Apache Mesos - BigData DC
Datacenter Computing with Apache Mesos - BigData DCPaco Nathan
 
Apache Mesos Overview and Integration
Apache Mesos Overview and IntegrationApache Mesos Overview and Integration
Apache Mesos Overview and IntegrationAlex Baretto
 
Choosing PaaS: Cisco and Open Source Options: an overview
Choosing PaaS:  Cisco and Open Source Options: an overviewChoosing PaaS:  Cisco and Open Source Options: an overview
Choosing PaaS: Cisco and Open Source Options: an overviewCisco DevNet
 
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...Ludovic Piot
 
How leading financial services organisations are winning with tech
How leading financial services organisations are winning with techHow leading financial services organisations are winning with tech
How leading financial services organisations are winning with techMongoDB
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSSteve Wong
 
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...Dean Delamont
 
2017 Microservices Practitioner Virtual Summit: Ancestry's Journey towards Mi...
2017 Microservices Practitioner Virtual Summit: Ancestry's Journey towards Mi...2017 Microservices Practitioner Virtual Summit: Ancestry's Journey towards Mi...
2017 Microservices Practitioner Virtual Summit: Ancestry's Journey towards Mi...Ambassador Labs
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterpriseBert Poller
 
Integration in the age of DevOps
Integration in the age of DevOpsIntegration in the age of DevOps
Integration in the age of DevOpsAlbert Wong
 
Tackling complexity in giant systems: approaches from several cloud providers
Tackling complexity in giant systems: approaches from several cloud providersTackling complexity in giant systems: approaches from several cloud providers
Tackling complexity in giant systems: approaches from several cloud providersPatrick Chanezon
 
Above the cloud joarder kamal
Above the cloud   joarder kamalAbove the cloud   joarder kamal
Above the cloud joarder kamalJoarder Kamal
 
Kubernetes solutions
Kubernetes solutionsKubernetes solutions
Kubernetes solutionsEric Cattoir
 
OCCIware, an extensible, standard-based XaaS consumer platform to manage ever...
OCCIware, an extensible, standard-based XaaS consumer platform to manage ever...OCCIware, an extensible, standard-based XaaS consumer platform to manage ever...
OCCIware, an extensible, standard-based XaaS consumer platform to manage ever...OCCIware
 
OCCIware: Extensible and Standard-based XaaS Platform To Manage Everything in...
OCCIware: Extensible and Standard-based XaaS Platform To Manage Everything in...OCCIware: Extensible and Standard-based XaaS Platform To Manage Everything in...
OCCIware: Extensible and Standard-based XaaS Platform To Manage Everything in...OW2
 
OCCIware@POSS 2016 - an extensible, standard XaaS cloud consumer platform
OCCIware@POSS 2016 - an extensible, standard XaaS cloud consumer platformOCCIware@POSS 2016 - an extensible, standard XaaS cloud consumer platform
OCCIware@POSS 2016 - an extensible, standard XaaS cloud consumer platformMarc Dutoo
 
Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos Rahul Kumar
 
Getting Started with Docker - Nick Stinemates
Getting Started with Docker - Nick StinematesGetting Started with Docker - Nick Stinemates
Getting Started with Docker - Nick StinematesAtlassian
 
The challenge of application distribution - Introduction to Docker (2014 dec ...
The challenge of application distribution - Introduction to Docker (2014 dec ...The challenge of application distribution - Introduction to Docker (2014 dec ...
The challenge of application distribution - Introduction to Docker (2014 dec ...Sébastien Portebois
 
Java Agile ALM: OTAP and DevOps in the Cloud
Java Agile ALM: OTAP and DevOps in the CloudJava Agile ALM: OTAP and DevOps in the Cloud
Java Agile ALM: OTAP and DevOps in the CloudMongoDB
 

Similaire à Datacenter Computing with Apache Mesos - シリコンバレー日本人駐在員Meetup (20)

Datacenter Computing with Apache Mesos - BigData DC
Datacenter Computing with Apache Mesos - BigData DCDatacenter Computing with Apache Mesos - BigData DC
Datacenter Computing with Apache Mesos - BigData DC
 
Apache Mesos Overview and Integration
Apache Mesos Overview and IntegrationApache Mesos Overview and Integration
Apache Mesos Overview and Integration
 
Choosing PaaS: Cisco and Open Source Options: an overview
Choosing PaaS:  Cisco and Open Source Options: an overviewChoosing PaaS:  Cisco and Open Source Options: an overview
Choosing PaaS: Cisco and Open Source Options: an overview
 
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
 
How leading financial services organisations are winning with tech
How leading financial services organisations are winning with techHow leading financial services organisations are winning with tech
How leading financial services organisations are winning with tech
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
 
2017 Microservices Practitioner Virtual Summit: Ancestry's Journey towards Mi...
2017 Microservices Practitioner Virtual Summit: Ancestry's Journey towards Mi...2017 Microservices Practitioner Virtual Summit: Ancestry's Journey towards Mi...
2017 Microservices Practitioner Virtual Summit: Ancestry's Journey towards Mi...
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterprise
 
Integration in the age of DevOps
Integration in the age of DevOpsIntegration in the age of DevOps
Integration in the age of DevOps
 
Tackling complexity in giant systems: approaches from several cloud providers
Tackling complexity in giant systems: approaches from several cloud providersTackling complexity in giant systems: approaches from several cloud providers
Tackling complexity in giant systems: approaches from several cloud providers
 
Above the cloud joarder kamal
Above the cloud   joarder kamalAbove the cloud   joarder kamal
Above the cloud joarder kamal
 
Kubernetes solutions
Kubernetes solutionsKubernetes solutions
Kubernetes solutions
 
OCCIware, an extensible, standard-based XaaS consumer platform to manage ever...
OCCIware, an extensible, standard-based XaaS consumer platform to manage ever...OCCIware, an extensible, standard-based XaaS consumer platform to manage ever...
OCCIware, an extensible, standard-based XaaS consumer platform to manage ever...
 
OCCIware: Extensible and Standard-based XaaS Platform To Manage Everything in...
OCCIware: Extensible and Standard-based XaaS Platform To Manage Everything in...OCCIware: Extensible and Standard-based XaaS Platform To Manage Everything in...
OCCIware: Extensible and Standard-based XaaS Platform To Manage Everything in...
 
OCCIware@POSS 2016 - an extensible, standard XaaS cloud consumer platform
OCCIware@POSS 2016 - an extensible, standard XaaS cloud consumer platformOCCIware@POSS 2016 - an extensible, standard XaaS cloud consumer platform
OCCIware@POSS 2016 - an extensible, standard XaaS cloud consumer platform
 
Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos
 
Getting Started with Docker - Nick Stinemates
Getting Started with Docker - Nick StinematesGetting Started with Docker - Nick Stinemates
Getting Started with Docker - Nick Stinemates
 
The challenge of application distribution - Introduction to Docker (2014 dec ...
The challenge of application distribution - Introduction to Docker (2014 dec ...The challenge of application distribution - Introduction to Docker (2014 dec ...
The challenge of application distribution - Introduction to Docker (2014 dec ...
 
Java Agile ALM: OTAP and DevOps in the Cloud
Java Agile ALM: OTAP and DevOps in the CloudJava Agile ALM: OTAP and DevOps in the Cloud
Java Agile ALM: OTAP and DevOps in the Cloud
 

Plus de Paco Nathan

Human in the loop: a design pattern for managing teams working with ML
Human in the loop: a design pattern for managing  teams working with MLHuman in the loop: a design pattern for managing  teams working with ML
Human in the loop: a design pattern for managing teams working with MLPaco Nathan
 
Human-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-the-loop: a design pattern for managing teams that leverage MLHuman-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-the-loop: a design pattern for managing teams that leverage MLPaco Nathan
 
Human-in-a-loop: a design pattern for managing teams which leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage MLHuman-in-a-loop: a design pattern for managing teams which leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage MLPaco Nathan
 
Humans in a loop: Jupyter notebooks as a front-end for AI
Humans in a loop: Jupyter notebooks as a front-end for AIHumans in a loop: Jupyter notebooks as a front-end for AI
Humans in a loop: Jupyter notebooks as a front-end for AIPaco Nathan
 
Humans in the loop: AI in open source and industry
Humans in the loop: AI in open source and industryHumans in the loop: AI in open source and industry
Humans in the loop: AI in open source and industryPaco Nathan
 
Computable Content
Computable ContentComputable Content
Computable ContentPaco Nathan
 
Computable Content: Lessons Learned
Computable Content: Lessons LearnedComputable Content: Lessons Learned
Computable Content: Lessons LearnedPaco Nathan
 
SF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonSF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonPaco Nathan
 
Use of standards and related issues in predictive analytics
Use of standards and related issues in predictive analyticsUse of standards and related issues in predictive analytics
Use of standards and related issues in predictive analyticsPaco Nathan
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving UpPaco Nathan
 
Data Science Reinvents Learning?
Data Science Reinvents Learning?Data Science Reinvents Learning?
Data Science Reinvents Learning?Paco Nathan
 
Jupyter for Education: Beyond Gutenberg and Erasmus
Jupyter for Education: Beyond Gutenberg and ErasmusJupyter for Education: Beyond Gutenberg and Erasmus
Jupyter for Education: Beyond Gutenberg and ErasmusPaco Nathan
 
GalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataGalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataPaco Nathan
 
Microservices, containers, and machine learning
Microservices, containers, and machine learningMicroservices, containers, and machine learning
Microservices, containers, and machine learningPaco Nathan
 
GraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesGraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesPaco Nathan
 
Graph Analytics in Spark
Graph Analytics in SparkGraph Analytics in Spark
Graph Analytics in SparkPaco Nathan
 
Apache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataApache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataPaco Nathan
 
QCon São Paulo: Real-Time Analytics with Spark Streaming
QCon São Paulo: Real-Time Analytics with Spark StreamingQCon São Paulo: Real-Time Analytics with Spark Streaming
QCon São Paulo: Real-Time Analytics with Spark StreamingPaco Nathan
 
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MoreStrata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MorePaco Nathan
 
A New Year in Data Science: ML Unpaused
A New Year in Data Science: ML UnpausedA New Year in Data Science: ML Unpaused
A New Year in Data Science: ML UnpausedPaco Nathan
 

Plus de Paco Nathan (20)

Human in the loop: a design pattern for managing teams working with ML
Human in the loop: a design pattern for managing  teams working with MLHuman in the loop: a design pattern for managing  teams working with ML
Human in the loop: a design pattern for managing teams working with ML
 
Human-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-the-loop: a design pattern for managing teams that leverage MLHuman-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-the-loop: a design pattern for managing teams that leverage ML
 
Human-in-a-loop: a design pattern for managing teams which leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage MLHuman-in-a-loop: a design pattern for managing teams which leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage ML
 
Humans in a loop: Jupyter notebooks as a front-end for AI
Humans in a loop: Jupyter notebooks as a front-end for AIHumans in a loop: Jupyter notebooks as a front-end for AI
Humans in a loop: Jupyter notebooks as a front-end for AI
 
Humans in the loop: AI in open source and industry
Humans in the loop: AI in open source and industryHumans in the loop: AI in open source and industry
Humans in the loop: AI in open source and industry
 
Computable Content
Computable ContentComputable Content
Computable Content
 
Computable Content: Lessons Learned
Computable Content: Lessons LearnedComputable Content: Lessons Learned
Computable Content: Lessons Learned
 
SF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonSF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in Python
 
Use of standards and related issues in predictive analytics
Use of standards and related issues in predictive analyticsUse of standards and related issues in predictive analytics
Use of standards and related issues in predictive analytics
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving Up
 
Data Science Reinvents Learning?
Data Science Reinvents Learning?Data Science Reinvents Learning?
Data Science Reinvents Learning?
 
Jupyter for Education: Beyond Gutenberg and Erasmus
Jupyter for Education: Beyond Gutenberg and ErasmusJupyter for Education: Beyond Gutenberg and Erasmus
Jupyter for Education: Beyond Gutenberg and Erasmus
 
GalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataGalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About Data
 
Microservices, containers, and machine learning
Microservices, containers, and machine learningMicroservices, containers, and machine learning
Microservices, containers, and machine learning
 
GraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesGraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communities
 
Graph Analytics in Spark
Graph Analytics in SparkGraph Analytics in Spark
Graph Analytics in Spark
 
Apache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataApache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big Data
 
QCon São Paulo: Real-Time Analytics with Spark Streaming
QCon São Paulo: Real-Time Analytics with Spark StreamingQCon São Paulo: Real-Time Analytics with Spark Streaming
QCon São Paulo: Real-Time Analytics with Spark Streaming
 
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MoreStrata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
 
A New Year in Data Science: ML Unpaused
A New Year in Data Science: ML UnpausedA New Year in Data Science: ML Unpaused
A New Year in Data Science: ML Unpaused
 

Dernier

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Dernier (20)

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

Datacenter Computing with Apache Mesos - シリコンバレー日本人駐在員Meetup

  • 1. Datacenter Computing 
 with Apache Mesos 
 シリコンバレー日本人駐在員Meetup
 2014-03-14 
 Paco Nathan 
 http://liber118.com/pxn/
 @pacoid
  • 3. ! Have you heard about 
 “data democratization” ? ? ? making data available
 throughout more of the organization
  • 4. ! Have you heard about 
 “data democratization” ? ? ? making data available
 throughout more of the organization ! Then how would you handle 
 “cluster democratization” ? ? ? making data+resources available
 throughout more of the organization
  • 5. ! Have you heard about 
 “data democratization” ? ? ? making data available
 throughout more of the organization ! Then how would you handle 
 “cluster democratization” ? ? ? making data+resources available
 throughout more of the organization In other words, 
 how to remove silos…
  • 7. Datacenter Computing Google has been doing datacenter computing for years, 
 to address the complexities of large-scale data workflows: • leveraging the modern kernel: isolation in lieu of VMs • “most (>80%) jobs are batch jobs, but the majority 
 of resources (55–80%) are allocated to service jobs” • mixed workloads, multi-tenancy • relatively high utilization rates • JVM? not so much… • reality: scheduling batch is simple; 
 scheduling services is hard/expensive
  • 8. The Modern Kernel: Top Linux Contributors… arstechnica.com/information-technology/2013/09/...
  • 9. “Return of the Borg” Return of the Borg: HowTwitter Rebuilt Google’s SecretWeapon
 Cade Metz
 wired.com/wiredenterprise/2013/03/google- borg-twitter-mesos ! The Datacenter as a Computer: An Introduction 
 to the Design ofWarehouse-Scale Machines Luiz André Barroso, Urs Hölzle research.google.com/pubs/pub35290.html ! ! 2011 GAFS Omega
 John Wilkes, et al.
 youtu.be/0ZFMlO98Jkc
  • 10. Google describes the technology… Omega: flexible, scalable schedulers for large compute clusters Malte Schwarzkopf,Andy Konwinski, Michael Abd-El-Malek, John Wilkes eurosys2013.tudos.org/wp-content/uploads/2013/paper/ Schwarzkopf.pdf
  • 11. Google describes the business case… Taming LatencyVariability
 Jeff Dean
 plus.google.com/u/0/+ResearchatGoogle/posts/C1dPhQhcDRv
  • 12. Commercial OS Cluster Schedulers ! • IBM Platform Symphony
 • Microsoft Autopilot ! 
 Arguably, some grid controllers 
 are quite notable in-category: • Univa Grid Engine (formerly SGE)
 • Condor • etc.
  • 14. Beyond Hadoop Hadoop – an open source solution for fault-tolerant parallel processing of batch jobs at scale, based on commodity hardware… however, other priorities have emerged for the analytics lifecycle: • apps require integration beyond Hadoop • multiple topologies, mixed workloads, multi-tenancy • significant disruptions in h/w cost/performance curves • higher utilization • lower latency • highly-available, long running services • more than “Just JVM” – e.g., Python growth
  • 15. Beyond Hadoop Hadoop – an open source solution for fault-tolerant parallel processing of batch jobs at scale, based on commodity hardware… however, other priorities have emerged for the • apps require integration beyond Hadoop • multiple topologies, mixed workloads, multi-tenancy • significant disruptions in h/w cost/performance curves • higher utilization • lower latency • highly-available, long running services • more than “Just JVM” – e.g., Python growth keep in mind priorities for interdisciplinary efforts, to break down silos – extending beyond a de facto “priesthood” of data engineering
  • 16.
  • 17. Mesos – open source datacenter computing a common substrate for cluster computing mesos.apache.org heterogenous assets in your datacenter or cloud 
 made available as a homogenous set of resources • top-level Apache project • scalability to 10,000s of nodes • obviates the need for virtual machines • isolation (pluggable) for CPU, RAM, I/O, FS, etc. • fault-tolerant leader election based on Zookeeper • APIs in C++, Java, Python, Go • web UI for inspecting cluster state • available for Linux, OpenSolaris, Mac OSX
  • 18. What are the costs of Virtualization? benchmark type OpenVZ improvement mixed workloads 210%-300% LAMP (related) 38%-200% I/O throughput 200%-500% response time order magnitude more pronounced 
 at higher loads
  • 19. What are the costs of Single Tenancy? 0% 25% 50% 75% 100% RAILS CPU LOAD MEMCACHED CPU LOAD 0% 25% 50% 75% 100% HADOOP CPU LOAD 0% 25% 50% 75% 100% t t 0% 25% 50% 75% 100% Rails Memcached Hadoop COMBINED CPU LOAD (RAILS, MEMCACHED, HADOOP)
  • 20. Arguments for Datacenter Computing rather than running several specialized clusters, each 
 at relatively low utilization rates, instead run many 
 mixed workloads obvious benefits are realized in terms of: • scalability, elasticity, fault tolerance, performance, utilization • reduced equipment capex, Ops overhead, etc. • reduced licensing, eliminating need forVMs or potential 
 vendor lock-in subtle benefits – arguably, more important for Enterprise IT: • reduced time for engineers to ramp up new services at scale • reduced latency between batch and services, enabling new 
 high ROI use cases • enables Dev/Test apps to run safely on a Production cluster
  • 22. Prior Practice: Dedicated Servers • low utilization rates • longer time to ramp up new services DATACENTER
  • 23. Prior Practice: Virtualization DATACENTER PROVISIONED VMS • even more machines to manage • substantial performance decrease 
 due to virtualization • VM licensing costs
  • 24. Prior Practice: Static Partitioning STATIC PARTITIONING • even more machines to manage • substantial performance decrease 
 due to virtualization • VM licensing costs • static partitioning limits elasticity DATACENTER
  • 25. MESOS Mesos: One Large Pool of Resources “We wanted people to be able to program 
 for the datacenter just like they program 
 for their laptop." ! Ben Hindman DATACENTER
  • 26. Frameworks Integrated with Mesos Continuous Integration:
 Jenkins, GitLab Big Data:
 Hadoop, Spark, Storm, Kafka, Cassandra,
 Hypertable, MPI Python workloads:
 DPark, Exelixi Meta-Frameworks / HA Services:
 Aurora, Marathon Distributed Cron:
 Chronos Containers:
 Docker
  • 27. ! Fault-tolerant distributed systems… …written in 100-300 lines of 
 C++, Java/Scala, Python, Go, etc. …building blocks, if you will ! Q: required lines of network code? A: probably none
  • 28. Kernel Apps servicesbatch Frameworks Python JVM C ++ Workloads distributed file system Chronos DFS distributed resources: CPU, RAM, I/O, FS, rack locality, etc. Cluster Storm Kafka JBoss Django RailsSharkImpalaScalding Marathon SparkHadoopMPI MySQL Mesos – architecture
  • 29. Mesos – architecture HDFS, distrib file system Mesos, distrib kernel meta-frameworks: Aurora, Marathon frameworks: Spark, Storm, MPI, Jenkins, etc. task schedulers: Chronos, etc. APIs: C++, JVM, Py, Go apps: HA services, web apps, batch jobs, scripts, etc. Linux: libcgroup, libprocess, libev, etc.
  • 30. Mesos – dynamics Mesos distrib kernel Marathon distrib init.d Chronos distrib cron distrib frameworks HA services scheduled apps
  • 31. Mesos – dynamics resource offers distributed framework Scheduler Executor Executor Executor Mesos slave Mesos slave Mesos slave distributed kernel available resources Mesos slave Mesos slave Mesos slave Mesos masterMesos master
  • 32. Example: Resource Offer in a Two-Level Scheduler mesos.apache.org/documentation/latest/mesos-architecture/
  • 33. M Master Docker Registry index.docker.io Local Docker Registry ( optional ) M M S S S S S S marathon docker docker docker Mesos master servers Mesos slave servers Marathon can launch and monitor service containers from one or more Docker registries, using the Docker executor for Mesos S S S S S S … … … …… … … Example: Docker on Mesos mesosphere.io/2013/09/26/docker-on-mesos/
  • 34. Mesos Master Server init | + mesos-master | + marathon | Mesos Slave Server init | + docker | | | + lxc | | | + (user task, under container init system) | | | + mesos-slave | | | + /var/lib/mesos/executors/docker | | | | | + docker run … | | | The executor, monitored by the Mesos slave, delegates to the local Docker daemon for image discovery and management. The executor communicates with Marathon via the Mesos master and ensures that Docker enforces the specified resource limitations. Example: Docker on Mesos mesosphere.io/2013/09/26/docker-on-mesos/
  • 35. Mesos Master Server init | + mesos-master | + marathon | Mesos Slave Server init | + docker | | | + lxc | | | + (user task, under container init system) | | | + mesos-slave | | | + /var/lib/mesos/executors/docker | | | | | + docker run … | | | Docker Registry When a user requests a container… Mesos, LXC, and Docker are tied together for launch 2 1 3 4 5 6 7 8 Example: Docker on Mesos mesosphere.io/2013/09/26/docker-on-mesos/
  • 38. Built-in /
 bare metal Hypervisors Solaris Zones Linux CGroups Opposite Ends of the Spectrum, One Common Substrate
  • 39. Opposite Ends of the Spectrum, One Common Substrate Request /
 Response Batch
  • 40. Case Study: Twitter (bare metal / on premise) “Mesos is the cornerstone of our elastic compute infrastructure – 
 it’s how we build all our new services and is critical forTwitter’s
 continued success at scale. It's one of the primary keys to our
 data center efficiency." Chris Fry, SVP Engineering blog.twitter.com/2013/mesos-graduates-from-apache-incubation wired.com/gadgetlab/2013/11/qa-with-chris-fry/ ! • key services run in production: analytics, typeahead, ads • Twitter engineers rely on Mesos to build all new services • instead of thinking about static machines, engineers think 
 about resources like CPU, memory and disk • allows services to scale and leverage a shared pool of 
 servers across datacenters efficiently • reduces the time between prototyping and launching
  • 41. Case Study: Airbnb (fungible cloud infrastructure) “We think we might be pushing data science in the field of travel 
 more so than anyone has ever done before… a smaller number 
 of engineers can have higher impact through automation on 
 Mesos." Mike Curtis,VP Engineering
 gigaom.com/2013/07/29/airbnb-is-engineering-itself-into-a-data... • improves resource management and efficiency • helps advance engineering strategy of building small teams 
 that can move fast • key to letting engineers make the most of AWS-based 
 infrastructure beyond just Hadoop • allowed company to migrate off Elastic MapReduce • enables use of Hadoop along with Chronos, Spark, Storm, etc.
  • 42. DIY
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50. Worker DN Worker DN Worker DN Worker DN Master 2 NN ZK Master 1 NN ZK Master 3 NN ZK Worker DN Worker DN Worker DN Worker DN Worker DN Worker DN Worker DN Worker DN Worker DN Worker DN Worker DN Elastic Mesos
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57. Former Airbnb engineers simplify Mesos to manage data jobs in the cloud
 Jordan Novet
 VentureBeat (2013-11-12)
 venturebeat.com/2013/11/12/former-airbnb-engineers-simplify... Mesosphere Adds Docker SupportTo Its Mesos-Based Operating System ForThe Data Center
 Frederic Lardinois
 TechCrunch (2013-09-26)
 techcrunch.com/2013/09/26/mesosphere... Play Framework Grid Deployment with Mesos
 James Ward, Flo Leibert, et al.
 Typesafe blog (2013-09-19)
 typesafe.com/blog/play-framework-grid... Mesosphere Launches Marathon Framework
 Adrian Bridgwater
 Dr. Dobbs (2013-09-18)
 drdobbs.com/open-source/mesosphere... New open source tech Marathon wants to make your data center run like Google’s
 Derrick Harris
 GigaOM (2013-09-04)
 gigaom.com/2013/09/04/new-open-source... Running batch and long-running, highly available service jobs on the same cluster
 Ben Lorica
 O’Reilly (2013-09-01)
 strata.oreilly.com/2013/09/running-batch...

  • 58. Resources Apache Mesos Project
 mesos.apache.org Twitter
 @ApacheMesos Mesosphere
 mesosphere.io Tutorials
 mesosphere.io/learn Documentation
 mesos.apache.org/documentation 2011 USENIX Research Paper
 usenix.org/legacy/event/nsdi11/tech/full_papers/ Hindman_new.pdf Collected Notes/Archives
 goo.gl/jPtTP
  • 60. Enterprise DataWorkflows with Cascading O’Reilly, 2013 shop.oreilly.com/product/ 0636920028536.do ! monthly newsletter for updates, 
 events, conference summaries, etc.: liber118.com/pxn/
  • 61. New Book: “Just Enough Math”, 
 with Allen Day @MapR Asia ! advanced math for business people,
 to leverage open source for Big Data ! galleys circulate: July 2014