SlideShare une entreprise Scribd logo
1  sur  39
Télécharger pour lire hors ligne
1 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Hadoop YARN:
State of the Union
Sanjay Radia
Founder, Chief Architect Hortonworks
Apache Hadoop PMC
2 © Hortonworks Inc. 2011–2018. All rights reserved
About the Speaker
• Sanjay Radia
• Chief Architect, Founder, Hortonworks
• Apache Hadoop PMC and Committer
• Part of the original Hadoop team at Yahoo! since 2007
• Chief Architect of Hadoop Core at Yahoo!
• Prior
• Data center automation, virtualization, Java, HA, OSs, File Systems
• Startup, Sun Microsystems, INRIA…
• Ph.D., University of Waterloo
Page 2Architecting the Future of Big Data
3 © Hortonworks Inc. 2011–2018. All rights reserved
• Introduction
• Past
• State of Union for YARN
Agenda
4 © Hortonworks Inc. 2011–2018. All rights reserved
A Brief Timeline from Past Year: GA Releases
2.8.0 2.9.0 3.0.0 3.1.0
• GPU/FPGA
• YARN Native
Service
• Placement
Constraints
• YARN Federation
• Opportunistic
Container
(Backported from 3.0)
• New YARN UI
• Timeline V2
• Global Scheduling
• Multiple Resource types
• New YARN UI
• Timeline service V2
Ever involving requirements (computation intensive, larger, services)
• Application Priority
• Reservations
• Node labels
improvements
22 March ’17 17 Nov ’17 13 Dec ‘17 02 Aug ‘182.8.4 2.9.1 3.0.3 3.1.1
8 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Hadoop 3.0/3.1
9 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
10 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
11 © Hortonworks Inc. 2011–2018. All rights reserved
• Many customers with clusters made up of large number of nodes
• Oath (Yahoo!), Twitter, LinkedIn, Microsoft, Alibaba etc.
• Previous largest clusters: 6K-8K
• Now: 50K nodes under YARN in Microsoft [1]
• Roadmap: To 100K and beyond
[1] https://azure.microsoft.com/en-us/blog/how-microsoft-drives-exabyte-analytics-on-the-world-s-largest-yarn-cluster/
Looking at the Scale!
12 © Hortonworks Inc. 2011–2018. All rights reserved
• Enables applications to scale to 100k of thousands of nodes
• Federation divides a large (10-100k nodes) cluster into smaller units called sub-clusters
• Federation negotiates with sub-clusters RM’s and provide resources to the application
• Applications can schedule tasks on any node
YARN Federation
13 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
14 © Hortonworks Inc. 2011–2018. All rights reserved
• YARN-5139
• Problems
• Current design of one-node-at-a-time allocation cycle can lead to suboptimal decisions.
• Performance and also hard to do global placement policies
• Several coarse grained locks
• With this, we improved to
• Look at several nodes at a time
• Fine grained locks
• Multiple allocator threads
• YARN scheduler can allocate 3k+ containers per second ≈ 10 mil allocations / hour!
• 10X throughput gains
• Much better placement decisions
Moving Towards Global & Fast Scheduling
16 © Hortonworks Inc. 2011–2018. All rights reserved
Better Placement Strategies (YARN-6592)
• Past
• Supported constraints in form of Node Locality
• Now YARN can support a lot more use cases
• Co-locate the allocations of a job on the same rack (affinity)
• Spread allocations across machines (anti-affinity) to minimize resource interference
• Allow up to a specific number of allocations in a node group (cardinality)
17 © Hortonworks Inc. 2011–2018. All rights reserved
Better Placement Strategies (YARN-6592)
Affinity Anti-affinity
Region
Server
Region
Server
18 © Hortonworks Inc. 2011–2018. All rights reserved
Absolute Resources Configuration in CS – YARN-5881
• The cloud model! “Give me X resources, not X%”
• Gives ability to configure Queue resources as below
<memory=24GB, vcores=20, yarn.io/gpu=2>
• Enables admins to assign different quotas of different resource-types
• No more “Single percentage value limitation for all resource-types”
root
(memory=60G, vcores=40,
gpu=10)
sales
(memory=40G, vcores=20,
gpu=6)
marketing
(memory=10G, vcores=10,
gpu=4)
engineering
(memory=10G, vcores=10,
gpu=0)
20 © Hortonworks Inc. 2011–2018. All rights reserved
Useful Scheduling Features from Recent 2.X Releases
• Application Priorities
• Queue Priorities
• Previously : Resources to least satisfied queue
• Queues can have priorities e.g. (production over dev)
21 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
22 © Hortonworks Inc. 2011–2018. All rights reserved
Usability: UI (1)
23 © Hortonworks Inc. 2011–2018. All rights reserved
Usability: UI (2)
24 © Hortonworks Inc. 2011–2018. All rights reserved
Timeline Service 2.0
• Application History
• Where did my container run
• Why is my app slow
• Why is failing
• What happened with my app
• Cluster History
• Which user had most utilization
• Largest App
• What happened in my cluster
• …
• Scalable & Robust Service
• Using HBase as backend for better scale
• More robust storage fault tolerance
• Migration and compatibility with v.1.5
25 © Hortonworks Inc. 2011–2018. All rights reserved
Usability: Queue & Logs
API based queue management
Decentralized
(YARN-5734)
Improved logs
management
(YARN-4904)
Live application logs
26 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
27 © Hortonworks Inc. 2011–2018. All rights reserved
• Better Packaging model
• Lightweight mechanism for packaging and resource isolation
• Popularized and made accessible by Docker
• Native integration ++ in YARN
• Support for “Container Runtimes” in LCE: YARN-3611
• Native process runtime
• Docker runtime
• Many security/usability improves added to 3.0.x / 3.1.x
Containers
28 © Hortonworks Inc. 2011–2018. All rights reserved
Containers
• Run both with and without Docker on the same cluster
• Choose at run-time!
29 © Hortonworks Inc. 2011–2018. All rights reserved
• Apache Spark applications have a complex set of required software dependencies
• Docker on YARN helps to solve Spark package isolation issues with
• PySpark - Python versions, packages
• R - packages
Spark on Docker in YARN
31 © Hortonworks Inc. 2011–2018. All rights reserved
Hadoop Apps
YARN on YARN… Towards a Private Cloud …YCloud
YARN
MR Tez Spark
Tensor
Flow
YARN
MR Tez Spark
Spark
32 © Hortonworks Inc. 2011–2018. All rights reserved
YCloud: YARN Based Container Cloud We Use for Testing
ü Testing Hadoop on Hadoop!
33 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
34 © Hortonworks Inc. 2011–2018. All rights reserved
• YARN supported only Memory
and CPU
• Now
• A generalized vector for all resources
• Admin could add arbitrary resource types!
Resource Profiles and Custom Resource Types
• Ease of resource requesting model
using profiles for apps
Profile Memory CPU GPU
Small 2 GB 4 Cores 0 Cores
Medium 4 GB 8 Cores 0 Cores
Large 16 GB 16 Cores 4 CoresMemory
CPU
GPU
FPGA
Node Manager
35 © Hortonworks Inc. 2011–2018. All rights reserved
• Why?
• No need to setup separate clusters
• Leverage shared compute!
• Why need isolation?
• Multiple processes use the single GPU will be:
• Serialized.
• Cause OOM easily.
• GPU isolation on YARN:
• Granularity is for per-GPU device.
• Use Cgroups / Docker to enforce isolation.
GPU Support on YARN
Tensorflow 1.2
Nginx AppUbuntu 14:04
Nginx AppHost OS
GPU Base Lib v1
Volume Mount
CUDA Library 5.0
36 © Hortonworks Inc. 2011–2018. All rights reserved
• FPGA isolation on YARN: .
• Granularity is for per-FPGA device.
• Use Cgroups to enforce the isolation.
• Currently, only Intel OpenCL SDK for FPGA is supported.
• But implementation is extensible to other FPGA SDK.
FPGA on YARN
37 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
38 © Hortonworks Inc. 2011–2018. All rights reserved
Services Support in YARN
• In the past we have focused on Jobs and have an advanced fine grained scheduler
• A native YARN services framework (YARN-4692)
• Manage services and their lifecycle
• Scale up and down, restart etc
• Support for Upgrades (YARN-4726
• “Do an upgrade of my HBase app with minimal impact to end-users
• Apache Slider retired from Incubator – lessons and key code carried over to YARN
• Simplified discovery of services via DNS mechanisms: YARN-4757
• regionserver-0.hbase-app-3.hadoop.yarn.site
39 © Hortonworks Inc. 2011–2018. All rights reserved
• Applications need simple APIs
• Need to be deployable “easily”
• Simple REST API layer fronting YARN
• YARN-4793 Simplified API layer
• Spawn services & Manage them
Simplified APIs for Service Definitions
41 © Hortonworks Inc. 2011–2018. All rights reserved
LLAP on YARN
• Apache Hive LLAP is a key long running application
• Used for query processing
• Designed to run on a shared multi-tenant YARN cluster
42 © Hortonworks Inc. 2011–2018. All rights reserved
Application Timeout – YARN-3813
• Controlling running time of workloads in YARN
• Define lifetime for an application anytime for YARN to manage.
• “Give me resources for this app/service but kill it after 15 days”
Reservation-based Scheduling: If You’re Late Don’t Blame Us! - Carlo, et al. 2015
43 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Hadoop 3.2 and Beyond
44 © Hortonworks Inc. 2011–2018. All rights reserved
• Every user says “Give me 16GB for my task”, even though it’s only needed at peak
• Each node has some allocated but unutilized capacity. Use such capacity to run opportunistic tasks
• Preempt such tasks when needed
Container Overcommit (YARN-1011)
Un-allocated
Allocated
Utilized
Allocated-
but-unutilized
Node1
Guaranteed
Containers
Node1
Opportunistic
Containers
Preemption
When necessary
45 © Hortonworks Inc. 2011–2018. All rights reserved
• “Take me to a node with JDK 10”
• Node Partition vs. Node Attribute
• Partition:
• One partition for one node
• ACL
• Shares between queues
• Preemption enforced.
• Attribute:
• For container placement
• No ACL/Shares on attributes
• First-come-first-serve
Node Attributes (YARN-3409) Partition - 1 (15 nodes)
Queue-A (40%)
Queue-B (60%)
Partition - 2 (15 nodes)
A (25%)
B (25%)
C (25%)
D (25%)
Node 1
os.type=ubuntu
os.version=14.10
glibc.version=2.20
JDK.version=8u20
Node 2
os.type=RHEL
os.version=5.1
GPU.type=x86_64
JDK.version=7u20
Node 16
os.type=windows
os.version=7
JDK.version=8u20
Node 17
os.type=SUSE
os.version=12
GPU.type=i686
JDK.version=7u20
48 © Hortonworks Inc. 2011–2018. All rights reserved
Questions?

Contenu connexe

Tendances

What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4DataWorks Summit
 
Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0DataWorks Summit
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionDataWorks Summit
 
What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?DataWorks Summit
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataWorks Summit
 
Solving Cybersecurity at Scale
Solving Cybersecurity at ScaleSolving Cybersecurity at Scale
Solving Cybersecurity at ScaleDataWorks Summit
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseDataWorks Summit
 
What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4DataWorks Summit
 
Navigating Idiosyncrasies of IoT Development
Navigating Idiosyncrasies of IoT DevelopmentNavigating Idiosyncrasies of IoT Development
Navigating Idiosyncrasies of IoT DevelopmentDataWorks Summit
 
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFiThe First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFiDataWorks Summit
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash CourseDataWorks Summit
 
Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?DataWorks Summit
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0DataWorks Summit
 
Curing the Kafka Blindness – Streams Messaging Manager
Curing the Kafka Blindness – Streams Messaging ManagerCuring the Kafka Blindness – Streams Messaging Manager
Curing the Kafka Blindness – Streams Messaging ManagerDataWorks Summit
 
How to Ingest 16 Billion Records Per Day into your Hadoop Environment
How to Ingest 16 Billion Records Per Day into your Hadoop EnvironmentHow to Ingest 16 Billion Records Per Day into your Hadoop Environment
How to Ingest 16 Billion Records Per Day into your Hadoop EnvironmentDataWorks Summit
 
Hadoop: The Unintended Benefits
Hadoop: The Unintended BenefitsHadoop: The Unintended Benefits
Hadoop: The Unintended BenefitsDataWorks Summit
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiDataWorks Summit
 

Tendances (20)

What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4
 
Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFi
 
Solving Cybersecurity at Scale
Solving Cybersecurity at ScaleSolving Cybersecurity at Scale
Solving Cybersecurity at Scale
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
 
What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4
 
Navigating Idiosyncrasies of IoT Development
Navigating Idiosyncrasies of IoT DevelopmentNavigating Idiosyncrasies of IoT Development
Navigating Idiosyncrasies of IoT Development
 
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFiThe First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
 
Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
 
Curing the Kafka Blindness – Streams Messaging Manager
Curing the Kafka Blindness – Streams Messaging ManagerCuring the Kafka Blindness – Streams Messaging Manager
Curing the Kafka Blindness – Streams Messaging Manager
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
 
Deep learning 101
Deep learning 101Deep learning 101
Deep learning 101
 
How to Ingest 16 Billion Records Per Day into your Hadoop Environment
How to Ingest 16 Billion Records Per Day into your Hadoop EnvironmentHow to Ingest 16 Billion Records Per Day into your Hadoop Environment
How to Ingest 16 Billion Records Per Day into your Hadoop Environment
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
Hadoop: The Unintended Benefits
Hadoop: The Unintended BenefitsHadoop: The Unintended Benefits
Hadoop: The Unintended Benefits
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
 

Similaire à Apache Hadoop YARN: State of the Union

Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionDataWorks Summit
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storySunil Govindan
 
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionDataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionWangda Tan
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionDataWorks Summit
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureDataWorks Summit
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & FutureDataWorks Summit
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureDataWorks Summit
 
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureHadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureVinod Kumar Vavilapalli
 
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNLessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNBillie Rinaldi
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNDataWorks Summit
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive DataWorks Summit
 
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele Hakka Labs
 
Running Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache HadoopRunning Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache Hadoophitesh1892
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureDataWorks Summit
 
Apache Hadoop YARN: best practices
Apache Hadoop YARN: best practicesApache Hadoop YARN: best practices
Apache Hadoop YARN: best practicesDataWorks Summit
 
High throughput data replication over RAFT
High throughput data replication over RAFTHigh throughput data replication over RAFT
High throughput data replication over RAFTDataWorks Summit
 

Similaire à Apache Hadoop YARN: State of the Union (20)

Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration story
 
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionDataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & Future
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
 
Apache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduceApache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduce
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureHadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
 
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNLessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARN
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
 
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
 
Running Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache HadoopRunning Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache Hadoop
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
Apache Hadoop YARN: best practices
Apache Hadoop YARN: best practicesApache Hadoop YARN: best practices
Apache Hadoop YARN: best practices
 
High throughput data replication over RAFT
High throughput data replication over RAFTHigh throughput data replication over RAFT
High throughput data replication over RAFT
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 

Plus de DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Plus de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Dernier

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Dernier (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Apache Hadoop YARN: State of the Union

  • 1. 1 © Hortonworks Inc. 2011–2018. All rights reserved Apache Hadoop YARN: State of the Union Sanjay Radia Founder, Chief Architect Hortonworks Apache Hadoop PMC
  • 2. 2 © Hortonworks Inc. 2011–2018. All rights reserved About the Speaker • Sanjay Radia • Chief Architect, Founder, Hortonworks • Apache Hadoop PMC and Committer • Part of the original Hadoop team at Yahoo! since 2007 • Chief Architect of Hadoop Core at Yahoo! • Prior • Data center automation, virtualization, Java, HA, OSs, File Systems • Startup, Sun Microsystems, INRIA… • Ph.D., University of Waterloo Page 2Architecting the Future of Big Data
  • 3. 3 © Hortonworks Inc. 2011–2018. All rights reserved • Introduction • Past • State of Union for YARN Agenda
  • 4. 4 © Hortonworks Inc. 2011–2018. All rights reserved A Brief Timeline from Past Year: GA Releases 2.8.0 2.9.0 3.0.0 3.1.0 • GPU/FPGA • YARN Native Service • Placement Constraints • YARN Federation • Opportunistic Container (Backported from 3.0) • New YARN UI • Timeline V2 • Global Scheduling • Multiple Resource types • New YARN UI • Timeline service V2 Ever involving requirements (computation intensive, larger, services) • Application Priority • Reservations • Node labels improvements 22 March ’17 17 Nov ’17 13 Dec ‘17 02 Aug ‘182.8.4 2.9.1 3.0.3 3.1.1
  • 5. 8 © Hortonworks Inc. 2011–2018. All rights reserved Apache Hadoop 3.0/3.1
  • 6. 9 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 7. 10 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 8. 11 © Hortonworks Inc. 2011–2018. All rights reserved • Many customers with clusters made up of large number of nodes • Oath (Yahoo!), Twitter, LinkedIn, Microsoft, Alibaba etc. • Previous largest clusters: 6K-8K • Now: 50K nodes under YARN in Microsoft [1] • Roadmap: To 100K and beyond [1] https://azure.microsoft.com/en-us/blog/how-microsoft-drives-exabyte-analytics-on-the-world-s-largest-yarn-cluster/ Looking at the Scale!
  • 9. 12 © Hortonworks Inc. 2011–2018. All rights reserved • Enables applications to scale to 100k of thousands of nodes • Federation divides a large (10-100k nodes) cluster into smaller units called sub-clusters • Federation negotiates with sub-clusters RM’s and provide resources to the application • Applications can schedule tasks on any node YARN Federation
  • 10. 13 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 11. 14 © Hortonworks Inc. 2011–2018. All rights reserved • YARN-5139 • Problems • Current design of one-node-at-a-time allocation cycle can lead to suboptimal decisions. • Performance and also hard to do global placement policies • Several coarse grained locks • With this, we improved to • Look at several nodes at a time • Fine grained locks • Multiple allocator threads • YARN scheduler can allocate 3k+ containers per second ≈ 10 mil allocations / hour! • 10X throughput gains • Much better placement decisions Moving Towards Global & Fast Scheduling
  • 12. 16 © Hortonworks Inc. 2011–2018. All rights reserved Better Placement Strategies (YARN-6592) • Past • Supported constraints in form of Node Locality • Now YARN can support a lot more use cases • Co-locate the allocations of a job on the same rack (affinity) • Spread allocations across machines (anti-affinity) to minimize resource interference • Allow up to a specific number of allocations in a node group (cardinality)
  • 13. 17 © Hortonworks Inc. 2011–2018. All rights reserved Better Placement Strategies (YARN-6592) Affinity Anti-affinity Region Server Region Server
  • 14. 18 © Hortonworks Inc. 2011–2018. All rights reserved Absolute Resources Configuration in CS – YARN-5881 • The cloud model! “Give me X resources, not X%” • Gives ability to configure Queue resources as below <memory=24GB, vcores=20, yarn.io/gpu=2> • Enables admins to assign different quotas of different resource-types • No more “Single percentage value limitation for all resource-types” root (memory=60G, vcores=40, gpu=10) sales (memory=40G, vcores=20, gpu=6) marketing (memory=10G, vcores=10, gpu=4) engineering (memory=10G, vcores=10, gpu=0)
  • 15. 20 © Hortonworks Inc. 2011–2018. All rights reserved Useful Scheduling Features from Recent 2.X Releases • Application Priorities • Queue Priorities • Previously : Resources to least satisfied queue • Queues can have priorities e.g. (production over dev)
  • 16. 21 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 17. 22 © Hortonworks Inc. 2011–2018. All rights reserved Usability: UI (1)
  • 18. 23 © Hortonworks Inc. 2011–2018. All rights reserved Usability: UI (2)
  • 19. 24 © Hortonworks Inc. 2011–2018. All rights reserved Timeline Service 2.0 • Application History • Where did my container run • Why is my app slow • Why is failing • What happened with my app • Cluster History • Which user had most utilization • Largest App • What happened in my cluster • … • Scalable & Robust Service • Using HBase as backend for better scale • More robust storage fault tolerance • Migration and compatibility with v.1.5
  • 20. 25 © Hortonworks Inc. 2011–2018. All rights reserved Usability: Queue & Logs API based queue management Decentralized (YARN-5734) Improved logs management (YARN-4904) Live application logs
  • 21. 26 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 22. 27 © Hortonworks Inc. 2011–2018. All rights reserved • Better Packaging model • Lightweight mechanism for packaging and resource isolation • Popularized and made accessible by Docker • Native integration ++ in YARN • Support for “Container Runtimes” in LCE: YARN-3611 • Native process runtime • Docker runtime • Many security/usability improves added to 3.0.x / 3.1.x Containers
  • 23. 28 © Hortonworks Inc. 2011–2018. All rights reserved Containers • Run both with and without Docker on the same cluster • Choose at run-time!
  • 24. 29 © Hortonworks Inc. 2011–2018. All rights reserved • Apache Spark applications have a complex set of required software dependencies • Docker on YARN helps to solve Spark package isolation issues with • PySpark - Python versions, packages • R - packages Spark on Docker in YARN
  • 25. 31 © Hortonworks Inc. 2011–2018. All rights reserved Hadoop Apps YARN on YARN… Towards a Private Cloud …YCloud YARN MR Tez Spark Tensor Flow YARN MR Tez Spark Spark
  • 26. 32 © Hortonworks Inc. 2011–2018. All rights reserved YCloud: YARN Based Container Cloud We Use for Testing ü Testing Hadoop on Hadoop!
  • 27. 33 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 28. 34 © Hortonworks Inc. 2011–2018. All rights reserved • YARN supported only Memory and CPU • Now • A generalized vector for all resources • Admin could add arbitrary resource types! Resource Profiles and Custom Resource Types • Ease of resource requesting model using profiles for apps Profile Memory CPU GPU Small 2 GB 4 Cores 0 Cores Medium 4 GB 8 Cores 0 Cores Large 16 GB 16 Cores 4 CoresMemory CPU GPU FPGA Node Manager
  • 29. 35 © Hortonworks Inc. 2011–2018. All rights reserved • Why? • No need to setup separate clusters • Leverage shared compute! • Why need isolation? • Multiple processes use the single GPU will be: • Serialized. • Cause OOM easily. • GPU isolation on YARN: • Granularity is for per-GPU device. • Use Cgroups / Docker to enforce isolation. GPU Support on YARN Tensorflow 1.2 Nginx AppUbuntu 14:04 Nginx AppHost OS GPU Base Lib v1 Volume Mount CUDA Library 5.0
  • 30. 36 © Hortonworks Inc. 2011–2018. All rights reserved • FPGA isolation on YARN: . • Granularity is for per-FPGA device. • Use Cgroups to enforce the isolation. • Currently, only Intel OpenCL SDK for FPGA is supported. • But implementation is extensible to other FPGA SDK. FPGA on YARN
  • 31. 37 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 32. 38 © Hortonworks Inc. 2011–2018. All rights reserved Services Support in YARN • In the past we have focused on Jobs and have an advanced fine grained scheduler • A native YARN services framework (YARN-4692) • Manage services and their lifecycle • Scale up and down, restart etc • Support for Upgrades (YARN-4726 • “Do an upgrade of my HBase app with minimal impact to end-users • Apache Slider retired from Incubator – lessons and key code carried over to YARN • Simplified discovery of services via DNS mechanisms: YARN-4757 • regionserver-0.hbase-app-3.hadoop.yarn.site
  • 33. 39 © Hortonworks Inc. 2011–2018. All rights reserved • Applications need simple APIs • Need to be deployable “easily” • Simple REST API layer fronting YARN • YARN-4793 Simplified API layer • Spawn services & Manage them Simplified APIs for Service Definitions
  • 34. 41 © Hortonworks Inc. 2011–2018. All rights reserved LLAP on YARN • Apache Hive LLAP is a key long running application • Used for query processing • Designed to run on a shared multi-tenant YARN cluster
  • 35. 42 © Hortonworks Inc. 2011–2018. All rights reserved Application Timeout – YARN-3813 • Controlling running time of workloads in YARN • Define lifetime for an application anytime for YARN to manage. • “Give me resources for this app/service but kill it after 15 days” Reservation-based Scheduling: If You’re Late Don’t Blame Us! - Carlo, et al. 2015
  • 36. 43 © Hortonworks Inc. 2011–2018. All rights reserved Apache Hadoop 3.2 and Beyond
  • 37. 44 © Hortonworks Inc. 2011–2018. All rights reserved • Every user says “Give me 16GB for my task”, even though it’s only needed at peak • Each node has some allocated but unutilized capacity. Use such capacity to run opportunistic tasks • Preempt such tasks when needed Container Overcommit (YARN-1011) Un-allocated Allocated Utilized Allocated- but-unutilized Node1 Guaranteed Containers Node1 Opportunistic Containers Preemption When necessary
  • 38. 45 © Hortonworks Inc. 2011–2018. All rights reserved • “Take me to a node with JDK 10” • Node Partition vs. Node Attribute • Partition: • One partition for one node • ACL • Shares between queues • Preemption enforced. • Attribute: • For container placement • No ACL/Shares on attributes • First-come-first-serve Node Attributes (YARN-3409) Partition - 1 (15 nodes) Queue-A (40%) Queue-B (60%) Partition - 2 (15 nodes) A (25%) B (25%) C (25%) D (25%) Node 1 os.type=ubuntu os.version=14.10 glibc.version=2.20 JDK.version=8u20 Node 2 os.type=RHEL os.version=5.1 GPU.type=x86_64 JDK.version=7u20 Node 16 os.type=windows os.version=7 JDK.version=8u20 Node 17 os.type=SUSE os.version=12 GPU.type=i686 JDK.version=7u20
  • 39. 48 © Hortonworks Inc. 2011–2018. All rights reserved Questions?