SlideShare une entreprise Scribd logo
1  sur  43
Télécharger pour lire hors ligne
hadoop123.com

Hadoop YARN
基本架构和发展趋势
董西成
email :dongxicheng@yahoo.com
微信号:hadoop-123 [口号:做微信上最好的hadoop技术传播者]

dongxicheng.org
hadoop123.com

主要内容

1

Hadoop YARN产生背景

2

Hadoop YARN基本架构

3

运行在YARN上的计算框架

4

YARN发展趋势

dongxicheng.org
hadoop123.com

主要内容

1

Hadoop YARN产生背景

2

Hadoop YARN基本架构

3

运行在YARN上的计算框架

4

YARN发展趋势

dongxicheng.org
hadoop123.com

Hadoop YARN产生背景
 直接源于MRv1在几个方面的缺陷
 扩展性受限
 单点故障
 难以支持MR之外的计算

 多计算框架各自为战,数据共享困难
 MR:离线计算框架
 Storm:实时计算框架
 Spark:内存计算框架
dongxicheng.org
hadoop123.com

Hadoop 1.0和2.0

 Hadoop 2.0由HDFS、MapReduce和YARN三个分支构成;
 HDFS:NN Federation、HA;
 MapReduce:运行在YARN上的MR;
 YARN:资源管理系统
dongxicheng.org
hadoop123.com

主要内容

1

Hadoop YARN产生背景

2

Hadoop YARN基本架构

3

运行在YARN上的计算框架

4

YARN发展趋势

dongxicheng.org
hadoop123.com

Hadoop YARN基本架构

dongxicheng.org
hadoop123.com

Hadoop YARN各模块组成
 ResourceManager





处理客户端请求
启动/监控ApplicationMaster
监控NodeManager
资源分配与调度

 NodeManager
 单个节点上的资源管理
 处理来自ResourceManager的命令
 处理来自ApplicationMaster的命令

 ApplicationMaster
 数据切分
 为应用程序申请资源,并分配给内部任务
 任务监控与容错
dongxicheng.org
hadoop123.com

Hadoop YARN运行流程分析
Task

⑥

Node
Manager
Client

Client

①

Task

②

Application
Master

③
Node
Manager

Container

⑤

④
Resource
Manager

Container

⑤
⑤

Client
Node
Manager

Task

⑥
dongxicheng.org

Container
hadoop123.com

Hadoop YARN容错
 ResourceManager
 存在单点故障;
 正在基于ZooKeeper实现HA。

 NodeManager
 失败后,RM将失败任务告诉对应的AM;
 AM决定如何处理失败的任务。

 ApplicationMaster
 失败后,由RM负责重启;
 AM需处理内部任务的容错问题;
 RMAppMaster会保存已经运行完成的Task,重启后无需重新运
行。

dongxicheng.org
hadoop123.com

Hadoop YARN调度框架
 双层调度框架
 RM将资源分配给AM
 AM将资源进一步分配给各个Task

 基于资源预留的调度策略
 资源不够时,会为Task预留,直到资源充足
 与“all or nothing”策略不同(Apache Mesos)
Job1
(MR Appcation Master)

containers
to launch
(tasks)

free
containers

JobTracker
(job1,job2,job3)

free
containers
resources

tasks

Job2
(MR Appcation Master)

YARN Cluster
YARN Cluster
(resource pool)
(resource pool)
containers
to launch
(tasks)

TaskTracker

TaskTracker

TaskTracker

TaskTracker

MRv1

containers
to launch

free
Containers
(tasks)

Job3
(MR Appcation Master)

dongxicheng.org

MRv2
hadoop123.com

Hadoop YARN资源调度器
 多类型资源调度
 采用DRF算法(论文:“Dominant Resource Fairness: Fair
Allocation of Multiple Resource Types”)
 目前支持CPU和内存两种资源

 提供多种资源调度器
 FIFO
 Fair Scheduler
 Capacity Scheduler

 多租户资源调度器
 支持资源按比例分配
 支持层级队列划分方式
 支持资源抢占

dongxicheng.org
hadoop123.com

Hadoop YARN资源隔离方案
 支持内存和CPU两种资源隔离
 内存是一种“决定生死”的资源
 CPU是一种“影响快慢”的资源

 内存隔离
 基于线程监控的方案
 基于Cgroups的方案

 CPU隔离
 默认不对CPU资源进行隔离
 基于Cgroups的方案

dongxicheng.org
hadoop123.com

Hadoop YARN资源调度语义
 支持的语义
 请求某个特定节点/机架上的特定资源量
 将某些节点加入(或移除)黑名单,不再为自己分配这些节点上
的资源
 请求归还某些资源

 不支持的语义





请求任意节点/机架上的特定资源量
请求一组或几组符合某种特质的资源
超细粒度资源
动态调整Container资源

dongxicheng.org
hadoop123.com

主要内容

1

Hadoop YARN产生背景

2

Hadoop YARN基本架构

3

运行在YARN上的计算框架

4

YARN发展趋势

dongxicheng.org
hadoop123.com

应用程序的运行模型
input

Map

input

output

Stage
1

input

Stage
1

Reduce

Stage
2
Stage
3

input

Map

Stage
4

output

Stage
N

output

Stage
2
Stage
3

dongxicheng.org

output
hadoop123.com

YARN应用程序类型
 长应用程序和短应用程序
 长应用程序
Service、HTTP Server等
 短应用程序
MR job、Spark Job等

dongxicheng.org
hadoop123.com

以YARN为核心的生态系统

dongxicheng.org
hadoop123.com

运行在YARN上的计算框架

离线计算框架:MapReduce
DAG计算框架:Tez

流式计算框架:Storm
内存计算框架:Spark
图计算框架:Giraph、GraphLib
dongxicheng.org
hadoop123.com

离线计算框架MapReduce
 将计算过程分为两个阶段,Map和Reduce
 Map 阶段并行处理输入数据
 Reduce阶段对Map结果进行汇总
 Shuffle连接Map和Reduce两个阶段
 Map Task将数据写到本地磁盘
 Reduce Task从每个Map Task上读取一份数据
 仅适合离线批处理
 具有很好的容错性和扩展性
 适合简单的批处理任务
 缺点明显
 启动开销大、过多使用磁盘导致效率低下等
dongxicheng.org
hadoop123.com

MapReduce On YARN
ResourceManager

Client
Client

Applications
Manager

1

Resource
Scheduler

2
3,8

4
Node
Manager

Node
Manager
6
Map Task
Map Task
Container

2
7

5

5

MR App
MR App
Mstr
Mstr
Container

Container 6
Reduce Task
Reduce Task

7
7

dongxicheng.org

6 Container
Map Task
Map Task
hadoop123.com

DAG计算框架Tez
 多个作业之间存在数据依赖关系,并形成一个依赖关系有
向图( Directed Acyclic Graph ),该图的计算称为
“DAG计算”
 Apache Tez:基于YARN的DAG计算框架
 运行在YARN之上,充分利用YARN的资源管理和容错等功能;
 提供了丰富的数据流(dataflow)API;
 扩展性良好的“Input-Processor-Output”运行时模型;
 动态生成物理数据流关系。
Phase 2
Map

Phase 4

Reduce
Phase 1
Phase 5

Map

Reduce

Reduce

Phase 3

dongxicheng.org
hadoop123.com

DAG计算框架Tez

DAG Job

Job1
HDFS

HDFS

Job2
HDFS

Map1

Map1

Map2

Job2

Job1

Using
Tez
Reduce12
Using
Tez

Reduce1

Reduce2
Reduce2

HDFS

Job3

HDFS
HDFS

WordCount

Job4

Top K
WordCount+Top K

dongxicheng.org

Single Job
hadoop123.com

Tez On YARN
ResourceManager

Client
Client

Applications
Manager

1

Resource
Scheduler

2
3,8

4
Node
Manager

Node
Manager
2
Vertex-A
Vertex-A
Task
Task
Container

7

5

5

DAG App
DAG App
Mstr
Mstr
Container

Container 6
Vertex-A
Vertex-A
Task
Task

7
7

dongxicheng.org

6 Container
Vertex-A
Vertex-A
Task
Task
hadoop123.com

Tez优化技术
 ApplicationMaster缓冲池
 作业提交到AMPoolServer服务上
 预启动若干个ApplicationMaster,形成一个
ApplicationMaster缓冲池
 预先启动Container
 ApplicationMaster启动时可以预先启动若干个Container
 Container重用
 任务运行完成后,ApplicationMaster不会马上注销它使
用的Container,而是将它重新分配给其他未运行的任务

dongxicheng.org
hadoop123.com

Tez应用场景
 直接编写应用程序
 Tez提供了一套通用编程接口
 适合编写有依赖关系的作业
 优化Pig、Hive等引擎
 下一代Hive:Stinger
 好处1:避免查询语句转换成过多的MapReduce作业后
产生大量不必要的网络和磁盘IO
 好处2:更加智能的任务处理引擎

dongxicheng.org
hadoop123.com

流式计算Storm
 流式(Streaming)计算,是指被处理的数据
像流水一样不断流入系统,而系统需要针对每条
数据进行实时处理和计算,并永不停止(直到用
户显式杀死进程);
 传统做法:由消息队列和消息处理者组成的实时
处理网络进行实时计算; 引自:2013中国大数据技术大会
肖康:“Storm在实时网络攻击检测和分析的应用与改进”,
PPT:http://share.csdn.net/slides/1230
缺乏自动化
缺乏健壮性
伸缩性差
 Storm出现。
dongxicheng.org
hadoop123.com

Worker

Executor

Blot-A Tasks, topology1

Executor

Spout Tasks, topology 2

Executor

Blot-1 Tasks, topology 2

Spout Tasks, topology 1

Executor

Blot-B Tasks, topology1

Executor

Blot-C Tasks, topology1

Executor

Blot-B Tasks, topology1

Worker

Executor

Executor

Blot-2 Tasks, topology 2

Executor

Blot-1 Tasks, topology 2

Worker

Worker

Spout Tasks, topology 1

Worker

Supervisor
Supervisor

Zookeeper

Supervisor

Nimbus

Executor

Worker

流式计算框架Storm

Executor

Blot-2 Tasks, topology 2

Executor

Blot-2 Tasks, topology 2

dongxicheng.org
hadoop123.com

流式计算框架Storm
Topology
Blot-A

Stream
Grouping
Blot-C

Stream
Grouping
Task
Task
Spout

Task

Task

Task

Task

Task
Blot-B

Task

Task

Hadoop MapReduce(MRv1)
JobTracker

应用程序名称
编程模型

Nimbus

TaskTracker

Supervisor

Child

系统服务

Storm

Worker

Job

Topology

Map/Reduce

Spout/Blot

Shuffle

Stream Grouping

dongxicheng.org
hadoop123.com

Storm On YARN
YARN-MapReduce
YARN-MapReduce
Client
Client

YARN-Storm
YARN-Storm
Client
Client

YARN-MPI
YARN-MPI
Client
Client

Storm ①
Submission

Resource
Manager

Node
Manager
Storm
Storm
Client
Client
⑤

Storm Application
Master ③
Nimbus Web UI

so
Re

② st
e
equ
eR ④
rc
u

④

④

Node
Manager

Node
Manager
Container
Storm
Storm
Supervisior
Supervisior
Container

⑤
Storm
Storm
Client
Client

Zookeeper

dongxicheng.org

MR App
MR App
Mstr
Mstr

Map Task
Map Task

Node
Manager

……
Storm
Storm
Supervisior
Supervisior
hadoop123.com

内存计算框架Spark
 克服MapReduce在迭代式计算和交互式计算方
面的不足;
 引入RDD(Resilient Distributed
Datasets)数据表示模型;
 RDD是一个有容错机制,可以被并行操作的数据
集合,能够被缓存到内存或磁盘上。
引自: “基于Spark on Yarn的淘宝数据挖掘平台”,
PPT:http://vdisk.weibo.com/s/dn9q7A_XuVrf

dongxicheng.org
hadoop123.com

内存计算框架Spark

dongxicheng.org
hadoop123.com

Spark On YARN
YARN-Spark
YARN-Spark
Client
Client

YARN-MapReduce
YARN-MapReduce
Client
Client

YARN-MPI
YARN-MPI
Client
Client

Spark
Submission

Resource
Manager

Node
Manager

so
Re

st
que
Re
e
ur c
Node
Manager

Node
Manager
Container

Spark Application Master
Cluster
Web UI
Scheduler

StandaloneExecutorBackend
StandaloneExecutorBackend

MR App
MR App
Mstr
Mstr

Node
Manager

……
StandaloneExecutorBackend
StandaloneExecutorBackend

Container

Map Task
Map Task

dongxicheng.org
hadoop123.com

Spark生态系统

dongxicheng.org
hadoop123.com

其他Framework On YARN
 Hoya:HBase on YARN ;
https://github.com/hortonworks/hoya/

 LLAMA:Impala On YARN
http://cloudera.github.io/llama/

 Kafka On YARN
https://github.com/kkasravi/kafka-yarn

dongxicheng.org
hadoop123.com

主要内容

1

Hadoop YARN产生背景

2

Hadoop YARN基本架构

3

运行在YARN上的计算框架

4

YARN发展趋势

dongxicheng.org
hadoop123.com

资源管理系统带来的好处
 提高集群资源利用率

 服务自动化部署
bin/hadoop install hbase -version 0.95.0 -slaves 10

InstallerApp
Master
zookeeper

YARN(资源管理系统)

zookeeper

zookeeper

HDFS2(分布式存储系统)

software
codebase

HBase
|__0.92.0
|__0.94.0
|__0.95.0
……

Storm
|__0.7.0
|__0.8.0
MYSQL
|__5.5.25
|__5.6.12
……

dongxicheng.org
hadoop123.com

资源管理系统发展趋势
双层调度

集中式调度

Scheduling
logic

Scheduling
logic

共享状态调度
Scheduling
logic

Scheduling
logic

Scheduling
logic

subset
full state

Cluster
Machines
No
concurrency

pessimistic
concurrency
(offers)

optimistic
concurrency
(transactions)

引自:Google论文“Omega: flexible, scalable schedulers for large
compute clusters”

dongxicheng.org
hadoop123.com

YARN自身的完善
 调度框架的完善:YARN-896
支持更多的资源类型(网络、磁盘等)
支持更多的调度语义
 长作业的在线升级
Storm在线升级等
Container资源动态调整:YARN-1197
 容错机制
ResourceManager自身容错
NodeManager宕掉,任务不受影响
ApplicationMaster个性化容错
dongxicheng.org
hadoop123.com

以YARN为核心的生态系统
 YARN使得Hadoop生态系统更加强大
吸纳其他开源计算框架,比Storm、Spark等
变为集群管理者
MapReduce

Tez

Storm

Spark

Giraph

YARN(资源管理系统)

HDFS2(分布式存储系统)

dongxicheng.org

HBase

OpenMPI

……
hadoop123.com

Hadoop技术内幕

dongxicheng.org
hadoop123.com

引用链接
 Hadoop;
http://hadoop.apache.org/
 Tez
http://tez.incubator.apache.org/
 Storm
http://storm-project.net/
 Storm On YARN
https://github.com/yahoo/storm-yarn
 Spark
http://spark.incubator.apache.org/
 Spark On YARN
https://github.com/apache/incubatorspark/tree/branch-0.8/yarn

dongxicheng.org
hadoop123.com

【联系我】
邮 箱:dongxicheng@yahoo.com
微信号:hadoop-123(每天一篇hadoop文章)
微博ID:西成懂
博 客:dongxicheng.org
书 籍:hadoop123.com

Contenu connexe

Tendances

2016-07-12 Introduction to Big Data Platform Security
2016-07-12 Introduction to Big Data Platform Security2016-07-12 Introduction to Big Data Platform Security
2016-07-12 Introduction to Big Data Platform SecurityJazz Yao-Tsung Wang
 
Hadoop 2.0 之古往今來
Hadoop 2.0 之古往今來Hadoop 2.0 之古往今來
Hadoop 2.0 之古往今來Wei-Yu Chen
 
2014-10-17 探析台灣巨量資料產業供應鏈串聯現況
2014-10-17 探析台灣巨量資料產業供應鏈串聯現況2014-10-17 探析台灣巨量資料產業供應鏈串聯現況
2014-10-17 探析台灣巨量資料產業供應鏈串聯現況Jazz Yao-Tsung Wang
 
2006-11-16 RFID and OSS for Agriculture
2006-11-16 RFID and OSS for Agriculture2006-11-16 RFID and OSS for Agriculture
2006-11-16 RFID and OSS for AgricultureJazz Yao-Tsung Wang
 
MapReduce 簡單介紹與練習
MapReduce 簡單介紹與練習MapReduce 簡單介紹與練習
MapReduce 簡單介紹與練習孜羲 顏
 
Big Data Taiwan 2014 Track1-3: Big Data, Big Challenge — Splunk 幫你解決 Big Data...
Big Data Taiwan 2014 Track1-3: Big Data, Big Challenge — Splunk 幫你解決 Big Data...Big Data Taiwan 2014 Track1-3: Big Data, Big Challenge — Splunk 幫你解決 Big Data...
Big Data Taiwan 2014 Track1-3: Big Data, Big Challenge — Splunk 幫你解決 Big Data...Etu Solution
 
Big Data Projet Management the Body of Knowledge (BDPMBOK)
Big Data Projet Management the Body of Knowledge (BDPMBOK)Big Data Projet Management the Body of Knowledge (BDPMBOK)
Big Data Projet Management the Body of Knowledge (BDPMBOK)Jazz Yao-Tsung Wang
 
Hadoop Map Reduce 程式設計
Hadoop Map Reduce 程式設計Hadoop Map Reduce 程式設計
Hadoop Map Reduce 程式設計Wei-Yu Chen
 
Hadoop 生態系十年回顧與未來展望
Hadoop 生態系十年回顧與未來展望Hadoop 生態系十年回顧與未來展望
Hadoop 生態系十年回顧與未來展望Jazz Yao-Tsung Wang
 
淘宝Hadoop数据分析实践
淘宝Hadoop数据分析实践淘宝Hadoop数据分析实践
淘宝Hadoop数据分析实践Min Zhou
 
Hadoop con 2015 hadoop enables enterprise data lake
Hadoop con 2015   hadoop enables enterprise data lakeHadoop con 2015   hadoop enables enterprise data lake
Hadoop con 2015 hadoop enables enterprise data lakeJames Chen
 
分布式流数据实时计算平台 Iprocess
分布式流数据实时计算平台 Iprocess分布式流数据实时计算平台 Iprocess
分布式流数据实时计算平台 Iprocessbabel_qi
 
2015-05-20 製造業生產歷程全方位整合查詢與探勘的規劃心法
2015-05-20 製造業生產歷程全方位整合查詢與探勘的規劃心法2015-05-20 製造業生產歷程全方位整合查詢與探勘的規劃心法
2015-05-20 製造業生產歷程全方位整合查詢與探勘的規劃心法Jazz Yao-Tsung Wang
 
HDFS與MapReduce架構研討
HDFS與MapReduce架構研討HDFS與MapReduce架構研討
HDFS與MapReduce架構研討Billy Yang
 
Bigdata 大資料分析實務 (進階上機課程)
Bigdata 大資料分析實務 (進階上機課程)Bigdata 大資料分析實務 (進階上機課程)
Bigdata 大資料分析實務 (進階上機課程)家雋 莊
 
How to plan a hadoop cluster for testing and production environment
How to plan a hadoop cluster for testing and production environmentHow to plan a hadoop cluster for testing and production environment
How to plan a hadoop cluster for testing and production environmentAnna Yen
 

Tendances (20)

2016-07-12 Introduction to Big Data Platform Security
2016-07-12 Introduction to Big Data Platform Security2016-07-12 Introduction to Big Data Platform Security
2016-07-12 Introduction to Big Data Platform Security
 
Hadoop 2.0 之古往今來
Hadoop 2.0 之古往今來Hadoop 2.0 之古往今來
Hadoop 2.0 之古往今來
 
Hadoop 介紹 20141024
Hadoop 介紹 20141024Hadoop 介紹 20141024
Hadoop 介紹 20141024
 
2014-10-17 探析台灣巨量資料產業供應鏈串聯現況
2014-10-17 探析台灣巨量資料產業供應鏈串聯現況2014-10-17 探析台灣巨量資料產業供應鏈串聯現況
2014-10-17 探析台灣巨量資料產業供應鏈串聯現況
 
2006-11-16 RFID and OSS for Agriculture
2006-11-16 RFID and OSS for Agriculture2006-11-16 RFID and OSS for Agriculture
2006-11-16 RFID and OSS for Agriculture
 
MapReduce 簡單介紹與練習
MapReduce 簡單介紹與練習MapReduce 簡單介紹與練習
MapReduce 簡單介紹與練習
 
Big Data Taiwan 2014 Track1-3: Big Data, Big Challenge — Splunk 幫你解決 Big Data...
Big Data Taiwan 2014 Track1-3: Big Data, Big Challenge — Splunk 幫你解決 Big Data...Big Data Taiwan 2014 Track1-3: Big Data, Big Challenge — Splunk 幫你解決 Big Data...
Big Data Taiwan 2014 Track1-3: Big Data, Big Challenge — Splunk 幫你解決 Big Data...
 
Big Data Projet Management the Body of Knowledge (BDPMBOK)
Big Data Projet Management the Body of Knowledge (BDPMBOK)Big Data Projet Management the Body of Knowledge (BDPMBOK)
Big Data Projet Management the Body of Knowledge (BDPMBOK)
 
Hadoop Map Reduce 程式設計
Hadoop Map Reduce 程式設計Hadoop Map Reduce 程式設計
Hadoop Map Reduce 程式設計
 
Hadoop 生態系十年回顧與未來展望
Hadoop 生態系十年回顧與未來展望Hadoop 生態系十年回顧與未來展望
Hadoop 生態系十年回顧與未來展望
 
淘宝Hadoop数据分析实践
淘宝Hadoop数据分析实践淘宝Hadoop数据分析实践
淘宝Hadoop数据分析实践
 
Hadoop hive
Hadoop hiveHadoop hive
Hadoop hive
 
Life of Big Data Technologies
Life of Big Data TechnologiesLife of Big Data Technologies
Life of Big Data Technologies
 
Hadoop con 2015 hadoop enables enterprise data lake
Hadoop con 2015   hadoop enables enterprise data lakeHadoop con 2015   hadoop enables enterprise data lake
Hadoop con 2015 hadoop enables enterprise data lake
 
分布式流数据实时计算平台 Iprocess
分布式流数据实时计算平台 Iprocess分布式流数据实时计算平台 Iprocess
分布式流数据实时计算平台 Iprocess
 
2015-05-20 製造業生產歷程全方位整合查詢與探勘的規劃心法
2015-05-20 製造業生產歷程全方位整合查詢與探勘的規劃心法2015-05-20 製造業生產歷程全方位整合查詢與探勘的規劃心法
2015-05-20 製造業生產歷程全方位整合查詢與探勘的規劃心法
 
Hadoop.TW : Now and Future
Hadoop.TW : Now and FutureHadoop.TW : Now and Future
Hadoop.TW : Now and Future
 
HDFS與MapReduce架構研討
HDFS與MapReduce架構研討HDFS與MapReduce架構研討
HDFS與MapReduce架構研討
 
Bigdata 大資料分析實務 (進階上機課程)
Bigdata 大資料分析實務 (進階上機課程)Bigdata 大資料分析實務 (進階上機課程)
Bigdata 大資料分析實務 (進階上機課程)
 
How to plan a hadoop cluster for testing and production environment
How to plan a hadoop cluster for testing and production environmentHow to plan a hadoop cluster for testing and production environment
How to plan a hadoop cluster for testing and production environment
 

En vedette

Apache Spark Workshop, Apr. 2016, Euangelos Linardos
Apache Spark Workshop, Apr. 2016, Euangelos LinardosApache Spark Workshop, Apr. 2016, Euangelos Linardos
Apache Spark Workshop, Apr. 2016, Euangelos LinardosEuangelos Linardos
 
Big Data Programming Using Hadoop Workshop
Big Data Programming Using Hadoop WorkshopBig Data Programming Using Hadoop Workshop
Big Data Programming Using Hadoop WorkshopIMC Institute
 
How to Upgrade Hundreds or Thousands of Databases
How to Upgrade Hundreds or Thousands of DatabasesHow to Upgrade Hundreds or Thousands of Databases
How to Upgrade Hundreds or Thousands of DatabasesGuatemala User Group
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureDataWorks Summit
 
#MesosCon 2014: Spark on Mesos
#MesosCon 2014: Spark on Mesos#MesosCon 2014: Spark on Mesos
#MesosCon 2014: Spark on MesosPaco Nathan
 
Continous delivery with sbt
Continous delivery with sbtContinous delivery with sbt
Continous delivery with sbtWojciech Pituła
 
Hive User Meeting March 2010 - Hive Team
Hive User Meeting March 2010 - Hive TeamHive User Meeting March 2010 - Hive Team
Hive User Meeting March 2010 - Hive TeamZheng Shao
 
Introduction to Oracle Clusterware 12c
Introduction to Oracle Clusterware 12cIntroduction to Oracle Clusterware 12c
Introduction to Oracle Clusterware 12cGuatemala User Group
 
ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...
ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...
ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...Zhijie Shen
 
Apache spark workshop
Apache spark workshopApache spark workshop
Apache spark workshopPawel Szulc
 
Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala Edureka!
 
11. From Hadoop to Spark 1:2
11. From Hadoop to Spark 1:211. From Hadoop to Spark 1:2
11. From Hadoop to Spark 1:2Fabio Fumarola
 
Blockchain คืออะไร
Blockchain คืออะไรBlockchain คืออะไร
Blockchain คืออะไรIMC Institute
 
Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopLearning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopSomeshwar Kale
 
La transformacion digital en nuestra vida cotidiana. Un vistazo a las APIs
La transformacion digital en nuestra vida cotidiana. Un vistazo a las APIsLa transformacion digital en nuestra vida cotidiana. Un vistazo a las APIs
La transformacion digital en nuestra vida cotidiana. Un vistazo a las APIsGuatemala User Group
 
11. From Hadoop to Spark 2/2
11. From Hadoop to Spark 2/211. From Hadoop to Spark 2/2
11. From Hadoop to Spark 2/2Fabio Fumarola
 

En vedette (20)

Hbase an introduction
Hbase an introductionHbase an introduction
Hbase an introduction
 
Apache Spark Workshop, Apr. 2016, Euangelos Linardos
Apache Spark Workshop, Apr. 2016, Euangelos LinardosApache Spark Workshop, Apr. 2016, Euangelos Linardos
Apache Spark Workshop, Apr. 2016, Euangelos Linardos
 
Big Data Programming Using Hadoop Workshop
Big Data Programming Using Hadoop WorkshopBig Data Programming Using Hadoop Workshop
Big Data Programming Using Hadoop Workshop
 
Oracle 12 Upgrade
Oracle 12 UpgradeOracle 12 Upgrade
Oracle 12 Upgrade
 
How to Upgrade Hundreds or Thousands of Databases
How to Upgrade Hundreds or Thousands of DatabasesHow to Upgrade Hundreds or Thousands of Databases
How to Upgrade Hundreds or Thousands of Databases
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
#MesosCon 2014: Spark on Mesos
#MesosCon 2014: Spark on Mesos#MesosCon 2014: Spark on Mesos
#MesosCon 2014: Spark on Mesos
 
Continous delivery with sbt
Continous delivery with sbtContinous delivery with sbt
Continous delivery with sbt
 
Hive User Meeting March 2010 - Hive Team
Hive User Meeting March 2010 - Hive TeamHive User Meeting March 2010 - Hive Team
Hive User Meeting March 2010 - Hive Team
 
Introduction to Oracle Clusterware 12c
Introduction to Oracle Clusterware 12cIntroduction to Oracle Clusterware 12c
Introduction to Oracle Clusterware 12c
 
ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...
ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...
ApacheCon North America 2014 - Apache Hadoop YARN: The Next-generation Distri...
 
Apache spark workshop
Apache spark workshopApache spark workshop
Apache spark workshop
 
Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala
 
Spark workshop
Spark workshopSpark workshop
Spark workshop
 
11. From Hadoop to Spark 1:2
11. From Hadoop to Spark 1:211. From Hadoop to Spark 1:2
11. From Hadoop to Spark 1:2
 
Blockchain คืออะไร
Blockchain คืออะไรBlockchain คืออะไร
Blockchain คืออะไร
 
Scala and spark
Scala and sparkScala and spark
Scala and spark
 
Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for HadoopLearning Apache HIVE - Data Warehouse and Query Language for Hadoop
Learning Apache HIVE - Data Warehouse and Query Language for Hadoop
 
La transformacion digital en nuestra vida cotidiana. Un vistazo a las APIs
La transformacion digital en nuestra vida cotidiana. Un vistazo a las APIsLa transformacion digital en nuestra vida cotidiana. Un vistazo a las APIs
La transformacion digital en nuestra vida cotidiana. Un vistazo a las APIs
 
11. From Hadoop to Spark 2/2
11. From Hadoop to Spark 2/211. From Hadoop to Spark 2/2
11. From Hadoop to Spark 2/2
 

Similaire à Hadoop yarn 基本架构和发展趋势

大规模数据处理
大规模数据处理大规模数据处理
大规模数据处理Kay Yan
 
大规模数据处理
大规模数据处理大规模数据处理
大规模数据处理airsex
 
淘宝分布式数据处理实践
淘宝分布式数据处理实践淘宝分布式数据处理实践
淘宝分布式数据处理实践isnull
 
Hadoop与数据分析
Hadoop与数据分析Hadoop与数据分析
Hadoop与数据分析George Ang
 
Hdfs introduction
Hdfs introductionHdfs introduction
Hdfs introductionbaggioss
 
The Construction and Practice of Apache Pegasus in Offline and Online Scenari...
The Construction and Practice of Apache Pegasus in Offline and Online Scenari...The Construction and Practice of Apache Pegasus in Offline and Online Scenari...
The Construction and Practice of Apache Pegasus in Offline and Online Scenari...acelyc1112009
 
選擇正確的Solution 來建置現代化的雲端資料倉儲
選擇正確的Solution 來建置現代化的雲端資料倉儲選擇正確的Solution 來建置現代化的雲端資料倉儲
選擇正確的Solution 來建置現代化的雲端資料倉儲Herman Wu
 
Qcon2013 罗李 - hadoop在阿里
Qcon2013 罗李 - hadoop在阿里Qcon2013 罗李 - hadoop在阿里
Qcon2013 罗李 - hadoop在阿里li luo
 
Distributed Data Analytics at Taobao
Distributed Data Analytics at TaobaoDistributed Data Analytics at Taobao
Distributed Data Analytics at TaobaoMin Zhou
 
Voldemort Intro Tangfl
Voldemort Intro TangflVoldemort Intro Tangfl
Voldemort Intro Tangflfulin tang
 
Introduction of Spark by Wang Haihua
Introduction of Spark by Wang HaihuaIntroduction of Spark by Wang Haihua
Introduction of Spark by Wang HaihuaWang Haihua
 
How does Apache Pegasus used in SensorsData
How does Apache Pegasusused in SensorsDataHow does Apache Pegasusused in SensorsData
How does Apache Pegasus used in SensorsDataacelyc1112009
 
Big Java, Big Data
Big Java, Big DataBig Java, Big Data
Big Java, Big DataKuo-Chun Su
 
Hadoop大数据实践经验
Hadoop大数据实践经验Hadoop大数据实践经验
Hadoop大数据实践经验Hanborq Inc.
 
Hadoop大数据实践经验
Hadoop大数据实践经验Hadoop大数据实践经验
Hadoop大数据实践经验Schubert Zhang
 
Hadoop系统架构
Hadoop系统架构Hadoop系统架构
Hadoop系统架构ssuser220dc6
 
百度系统部分布式系统介绍 马如悦 Sacc2010
百度系统部分布式系统介绍 马如悦 Sacc2010百度系统部分布式系统介绍 马如悦 Sacc2010
百度系统部分布式系统介绍 马如悦 Sacc2010Chuanying Du
 
Hadoop系统及其关键技术
Hadoop系统及其关键技术Hadoop系统及其关键技术
Hadoop系统及其关键技术冬 陈
 
《数据库发展研究报告-解读(2023年)》.pdf
《数据库发展研究报告-解读(2023年)》.pdf《数据库发展研究报告-解读(2023年)》.pdf
《数据库发展研究报告-解读(2023年)》.pdfmarkmind
 

Similaire à Hadoop yarn 基本架构和发展趋势 (20)

大规模数据处理
大规模数据处理大规模数据处理
大规模数据处理
 
大规模数据处理
大规模数据处理大规模数据处理
大规模数据处理
 
淘宝分布式数据处理实践
淘宝分布式数据处理实践淘宝分布式数据处理实践
淘宝分布式数据处理实践
 
Hadoop与数据分析
Hadoop与数据分析Hadoop与数据分析
Hadoop与数据分析
 
Hdfs introduction
Hdfs introductionHdfs introduction
Hdfs introduction
 
The Construction and Practice of Apache Pegasus in Offline and Online Scenari...
The Construction and Practice of Apache Pegasus in Offline and Online Scenari...The Construction and Practice of Apache Pegasus in Offline and Online Scenari...
The Construction and Practice of Apache Pegasus in Offline and Online Scenari...
 
選擇正確的Solution 來建置現代化的雲端資料倉儲
選擇正確的Solution 來建置現代化的雲端資料倉儲選擇正確的Solution 來建置現代化的雲端資料倉儲
選擇正確的Solution 來建置現代化的雲端資料倉儲
 
Qcon2013 罗李 - hadoop在阿里
Qcon2013 罗李 - hadoop在阿里Qcon2013 罗李 - hadoop在阿里
Qcon2013 罗李 - hadoop在阿里
 
Distributed Data Analytics at Taobao
Distributed Data Analytics at TaobaoDistributed Data Analytics at Taobao
Distributed Data Analytics at Taobao
 
Voldemort Intro Tangfl
Voldemort Intro TangflVoldemort Intro Tangfl
Voldemort Intro Tangfl
 
Introduction of Spark by Wang Haihua
Introduction of Spark by Wang HaihuaIntroduction of Spark by Wang Haihua
Introduction of Spark by Wang Haihua
 
How does Apache Pegasus used in SensorsData
How does Apache Pegasusused in SensorsDataHow does Apache Pegasusused in SensorsData
How does Apache Pegasus used in SensorsData
 
Big Java, Big Data
Big Java, Big DataBig Java, Big Data
Big Java, Big Data
 
Hadoop大数据实践经验
Hadoop大数据实践经验Hadoop大数据实践经验
Hadoop大数据实践经验
 
Hadoop大数据实践经验
Hadoop大数据实践经验Hadoop大数据实践经验
Hadoop大数据实践经验
 
Hadoop系统架构
Hadoop系统架构Hadoop系统架构
Hadoop系统架构
 
百度系统部分布式系统介绍 马如悦 Sacc2010
百度系统部分布式系统介绍 马如悦 Sacc2010百度系统部分布式系统介绍 马如悦 Sacc2010
百度系统部分布式系统介绍 马如悦 Sacc2010
 
Hadoop系统及其关键技术
Hadoop系统及其关键技术Hadoop系统及其关键技术
Hadoop系统及其关键技术
 
Erlang and HTML5
Erlang and HTML5Erlang and HTML5
Erlang and HTML5
 
《数据库发展研究报告-解读(2023年)》.pdf
《数据库发展研究报告-解读(2023年)》.pdf《数据库发展研究报告-解读(2023年)》.pdf
《数据库发展研究报告-解读(2023年)》.pdf
 

Hadoop yarn 基本架构和发展趋势