SlideShare une entreprise Scribd logo
1  sur  41
Hadoop Operations – 
Best Practices from the Field 
October 17, 2014 
Chris Nauroth 
email: cnauroth@hortonworks.com 
twitter: @cnauroth 
Suresh Srinivas 
email: suresh@hortonworks.com 
twitter: @suresh_m_s
About Us 
Chris Nauroth 
• Member of Technical Staff, Hortonworks 
– Apache Hadoop committer and PMC member 
– Major contributor to HDFS ACLs, Windows compatibility, and operability improvements 
• Hadoop user since 2010 
– Prior employment experience deploying, maintaining and using Hadoop clusters 
Suresh Srinivas 
• Architect & Founder at Hortonworks 
– Long time Apache Hadoop committer and PMC member 
– Designed and developed many key Hadoop features 
• Experience from supporting many clusters 
– Including some of the world’s largest Hadoop clusters 
© Hortonworks Inc. 2011 
Page 2 
Architecting the Future of Big Data
Agenda 
• Analysis of Hadoop Support Cases 
– Support case trends 
– Configuration 
– Documentation 
– Software Improvements 
• Key Learnings and Best Practices 
– HDFS ACLs 
– HDFS Snapshots 
– YARN Application Timeline Server 
© Hortonworks Inc. 2011 
Page 3 
Architecting the Future of Big Data
Support Cases: Setting the Context 
• Hortonworks Support 
– Multiple tiers of support contacts 
– Support engineers trained and knowledgeable across the entire Hadoop ecosystem 
– Cases may escalate to subject matter experts for depth in one particular area 
– Challenging cases may escalate to Apache committers at Hortonworks if additional expertise is required 
• Apache Community Support 
– user@hadoop.apache.org for user questions and support 
– https://issues.apache.org/jira for reporting confirmed bugs 
– Apache Hadoop users, contributors, committers and PMC members all participate actively in these forums to help 
resolve issues 
© Hortonworks Inc. 2011 
Page 4 
Architecting the Future of Big Data
Support Case Analysis Methodology 
• Inspected over 2 years of support case history across hundreds of customers 
• Broad inclusion of 29 Hadoop ecosystem and related projects 
• Multiple versions of Hadoop in deployments 
– 2 major versions: Hadoop 1.x and 2.x 
– ~3 minor versions within each major version 
– ~3 patch releases per minor version 
– ~15 total releases and updates 
• Distinct deployment environments 
– Cluster sizes ranging from 10s to 1000s of nodes 
– Different management environments and operational practices 
– Various deployment techniques: Ambari, Chef, RPMs, etc. 
© Hortonworks Inc. 2011 
Page 5 
Architecting the Future of Big Data
Support Case Trends – Cases per Month 
© Hortonworks Inc. 2011 
Page 6 
Architecting the Future of Big Data 
140 
120 
100 
80 
60 
40 
20 
0 
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 
HDFS 
Map Reduce 
YARN
Support Case Trends – Cases per Month 
• What is the spike in May 2014? 
– More users 
– More total users means more total support cases 
– More features 
– Many upgrades of existing clusters from Hadoop 1 to Hadoop 2 
– Many conversions to HA deployments 
– Many conversions to secured deployments 
– More integration 
– Many sites running separate Hadoop 1 and Hadoop 2 clusters simultaneously 
– Questions around migrating data between clusters at 2 different versions (DistCp) 
© Hortonworks Inc. 2011 
Page 7 
Architecting the Future of Big Data
Support Case Trends – Proportional Cases per Month 
© Hortonworks Inc. 2011 
Page 8 
Architecting the Future of Big Data 
1 
0.9 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 
0 
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 
HDFS 
Map Reduce 
YARN 
Other (26 components)
Support Case Trends – Root Cause 
© Hortonworks Inc. 2011 
Page 9 
Architecting the Future of Big Data 
450 
400 
350 
300 
250 
200 
150 
100 
50 
0 
Customer Environment 
(Non HDP) 
Documentation Defect Documentation Gap Documentation Not 
Utilized 
Education - 
Configuration 
Needs Training Product Defect 
YARN 
Map Reduce 
HDFS
Support Case Trends 
• Highlights 
– Core Hadoop components (HDFS, YARN and MapReduce) are used across all deployments, and therefore 
receive proportionally more support cases than other ecosystem components. 
– Misconfiguration is the dominant root cause. 
– Documentation is a close second. 
– We are constantly improving the code to eliminate operational issues, help with diagnosis and provide increased 
visibility. 
© Hortonworks Inc. 2011 
Page 10 
Architecting the Future of Big Data
Configuration
Hardware and Cluster Sizing 
• Considerations 
–Larger clusters heal faster on nodes or disk failure 
–Machines with huge storage take longer to recover 
–More racks give more failure domains 
• Recommendations 
– Get good-quality commodity hardware 
– Buy the sweet-spot in pricing: 3TB disk, 96GB, 8-12 cores 
– More memory is better – real time is memory hungry! 
– Before considering fatter machines (1U 6 disks vs. 2U 12 disks) 
– Get to 30-40 machines or 3-4 racks 
–Use pilot cluster to learn about load patterns 
– Balanced hardware for I/O, compute or memory bound 
– More details - http://tinyurl.com/hwx-hadoop-hw 
© Hortonworks Inc. 2011 
Page 12
Configuration 
• Avoid JVM issues 
– Use 64 bit JVM for all daemons 
– Compressed OOPS enabled by default (6 u23 and later) 
– Java heap size 
– Set same max and starting heapsize, Xmx == Xms 
– Avoid java defaults – configure NewSize and MaxNewSize 
– Use 1/8 to 1/6 of max size for JVMs larger than 4G 
– Configure –XX:PermSize=128 MB, -XX:MaxPermSize=256 MB 
– Use low-latency GC collector 
– -XX:+UseConcMarkSweepGC, -XX:ParallelGCThreads=<N> 
– High <N> on Namenode and JobTracker or ResourceManager 
– Important JVM configs to help debugging 
– -verbose:gc -Xloggc:<file> -XX:+PrintGCDetails 
– -XX:ErrorFile=<file> 
– -XX:+HeapDumpOnOutOfMemoryError 
© Hortonworks Inc. 2011 
Page 13
Configuration 
• Multiple redundant dirs for namenode metadata 
– One of dfs.namenode.name.dir should be on NFS 
– NFS softmount - tcp,soft,intr,timeo=20,retrans=5 
• Configure open fd ulimit 
– Default 1024 is too low 
– 16K for datanodes, 64K for Master nodes 
• Use version control for configuration! 
© Hortonworks Inc. 2011 
Page 14
Configuration 
• Use disk fail in place for datanodes: dfs.datanode.failed.volumes.tolerated 
– Disk failure is no longer datanode failure 
– Especially important for large density nodes 
• Set dfs.namenode.name.dir.restore to true 
– Restores NN storage directory during checkpointing 
• Take periodic backups of namenode metadata 
– Make copies of the entire storage directory 
• Set aside a lot of disk space for NN logs 
– It is verbose – set aside multiple GBs 
– Many installs configure this too small 
– NN logs roll with in minutes – hard to debug issues 
© Hortonworks Inc. 2011 
Page 15
Monitor Usage 
• Cluster storage, nodes, files, blocks grows 
– Update NN heap, handler count, number of DN xceivers 
– Tweak other related config periodically 
• Monitor the hardware usage for your work load 
– Disk I/O, network I/O, CPU and memory usage 
– Use this information when expanding cluster capacity 
• Monitor the usage with HADOOP metrics 
– JVM metrics – GC times, Memory used, Thread Status 
– RPC metrics – especially latency to track slowdowns 
–HDFS metrics 
– Used storage, # of files and blocks, total load on the cluster 
– File System operations 
– MapReduce Metrics 
– Slot utilization and Job status 
• Tweak configurations during upgrades/maintenance on an ongoing basis 
© Hortonworks Inc. 2011 
Page 16
Documentation
Documentation 
• Continual Investment in Documentation 
– Hortonworks Data Platform Documentation 
– http://docs.hortonworks.com/ 
– Apache Hadoop Documentation 
– http://hadoop.apache.org/docs/current/ 
• Apache Hadoop Documentation 
– We welcome your requests in Apache jira for documentation improvements. 
– Create issues with the “documentation” label. 
– Getting the end user perspective is extremely valuable. 
– We would be grateful to receive documentation patches. 
– It’s a great way to get started in the Apache Hadoop open source process. 
– Search for unresolved issues with the “documentation” label. 
– https://issues.apache.org/jira/issues/?jql=project%20in%20(HDFS%2C%20HADOOP%2C%20YARN%2C%20MAPREDUC 
E)%20AND%20resolution%20%3D%20Unresolved%20AND%20labels%20%3D%20documentation 
© Hortonworks Inc. 2011 
Page 18 
Architecting the Future of Big Data
Software Improvements 
Real Incidents and Software Improvements to Address Them
Don’t edit the metadata files! 
• Editing can corrupt the cluster state 
– Might result in loss of data 
• Real incident 
– NN misconfigured to point to another NN’s metadata 
– DNs can’t register due to namespace ID mismatch 
– System detected the problem correctly 
– Safety net ignored by the admin! 
– Admin edits the namenode VERSION file to match ids 
© Hortonworks Inc. 2011 
What Happens Next? 
Page 20
Improvement 
• Pause deletion of blocks when the namenode starts up 
– https://issues.apache.org/jira/browse/HDFS-6186 
– Supports configurable delay of block deletions after NameNode startup 
– Gives an admin extra time to diagnose before deletions begin 
• Show when block deletion will start after NameNode startup in WebUI 
– https://issues.apache.org/jira/browse/HDFS-6385 
– The web UI already displays the number of pending block deletions 
– This will enhance the display to indicate when actual deletion will begin 
© Hortonworks Inc. 2011 
Page 21 
Architecting the Future of Big Data
Guard Against Accidental Deletion 
• rm –r deletes the data at the speed of Hadoop! 
– ctrl-c of the command does not stop deletion! 
– Undeleting files on datanodes is hard & time consuming 
– Immediately shutdown NN, unmount disks on datanodes 
– Recover deleted files 
– Start namenode without the delete operation in edits 
• Enable Trash 
• Real Incident 
– Customer is running a distro of Hadoop with trash not enabled 
– Deletes a large dir (100 TB) and shuts down NN immediately 
– Support person asks NN to be restarted to see if trash is enabled! 
© Hortonworks Inc. 2011 
What happens next? 
• Now HDFS has Snapshots! 
Page 22
Improvement 
• HDFS Snapshots 
– https://issues.apache.org/jira/browse/HDFS-2802 
– A snapshot is a read-only point-in-time image of part of the file system 
– A snapshot created before a deletion can be used to restore deleted data 
– More coverage of snapshots later in the presentation 
• HDFS ACLs 
– https://issues.apache.org/jira/browse/HDFS-4685 
– Finer-grained control of file permissions can help prevent an accidental deletion 
– More coverage of ACLs later in the presentation 
© Hortonworks Inc. 2011 
Page 23 
Architecting the Future of Big Data
Unexpected error during HA HDFS upgrade 
• Background: HDFS HA Architecture 
– http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithQJM.html 
• Real Incident 
– During upgrade, NameNode calls every JournalNode to request backup of metadata directory, which renames 
“current” directory to “previous.tmp”. 
– Permissions incorrect on metadata directory for 1 out of 3 JournalNodes. 
– The hdfs user is not authorized to rename. Backup fails for that JournalNode, so upgrade process aborts with 
error. 
What happens next? 
© Hortonworks Inc. 2011 
Page 24 
Architecting the Future of Big Data
Improvement 
• Improve diagnostics on storage directory rename operations by using native code. 
– https://issues.apache.org/jira/browse/HDFS-7118 
– Logs additional root cause information for rename failure. For example, EACCES 
• Split error checks in into separate conditions to improve diagnostics. 
– https://issues.apache.org/jira/browse/HDFS-7119 
– Splits a log message about failure to delete or rename into separate log messages to clarify which specific action 
failed 
• When aborting NameNode or JournalNode, write the contents of the metadata directories and 
permissions to logs. 
– https://issues.apache.org/jira/browse/HDFS-7120 
– Usually the first information asked of the user, so we can automate this 
• For JournalNode operations that must succeed on all nodes, execute a pre-check to verify that 
the operation can succeed. 
– https://issues.apache.org/jira/browse/HDFS-7121 
– Prevents need for manual cleanup on 2 out of 3 JournalNodes where backup succeeded 
© Hortonworks Inc. 2011 
Page 25 
Architecting the Future of Big Data
Support Case Trends 
• Highlights Revisited 
– Core Hadoop components (HDFS, YARN and MapReduce) are used across almost all deployments, and 
therefore receive proportionally more support cases than other ecosystem components. 
– Action: Focus efforts on core Hadoop first to improve operability of the platform. 
– Misconfiguration is the dominant root cause. 
– Action: Publish configuration best practices and advise on the need for ongoing review of configuration as cluster usage 
patterns change over time. 
– Documentation is a close second. 
– Action: Contribute frequently to product documentation, both in open source Apache Hadoop and in the distro. End user 
documentation is a gating factor for launching new features. We welcome your requests in Apache jira for documentation 
improvements, and we welcome your patches! 
– Code changes often can be implemented to eliminate an operational issue, help with diagnosis or provide 
increased visibility. 
– Action: After resolution of each support case, consider potential product improvements. For example, can logging be 
improved? Small code changes can have a big impact. 
© Hortonworks Inc. 2011 
Page 26 
Architecting the Future of Big Data
Key Learnings and Best Practices 
Features that Help Improve Production Operations
HDFS ACLs 
• Existing HDFS POSIX permissions good, but not flexible enough 
– Permission requirements may differ from the natural organizational hierarchy of users and groups. 
• HDFS ACLs augment the existing HDFS POSIX permissions model by implementing the POSIX 
ACL model. 
– An ACL (Access Control List) provides a way to set different permissions for specific named users or named 
groups, not only the file’s owner and file’s group. 
© Hortonworks Inc. 2011 
Page 28 
Architecting the Future of Big Data
HDFS File Permissions Example 
• Authorization requirements: 
– In a sales department, they would like a single user Maya (Department Manager) to 
control all modifications to sales data 
– Other members of sales department need to view the data, but can’t modify it. 
–Everyone else in the company must not be allowed to view the data. 
• Can be implemented via the following: 
Read/Write perm for user 
maya 
© Hortonworks Inc. 2011 
User 
Group 
Read perm for group sales 
File with sales data
HDFS ACLs 
• Problem 
–No longer feasible for Maya to control all modifications to the file 
– New Requirement: Maya, Diane and Clark are allowed to make modifications 
– New Requirement: New group called executives should be able to read the sales data 
–Current permissions model only allows permissions at 1 group and 1 user 
• Solution: HDFS ACLs 
–Now assign different permissions to different users and groups 
© Hortonworks Inc. 2011 
Owner 
Group 
Others 
HDFS 
Directory 
… rwx 
… rwx 
… rwx 
Group D … rwx 
Group F … rwx 
User Y … rwx
HDFS ACLs 
New Tools for ACL Management (setfacl, getfacl) 
– hdfs dfs -setfacl -m group:execs:r-- /sales-data 
– hdfs dfs -getfacl /sales-data # file: /sales-data # owner: maya # group: 
sales user::rw- group::r-- group:execs:r-- mask::r-- other::-- 
– How do you know if a directory has ACLs set? 
– hdfs dfs -ls /sales-data Found 1 items -rw-r-----+ 3 maya sales 0 
2014-03-04 16:31 /sales-data 
© Hortonworks Inc. 2011
HDFS ACLs 
Default ACLs 
–hdfs dfs -setfacl -m default:group:execs:r-x /monthly-sales-data 
–hdfs dfs -mkdir /monthly-sales-data/JAN 
–hdfs dfs –getfacl /monthly-sales-data/JAN 
– # file: /monthly-sales-data/JAN # owner: maya # group: sales user::rwx group::r-x 
group:execs:r-x mask::r-x other::--- default:user::rwx default:group::r-x 
default:group:execs:r-x default:mask::r-x default:other::--- 
© Hortonworks Inc. 2011
HDFS ACLs Best Practices 
• Start with traditional HDFS permissions to implement most permission requirements. 
• Define a smaller number of ACLs to handle exceptional cases. 
• A file with an ACL incurs an additional cost in memory in the NameNode compared to a file that 
has only traditional permissions. 
© Hortonworks Inc. 2011 
Page 33 
Architecting the Future of Big Data
HDFS Snapshots 
• HDFS Snapshots 
– A snapshot is a read-only point-in-time image of part of the file system 
– Performance: snapshot creation is instantaneous, regardless of data size or subtree depth 
– Reliability: snapshot creation is atomic 
– Scalability: snapshots do not create extra copies of data blocks 
– Useful for protecting against accidental deletion of data 
• Example: Daily Feeds 
hdfs dfs -ls /daily-feeds 
Found 5 items 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/2014-10-13 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/2014-10-14 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-15 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-16 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-17 
© Hortonworks Inc. 2011 
Page 34 
Architecting the Future of Big Data
HDFS Snapshots 
• Create a snapshot after each daily load 
hdfs dfsadmin -allowSnapshot /daily-feeds 
Allowing snaphot on /daily-feeds succeeded 
hdfs dfs -createSnapshot /daily-feeds snapshot-to-2014-10-17 
Created snapshot /daily-feeds/.snapshot/snapshot-to-2014-10-17 
• User accidentally deletes data for 2014-10-16 
hdfs dfs -ls /daily-feeds 
Found 4 items 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/2014-10-13 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/2014-10-14 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-15 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-17 
© Hortonworks Inc. 2011 
Page 35 
Architecting the Future of Big Data
HDFS Snapshots 
• Snapshots to the rescue: the data is still in the snapshot 
hdfs dfs -ls /daily-feeds/.snapshot/snapshot-to-2014-10-17 
Found 5 items 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/. 
snapshot/snapshot-to-2014-10-17/2014-10-13 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/. 
snapshot/snapshot-to-2014-10-17/2014-10-14 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/. 
snapshot/snapshot-to-2014-10-17/2014-10-15 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/. 
snapshot/snapshot-to-2014-10-17/2014-10-16 
drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/. 
snapshot/snapshot-to-2014-10-17/2014-10-17 
• Restore data from 2014-10-16 
hdfs dfs -cp /daily-feeds/.snapshot/snapshot-to-2014-10-17/2014-10-16 /daily-feeds 
© Hortonworks Inc. 2011 
Page 36 
Architecting the Future of Big Data
YARN Application Timeline Server 
• Stores data about YARN application execution 
– Generic data 
– YARN container utilization 
– Metrics related to containers 
– Application-specific data 
– MapReduce jobs and their tasks 
– Tez DAG execution 
• Provides CLI for accessing data 
– Useful for ad-hoc queries or scripted analysis 
• Provides REST API for accessing data 
– Consumed by UI front-ends such as Apache Ambari 
© Hortonworks Inc. 2011 
Page 37 
Architecting the Future of Big Data
Querying a Map Reduce Job Entity 
curl http://127.0.0.1:8188/ws/v1/timeline/MAPREDUCE_JOB/job_1413405332088_0001 
{ 
"entity": "job_1413405332088_0001", 
"entitytype": "MAPREDUCE_JOB", 
"events": [ 
© Hortonworks Inc. 2011 
{ 
"eventinfo": { 
"FINISHED_MAPS": 2, 
"FINISHED_REDUCES": 1, 
"FINISH_TIME": 1413405349192, 
"JOB_STATUS": "SUCCEEDED" 
}, 
"eventtype": "JOB_FINISHED", 
"timestamp": 1413405349194 
} 
], 
"relatedentities": { 
"MAPREDUCE_TASK": [ 
"task_1413405332088_0001_m_000000" 
] 
}, 
"starttime": 1413405339442 
} 
Page 38 
Architecting the Future of Big Data
Querying a Map Task Entity 
curl http://127.0.0.1:8188/ws/v1/timeline/MAPREDUCE_TASK/task_1413405332088_0001_m_000000 
{ 
"entity": "task_1413405332088_0001_m_000000", 
"entitytype": "MAPREDUCE_TASK", 
"events": [ 
© Hortonworks Inc. 2011 
{ 
"eventtype": "TASK_FINISHED", 
"timestamp": 1413405345253 
}, 
{ 
"eventinfo": { 
"SPLIT_LOCATIONS": "localhost", 
"START_TIME": 1413405340255, 
"TASK_TYPE": "MAP" 
}, 
"eventtype": "TASK_STARTED", 
"timestamp": 1413405340258 
} 
], 
} 
Page 39 
Architecting the Future of Big Data
Summary 
• Configuration 
– Prevent garbage collection issues 
– Configure for redundancy 
– Retune configuration in response to metrics 
• Documentation 
– End user perspective is crucial 
– Please consider contributing to Apache Hadoop documentation 
• HDFS ACLs 
– Implement fine-grained authorization rules on files 
– Can protect against accidental file manipulations 
• HDFS Snapshots 
– Point-in-time image of part of the filesystem 
– Useful for restoring to a prior state after accidental file manipulation 
• YARN Application Timeline Server 
– Provides generic and application-specific data about YARN application execution 
– Useful for analyzing cluster usage patterns 
© Hortonworks Inc. 2011 
Page 40 
Architecting the Future of Big Data
Thank you, Q&A 
© Hortonworks Inc. 2011 
Page 41 
Resource Location 
Hardware 
Recommendations for 
Apache Hadoop 
http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.2/bk_cluster-planning-guide/ 
content/ch_hardware-recommendations.html 
Hadoop Documentation 
Issues 
https://issues.apache.org/jira/issues/?jql=project%20in%20(HDFS%2C%20HA 
DOOP%2C%20YARN%2C%20MAPREDUCE)%20AND%20resolution%20%3 
D%20Unresolved%20AND%20labels%20%3D%20documentation 
HDFS operational and 
debuggability 
improvements 
https://issues.apache.org/jira/browse/HDFS-6185 
HDFS ACLs Blog Post http://hortonworks.com/blog/hdfs-acls-fine-grained-permissions-hdfs-files-hadoop/ 
HDFS Snapshots Blog Post http://hortonworks.com/blog/protecting-your-enterprise-data-with-hdfs-snapshots/ 
YARN Timeline Server 
Documentation 
http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/ 
TimelineServer.html 
Learn more

Contenu connexe

Tendances

Hadoop Backup and Disaster Recovery
Hadoop Backup and Disaster RecoveryHadoop Backup and Disaster Recovery
Hadoop Backup and Disaster RecoveryCloudera, Inc.
 
Learn Hadoop Administration
Learn Hadoop AdministrationLearn Hadoop Administration
Learn Hadoop AdministrationEdureka!
 
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability | Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability | Edureka!
 
Introduction to Cloudera's Administrator Training for Apache Hadoop
Introduction to Cloudera's Administrator Training for Apache HadoopIntroduction to Cloudera's Administrator Training for Apache Hadoop
Introduction to Cloudera's Administrator Training for Apache HadoopCloudera, Inc.
 
Upgrade Without the Headache: Best Practices for Upgrading Hadoop in Production
Upgrade Without the Headache: Best Practices for Upgrading Hadoop in ProductionUpgrade Without the Headache: Best Practices for Upgrading Hadoop in Production
Upgrade Without the Headache: Best Practices for Upgrading Hadoop in ProductionCloudera, Inc.
 
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsBig Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsEsther Kundin
 
Hadoop configuration & performance tuning
Hadoop configuration & performance tuningHadoop configuration & performance tuning
Hadoop configuration & performance tuningVitthal Gogate
 
Difference between hadoop 2 vs hadoop 3
Difference between hadoop 2 vs hadoop 3Difference between hadoop 2 vs hadoop 3
Difference between hadoop 2 vs hadoop 3Manish Chopra
 
Hadoop 3 (2017 hadoop taiwan workshop)
Hadoop 3 (2017 hadoop taiwan workshop)Hadoop 3 (2017 hadoop taiwan workshop)
Hadoop 3 (2017 hadoop taiwan workshop)Wei-Chiu Chuang
 
Keep your hadoop cluster at its best! v4
Keep your hadoop cluster at its best! v4Keep your hadoop cluster at its best! v4
Keep your hadoop cluster at its best! v4Chris Nauroth
 
Storage and-compute-hdfs-map reduce
Storage and-compute-hdfs-map reduceStorage and-compute-hdfs-map reduce
Storage and-compute-hdfs-map reduceChris Nauroth
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storySunil Govindan
 
Deployment and Management of Hadoop Clusters
Deployment and Management of Hadoop ClustersDeployment and Management of Hadoop Clusters
Deployment and Management of Hadoop ClustersAmal G Jose
 
How the Internet of Things are Turning the Internet Upside Down
How the Internet of Things are Turning the Internet Upside DownHow the Internet of Things are Turning the Internet Upside Down
How the Internet of Things are Turning the Internet Upside DownDataWorks Summit
 
Hadoop 3.0 - Revolution or evolution?
Hadoop 3.0 - Revolution or evolution?Hadoop 3.0 - Revolution or evolution?
Hadoop 3.0 - Revolution or evolution?Uwe Printz
 
Big data processing meets non-volatile memory: opportunities and challenges
Big data processing meets non-volatile memory: opportunities and challenges Big data processing meets non-volatile memory: opportunities and challenges
Big data processing meets non-volatile memory: opportunities and challenges DataWorks Summit
 
Top 5 Hadoop Admin Tasks
Top 5 Hadoop Admin TasksTop 5 Hadoop Admin Tasks
Top 5 Hadoop Admin TasksEdureka!
 

Tendances (20)

Hadoop Backup and Disaster Recovery
Hadoop Backup and Disaster RecoveryHadoop Backup and Disaster Recovery
Hadoop Backup and Disaster Recovery
 
Learn Hadoop Administration
Learn Hadoop AdministrationLearn Hadoop Administration
Learn Hadoop Administration
 
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability | Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |
 
Introduction to Cloudera's Administrator Training for Apache Hadoop
Introduction to Cloudera's Administrator Training for Apache HadoopIntroduction to Cloudera's Administrator Training for Apache Hadoop
Introduction to Cloudera's Administrator Training for Apache Hadoop
 
Upgrade Without the Headache: Best Practices for Upgrading Hadoop in Production
Upgrade Without the Headache: Best Practices for Upgrading Hadoop in ProductionUpgrade Without the Headache: Best Practices for Upgrading Hadoop in Production
Upgrade Without the Headache: Best Practices for Upgrading Hadoop in Production
 
Introduction to Hadoop Administration
Introduction to Hadoop AdministrationIntroduction to Hadoop Administration
Introduction to Hadoop Administration
 
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsBig Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
 
Cross-DC Fault-Tolerant ViewFileSystem @ Twitter
Cross-DC Fault-Tolerant ViewFileSystem @ TwitterCross-DC Fault-Tolerant ViewFileSystem @ Twitter
Cross-DC Fault-Tolerant ViewFileSystem @ Twitter
 
Hadoop configuration & performance tuning
Hadoop configuration & performance tuningHadoop configuration & performance tuning
Hadoop configuration & performance tuning
 
Difference between hadoop 2 vs hadoop 3
Difference between hadoop 2 vs hadoop 3Difference between hadoop 2 vs hadoop 3
Difference between hadoop 2 vs hadoop 3
 
Hadoop 3 (2017 hadoop taiwan workshop)
Hadoop 3 (2017 hadoop taiwan workshop)Hadoop 3 (2017 hadoop taiwan workshop)
Hadoop 3 (2017 hadoop taiwan workshop)
 
Keep your hadoop cluster at its best! v4
Keep your hadoop cluster at its best! v4Keep your hadoop cluster at its best! v4
Keep your hadoop cluster at its best! v4
 
Big data- HDFS(2nd presentation)
Big data- HDFS(2nd presentation)Big data- HDFS(2nd presentation)
Big data- HDFS(2nd presentation)
 
Storage and-compute-hdfs-map reduce
Storage and-compute-hdfs-map reduceStorage and-compute-hdfs-map reduce
Storage and-compute-hdfs-map reduce
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration story
 
Deployment and Management of Hadoop Clusters
Deployment and Management of Hadoop ClustersDeployment and Management of Hadoop Clusters
Deployment and Management of Hadoop Clusters
 
How the Internet of Things are Turning the Internet Upside Down
How the Internet of Things are Turning the Internet Upside DownHow the Internet of Things are Turning the Internet Upside Down
How the Internet of Things are Turning the Internet Upside Down
 
Hadoop 3.0 - Revolution or evolution?
Hadoop 3.0 - Revolution or evolution?Hadoop 3.0 - Revolution or evolution?
Hadoop 3.0 - Revolution or evolution?
 
Big data processing meets non-volatile memory: opportunities and challenges
Big data processing meets non-volatile memory: opportunities and challenges Big data processing meets non-volatile memory: opportunities and challenges
Big data processing meets non-volatile memory: opportunities and challenges
 
Top 5 Hadoop Admin Tasks
Top 5 Hadoop Admin TasksTop 5 Hadoop Admin Tasks
Top 5 Hadoop Admin Tasks
 

En vedette

Hbase schema design and sizing apache-con europe - nov 2012
Hbase schema design and sizing   apache-con europe - nov 2012Hbase schema design and sizing   apache-con europe - nov 2012
Hbase schema design and sizing apache-con europe - nov 2012Chris Huang
 
Hadoop Hardware @Twitter: Size does matter.
Hadoop Hardware @Twitter: Size does matter.Hadoop Hardware @Twitter: Size does matter.
Hadoop Hardware @Twitter: Size does matter.Michael Zhang
 
Data Engineering Quick Guide
Data Engineering Quick GuideData Engineering Quick Guide
Data Engineering Quick GuideAsim Jalis
 
Successes, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Successes, Challenges, and Pitfalls Migrating a SAAS business to HadoopSuccesses, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Successes, Challenges, and Pitfalls Migrating a SAAS business to HadoopDataWorks Summit/Hadoop Summit
 
HBase Sizing Guide
HBase Sizing GuideHBase Sizing Guide
HBase Sizing Guidelarsgeorge
 
Hadoop and BigData - July 2016
Hadoop and BigData - July 2016Hadoop and BigData - July 2016
Hadoop and BigData - July 2016Ranjith Sekar
 
Spark Summit EU talk by Shaun Klopfenstein and Neelesh Shastry
Spark Summit EU talk by Shaun Klopfenstein and Neelesh ShastrySpark Summit EU talk by Shaun Klopfenstein and Neelesh Shastry
Spark Summit EU talk by Shaun Klopfenstein and Neelesh ShastrySpark Summit
 
Apache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
Apache Phoenix and Apache HBase: An Enterprise Grade Data WarehouseApache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
Apache Phoenix and Apache HBase: An Enterprise Grade Data WarehouseJosh Elser
 
HBaseCon 2015: HBase Operations in a Flurry
HBaseCon 2015: HBase Operations in a FlurryHBaseCon 2015: HBase Operations in a Flurry
HBaseCon 2015: HBase Operations in a FlurryHBaseCon
 
Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!DataWorks Summit
 
Big Data mit Apache Hadoop
Big Data mit Apache HadoopBig Data mit Apache Hadoop
Big Data mit Apache HadoopAlexander Alten
 
Hadoop Einführung @codecentric
Hadoop Einführung @codecentricHadoop Einführung @codecentric
Hadoop Einführung @codecentricimalik8088
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseenissoz
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation HadoopVarun Narang
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?sudhakara st
 

En vedette (17)

Hbase schema design and sizing apache-con europe - nov 2012
Hbase schema design and sizing   apache-con europe - nov 2012Hbase schema design and sizing   apache-con europe - nov 2012
Hbase schema design and sizing apache-con europe - nov 2012
 
Hadoop Hardware @Twitter: Size does matter.
Hadoop Hardware @Twitter: Size does matter.Hadoop Hardware @Twitter: Size does matter.
Hadoop Hardware @Twitter: Size does matter.
 
HBase Sizing Notes
HBase Sizing NotesHBase Sizing Notes
HBase Sizing Notes
 
Data Engineering Quick Guide
Data Engineering Quick GuideData Engineering Quick Guide
Data Engineering Quick Guide
 
Successes, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Successes, Challenges, and Pitfalls Migrating a SAAS business to HadoopSuccesses, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Successes, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
 
HBase Sizing Guide
HBase Sizing GuideHBase Sizing Guide
HBase Sizing Guide
 
Hadoop and BigData - July 2016
Hadoop and BigData - July 2016Hadoop and BigData - July 2016
Hadoop and BigData - July 2016
 
Spark Summit EU talk by Shaun Klopfenstein and Neelesh Shastry
Spark Summit EU talk by Shaun Klopfenstein and Neelesh ShastrySpark Summit EU talk by Shaun Klopfenstein and Neelesh Shastry
Spark Summit EU talk by Shaun Klopfenstein and Neelesh Shastry
 
Apache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
Apache Phoenix and Apache HBase: An Enterprise Grade Data WarehouseApache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
Apache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
 
HBaseCon 2015: HBase Operations in a Flurry
HBaseCon 2015: HBase Operations in a FlurryHBaseCon 2015: HBase Operations in a Flurry
HBaseCon 2015: HBase Operations in a Flurry
 
Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!
 
Big Data mit Apache Hadoop
Big Data mit Apache HadoopBig Data mit Apache Hadoop
Big Data mit Apache Hadoop
 
Hadoop Einführung @codecentric
Hadoop Einführung @codecentricHadoop Einführung @codecentric
Hadoop Einführung @codecentric
 
Yahoo compares Storm and Spark
Yahoo compares Storm and SparkYahoo compares Storm and Spark
Yahoo compares Storm and Spark
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation Hadoop
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
 

Similaire à Hadoop operations-2014-strata-new-york-v5

Hadoop operations-2015-hadoop-summit-san-jose-v5
Hadoop operations-2015-hadoop-summit-san-jose-v5Hadoop operations-2015-hadoop-summit-san-jose-v5
Hadoop operations-2015-hadoop-summit-san-jose-v5Chris Nauroth
 
HDFS- What is New and Future
HDFS- What is New and FutureHDFS- What is New and Future
HDFS- What is New and FutureDataWorks Summit
 
Democratizing Memory Storage
Democratizing Memory StorageDemocratizing Memory Storage
Democratizing Memory StorageDataWorks Summit
 
Hdfs 2016-hadoop-summit-san-jose-v4
Hdfs 2016-hadoop-summit-san-jose-v4Hdfs 2016-hadoop-summit-san-jose-v4
Hdfs 2016-hadoop-summit-san-jose-v4Chris Nauroth
 
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...Sumeet Singh
 
Interactive Hadoop via Flash and Memory
Interactive Hadoop via Flash and MemoryInteractive Hadoop via Flash and Memory
Interactive Hadoop via Flash and MemoryChris Nauroth
 
Hdfs 2016-hadoop-summit-dublin-v1
Hdfs 2016-hadoop-summit-dublin-v1Hdfs 2016-hadoop-summit-dublin-v1
Hdfs 2016-hadoop-summit-dublin-v1Chris Nauroth
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3tcloudcomputing-tw
 
Hadoop project design and a usecase
Hadoop project design and  a usecaseHadoop project design and  a usecase
Hadoop project design and a usecasesudhakara st
 
Optimizing Dell PowerEdge Configurations for Hadoop
Optimizing Dell PowerEdge Configurations for HadoopOptimizing Dell PowerEdge Configurations for Hadoop
Optimizing Dell PowerEdge Configurations for HadoopMike Pittaro
 
Apache hadoop: POSH Meetup Palo Alto, CA April 2014
Apache hadoop: POSH Meetup Palo Alto, CA April 2014Apache hadoop: POSH Meetup Palo Alto, CA April 2014
Apache hadoop: POSH Meetup Palo Alto, CA April 2014Kevin Crocker
 
Application Architectures with Hadoop - Big Data TechCon SF 2014
Application Architectures with Hadoop - Big Data TechCon SF 2014Application Architectures with Hadoop - Big Data TechCon SF 2014
Application Architectures with Hadoop - Big Data TechCon SF 2014hadooparchbook
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHortonworks
 

Similaire à Hadoop operations-2014-strata-new-york-v5 (20)

Hadoop operations-2015-hadoop-summit-san-jose-v5
Hadoop operations-2015-hadoop-summit-san-jose-v5Hadoop operations-2015-hadoop-summit-san-jose-v5
Hadoop operations-2015-hadoop-summit-san-jose-v5
 
HDFS- What is New and Future
HDFS- What is New and FutureHDFS- What is New and Future
HDFS- What is New and Future
 
Democratizing Memory Storage
Democratizing Memory StorageDemocratizing Memory Storage
Democratizing Memory Storage
 
Hdfs 2016-hadoop-summit-san-jose-v4
Hdfs 2016-hadoop-summit-san-jose-v4Hdfs 2016-hadoop-summit-san-jose-v4
Hdfs 2016-hadoop-summit-san-jose-v4
 
Evolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage SubsystemEvolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage Subsystem
 
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...
 
Interactive Hadoop via Flash and Memory
Interactive Hadoop via Flash and MemoryInteractive Hadoop via Flash and Memory
Interactive Hadoop via Flash and Memory
 
HDFS: Optimization, Stabilization and Supportability
HDFS: Optimization, Stabilization and SupportabilityHDFS: Optimization, Stabilization and Supportability
HDFS: Optimization, Stabilization and Supportability
 
Hdfs 2016-hadoop-summit-dublin-v1
Hdfs 2016-hadoop-summit-dublin-v1Hdfs 2016-hadoop-summit-dublin-v1
Hdfs 2016-hadoop-summit-dublin-v1
 
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
Tcloud Computing Hadoop Family and Ecosystem Service 2013.Q3
 
Hadoop 3 in a Nutshell
Hadoop 3 in a NutshellHadoop 3 in a Nutshell
Hadoop 3 in a Nutshell
 
List of Engineering Colleges in Uttarakhand
List of Engineering Colleges in UttarakhandList of Engineering Colleges in Uttarakhand
List of Engineering Colleges in Uttarakhand
 
Hadoop.pptx
Hadoop.pptxHadoop.pptx
Hadoop.pptx
 
Hadoop.pptx
Hadoop.pptxHadoop.pptx
Hadoop.pptx
 
Hadoop project design and a usecase
Hadoop project design and  a usecaseHadoop project design and  a usecase
Hadoop project design and a usecase
 
Optimizing Dell PowerEdge Configurations for Hadoop
Optimizing Dell PowerEdge Configurations for HadoopOptimizing Dell PowerEdge Configurations for Hadoop
Optimizing Dell PowerEdge Configurations for Hadoop
 
Hadoop ppt1
Hadoop ppt1Hadoop ppt1
Hadoop ppt1
 
Apache hadoop: POSH Meetup Palo Alto, CA April 2014
Apache hadoop: POSH Meetup Palo Alto, CA April 2014Apache hadoop: POSH Meetup Palo Alto, CA April 2014
Apache hadoop: POSH Meetup Palo Alto, CA April 2014
 
Application Architectures with Hadoop - Big Data TechCon SF 2014
Application Architectures with Hadoop - Big Data TechCon SF 2014Application Architectures with Hadoop - Big Data TechCon SF 2014
Application Architectures with Hadoop - Big Data TechCon SF 2014
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
 

Dernier

Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdfPearlKirahMaeRagusta1
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfryanfarris8
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfkalichargn70th171
 
ManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide DeckManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide DeckManageIQ
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfonteinmasabamasaba
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...kalichargn70th171
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024Mind IT Systems
 
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxBUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxalwaysnagaraju26
 

Dernier (20)

Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
ManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide DeckManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide Deck
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptxBUS PASS MANGEMENT SYSTEM USING PHP.pptx
BUS PASS MANGEMENT SYSTEM USING PHP.pptx
 

Hadoop operations-2014-strata-new-york-v5

  • 1. Hadoop Operations – Best Practices from the Field October 17, 2014 Chris Nauroth email: cnauroth@hortonworks.com twitter: @cnauroth Suresh Srinivas email: suresh@hortonworks.com twitter: @suresh_m_s
  • 2. About Us Chris Nauroth • Member of Technical Staff, Hortonworks – Apache Hadoop committer and PMC member – Major contributor to HDFS ACLs, Windows compatibility, and operability improvements • Hadoop user since 2010 – Prior employment experience deploying, maintaining and using Hadoop clusters Suresh Srinivas • Architect & Founder at Hortonworks – Long time Apache Hadoop committer and PMC member – Designed and developed many key Hadoop features • Experience from supporting many clusters – Including some of the world’s largest Hadoop clusters © Hortonworks Inc. 2011 Page 2 Architecting the Future of Big Data
  • 3. Agenda • Analysis of Hadoop Support Cases – Support case trends – Configuration – Documentation – Software Improvements • Key Learnings and Best Practices – HDFS ACLs – HDFS Snapshots – YARN Application Timeline Server © Hortonworks Inc. 2011 Page 3 Architecting the Future of Big Data
  • 4. Support Cases: Setting the Context • Hortonworks Support – Multiple tiers of support contacts – Support engineers trained and knowledgeable across the entire Hadoop ecosystem – Cases may escalate to subject matter experts for depth in one particular area – Challenging cases may escalate to Apache committers at Hortonworks if additional expertise is required • Apache Community Support – user@hadoop.apache.org for user questions and support – https://issues.apache.org/jira for reporting confirmed bugs – Apache Hadoop users, contributors, committers and PMC members all participate actively in these forums to help resolve issues © Hortonworks Inc. 2011 Page 4 Architecting the Future of Big Data
  • 5. Support Case Analysis Methodology • Inspected over 2 years of support case history across hundreds of customers • Broad inclusion of 29 Hadoop ecosystem and related projects • Multiple versions of Hadoop in deployments – 2 major versions: Hadoop 1.x and 2.x – ~3 minor versions within each major version – ~3 patch releases per minor version – ~15 total releases and updates • Distinct deployment environments – Cluster sizes ranging from 10s to 1000s of nodes – Different management environments and operational practices – Various deployment techniques: Ambari, Chef, RPMs, etc. © Hortonworks Inc. 2011 Page 5 Architecting the Future of Big Data
  • 6. Support Case Trends – Cases per Month © Hortonworks Inc. 2011 Page 6 Architecting the Future of Big Data 140 120 100 80 60 40 20 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 HDFS Map Reduce YARN
  • 7. Support Case Trends – Cases per Month • What is the spike in May 2014? – More users – More total users means more total support cases – More features – Many upgrades of existing clusters from Hadoop 1 to Hadoop 2 – Many conversions to HA deployments – Many conversions to secured deployments – More integration – Many sites running separate Hadoop 1 and Hadoop 2 clusters simultaneously – Questions around migrating data between clusters at 2 different versions (DistCp) © Hortonworks Inc. 2011 Page 7 Architecting the Future of Big Data
  • 8. Support Case Trends – Proportional Cases per Month © Hortonworks Inc. 2011 Page 8 Architecting the Future of Big Data 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 HDFS Map Reduce YARN Other (26 components)
  • 9. Support Case Trends – Root Cause © Hortonworks Inc. 2011 Page 9 Architecting the Future of Big Data 450 400 350 300 250 200 150 100 50 0 Customer Environment (Non HDP) Documentation Defect Documentation Gap Documentation Not Utilized Education - Configuration Needs Training Product Defect YARN Map Reduce HDFS
  • 10. Support Case Trends • Highlights – Core Hadoop components (HDFS, YARN and MapReduce) are used across all deployments, and therefore receive proportionally more support cases than other ecosystem components. – Misconfiguration is the dominant root cause. – Documentation is a close second. – We are constantly improving the code to eliminate operational issues, help with diagnosis and provide increased visibility. © Hortonworks Inc. 2011 Page 10 Architecting the Future of Big Data
  • 12. Hardware and Cluster Sizing • Considerations –Larger clusters heal faster on nodes or disk failure –Machines with huge storage take longer to recover –More racks give more failure domains • Recommendations – Get good-quality commodity hardware – Buy the sweet-spot in pricing: 3TB disk, 96GB, 8-12 cores – More memory is better – real time is memory hungry! – Before considering fatter machines (1U 6 disks vs. 2U 12 disks) – Get to 30-40 machines or 3-4 racks –Use pilot cluster to learn about load patterns – Balanced hardware for I/O, compute or memory bound – More details - http://tinyurl.com/hwx-hadoop-hw © Hortonworks Inc. 2011 Page 12
  • 13. Configuration • Avoid JVM issues – Use 64 bit JVM for all daemons – Compressed OOPS enabled by default (6 u23 and later) – Java heap size – Set same max and starting heapsize, Xmx == Xms – Avoid java defaults – configure NewSize and MaxNewSize – Use 1/8 to 1/6 of max size for JVMs larger than 4G – Configure –XX:PermSize=128 MB, -XX:MaxPermSize=256 MB – Use low-latency GC collector – -XX:+UseConcMarkSweepGC, -XX:ParallelGCThreads=<N> – High <N> on Namenode and JobTracker or ResourceManager – Important JVM configs to help debugging – -verbose:gc -Xloggc:<file> -XX:+PrintGCDetails – -XX:ErrorFile=<file> – -XX:+HeapDumpOnOutOfMemoryError © Hortonworks Inc. 2011 Page 13
  • 14. Configuration • Multiple redundant dirs for namenode metadata – One of dfs.namenode.name.dir should be on NFS – NFS softmount - tcp,soft,intr,timeo=20,retrans=5 • Configure open fd ulimit – Default 1024 is too low – 16K for datanodes, 64K for Master nodes • Use version control for configuration! © Hortonworks Inc. 2011 Page 14
  • 15. Configuration • Use disk fail in place for datanodes: dfs.datanode.failed.volumes.tolerated – Disk failure is no longer datanode failure – Especially important for large density nodes • Set dfs.namenode.name.dir.restore to true – Restores NN storage directory during checkpointing • Take periodic backups of namenode metadata – Make copies of the entire storage directory • Set aside a lot of disk space for NN logs – It is verbose – set aside multiple GBs – Many installs configure this too small – NN logs roll with in minutes – hard to debug issues © Hortonworks Inc. 2011 Page 15
  • 16. Monitor Usage • Cluster storage, nodes, files, blocks grows – Update NN heap, handler count, number of DN xceivers – Tweak other related config periodically • Monitor the hardware usage for your work load – Disk I/O, network I/O, CPU and memory usage – Use this information when expanding cluster capacity • Monitor the usage with HADOOP metrics – JVM metrics – GC times, Memory used, Thread Status – RPC metrics – especially latency to track slowdowns –HDFS metrics – Used storage, # of files and blocks, total load on the cluster – File System operations – MapReduce Metrics – Slot utilization and Job status • Tweak configurations during upgrades/maintenance on an ongoing basis © Hortonworks Inc. 2011 Page 16
  • 18. Documentation • Continual Investment in Documentation – Hortonworks Data Platform Documentation – http://docs.hortonworks.com/ – Apache Hadoop Documentation – http://hadoop.apache.org/docs/current/ • Apache Hadoop Documentation – We welcome your requests in Apache jira for documentation improvements. – Create issues with the “documentation” label. – Getting the end user perspective is extremely valuable. – We would be grateful to receive documentation patches. – It’s a great way to get started in the Apache Hadoop open source process. – Search for unresolved issues with the “documentation” label. – https://issues.apache.org/jira/issues/?jql=project%20in%20(HDFS%2C%20HADOOP%2C%20YARN%2C%20MAPREDUC E)%20AND%20resolution%20%3D%20Unresolved%20AND%20labels%20%3D%20documentation © Hortonworks Inc. 2011 Page 18 Architecting the Future of Big Data
  • 19. Software Improvements Real Incidents and Software Improvements to Address Them
  • 20. Don’t edit the metadata files! • Editing can corrupt the cluster state – Might result in loss of data • Real incident – NN misconfigured to point to another NN’s metadata – DNs can’t register due to namespace ID mismatch – System detected the problem correctly – Safety net ignored by the admin! – Admin edits the namenode VERSION file to match ids © Hortonworks Inc. 2011 What Happens Next? Page 20
  • 21. Improvement • Pause deletion of blocks when the namenode starts up – https://issues.apache.org/jira/browse/HDFS-6186 – Supports configurable delay of block deletions after NameNode startup – Gives an admin extra time to diagnose before deletions begin • Show when block deletion will start after NameNode startup in WebUI – https://issues.apache.org/jira/browse/HDFS-6385 – The web UI already displays the number of pending block deletions – This will enhance the display to indicate when actual deletion will begin © Hortonworks Inc. 2011 Page 21 Architecting the Future of Big Data
  • 22. Guard Against Accidental Deletion • rm –r deletes the data at the speed of Hadoop! – ctrl-c of the command does not stop deletion! – Undeleting files on datanodes is hard & time consuming – Immediately shutdown NN, unmount disks on datanodes – Recover deleted files – Start namenode without the delete operation in edits • Enable Trash • Real Incident – Customer is running a distro of Hadoop with trash not enabled – Deletes a large dir (100 TB) and shuts down NN immediately – Support person asks NN to be restarted to see if trash is enabled! © Hortonworks Inc. 2011 What happens next? • Now HDFS has Snapshots! Page 22
  • 23. Improvement • HDFS Snapshots – https://issues.apache.org/jira/browse/HDFS-2802 – A snapshot is a read-only point-in-time image of part of the file system – A snapshot created before a deletion can be used to restore deleted data – More coverage of snapshots later in the presentation • HDFS ACLs – https://issues.apache.org/jira/browse/HDFS-4685 – Finer-grained control of file permissions can help prevent an accidental deletion – More coverage of ACLs later in the presentation © Hortonworks Inc. 2011 Page 23 Architecting the Future of Big Data
  • 24. Unexpected error during HA HDFS upgrade • Background: HDFS HA Architecture – http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithQJM.html • Real Incident – During upgrade, NameNode calls every JournalNode to request backup of metadata directory, which renames “current” directory to “previous.tmp”. – Permissions incorrect on metadata directory for 1 out of 3 JournalNodes. – The hdfs user is not authorized to rename. Backup fails for that JournalNode, so upgrade process aborts with error. What happens next? © Hortonworks Inc. 2011 Page 24 Architecting the Future of Big Data
  • 25. Improvement • Improve diagnostics on storage directory rename operations by using native code. – https://issues.apache.org/jira/browse/HDFS-7118 – Logs additional root cause information for rename failure. For example, EACCES • Split error checks in into separate conditions to improve diagnostics. – https://issues.apache.org/jira/browse/HDFS-7119 – Splits a log message about failure to delete or rename into separate log messages to clarify which specific action failed • When aborting NameNode or JournalNode, write the contents of the metadata directories and permissions to logs. – https://issues.apache.org/jira/browse/HDFS-7120 – Usually the first information asked of the user, so we can automate this • For JournalNode operations that must succeed on all nodes, execute a pre-check to verify that the operation can succeed. – https://issues.apache.org/jira/browse/HDFS-7121 – Prevents need for manual cleanup on 2 out of 3 JournalNodes where backup succeeded © Hortonworks Inc. 2011 Page 25 Architecting the Future of Big Data
  • 26. Support Case Trends • Highlights Revisited – Core Hadoop components (HDFS, YARN and MapReduce) are used across almost all deployments, and therefore receive proportionally more support cases than other ecosystem components. – Action: Focus efforts on core Hadoop first to improve operability of the platform. – Misconfiguration is the dominant root cause. – Action: Publish configuration best practices and advise on the need for ongoing review of configuration as cluster usage patterns change over time. – Documentation is a close second. – Action: Contribute frequently to product documentation, both in open source Apache Hadoop and in the distro. End user documentation is a gating factor for launching new features. We welcome your requests in Apache jira for documentation improvements, and we welcome your patches! – Code changes often can be implemented to eliminate an operational issue, help with diagnosis or provide increased visibility. – Action: After resolution of each support case, consider potential product improvements. For example, can logging be improved? Small code changes can have a big impact. © Hortonworks Inc. 2011 Page 26 Architecting the Future of Big Data
  • 27. Key Learnings and Best Practices Features that Help Improve Production Operations
  • 28. HDFS ACLs • Existing HDFS POSIX permissions good, but not flexible enough – Permission requirements may differ from the natural organizational hierarchy of users and groups. • HDFS ACLs augment the existing HDFS POSIX permissions model by implementing the POSIX ACL model. – An ACL (Access Control List) provides a way to set different permissions for specific named users or named groups, not only the file’s owner and file’s group. © Hortonworks Inc. 2011 Page 28 Architecting the Future of Big Data
  • 29. HDFS File Permissions Example • Authorization requirements: – In a sales department, they would like a single user Maya (Department Manager) to control all modifications to sales data – Other members of sales department need to view the data, but can’t modify it. –Everyone else in the company must not be allowed to view the data. • Can be implemented via the following: Read/Write perm for user maya © Hortonworks Inc. 2011 User Group Read perm for group sales File with sales data
  • 30. HDFS ACLs • Problem –No longer feasible for Maya to control all modifications to the file – New Requirement: Maya, Diane and Clark are allowed to make modifications – New Requirement: New group called executives should be able to read the sales data –Current permissions model only allows permissions at 1 group and 1 user • Solution: HDFS ACLs –Now assign different permissions to different users and groups © Hortonworks Inc. 2011 Owner Group Others HDFS Directory … rwx … rwx … rwx Group D … rwx Group F … rwx User Y … rwx
  • 31. HDFS ACLs New Tools for ACL Management (setfacl, getfacl) – hdfs dfs -setfacl -m group:execs:r-- /sales-data – hdfs dfs -getfacl /sales-data # file: /sales-data # owner: maya # group: sales user::rw- group::r-- group:execs:r-- mask::r-- other::-- – How do you know if a directory has ACLs set? – hdfs dfs -ls /sales-data Found 1 items -rw-r-----+ 3 maya sales 0 2014-03-04 16:31 /sales-data © Hortonworks Inc. 2011
  • 32. HDFS ACLs Default ACLs –hdfs dfs -setfacl -m default:group:execs:r-x /monthly-sales-data –hdfs dfs -mkdir /monthly-sales-data/JAN –hdfs dfs –getfacl /monthly-sales-data/JAN – # file: /monthly-sales-data/JAN # owner: maya # group: sales user::rwx group::r-x group:execs:r-x mask::r-x other::--- default:user::rwx default:group::r-x default:group:execs:r-x default:mask::r-x default:other::--- © Hortonworks Inc. 2011
  • 33. HDFS ACLs Best Practices • Start with traditional HDFS permissions to implement most permission requirements. • Define a smaller number of ACLs to handle exceptional cases. • A file with an ACL incurs an additional cost in memory in the NameNode compared to a file that has only traditional permissions. © Hortonworks Inc. 2011 Page 33 Architecting the Future of Big Data
  • 34. HDFS Snapshots • HDFS Snapshots – A snapshot is a read-only point-in-time image of part of the file system – Performance: snapshot creation is instantaneous, regardless of data size or subtree depth – Reliability: snapshot creation is atomic – Scalability: snapshots do not create extra copies of data blocks – Useful for protecting against accidental deletion of data • Example: Daily Feeds hdfs dfs -ls /daily-feeds Found 5 items drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/2014-10-13 drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/2014-10-14 drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-15 drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-16 drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-17 © Hortonworks Inc. 2011 Page 34 Architecting the Future of Big Data
  • 35. HDFS Snapshots • Create a snapshot after each daily load hdfs dfsadmin -allowSnapshot /daily-feeds Allowing snaphot on /daily-feeds succeeded hdfs dfs -createSnapshot /daily-feeds snapshot-to-2014-10-17 Created snapshot /daily-feeds/.snapshot/snapshot-to-2014-10-17 • User accidentally deletes data for 2014-10-16 hdfs dfs -ls /daily-feeds Found 4 items drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/2014-10-13 drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/2014-10-14 drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-15 drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/2014-10-17 © Hortonworks Inc. 2011 Page 35 Architecting the Future of Big Data
  • 36. HDFS Snapshots • Snapshots to the rescue: the data is still in the snapshot hdfs dfs -ls /daily-feeds/.snapshot/snapshot-to-2014-10-17 Found 5 items drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/. snapshot/snapshot-to-2014-10-17/2014-10-13 drwxr-xr-x - chris supergroup 0 2014-10-13 14:36 /daily-feeds/. snapshot/snapshot-to-2014-10-17/2014-10-14 drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/. snapshot/snapshot-to-2014-10-17/2014-10-15 drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/. snapshot/snapshot-to-2014-10-17/2014-10-16 drwxr-xr-x - chris supergroup 0 2014-10-13 14:37 /daily-feeds/. snapshot/snapshot-to-2014-10-17/2014-10-17 • Restore data from 2014-10-16 hdfs dfs -cp /daily-feeds/.snapshot/snapshot-to-2014-10-17/2014-10-16 /daily-feeds © Hortonworks Inc. 2011 Page 36 Architecting the Future of Big Data
  • 37. YARN Application Timeline Server • Stores data about YARN application execution – Generic data – YARN container utilization – Metrics related to containers – Application-specific data – MapReduce jobs and their tasks – Tez DAG execution • Provides CLI for accessing data – Useful for ad-hoc queries or scripted analysis • Provides REST API for accessing data – Consumed by UI front-ends such as Apache Ambari © Hortonworks Inc. 2011 Page 37 Architecting the Future of Big Data
  • 38. Querying a Map Reduce Job Entity curl http://127.0.0.1:8188/ws/v1/timeline/MAPREDUCE_JOB/job_1413405332088_0001 { "entity": "job_1413405332088_0001", "entitytype": "MAPREDUCE_JOB", "events": [ © Hortonworks Inc. 2011 { "eventinfo": { "FINISHED_MAPS": 2, "FINISHED_REDUCES": 1, "FINISH_TIME": 1413405349192, "JOB_STATUS": "SUCCEEDED" }, "eventtype": "JOB_FINISHED", "timestamp": 1413405349194 } ], "relatedentities": { "MAPREDUCE_TASK": [ "task_1413405332088_0001_m_000000" ] }, "starttime": 1413405339442 } Page 38 Architecting the Future of Big Data
  • 39. Querying a Map Task Entity curl http://127.0.0.1:8188/ws/v1/timeline/MAPREDUCE_TASK/task_1413405332088_0001_m_000000 { "entity": "task_1413405332088_0001_m_000000", "entitytype": "MAPREDUCE_TASK", "events": [ © Hortonworks Inc. 2011 { "eventtype": "TASK_FINISHED", "timestamp": 1413405345253 }, { "eventinfo": { "SPLIT_LOCATIONS": "localhost", "START_TIME": 1413405340255, "TASK_TYPE": "MAP" }, "eventtype": "TASK_STARTED", "timestamp": 1413405340258 } ], } Page 39 Architecting the Future of Big Data
  • 40. Summary • Configuration – Prevent garbage collection issues – Configure for redundancy – Retune configuration in response to metrics • Documentation – End user perspective is crucial – Please consider contributing to Apache Hadoop documentation • HDFS ACLs – Implement fine-grained authorization rules on files – Can protect against accidental file manipulations • HDFS Snapshots – Point-in-time image of part of the filesystem – Useful for restoring to a prior state after accidental file manipulation • YARN Application Timeline Server – Provides generic and application-specific data about YARN application execution – Useful for analyzing cluster usage patterns © Hortonworks Inc. 2011 Page 40 Architecting the Future of Big Data
  • 41. Thank you, Q&A © Hortonworks Inc. 2011 Page 41 Resource Location Hardware Recommendations for Apache Hadoop http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.2/bk_cluster-planning-guide/ content/ch_hardware-recommendations.html Hadoop Documentation Issues https://issues.apache.org/jira/issues/?jql=project%20in%20(HDFS%2C%20HA DOOP%2C%20YARN%2C%20MAPREDUCE)%20AND%20resolution%20%3 D%20Unresolved%20AND%20labels%20%3D%20documentation HDFS operational and debuggability improvements https://issues.apache.org/jira/browse/HDFS-6185 HDFS ACLs Blog Post http://hortonworks.com/blog/hdfs-acls-fine-grained-permissions-hdfs-files-hadoop/ HDFS Snapshots Blog Post http://hortonworks.com/blog/protecting-your-enterprise-data-with-hdfs-snapshots/ YARN Timeline Server Documentation http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/ TimelineServer.html Learn more

Notes de l'éditeur

  1. There is often a lot of overlap between the two. We moderate each forum to learn what needs to be improved.
  2. Preliminary analysis suggested that we focus deeper analysis on core Hadoop, defined as HDFS, YARN and Map Reduce. This chart shows the count of support cases per month. One interesting observation that came out of this is a spike in support case activity centered around May 2014.
  3. Instead of a raw count, this chart shows the proportion of support cases attributed to core Hadoop (HDFS, YARN or Map Reduce). The gray line at the top covers all other components, a total of 26 different components. Here we see a trend stabilizing around 30% of support cases driven from core. This was another validation that focusing on core for this study would likely help the most users.
  4. This chart shows root cause analysis of the core issues during the time period. We use ~40 different root cause categories, but I’ve limited this view to the most prominent root causes. Explain each category.
  5. Investment in operations at the core helps the most users. We need to keep revisiting the code to make constant improvements.
  6. Fewer nodes is less resilient than many nodes. Failure of a DataNode that’s heavier on storage causes more re-replication activity. Map Reduce jobs may need to rerun more tasks. Commodity != poor quality.
  7. Compressed ordinary object pointers are a technique used in the JVM to represent managed pointers as 32-bit values, which saves on the space taken by 64-bit native pointers. Xmx different from Xms can cause big expensive malloc. Surprising results when you run out of memory late in the process lifetime.
  8. NFS soft mount option important for returning control to caller after timeouts.
  9. Configuration often needs to change over time.
  10. Mention Akira AJISAKA – ~64 documentation patches!
  11. If you’ve used POSIX ACLs on a Linux file system, then you already know how it works in HDFS too.
  12. By convention, snapshots can be referenced as a file system path under sub-directory “.snapshot”.
  13. Also easily consumed by other clients if you want to roll your own UI. Initial integration was done with Tez. Tez is a framework for modeling distributed computations as a directed acyclic graph of tasks. Tez code is instrumented to publish information about DAG execution to the Timeline Server.
  14. This demo shows integration with Map Reduce, which is still a work in progress. The patch is available in Apache. This view is simplified.