3. Cloudera’s Distribution including Apache Hadoop
File System Mount UI Framework SDK
FUSE-DFS HUE HUE SDK
Workflow Scheduling Metadata
APACHE OOZIE* APACHE OOZIE* APACHE HIVE
Languages / Compilers
APACHE PIG, APACHE HIVE Fast Read/Write
Data Integration
Access
APACHE FLUME*,
APACHE SQOOP* APACHE HBASE
Coordination
APACHE ZOOKEEPER
*currently under incubation in the Apache Software Foundation
3
Copyright 2011 Cloudera Inc. All rights reserved
22. Pig
22
Copyright 2011 Cloudera Inc. All rights reserved
23. Pig
• Scripting language
• Generates MapReduce jobs
• Perl for Hadoop
• Great for ETL
A = LOAD 'data' USING PigStorage() AS (f1:int, f2:int, f3:int);
B = GROUP A BY f1;
C = FOREACH B GENERATE COUNT ($0);
DUMP C;
23
Copyright 2011 Cloudera Inc. All rights reserved
48. HBase
48
Copyright 2011 Cloudera Inc. All rights reserved
49. HBase
• Key/value store
• Data stored in HDFS
• Access model is get/put/del
– Plus range scans and versions
• Random reads and writes for Hadoop
49
Copyright 2011 Cloudera Inc. All rights reserved
71. CDH
File System Mount UI Framework SDK
FUSE-DFS HUE HUE SDK
Workflow Scheduling Metadata
APACHE OOZIE* APACHE OOZIE* APACHE HIVE
Languages / Compilers
APACHE PIG, APACHE HIVE Fast Read/Write
Data Integration
Access
APACHE
FLUME*, APACHE APACHE HBASE
SQOOP*
Coordination
APACHE ZOOKEEPER
*currently under incubation in the Apache Software Foundation
71
Copyright 2011 Cloudera Inc. All rights reserved
72. What’s next?
• Cloudera Training Videos
• CDH Virtual Machines
• Hadoop: The Definitive Guide, 2nd Edition
• Cloudera University
– Developer Training in Columbia, MD
• Dec 13-16, Feb 13-16
– Administrator Training in Herndon, VA
• Jan 4-6
– Private Training
72
Copyright 2011 Cloudera Inc. All rights reserved
73. We’re Hiring!
• http://www.cloudera.com/company/careers/
• Customer Operations
– Customer Operations Engineer
– Customer Operations Tools Developer
• Customer Solutions
– Solutions Architect
• Engineering
– Senior Data Integration Developer
– Senior Distributed Systems Engineer
– Senior UI Engineer
– Software Quality Engineer
– Technical Writer
• IT/Operations
– Systems Administrator
73
Copyright 2011 Cloudera Inc. All rights reserved