Soumettre la recherche
Mettre en ligne
Hadoopを業務で使ってみました
•
Télécharger en tant que KEY, PDF
•
4 j'aime
•
2,196 vues
Tatsuya Sasaki
Suivre
テックライフLT #tllt で使ったスライドです
Lire moins
Lire la suite
Technologie
Formation
Signaler
Partager
Signaler
Partager
1 sur 23
Télécharger maintenant
Recommandé
800万人の"食べたい"をHadoopで分散処理
800万人の"食べたい"をHadoopで分散処理
Tatsuya Sasaki
マーケティングのためのHadoop利用
マーケティングのためのHadoop利用
Tatsuya Sasaki
961万人の食卓を支えるデータ解析
961万人の食卓を支えるデータ解析
Tatsuya Sasaki
Big Data in the Microsoft Platform
Big Data in the Microsoft Platform
Jesus Rodriguez
Cloud Friendly Hadoop and Hive
Cloud Friendly Hadoop and Hive
DataWorks Summit
Intro to cassandra + hadoop
Intro to cassandra + hadoop
Jeremy Hanna
Yahoo! - Arun Murthy - Hadoop World 2010
Yahoo! - Arun Murthy - Hadoop World 2010
Cloudera, Inc.
Hadoop_content_by_sasidhar2
Hadoop_content_by_sasidhar2
Akshara Technologies Training by Industry Experts
Recommandé
800万人の"食べたい"をHadoopで分散処理
800万人の"食べたい"をHadoopで分散処理
Tatsuya Sasaki
マーケティングのためのHadoop利用
マーケティングのためのHadoop利用
Tatsuya Sasaki
961万人の食卓を支えるデータ解析
961万人の食卓を支えるデータ解析
Tatsuya Sasaki
Big Data in the Microsoft Platform
Big Data in the Microsoft Platform
Jesus Rodriguez
Cloud Friendly Hadoop and Hive
Cloud Friendly Hadoop and Hive
DataWorks Summit
Intro to cassandra + hadoop
Intro to cassandra + hadoop
Jeremy Hanna
Yahoo! - Arun Murthy - Hadoop World 2010
Yahoo! - Arun Murthy - Hadoop World 2010
Cloudera, Inc.
Hadoop_content_by_sasidhar2
Hadoop_content_by_sasidhar2
Akshara Technologies Training by Industry Experts
Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
Joydeep Sen Sarma
Cassandra/Hadoop Integration
Cassandra/Hadoop Integration
Jeremy Hanna
Cloud Optimized Big Data
Cloud Optimized Big Data
Joydeep Sen Sarma
PySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark Meetup
Frens Jan Rumph
Hadoop big data online training
Hadoop big data online training
Magnific Trainings
Hadoop 101 - Big Data Technology
Hadoop 101 - Big Data Technology
Firman Gautama
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Yahoo Developer Network
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Big Data Joe™ Rossi
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
Introduction to Apache Hivemall v0.5.2 and v0.6
Introduction to Apache Hivemall v0.5.2 and v0.6
Makoto Yui
Productive data engineer
Productive data engineer
Rafał Wojdyła
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
Hadoop introduction
Hadoop introduction
shubham kuwar
Drill at the Chug 9-19-12
Drill at the Chug 9-19-12
Ted Dunning
Hadoop basics
Hadoop basics
Antonio Silveira
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Robert Stupp
Hadoop training
Hadoop training
TIB Academy
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Map reduce and hadoop at mylife
Map reduce and hadoop at mylife
responseteam
からあげエンジニアについて
からあげエンジニアについて
Tatsuya Sasaki
クックパッドでのemr利用事例
クックパッドでのemr利用事例
Tatsuya Sasaki
からあげとビーチと私
からあげとビーチと私
Tatsuya Sasaki
Contenu connexe
Tendances
Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
Joydeep Sen Sarma
Cassandra/Hadoop Integration
Cassandra/Hadoop Integration
Jeremy Hanna
Cloud Optimized Big Data
Cloud Optimized Big Data
Joydeep Sen Sarma
PySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark Meetup
Frens Jan Rumph
Hadoop big data online training
Hadoop big data online training
Magnific Trainings
Hadoop 101 - Big Data Technology
Hadoop 101 - Big Data Technology
Firman Gautama
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Yahoo Developer Network
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Big Data Joe™ Rossi
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
Introduction to Apache Hivemall v0.5.2 and v0.6
Introduction to Apache Hivemall v0.5.2 and v0.6
Makoto Yui
Productive data engineer
Productive data engineer
Rafał Wojdyła
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
Hadoop introduction
Hadoop introduction
shubham kuwar
Drill at the Chug 9-19-12
Drill at the Chug 9-19-12
Ted Dunning
Hadoop basics
Hadoop basics
Antonio Silveira
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Robert Stupp
Hadoop training
Hadoop training
TIB Academy
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Map reduce and hadoop at mylife
Map reduce and hadoop at mylife
responseteam
Tendances
(19)
Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
Cassandra/Hadoop Integration
Cassandra/Hadoop Integration
Cloud Optimized Big Data
Cloud Optimized Big Data
PySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark Meetup
Hadoop big data online training
Hadoop big data online training
Hadoop 101 - Big Data Technology
Hadoop 101 - Big Data Technology
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to Apache Hivemall v0.5.2 and v0.6
Introduction to Apache Hivemall v0.5.2 and v0.6
Productive data engineer
Productive data engineer
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Hadoop introduction
Hadoop introduction
Drill at the Chug 9-19-12
Drill at the Chug 9-19-12
Hadoop basics
Hadoop basics
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Hadoop training
Hadoop training
Introduction to pig & pig latin
Introduction to pig & pig latin
Map reduce and hadoop at mylife
Map reduce and hadoop at mylife
Plus de Tatsuya Sasaki
からあげエンジニアについて
からあげエンジニアについて
Tatsuya Sasaki
クックパッドでのemr利用事例
クックパッドでのemr利用事例
Tatsuya Sasaki
からあげとビーチと私
からあげとビーチと私
Tatsuya Sasaki
メタプログラミングでDSLを書こう
メタプログラミングでDSLを書こう
Tatsuya Sasaki
NoSQLデータベースが登場した背景と特徴
NoSQLデータベースが登場した背景と特徴
Tatsuya Sasaki
Hadoopをemr経由で利用する方法
Hadoopをemr経由で利用する方法
Tatsuya Sasaki
COOKPADでのHadoop利用
COOKPADでのHadoop利用
Tatsuya Sasaki
Hadoop導入事例 in クックパッド
Hadoop導入事例 in クックパッド
Tatsuya Sasaki
Hadoopを業務で使ってみた
Hadoopを業務で使ってみた
Tatsuya Sasaki
YUI
YUI
Tatsuya Sasaki
Plus de Tatsuya Sasaki
(10)
からあげエンジニアについて
からあげエンジニアについて
クックパッドでのemr利用事例
クックパッドでのemr利用事例
からあげとビーチと私
からあげとビーチと私
メタプログラミングでDSLを書こう
メタプログラミングでDSLを書こう
NoSQLデータベースが登場した背景と特徴
NoSQLデータベースが登場した背景と特徴
Hadoopをemr経由で利用する方法
Hadoopをemr経由で利用する方法
COOKPADでのHadoop利用
COOKPADでのHadoop利用
Hadoop導入事例 in クックパッド
Hadoop導入事例 in クックパッド
Hadoopを業務で使ってみた
Hadoopを業務で使ってみた
YUI
YUI
Dernier
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
wesley chun
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
debabhi2
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Zilliz
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Khem
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Miguel Araújo
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
DianaGray10
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
rafiqahmad00786416
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
Dropbox
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
Zilliz
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Khushali Kathiriya
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Drew Madelung
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
sammart93
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
sudhanshuwaghmare1
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
apidays
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
The Digital Insurer
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Edi Saputra
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
Rustici Software
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
apidays
Dernier
(20)
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Hadoopを業務で使ってみました
1.
Hadoop
2.
• id:sasata299 • • Ruby
Perl •
3.
Hadoop
4.
816 30
3 1
5.
(
)
6.
7.
• • GROUP BY
( ( Д`) • 7000 ( )
8.
9.
Hadoop
10.
Hadoop • Hadoop Streaming •
Ruby • Amazon EC2 Hadoop • 50
11.
Hadoop Streaming
12.
•
( ) • Mapper Reducer
13.
14.
HDFS Mapper, Reducer
15.
Java ( or
JRuby ) Java API
16.
Hadoop Streaming
…orz
17.
18.
Hadoop
cat `hadoop dfs -cat s3://xxxx/user/root/in/hoge` HDFS
19.
7000
( )→
20.
7000
( )→ 30
21.
Hadoop
!!
22.
• Hadoop Streaming
HDFS (Hadoop cat ) • 7000 30 Hadoop
Télécharger maintenant