Soumettre la recherche
Mettre en ligne
WebHDFS at King - May 2014 Hadoop MeetUp
•
1 j'aime
•
1,185 vues
huguk
Suivre
The latest developments at King on their work with WebHDFS .
Lire moins
Lire la suite
Technologie
Sports
Signaler
Partager
Signaler
Partager
1 sur 20
Télécharger maintenant
Télécharger pour lire hors ligne
Recommandé
Integration with hdfs using WebDFS and NFS
Integration with hdfs using WebDFS and NFS
Christophe Marchal
Fluentd and WebHDFS
Fluentd and WebHDFS
SATOSHI TAGOMORI
Drupal feature proposal: two new stream-wrappers
Drupal feature proposal: two new stream-wrappers
Marcus Deglos
HAProxy scale out using open source
HAProxy scale out using open source
Ingo Walz
GFProxy: Scaling the GlusterFS FUSE Client
GFProxy: Scaling the GlusterFS FUSE Client
Gluster.org
Using memcache to improve php performance
Using memcache to improve php performance
Sudar Muthu
Caching basics in PHP
Caching basics in PHP
Anis Ahmad
ReplacingSquidWithATS
ReplacingSquidWithATS
Chiranjeevi Jaladi
Recommandé
Integration with hdfs using WebDFS and NFS
Integration with hdfs using WebDFS and NFS
Christophe Marchal
Fluentd and WebHDFS
Fluentd and WebHDFS
SATOSHI TAGOMORI
Drupal feature proposal: two new stream-wrappers
Drupal feature proposal: two new stream-wrappers
Marcus Deglos
HAProxy scale out using open source
HAProxy scale out using open source
Ingo Walz
GFProxy: Scaling the GlusterFS FUSE Client
GFProxy: Scaling the GlusterFS FUSE Client
Gluster.org
Using memcache to improve php performance
Using memcache to improve php performance
Sudar Muthu
Caching basics in PHP
Caching basics in PHP
Anis Ahmad
ReplacingSquidWithATS
ReplacingSquidWithATS
Chiranjeevi Jaladi
GlusterFS As an Object Storage
GlusterFS As an Object Storage
Keisuke Takahashi
Windows Server 2016 Webinar
Windows Server 2016 Webinar
Men and Mice
dNFS for DBA's
dNFS for DBA's
Marcin Przepiórowski
Apache Traffic Server
Apache Traffic Server
supertom
5-WebServers.ppt
5-WebServers.ppt
webhostingguy
are available here
are available here
webhostingguy
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Ortus Solutions, Corp
Rubyspec y el largo camino hacia Ruby 1.9
Rubyspec y el largo camino hacia Ruby 1.9
David Calavera
[MathWorks] Versioning Infrastructure
[MathWorks] Versioning Infrastructure
Perforce
Redis
Redis
Marc Beaupré-Pham
Apache Traffic Server & Lua
Apache Traffic Server & Lua
Kit Chan
WE18_Performance_Up.ppt
WE18_Performance_Up.ppt
webhostingguy
HAProxy tech talk
HAProxy tech talk
icebourg
Curl Tutorial
Curl Tutorial
Ankireddy Polu
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
Gluster.org
What is new in BIND 9.11?
What is new in BIND 9.11?
Men and Mice
Web Server Load Balancer
Web Server Load Balancer
MobME Technical
Mini-Training: To cache or not to cache
Mini-Training: To cache or not to cache
Betclic Everest Group Tech Team
Clug 2012 March web server optimisation
Clug 2012 March web server optimisation
grooverdan
NGINX: High Performance Load Balancing
NGINX: High Performance Load Balancing
NGINX, Inc.
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
huguk
ether.camp - Hackathon & ether.camp intro
ether.camp - Hackathon & ether.camp intro
huguk
Contenu connexe
Tendances
GlusterFS As an Object Storage
GlusterFS As an Object Storage
Keisuke Takahashi
Windows Server 2016 Webinar
Windows Server 2016 Webinar
Men and Mice
dNFS for DBA's
dNFS for DBA's
Marcin Przepiórowski
Apache Traffic Server
Apache Traffic Server
supertom
5-WebServers.ppt
5-WebServers.ppt
webhostingguy
are available here
are available here
webhostingguy
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Ortus Solutions, Corp
Rubyspec y el largo camino hacia Ruby 1.9
Rubyspec y el largo camino hacia Ruby 1.9
David Calavera
[MathWorks] Versioning Infrastructure
[MathWorks] Versioning Infrastructure
Perforce
Redis
Redis
Marc Beaupré-Pham
Apache Traffic Server & Lua
Apache Traffic Server & Lua
Kit Chan
WE18_Performance_Up.ppt
WE18_Performance_Up.ppt
webhostingguy
HAProxy tech talk
HAProxy tech talk
icebourg
Curl Tutorial
Curl Tutorial
Ankireddy Polu
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
Gluster.org
What is new in BIND 9.11?
What is new in BIND 9.11?
Men and Mice
Web Server Load Balancer
Web Server Load Balancer
MobME Technical
Mini-Training: To cache or not to cache
Mini-Training: To cache or not to cache
Betclic Everest Group Tech Team
Clug 2012 March web server optimisation
Clug 2012 March web server optimisation
grooverdan
NGINX: High Performance Load Balancing
NGINX: High Performance Load Balancing
NGINX, Inc.
Tendances
(20)
GlusterFS As an Object Storage
GlusterFS As an Object Storage
Windows Server 2016 Webinar
Windows Server 2016 Webinar
dNFS for DBA's
dNFS for DBA's
Apache Traffic Server
Apache Traffic Server
5-WebServers.ppt
5-WebServers.ppt
are available here
are available here
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Cf camp 2019 cfconfig - a new way to manage your cold-fusion engine config
Rubyspec y el largo camino hacia Ruby 1.9
Rubyspec y el largo camino hacia Ruby 1.9
[MathWorks] Versioning Infrastructure
[MathWorks] Versioning Infrastructure
Redis
Redis
Apache Traffic Server & Lua
Apache Traffic Server & Lua
WE18_Performance_Up.ppt
WE18_Performance_Up.ppt
HAProxy tech talk
HAProxy tech talk
Curl Tutorial
Curl Tutorial
Hands On Gluster with Jeff Darcy
Hands On Gluster with Jeff Darcy
What is new in BIND 9.11?
What is new in BIND 9.11?
Web Server Load Balancer
Web Server Load Balancer
Mini-Training: To cache or not to cache
Mini-Training: To cache or not to cache
Clug 2012 March web server optimisation
Clug 2012 March web server optimisation
NGINX: High Performance Load Balancing
NGINX: High Performance Load Balancing
Plus de huguk
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
huguk
ether.camp - Hackathon & ether.camp intro
ether.camp - Hackathon & ether.camp intro
huguk
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
huguk
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
huguk
Extracting maximum value from data while protecting consumer privacy. Jason ...
Extracting maximum value from data while protecting consumer privacy. Jason ...
huguk
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
huguk
Streaming Dataflow with Apache Flink
Streaming Dataflow with Apache Flink
huguk
Lambda architecture on Spark, Kafka for real-time large scale ML
Lambda architecture on Spark, Kafka for real-time large scale ML
huguk
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
huguk
Jonathon Southam: Venture Capital, Funding & Pitching
Jonathon Southam: Venture Capital, Funding & Pitching
huguk
Signal Media: Real-Time Media & News Monitoring
Signal Media: Real-Time Media & News Monitoring
huguk
Dean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your Startup
huguk
Peter Karney: Intro to the Digital catapult
Peter Karney: Intro to the Digital catapult
huguk
Cytora: Real-Time Political Risk Analysis
Cytora: Real-Time Political Risk Analysis
huguk
Cubitic: Predictive Analytics
Cubitic: Predictive Analytics
huguk
Bird.i: Earth Observation Data Made Social
Bird.i: Earth Observation Data Made Social
huguk
Aiseedo: Real Time Machine Intelligence
Aiseedo: Real Time Machine Intelligence
huguk
Secrets of Spark's success - Deenar Toraskar, Think Reactive
Secrets of Spark's success - Deenar Toraskar, Think Reactive
huguk
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
huguk
Hadoop - Looking to the Future By Arun Murthy
Hadoop - Looking to the Future By Arun Murthy
huguk
Plus de huguk
(20)
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
ether.camp - Hackathon & ether.camp intro
ether.camp - Hackathon & ether.camp intro
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Using Big Data techniques to query and store OpenStreetMap data. Stephen Knox...
Extracting maximum value from data while protecting consumer privacy. Jason ...
Extracting maximum value from data while protecting consumer privacy. Jason ...
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
Intelligence Augmented vs Artificial Intelligence. Alex Flamant, IBM Watson
Streaming Dataflow with Apache Flink
Streaming Dataflow with Apache Flink
Lambda architecture on Spark, Kafka for real-time large scale ML
Lambda architecture on Spark, Kafka for real-time large scale ML
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
Today’s reality Hadoop with Spark- How to select the best Data Science approa...
Jonathon Southam: Venture Capital, Funding & Pitching
Jonathon Southam: Venture Capital, Funding & Pitching
Signal Media: Real-Time Media & News Monitoring
Signal Media: Real-Time Media & News Monitoring
Dean Bryen: Scaling The Platform For Your Startup
Dean Bryen: Scaling The Platform For Your Startup
Peter Karney: Intro to the Digital catapult
Peter Karney: Intro to the Digital catapult
Cytora: Real-Time Political Risk Analysis
Cytora: Real-Time Political Risk Analysis
Cubitic: Predictive Analytics
Cubitic: Predictive Analytics
Bird.i: Earth Observation Data Made Social
Bird.i: Earth Observation Data Made Social
Aiseedo: Real Time Machine Intelligence
Aiseedo: Real Time Machine Intelligence
Secrets of Spark's success - Deenar Toraskar, Think Reactive
Secrets of Spark's success - Deenar Toraskar, Think Reactive
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
TV Marketing and big data: cat and dog or thick as thieves? Krzysztof Osiewal...
Hadoop - Looking to the Future By Arun Murthy
Hadoop - Looking to the Future By Arun Murthy
Dernier
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Allon Mureinik
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Roshan Dwivedi
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
V3cube
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
Enterprise Knowledge
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Puma Security, LLC
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Delhi Call girls
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
The Digital Insurer
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
naman860154
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
gurkirankumar98700
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Safe Software
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Results
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
The Digital Insurer
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Enterprise Knowledge
Dernier
(20)
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
WebHDFS at King - May 2014 Hadoop MeetUp
1.
2.
2 How to turbo
charge your data transfers with WebHDFS Andy Done, Data Platform Lead andy.done@king.com
3.
4.
Last time…
5.
Since then…
6.
100 40 Hadoop
7.
1 0.5 Storage
8.
15 10 Events
9.
10 4 ExaSol
10.
11.
2.5 6 Load times
12.
Problem WebHDFS 12
13.
Old way WebHDFS
14.
Old way hadoop fs
–cat /some/path/* | bulk_load my_table WebHDFS
15.
WebHDFS way WebHDFS
16.
WebHDFS way IMPORT INTO
TABLE my_table FROM FILE ‘http://namenode/webhdfs/v1/some/path/file_1’ FILE ‘http://namenode/webhdfs/v1/some/path/file_2’ … FILE ‘http://namenode/webhdfs/v1/some/path/file_n’ WebHDFS
17.
WebHDFS benefits • Simple •
Efficient • Ubiquitous • Parallelisable • Bidirectional • Fast WebHDFS
18.
18 Conclusion WebHDFS
19.
Thank you 19
20.
We're hiring! 20
Télécharger maintenant