Contenu connexe
Similaire à Hadoop - Simple. Scalable. (20)
Plus de elliando dias (20)
Hadoop - Simple. Scalable.
- 14. Infoporn from Yahoo
73 hours
490 TB Shuffling
280 TB Output
4000 Nodes
16 PB Disk Space
32K Cores
64 TB RAM
- 18. Data Storage
Capacity has increased rapidly
beyond read speeds. Datasets
won't fit on one disk. Tolerate node
failure.
- 19. Data Analysis
Combine data from many
machines. Tolerate node failure.