SlideShare a Scribd company logo
1 of 22
HADOOP
                           Framework and
                           Applications




Prepared by: TEAM HADOOP                   slide1/22
CONTENTS
   WHY   HADOOP?




   INTRODUCTION      TO MapReduce




Prepared by: TEAM HADOOP             slide 2/22
WHAT?
  “... to create building blocks for programmers
  who just happen to have lots of data to
  store, lots of data to analyze, or lots of machines
  to coordinate, and who don‟t have the
  time, the skill, or the inclination to become
  distributed systems experts to build the
  infrastructure to handle it.”
                                           -Tom White

  Source: Hadoop: The Definitive Guide



Prepared by: TEAM HADOOP                        slide 3/22
WHAT?
     Hadoop contains many subprojects:
     Hadoop Common
     Chukwa
     HBase
     ZooKeeper
     Pig
     Zombie
     Hive
     MapReduce

  We will focus on MapReduce



Prepared by: TEAM HADOOP                  slide 4/22
WHO & WHEN?
   Pre-2004 : Cutting and Cafarella develop
    open source projects for web-scale
    indexing, crawling and search.




Prepared by: TEAM HADOOP                slide 5/22
WHO & WHEN?
   2004: Jeffrey Dean and Sanjay
    Ghemawat introduce map reduce model
    used internally at Google.




Prepared by: TEAM HADOOP           slide 6/22
WHO & WHEN?
   2006:Hadoop becomes official Apache
    project, Cutting joins Yahoo!Yahoo
    adopts Hadoop.




Prepared by: TEAM HADOOP            slide 7/22
TRENDS




Prepared by: TEAM HADOOP   slide 8/22
WHO USES IT?




Prepared by: TEAM HADOOP   slide 9/22
Roughly how long to read 1TB
  from a commodity hard disk?




Prepared by: TEAM HADOOP   slide 10/22
Roughly how long to read 1TB
  from a commodity hard disk?


                     Around 4 hours
WITH HADOOP..



                      62 seconds…



Prepared by: TEAM HADOOP              slide 11/22
INTRODUCTION TO MapReduce




   "Break large problem into smaller parts, solve in
   parallel, combine results."



 Prepared by: TEAM HADOOP                              slide 12/22
Typical scenario
   How  many times is the word „IT‟ present?
    You‟ll probably count but in a 30k paged
    document, can you??




Prepared by: TEAM HADOOP                 slide 13/22
Map Reduce Typical Illustration




 Prepared by: TEAM HADOOP    slide 14/22
Map Reduce paradigm

                                 Input




               Output                                   Map




                        Reduce           Shuffle/Sort




Prepared by: TEAM HADOOP                                      slide 15/22
Map Reduce paradigm
   Map:  transforms input record to
    intermediate (key, value) pair




Prepared by: TEAM HADOOP               slide 16/22
Map Reduce paradigm
   Reduce:   transforms all records for given
    key to final output.




Prepared by: TEAM HADOOP                    slide 17/22
Map reduce principles

                                           Move code to data (local
                                                computation)




                  Abstract away fault                                    Allow programs to scale
            tolerance, synchronization, etc.                          transparently w.r.t size of input




Prepared by: TEAM HADOOP                                                                                  slide 18/22
Implementation: Hardware




Prepared by: TEAM HADOOP sroy choudhury7@gmail.com   slide 19/22
Map Reduce: strengths
   Batch,   offline jobs

   Write-once,   read-many across full data
    set

   Usually,
          though not always, simple
    computations

   I/O   bound by disk/network bandwidth


Prepared by: TEAM HADOOP                  slide 20/22
What it‟s not!

  What it‟s not:

   High-performance parallel
    computing, e.g. MPI

   Low-latency    random access relational
    database

   Always   the right solution


Prepared by: TEAM HADOOP                  slide 21/22
THANK YOU!
                           QUESTIONS?




Prepared by: TEAM HADOOP                slide 22/22

More Related Content

Viewers also liked

Zaidan ismail rashid original
Zaidan ismail rashid originalZaidan ismail rashid original
Zaidan ismail rashid originalMuhiss Rahman
 
Bo p, disequlibrium,
Bo p, disequlibrium,Bo p, disequlibrium,
Bo p, disequlibrium,wwgreatmutha
 
Estrategias de ensenanza_cap6 Anijovich Mora 2009_
Estrategias de ensenanza_cap6 Anijovich Mora 2009_Estrategias de ensenanza_cap6 Anijovich Mora 2009_
Estrategias de ensenanza_cap6 Anijovich Mora 2009_María Julia Bravo
 
Међумолекулске интеракције и водонична веза
Међумолекулске интеракције и водонична везаМеђумолекулске интеракције и водонична веза
Међумолекулске интеракције и водонична везаTanja Milanović
 
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...Lorenc Gordani
 
Chris Hamilton news:rewired presentation
Chris Hamilton news:rewired presentationChris Hamilton news:rewired presentation
Chris Hamilton news:rewired presentationrachelmcathy
 
22號 周玟伽
22號 周玟伽22號 周玟伽
22號 周玟伽輝 哲
 
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...South Asia Fast Track
 
EDS selection & implementation @ CCC
EDS selection & implementation @ CCCEDS selection & implementation @ CCC
EDS selection & implementation @ CCCMolly Beestrum
 
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch services
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch servicesSalon Maison Passive - Enterprise Europe Brussels - Technology Watch services
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch servicesecobuild.brussels
 
закон о пдн_последние_изменения_разбегаев_в_авг_2014
закон о пдн_последние_изменения_разбегаев_в_авг_2014закон о пдн_последние_изменения_разбегаев_в_авг_2014
закон о пдн_последние_изменения_разбегаев_в_авг_2014Vladimir Razbegaev
 
動画の作り方から稼ぎ方まで20130720
動画の作り方から稼ぎ方まで20130720動画の作り方から稼ぎ方まで20130720
動画の作り方から稼ぎ方まで20130720Keiko Morita
 

Viewers also liked (19)

Zaidan ismail rashid original
Zaidan ismail rashid originalZaidan ismail rashid original
Zaidan ismail rashid original
 
Bo p, disequlibrium,
Bo p, disequlibrium,Bo p, disequlibrium,
Bo p, disequlibrium,
 
加拉太書
加拉太書加拉太書
加拉太書
 
Expresiòn oral - Cassany
 Expresiòn oral - Cassany Expresiòn oral - Cassany
Expresiòn oral - Cassany
 
Supersticiones
SupersticionesSupersticiones
Supersticiones
 
La amistad
La amistadLa amistad
La amistad
 
Estrategias de ensenanza_cap6 Anijovich Mora 2009_
Estrategias de ensenanza_cap6 Anijovich Mora 2009_Estrategias de ensenanza_cap6 Anijovich Mora 2009_
Estrategias de ensenanza_cap6 Anijovich Mora 2009_
 
Pat7.3 253
Pat7.3 253Pat7.3 253
Pat7.3 253
 
Међумолекулске интеракције и водонична веза
Међумолекулске интеракције и водонична везаМеђумолекулске интеракције и водонична веза
Међумолекулске интеракције и водонична веза
 
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
Evolution of the EU institutional framework (in Albanian Language) by Dr Lore...
 
Chris Hamilton news:rewired presentation
Chris Hamilton news:rewired presentationChris Hamilton news:rewired presentation
Chris Hamilton news:rewired presentation
 
22號 周玟伽
22號 周玟伽22號 周玟伽
22號 周玟伽
 
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
Sourajit Aiyer - GSCGI WealthGram, Switzerland - Can the indian elephant move...
 
EDS selection & implementation @ CCC
EDS selection & implementation @ CCCEDS selection & implementation @ CCC
EDS selection & implementation @ CCC
 
disleksia kanak2
 disleksia kanak2 disleksia kanak2
disleksia kanak2
 
Tugas 4
Tugas 4Tugas 4
Tugas 4
 
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch services
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch servicesSalon Maison Passive - Enterprise Europe Brussels - Technology Watch services
Salon Maison Passive - Enterprise Europe Brussels - Technology Watch services
 
закон о пдн_последние_изменения_разбегаев_в_авг_2014
закон о пдн_последние_изменения_разбегаев_в_авг_2014закон о пдн_последние_изменения_разбегаев_в_авг_2014
закон о пдн_последние_изменения_разбегаев_в_авг_2014
 
動画の作り方から稼ぎ方まで20130720
動画の作り方から稼ぎ方まで20130720動画の作り方から稼ぎ方まで20130720
動画の作り方から稼ぎ方まで20130720
 

Similar to Hadoop and MapReduce

Hadoop live online training
Hadoop live online trainingHadoop live online training
Hadoop live online trainingHarika583
 
Intro to BigData , Hadoop and Mapreduce
Intro to BigData , Hadoop and MapreduceIntro to BigData , Hadoop and Mapreduce
Intro to BigData , Hadoop and MapreduceKrishna Sangeeth KS
 
Report Hadoop Map Reduce
Report Hadoop Map ReduceReport Hadoop Map Reduce
Report Hadoop Map ReduceUrvashi Kataria
 
Learn what is Hadoop-and-BigData
Learn  what is Hadoop-and-BigDataLearn  what is Hadoop-and-BigData
Learn what is Hadoop-and-BigDataThanusha154
 
Foss4g2016 Geopaparazzi Workshop
Foss4g2016 Geopaparazzi WorkshopFoss4g2016 Geopaparazzi Workshop
Foss4g2016 Geopaparazzi WorkshopAndrea Antonello
 
Introduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopIntroduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopGERARDO BARBERENA
 
Understanding hadoop
Understanding hadoopUnderstanding hadoop
Understanding hadoopRexRamos9
 
Reduce Side Joins
Reduce Side Joins Reduce Side Joins
Reduce Side Joins Edureka!
 
Seminar_Report_hadoop
Seminar_Report_hadoopSeminar_Report_hadoop
Seminar_Report_hadoopVarun Narang
 
Learning How to Learn Hadoop
Learning How to Learn HadoopLearning How to Learn Hadoop
Learning How to Learn HadoopSilicon Halton
 
Sparse matrix computations in MapReduce
Sparse matrix computations in MapReduceSparse matrix computations in MapReduce
Sparse matrix computations in MapReduceDavid Gleich
 
2. Develop a MapReduce program to calculate the frequency of a given word in ...
2. Develop a MapReduce program to calculate the frequency of a given word in ...2. Develop a MapReduce program to calculate the frequency of a given word in ...
2. Develop a MapReduce program to calculate the frequency of a given word in ...Prof. Maulik Trivedi
 
Intermachine Parallelism
Intermachine ParallelismIntermachine Parallelism
Intermachine ParallelismSri Prasanna
 
Hadoop, MapReduce and R = RHadoop
Hadoop, MapReduce and R = RHadoopHadoop, MapReduce and R = RHadoop
Hadoop, MapReduce and R = RHadoopVictoria López
 

Similar to Hadoop and MapReduce (20)

Hadoop Mapreduce
Hadoop MapreduceHadoop Mapreduce
Hadoop Mapreduce
 
Hadoop ppt2
Hadoop ppt2Hadoop ppt2
Hadoop ppt2
 
Hadoop live online training
Hadoop live online trainingHadoop live online training
Hadoop live online training
 
Intro to BigData , Hadoop and Mapreduce
Intro to BigData , Hadoop and MapreduceIntro to BigData , Hadoop and Mapreduce
Intro to BigData , Hadoop and Mapreduce
 
Report Hadoop Map Reduce
Report Hadoop Map ReduceReport Hadoop Map Reduce
Report Hadoop Map Reduce
 
Learn what is Hadoop-and-BigData
Learn  what is Hadoop-and-BigDataLearn  what is Hadoop-and-BigData
Learn what is Hadoop-and-BigData
 
mapReduce.pptx
mapReduce.pptxmapReduce.pptx
mapReduce.pptx
 
Foss4g2016 Geopaparazzi Workshop
Foss4g2016 Geopaparazzi WorkshopFoss4g2016 Geopaparazzi Workshop
Foss4g2016 Geopaparazzi Workshop
 
Hadoop MapReduce
Hadoop MapReduceHadoop MapReduce
Hadoop MapReduce
 
Introduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopIntroduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to Hadoop
 
Understanding hadoop
Understanding hadoopUnderstanding hadoop
Understanding hadoop
 
Hadoop Seminar Report
Hadoop Seminar ReportHadoop Seminar Report
Hadoop Seminar Report
 
Reduce Side Joins
Reduce Side Joins Reduce Side Joins
Reduce Side Joins
 
Seminar_Report_hadoop
Seminar_Report_hadoopSeminar_Report_hadoop
Seminar_Report_hadoop
 
Learning How to Learn Hadoop
Learning How to Learn HadoopLearning How to Learn Hadoop
Learning How to Learn Hadoop
 
Sparse matrix computations in MapReduce
Sparse matrix computations in MapReduceSparse matrix computations in MapReduce
Sparse matrix computations in MapReduce
 
Hadoop Internals
Hadoop InternalsHadoop Internals
Hadoop Internals
 
2. Develop a MapReduce program to calculate the frequency of a given word in ...
2. Develop a MapReduce program to calculate the frequency of a given word in ...2. Develop a MapReduce program to calculate the frequency of a given word in ...
2. Develop a MapReduce program to calculate the frequency of a given word in ...
 
Intermachine Parallelism
Intermachine ParallelismIntermachine Parallelism
Intermachine Parallelism
 
Hadoop, MapReduce and R = RHadoop
Hadoop, MapReduce and R = RHadoopHadoop, MapReduce and R = RHadoop
Hadoop, MapReduce and R = RHadoop
 

More from Abhishek Dey

Automatic problem generation
Automatic problem generationAutomatic problem generation
Automatic problem generationAbhishek Dey
 
Handling High Energy Physics Data using Cloud Computing
Handling High Energy Physics Data using Cloud ComputingHandling High Energy Physics Data using Cloud Computing
Handling High Energy Physics Data using Cloud ComputingAbhishek Dey
 
Big Data Analysis on a Cloud Ecosystem-PATW 2013
Big Data Analysis on a Cloud Ecosystem-PATW 2013Big Data Analysis on a Cloud Ecosystem-PATW 2013
Big Data Analysis on a Cloud Ecosystem-PATW 2013Abhishek Dey
 
Cloud computing using Eucalyptus
Cloud computing using EucalyptusCloud computing using Eucalyptus
Cloud computing using EucalyptusAbhishek Dey
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computingAbhishek Dey
 

More from Abhishek Dey (6)

Automatic problem generation
Automatic problem generationAutomatic problem generation
Automatic problem generation
 
Cafaholic ppt
Cafaholic pptCafaholic ppt
Cafaholic ppt
 
Handling High Energy Physics Data using Cloud Computing
Handling High Energy Physics Data using Cloud ComputingHandling High Energy Physics Data using Cloud Computing
Handling High Energy Physics Data using Cloud Computing
 
Big Data Analysis on a Cloud Ecosystem-PATW 2013
Big Data Analysis on a Cloud Ecosystem-PATW 2013Big Data Analysis on a Cloud Ecosystem-PATW 2013
Big Data Analysis on a Cloud Ecosystem-PATW 2013
 
Cloud computing using Eucalyptus
Cloud computing using EucalyptusCloud computing using Eucalyptus
Cloud computing using Eucalyptus
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
 

Recently uploaded

fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 

Recently uploaded (20)

fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 

Hadoop and MapReduce

  • 1. HADOOP Framework and Applications Prepared by: TEAM HADOOP slide1/22
  • 2. CONTENTS  WHY HADOOP?  INTRODUCTION TO MapReduce Prepared by: TEAM HADOOP slide 2/22
  • 3. WHAT? “... to create building blocks for programmers who just happen to have lots of data to store, lots of data to analyze, or lots of machines to coordinate, and who don‟t have the time, the skill, or the inclination to become distributed systems experts to build the infrastructure to handle it.” -Tom White Source: Hadoop: The Definitive Guide Prepared by: TEAM HADOOP slide 3/22
  • 4. WHAT?  Hadoop contains many subprojects:  Hadoop Common  Chukwa  HBase  ZooKeeper  Pig  Zombie  Hive  MapReduce We will focus on MapReduce Prepared by: TEAM HADOOP slide 4/22
  • 5. WHO & WHEN?  Pre-2004 : Cutting and Cafarella develop open source projects for web-scale indexing, crawling and search. Prepared by: TEAM HADOOP slide 5/22
  • 6. WHO & WHEN?  2004: Jeffrey Dean and Sanjay Ghemawat introduce map reduce model used internally at Google. Prepared by: TEAM HADOOP slide 6/22
  • 7. WHO & WHEN?  2006:Hadoop becomes official Apache project, Cutting joins Yahoo!Yahoo adopts Hadoop. Prepared by: TEAM HADOOP slide 7/22
  • 8. TRENDS Prepared by: TEAM HADOOP slide 8/22
  • 9. WHO USES IT? Prepared by: TEAM HADOOP slide 9/22
  • 10. Roughly how long to read 1TB from a commodity hard disk? Prepared by: TEAM HADOOP slide 10/22
  • 11. Roughly how long to read 1TB from a commodity hard disk? Around 4 hours WITH HADOOP.. 62 seconds… Prepared by: TEAM HADOOP slide 11/22
  • 12. INTRODUCTION TO MapReduce "Break large problem into smaller parts, solve in parallel, combine results." Prepared by: TEAM HADOOP slide 12/22
  • 13. Typical scenario  How many times is the word „IT‟ present? You‟ll probably count but in a 30k paged document, can you?? Prepared by: TEAM HADOOP slide 13/22
  • 14. Map Reduce Typical Illustration Prepared by: TEAM HADOOP slide 14/22
  • 15. Map Reduce paradigm Input Output Map Reduce Shuffle/Sort Prepared by: TEAM HADOOP slide 15/22
  • 16. Map Reduce paradigm  Map: transforms input record to intermediate (key, value) pair Prepared by: TEAM HADOOP slide 16/22
  • 17. Map Reduce paradigm  Reduce: transforms all records for given key to final output. Prepared by: TEAM HADOOP slide 17/22
  • 18. Map reduce principles Move code to data (local computation) Abstract away fault Allow programs to scale tolerance, synchronization, etc. transparently w.r.t size of input Prepared by: TEAM HADOOP slide 18/22
  • 19. Implementation: Hardware Prepared by: TEAM HADOOP sroy choudhury7@gmail.com slide 19/22
  • 20. Map Reduce: strengths  Batch, offline jobs  Write-once, read-many across full data set  Usually, though not always, simple computations  I/O bound by disk/network bandwidth Prepared by: TEAM HADOOP slide 20/22
  • 21. What it‟s not! What it‟s not:  High-performance parallel computing, e.g. MPI  Low-latency random access relational database  Always the right solution Prepared by: TEAM HADOOP slide 21/22
  • 22. THANK YOU! QUESTIONS? Prepared by: TEAM HADOOP slide 22/22