Analytics 3

•Télécharger en tant que PPTX, PDF•

0 j'aime•234 vues

All about data and analytics. its all about data analytics, Introduction to the field of analytsics. What is data analytics?

Données & analyses

IT’S ALL ABOUT DATA & ANALYTICS
Session - 3

Topics covered in this Presentation
• Analytics Landscape
• BigData
• Hadoop
• BigData Landscape

Hadoop Architecture
• Hadoop Common- A set of common libraries and utilities used by other
Hadoop modules.
• HDFS-The default storage layer for Hadoop.
• MapReduce- Executes a wide range of analytic functions by analysing
datasets in parallel before ‘reducing’ the results.The “Map” job distributes
a query to different nodes, and the “Reduce” gathers the results and
resolves them into a single value.
• YARN- Present in version 2.0 onwards,YARN is the cluster management
layer of Hadoop. Prior to 2.0, MapReduce was responsible for cluster
management as well as processing.The inclusion ofYARN means you can
run multiple applications in Hadoop (so you’re no longer limited to
MapReduce), which all share common cluster management.

Hadoop Architecture
• Spark- Used on top of HDFS, Spark promises speeds up to 100 times faster than the two-
step MapReduce function in certain applications. Allows data to loaded in-memory and
queried repeatedly, making it particularly apt for machine learning algorithms
• Hive- Originally developed by Facebook, Hive is a data warehouse infrastructure built on
top of Hadoop. Hive provides a simple, SQL-like language called HiveQL, whilst
maintaining full support for MapReduce. This means SQL programmers with little former
experience with Hadoop can use the system easier, and provides better integration with
certain analytics packages like Tableau. Hive also provides indexes, making querying
faster.
• HBase- Is a NoSQL columnar database which is designed to run on top of HDFS. It is
modelled after Google’s BigTable and written in Java. It was designed to provide BigTable-
like capabilities to Hadoop, such as the columnar data storage model and storage for
sparse data.
• Flume- Flume collects (typically log) data from ‘agents’ which it then aggregates and
moves into Hadoop. In essence, Flume is what takes the data from the source (say a
server or mobile device) and delivers it to Hadoop.
• Mahout- Mahout is a machine learning library. It collects key algorithms for clustering,
classification and collaborative filtering and implements them on top of distributed data
systems, like MapReduce. Mahout primarily set out to collect algorithms for
implementation on the MapReduce model, but has begun implementing on other
systems which were more efficient for data mining, such as Spark.
• Sqoop- Sqoop is a tool which aids in transitioning data from other database systems (such
as relational databases) into Hadoop.

In God,WeTrust... All Others must bring the "Data"
Srikanth Ayithy
about.me/srikanthayithy

Contenu connexe

Tendances

Big Data and Hadoop - An IntroductionNagarjuna Kanamarlapudi

Hadoop And Their Ecosystemsunera pathan

Introduction to hadoopChad Richeson

Hadoop EcosystemPatrick Nicolas

PPT on HadoopShubham Parmar

Hadoop introductionChirag Ahuja

Getting started big dataKibrom Gebrehiwot

Apache hadoop introduction and architectureHarikrishnan K

Big dataAlisha Roy

Apache Hadoop Big Data TechnologyJay Nagar

Hadoopsiva shankari

Hadoopchandinisanz

Apache hadoopSai Koppuravuri

Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for womenmaharajothip1

Hadoop vs Apache SparkALTEN Calsoft Labs

WHAT IS HADOOP AND ITS COMPONENTS? nakshatraL

Cloud Services for Big Data AnalyticsGeoffrey Fox

Hadoop white papersMuthu Natarajan

P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.MaharajothiP

Tendances (19)

Big Data and Hadoop - An Introduction

Hadoop And Their Ecosystem

Introduction to hadoop

Hadoop Ecosystem

PPT on Hadoop

Hadoop introduction

Getting started big data

Apache hadoop introduction and architecture

Big data

Apache Hadoop Big Data Technology

Hadoop

Apache hadoop

Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for women

Hadoop vs Apache Spark

WHAT IS HADOOP AND ITS COMPONENTS?

Cloud Services for Big Data Analytics

Hadoop white papers

P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.

Similaire à Analytics 3

Introduction to Hadoop and Hadoop component rebeccatho

Unit IV.pdfKennyPratheepKumar

Big Data and Cloud ComputingFarzad Nozarian

2.1-HADOOP.pdfMarianJRuben

Introduction to HADOOP.pdf8840VinayShelke

project report on hadoopManoj Jangalva

Distributed Systems Hadoop.pptxUttara University

Hadoop hive presentationArvind Kumar

Cppt Hadoopchunkypandey12

Cpptchunkypandey12

Big Data and Hadoop GuideSimplilearn

BIGDATA MODULE 3.pdfDIVYA370851

Hadoop.pptxAlbertoBarronMiranda1

Hadoop infoNikita Sure

BigData & Hadoop Ecosystem.pptxBibhasDeb1

MOD-2 presentation on engineering studentsrishavkumar1402

Asbury Hadoop OverviewBrian Enochson

Cap 10 inglesElianaSalinas4

Similaire à Analytics 3 (20)

Introduction to Hadoop and Hadoop component

Unit IV.pdf

Big Data and Cloud Computing

2.1-HADOOP.pdf

Introduction to HADOOP.pdf

project report on hadoop

Distributed Systems Hadoop.pptx

Hadoop hive presentation

Cppt Hadoop

Cppt

Big Data and Hadoop Guide

BIGDATA MODULE 3.pdf

Hadoop.pptx

Hadoop info

BigData & Hadoop Ecosystem.pptx

MOD-2 presentation on engineering students

Asbury Hadoop Overview

Cap 10 ingles

Dernier

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...gajnagarg

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...gajnagarg

Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823

SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli

Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...gajnagarg

Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823

➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...amitlee9823

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823

Dernier (20)

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...

Predicting Loan Approval: A Data Science Project

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...

Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...

Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand

➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...

Analytics 3

1. IT’S ALL ABOUT DATA & ANALYTICS Session - 3

2. Topics covered in this Presentation • Analytics Landscape • BigData • Hadoop • BigData Landscape

3. Analytics Landscape

5. Big Data

6. Hadoop Architecture

7. Hadoop Architecture

8. Hadoop Architecture • Hadoop Common- A set of common libraries and utilities used by other Hadoop modules. • HDFS-The default storage layer for Hadoop. • MapReduce- Executes a wide range of analytic functions by analysing datasets in parallel before ‘reducing’ the results.The “Map” job distributes a query to different nodes, and the “Reduce” gathers the results and resolves them into a single value. • YARN- Present in version 2.0 onwards,YARN is the cluster management layer of Hadoop. Prior to 2.0, MapReduce was responsible for cluster management as well as processing.The inclusion ofYARN means you can run multiple applications in Hadoop (so you’re no longer limited to MapReduce), which all share common cluster management.

9. Hadoop Architecture • Spark- Used on top of HDFS, Spark promises speeds up to 100 times faster than the two- step MapReduce function in certain applications. Allows data to loaded in-memory and queried repeatedly, making it particularly apt for machine learning algorithms • Hive- Originally developed by Facebook, Hive is a data warehouse infrastructure built on top of Hadoop. Hive provides a simple, SQL-like language called HiveQL, whilst maintaining full support for MapReduce. This means SQL programmers with little former experience with Hadoop can use the system easier, and provides better integration with certain analytics packages like Tableau. Hive also provides indexes, making querying faster. • HBase- Is a NoSQL columnar database which is designed to run on top of HDFS. It is modelled after Google’s BigTable and written in Java. It was designed to provide BigTable- like capabilities to Hadoop, such as the columnar data storage model and storage for sparse data. • Flume- Flume collects (typically log) data from ‘agents’ which it then aggregates and moves into Hadoop. In essence, Flume is what takes the data from the source (say a server or mobile device) and delivers it to Hadoop. • Mahout- Mahout is a machine learning library. It collects key algorithms for clustering, classification and collaborative filtering and implements them on top of distributed data systems, like MapReduce. Mahout primarily set out to collect algorithms for implementation on the MapReduce model, but has begun implementing on other systems which were more efficient for data mining, such as Spark. • Sqoop- Sqoop is a tool which aids in transitioning data from other database systems (such as relational databases) into Hadoop.

10. Map Reduce Example

11.

12.

13.

14.

15.

16.

17. In God,WeTrust... All Others must bring the "Data" Srikanth Ayithy about.me/srikanthayithy

Analytics 3

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (19)

Similaire à Analytics 3

Similaire à Analytics 3 (20)

Dernier

Dernier (20)

Analytics 3