SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
www.edureka.co/hadoop-admin
Which Hadoop Distribution to Choose
www.edureka.co/hadoop-admin
What will you learn today?
 Introduction to Apache Hadoop
 Various Hadoop Distributions
 Cloudera Hadoop Distribution
 A closer look at Hortonworks and MapR
 How to choose a Hadoop Distribution
www.edureka.co/hadoop-admin
Where it all started – Apache Hadoop
The Apache Hadoop is an open source framework that allows distributed processing of
large data sets across clusters of computers
Hadoop introduced a new way to simplify the analysis of large data sets, and in a very short time
reshaped the big data market and have become the synonym for big data
www.edureka.co/hadoop-admin
A closer look at Apache Hadoop
Apache Hadoop includes following modules :
Hadoop Distributed File System (HDFS): A distributed file system that provides access to application data
Hadoop Common: The common utilities that support the other Hadoop modules
Hadoop YARN: A framework for job scheduling and cluster resource management
Hadoop MapReduce: A YARN-based system for parallel processing of large data sets
www.edureka.co/hadoop-admin
Popular Hadoop Distributions
www.edureka.co/hadoop-admin
Popular Hadoop Distributions - Cloudera
Founded by a group of engineers from Yahoo, Google and Facebook Cloudera ranks top in the big
data vendors list for making Hadoop a reliable platform for business use since 2008
www.edureka.co/hadoop-admin
A closer look at Cloudera - CDH
Cloudera Hadoop (CDH) - CDH includes the core elements of Hadoop along with additional components
such as a user interface, security, and integration with a broad range of hardware and software
www.edureka.co/hadoop-admin
A closer look at Cloudera – Cloudera Manager
Cloudera Manager makes administration of your enterprise data hub simple and straightforward, at any
scale. With Cloudera Manager, you can easily deploy and centrally operate the complete Big Data stack
www.edureka.co/hadoop-admin
A closer look at Cloudera – Other Products
Cloudera Express - Cloudera Express is a free download that combines CDH with Cloudera Manager,
which provides robust cluster management capabilities like automated deployment, centralized
administration, monitoring, and diagnostic tools
Cloudera Enterprise - Cloudera Enterprise includes CDH with advanced system management and
data management tools plus dedicated support from Cloudera
Cloudera Director - Cloudera Director extends Cloudera's enterprise data hub architecture to the
cloud, without compromising on security, management, and governance
www.edureka.co/hadoop-admin
Popular Hadoop Distributions - Hortonworks
Founded in 2011, Hortonworks has quickly emerged as one of the leading vendors of Hadoop
Hortonworks Data Platform
www.edureka.co/hadoop-admin
Hortonworks Sandbox
Hortonworks Sandbox lets you get started with Hortonworks Data Platform (HDP) . You can run
Hortonworks Sandbox either in the cloud or on your personal machine.
Hortonworks Sandbox in the CloudHortonworks Sandbox on a VM
www.edureka.co/hadoop-admin
Popular Hadoop Distributions - MapR
Compared to other Hadoop distributions e.g. Cloudera and Hortonworks, MapR takes a different
approach as it uses its own proprietary file system MapRFS
MapR Data Platform
www.edureka.co/hadoop-admin
MapR Products
www.edureka.co/hadoop-admin
Which one to choose ?
www.edureka.co/hadoop-admin
Which one to choose ?
Before selecting the Hadoop Distribution ask yourself which problems you are
trying to solve and what all features you need
www.edureka.co/hadoop-admin
Choosing a Hadoop Distribution
If you are looking for complete
Hadoop stack with all features,
then MapR is the way to go.
But note that MapR enterprise
edition is not free and takes a
different approach than Apache
Hadoop
www.edureka.co/hadoop-admin
Choosing a Hadoop Distribution
If you are looking for complete
Hadoop stack with all features,
then MapR is the way to go.
But note that MapR enterprise
edition is not free and takes a
different approach than Apache
Hadoop
Cloudera is based on 100% open
source Apache Hadoop and has
added its own proprietary tools
Similar to MapR, Cloudera also
provides both free and paid
distribution with extra features
and support
www.edureka.co/hadoop-admin
Choosing a Hadoop Distribution
Social
xyz
Key point in Economical Key point in Social
If you are looking for complete
Hadoop stack with all features,
then MapR is the way to go.
But note that MapR enterprise
edition is not free and takes a
different approach than Apache
Hadoop
Cloudera is based on open source
Apache Hadoop but has added its
own proprietary tools.
Similar to MapR, Cloudera also
provides both free and paid
distribution with extra features
and support
Hortonworks is the only
commercial vendor to provide
complete open source Hadoop.
Hortonworks intentionally not
developed proprietary software
and uses open source softwares
e.g. Ambari, Stinger and Solr
www.edureka.co/hadoop-admin
Why not try them all ?
All Hadoop Distribution vendors provide free (community edition) version, its not a bad idea to try them
all and get an idea how each one of them is different from others
www.edureka.co/hadoop-admin
Survey
Your feedback is vital for us, be it a compliment, a suggestion or a complaint. It helps us to
make your experience better!
Please spare few minutes to take the survey after the webinar.
www.edureka.co/hadoop-admin
Thank You …
Questions/Queries/Feedback
Recording and presentation will be made available to you within 24 hours

Contenu connexe

Tendances

Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
DataWorks Summit
 

Tendances (20)

Internal Hive
Internal HiveInternal Hive
Internal Hive
 
Hadoop Backup and Disaster Recovery
Hadoop Backup and Disaster RecoveryHadoop Backup and Disaster Recovery
Hadoop Backup and Disaster Recovery
 
Introduction to Apache Kudu
Introduction to Apache KuduIntroduction to Apache Kudu
Introduction to Apache Kudu
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration Options
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the Covers
 
Reporting with Oracle Application Express (APEX)
Reporting with Oracle Application Express (APEX)Reporting with Oracle Application Express (APEX)
Reporting with Oracle Application Express (APEX)
 
Database-Migration and -Upgrade with Transportable Tablespaces
Database-Migration and -Upgrade with Transportable TablespacesDatabase-Migration and -Upgrade with Transportable Tablespaces
Database-Migration and -Upgrade with Transportable Tablespaces
 
LLAP: long-lived execution in Hive
LLAP: long-lived execution in HiveLLAP: long-lived execution in Hive
LLAP: long-lived execution in Hive
 
Presto Summit 2018 - 09 - Netflix Iceberg
Presto Summit 2018  - 09 - Netflix IcebergPresto Summit 2018  - 09 - Netflix Iceberg
Presto Summit 2018 - 09 - Netflix Iceberg
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
Multi-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase DeploymentsMulti-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase Deployments
 
Parquet performance tuning: the missing guide
Parquet performance tuning: the missing guideParquet performance tuning: the missing guide
Parquet performance tuning: the missing guide
 
Hit Refresh with Oracle GoldenGate Microservices
Hit Refresh with Oracle GoldenGate MicroservicesHit Refresh with Oracle GoldenGate Microservices
Hit Refresh with Oracle GoldenGate Microservices
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
Learn Apache Spark: A Comprehensive Guide
Learn Apache Spark: A Comprehensive GuideLearn Apache Spark: A Comprehensive Guide
Learn Apache Spark: A Comprehensive Guide
 
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real-time dashboards on data streams using Kafka, Druid, and Supe...
 
Hive: Loading Data
Hive: Loading DataHive: Loading Data
Hive: Loading Data
 
Elk with Openstack
Elk with OpenstackElk with Openstack
Elk with Openstack
 
Apache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & InternalsApache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & Internals
 

En vedette

NuVitae Presentation
NuVitae PresentationNuVitae Presentation
NuVitae Presentation
nuvitae
 
Open Source + Big Data = Big Money
Open Source + Big Data = Big Money Open Source + Big Data = Big Money
Open Source + Big Data = Big Money
sogrady
 

En vedette (20)

Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapRHadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
 
Hadoop distributions - ecosystem
Hadoop distributions - ecosystemHadoop distributions - ecosystem
Hadoop distributions - ecosystem
 
Getting involved with Open Source at the ASF
Getting involved with Open Source at the ASFGetting involved with Open Source at the ASF
Getting involved with Open Source at the ASF
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 
Building Custom
Machine Learning Algorithms
with Apache SystemML
Building Custom
Machine Learning Algorithms
with Apache SystemMLBuilding Custom
Machine Learning Algorithms
with Apache SystemML
Building Custom
Machine Learning Algorithms
with Apache SystemML
 
NuVitae Presentation
NuVitae PresentationNuVitae Presentation
NuVitae Presentation
 
Hadoop Primer
Hadoop PrimerHadoop Primer
Hadoop Primer
 
A Hadoop Primer
A Hadoop PrimerA Hadoop Primer
A Hadoop Primer
 
Open Source + Big Data = Big Money
Open Source + Big Data = Big Money Open Source + Big Data = Big Money
Open Source + Big Data = Big Money
 
Introduction to Hadoop : A bird eye's view | Abhishek Mukherjee
Introduction to Hadoop : A bird eye's view | Abhishek MukherjeeIntroduction to Hadoop : A bird eye's view | Abhishek Mukherjee
Introduction to Hadoop : A bird eye's view | Abhishek Mukherjee
 
Not only SQL - Database Choices
Not only SQL - Database ChoicesNot only SQL - Database Choices
Not only SQL - Database Choices
 
Sqoopコネクタを書いてみた (Hadoopソースコードリーディング第12回 発表資料)
Sqoopコネクタを書いてみた (Hadoopソースコードリーディング第12回 発表資料)Sqoopコネクタを書いてみた (Hadoopソースコードリーディング第12回 発表資料)
Sqoopコネクタを書いてみた (Hadoopソースコードリーディング第12回 発表資料)
 
Hadoop Einführung @codecentric
Hadoop Einführung @codecentricHadoop Einführung @codecentric
Hadoop Einführung @codecentric
 
ODPi is Now Open for Business: Here's What it Means
ODPi is Now Open for Business: Here's What it MeansODPi is Now Open for Business: Here's What it Means
ODPi is Now Open for Business: Here's What it Means
 
CDH5最新情報 #cwt2013
CDH5最新情報 #cwt2013CDH5最新情報 #cwt2013
CDH5最新情報 #cwt2013
 
CDHの歴史とCDH5新機能概要 #at_tokuben
CDHの歴史とCDH5新機能概要 #at_tokubenCDHの歴史とCDH5新機能概要 #at_tokuben
CDHの歴史とCDH5新機能概要 #at_tokuben
 
Bigdata.pptx
Bigdata.pptxBigdata.pptx
Bigdata.pptx
 
Limitless Data, Rapid Discovery, Powerful Insight: How to Connect Cloudera to...
Limitless Data, Rapid Discovery, Powerful Insight: How to Connect Cloudera to...Limitless Data, Rapid Discovery, Powerful Insight: How to Connect Cloudera to...
Limitless Data, Rapid Discovery, Powerful Insight: How to Connect Cloudera to...
 
Hadoop Architecture Options for Existing Enterprise DataWarehouse
Hadoop Architecture Options for Existing Enterprise DataWarehouseHadoop Architecture Options for Existing Enterprise DataWarehouse
Hadoop Architecture Options for Existing Enterprise DataWarehouse
 
Cloudera Federal Forum 2014: Cloud Deployment for the Enterprise Data Hub
Cloudera Federal Forum 2014: Cloud Deployment for the Enterprise Data HubCloudera Federal Forum 2014: Cloud Deployment for the Enterprise Data Hub
Cloudera Federal Forum 2014: Cloud Deployment for the Enterprise Data Hub
 

Similaire à Which Hadoop Distribution to use: Apache, Cloudera, MapR or HortonWorks?

HadoopDistributions
HadoopDistributionsHadoopDistributions
HadoopDistributions
Demet Aksoy
 

Similaire à Which Hadoop Distribution to use: Apache, Cloudera, MapR or HortonWorks? (20)

HadoopDistributions
HadoopDistributionsHadoopDistributions
HadoopDistributions
 
Hadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, ProvidersHadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, Providers
 
Cloudera
ClouderaCloudera
Cloudera
 
Hadoop Innovation Summit 2014
Hadoop Innovation Summit 2014Hadoop Innovation Summit 2014
Hadoop Innovation Summit 2014
 
Apresentação Hadoop
Apresentação HadoopApresentação Hadoop
Apresentação Hadoop
 
Brief Introduction about Hadoop and Core Services.
Brief Introduction about Hadoop and Core Services.Brief Introduction about Hadoop and Core Services.
Brief Introduction about Hadoop and Core Services.
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoop
 
Cap 10 ingles
Cap  10 inglesCap  10 ingles
Cap 10 ingles
 
Cap 10 ingles
Cap  10 inglesCap  10 ingles
Cap 10 ingles
 
Big Data and Hadoop Basics
Big Data and Hadoop BasicsBig Data and Hadoop Basics
Big Data and Hadoop Basics
 
Hadoop in a Nutshell
Hadoop in a NutshellHadoop in a Nutshell
Hadoop in a Nutshell
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
 
Big Data Hadoop Technology
Big Data Hadoop TechnologyBig Data Hadoop Technology
Big Data Hadoop Technology
 
Unit IV.pdf
Unit IV.pdfUnit IV.pdf
Unit IV.pdf
 
Big Data Training in Amritsar
Big Data Training in AmritsarBig Data Training in Amritsar
Big Data Training in Amritsar
 
Big Data Training in Mohali
Big Data Training in MohaliBig Data Training in Mohali
Big Data Training in Mohali
 
Big Data Training in Ludhiana
Big Data Training in LudhianaBig Data Training in Ludhiana
Big Data Training in Ludhiana
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Hadoop.pptx
Hadoop.pptxHadoop.pptx
Hadoop.pptx
 
Hadoop in action
Hadoop in actionHadoop in action
Hadoop in action
 

Plus de Edureka!

Plus de Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

Which Hadoop Distribution to use: Apache, Cloudera, MapR or HortonWorks?

  • 2. www.edureka.co/hadoop-admin What will you learn today?  Introduction to Apache Hadoop  Various Hadoop Distributions  Cloudera Hadoop Distribution  A closer look at Hortonworks and MapR  How to choose a Hadoop Distribution
  • 3. www.edureka.co/hadoop-admin Where it all started – Apache Hadoop The Apache Hadoop is an open source framework that allows distributed processing of large data sets across clusters of computers Hadoop introduced a new way to simplify the analysis of large data sets, and in a very short time reshaped the big data market and have become the synonym for big data
  • 4. www.edureka.co/hadoop-admin A closer look at Apache Hadoop Apache Hadoop includes following modules : Hadoop Distributed File System (HDFS): A distributed file system that provides access to application data Hadoop Common: The common utilities that support the other Hadoop modules Hadoop YARN: A framework for job scheduling and cluster resource management Hadoop MapReduce: A YARN-based system for parallel processing of large data sets
  • 6. www.edureka.co/hadoop-admin Popular Hadoop Distributions - Cloudera Founded by a group of engineers from Yahoo, Google and Facebook Cloudera ranks top in the big data vendors list for making Hadoop a reliable platform for business use since 2008
  • 7. www.edureka.co/hadoop-admin A closer look at Cloudera - CDH Cloudera Hadoop (CDH) - CDH includes the core elements of Hadoop along with additional components such as a user interface, security, and integration with a broad range of hardware and software
  • 8. www.edureka.co/hadoop-admin A closer look at Cloudera – Cloudera Manager Cloudera Manager makes administration of your enterprise data hub simple and straightforward, at any scale. With Cloudera Manager, you can easily deploy and centrally operate the complete Big Data stack
  • 9. www.edureka.co/hadoop-admin A closer look at Cloudera – Other Products Cloudera Express - Cloudera Express is a free download that combines CDH with Cloudera Manager, which provides robust cluster management capabilities like automated deployment, centralized administration, monitoring, and diagnostic tools Cloudera Enterprise - Cloudera Enterprise includes CDH with advanced system management and data management tools plus dedicated support from Cloudera Cloudera Director - Cloudera Director extends Cloudera's enterprise data hub architecture to the cloud, without compromising on security, management, and governance
  • 10. www.edureka.co/hadoop-admin Popular Hadoop Distributions - Hortonworks Founded in 2011, Hortonworks has quickly emerged as one of the leading vendors of Hadoop Hortonworks Data Platform
  • 11. www.edureka.co/hadoop-admin Hortonworks Sandbox Hortonworks Sandbox lets you get started with Hortonworks Data Platform (HDP) . You can run Hortonworks Sandbox either in the cloud or on your personal machine. Hortonworks Sandbox in the CloudHortonworks Sandbox on a VM
  • 12. www.edureka.co/hadoop-admin Popular Hadoop Distributions - MapR Compared to other Hadoop distributions e.g. Cloudera and Hortonworks, MapR takes a different approach as it uses its own proprietary file system MapRFS MapR Data Platform
  • 15. www.edureka.co/hadoop-admin Which one to choose ? Before selecting the Hadoop Distribution ask yourself which problems you are trying to solve and what all features you need
  • 16. www.edureka.co/hadoop-admin Choosing a Hadoop Distribution If you are looking for complete Hadoop stack with all features, then MapR is the way to go. But note that MapR enterprise edition is not free and takes a different approach than Apache Hadoop
  • 17. www.edureka.co/hadoop-admin Choosing a Hadoop Distribution If you are looking for complete Hadoop stack with all features, then MapR is the way to go. But note that MapR enterprise edition is not free and takes a different approach than Apache Hadoop Cloudera is based on 100% open source Apache Hadoop and has added its own proprietary tools Similar to MapR, Cloudera also provides both free and paid distribution with extra features and support
  • 18. www.edureka.co/hadoop-admin Choosing a Hadoop Distribution Social xyz Key point in Economical Key point in Social If you are looking for complete Hadoop stack with all features, then MapR is the way to go. But note that MapR enterprise edition is not free and takes a different approach than Apache Hadoop Cloudera is based on open source Apache Hadoop but has added its own proprietary tools. Similar to MapR, Cloudera also provides both free and paid distribution with extra features and support Hortonworks is the only commercial vendor to provide complete open source Hadoop. Hortonworks intentionally not developed proprietary software and uses open source softwares e.g. Ambari, Stinger and Solr
  • 19. www.edureka.co/hadoop-admin Why not try them all ? All Hadoop Distribution vendors provide free (community edition) version, its not a bad idea to try them all and get an idea how each one of them is different from others
  • 20. www.edureka.co/hadoop-admin Survey Your feedback is vital for us, be it a compliment, a suggestion or a complaint. It helps us to make your experience better! Please spare few minutes to take the survey after the webinar.
  • 21. www.edureka.co/hadoop-admin Thank You … Questions/Queries/Feedback Recording and presentation will be made available to you within 24 hours