SlideShare une entreprise Scribd logo
1  sur  22
Managing Your Hadoop Clusters
with Apache Ambari
September 19, 2013
Sumit Mohanty @ Hortonworks
Hello!
• Sumit Mohanty
–Committer / PPMC member, Apache Ambari
–Member of Technical Staff @ Hortonworks
–smohanty@hortonworks.com
Page 2
Today, We’ll Go Over…
• Intro
• Features
• Demo
• Architecture
• Futures
• Q & A
Page 3
Ambari: Enterprise Hadoop Operations
Ambari is an open source framework for provisioning,
managing and monitoring Apache Hadoop clusters
HADOOP
Storage & Process
at Scale
AMBARI
PROVISION
MANAGE
MONITOR
AMBARI
WEB
Page 4
Apache Ambari – 100% Open Source!
• Active community
• 50+ Contributors / 30+ Committers
• 170+ Ambari User Group Members
• Steady progress/release cycle
Page 5
Release
Version
Release Date JIRAs
Resolved
0.9.0 Sep 2012 402
1.2.0 Feb 2013 441
1.2.1 Mar 2013 134
1.2.2 Apr 2013 106
1.2.3 Jun 2013 515
1.2.4 Jul 2013 113
1.2.5 Sep 2013 350
1.4.x Oct 2013 580+
 Current Release
 Today’s Demo
Features Today
Provisioning: Simplified deployment across platforms
Managing: Consistent controls across the Stack
Monitoring: Visibility into key cluster metrics
- Single pane of glass for Hadoop & System status
- Pre-configured metrics & alerts
- Single point for cluster operations
- Customize w/o dealing with Hadoop complexities
- Advanced configurations and host controls
- Wizard-driven cluster install experience
- Deploy 10s,100s or 1000s of Hadoop servers
- Cloud, virtual and physical environments
Page 6
New Features
• Add slaves components to hosts
• Stop/Start All Services
• Enable Kerberos Security
• Enable NameNode HA
• Host Status Filtering
Page 7
New Features
• Provision, manage and monitoring Hadoop 2.0 Stack
• HDFS2, YARN, MR2
• HBase
–New Heatmaps specific to Region Servers
Page 8
New Features
• Ambari supports Postgres, Oracle
• Configure Hive and Oozie to use MySQL or Oracle
• Ambari + Redhat GlusterFS Integration
• Deploy / manage cluster with dist. file system other than HDFS
–HCFS: GlusterFS as first implementation
–Pluggability with other HCFS’s
Page 9
Demo
Page 10
Ambari System Architecture
Ambari Server
Host
Agent
gmond
Host
Agent
gmond
Ganglia
Server
Agent
Host
Agent
gmondgmetad
gmond
Ambari
Web
DB
REST
/clusters
Nagios
Server
Agent
Page 11
Ambari Architecture
Page 12
DB
Orchestrator SPI
REST API
Request Dispatcher
Ambari
Web
Ambari
Server
Metrics
AuthProvider
/clusters
/services
/hosts
/workflows/jobs
/users, …
User
Store
java
RDBMS
javascript
RDBM
S
AD/
LDAP
REST API for
integration
Auth
Provider
Cluster
Configurations
Web Client
100% REST
Ambari
Agents ganglia nagios
Alerts
Pluggable
Service
Providersfalcon
Data Mgmt
jmx
python
puppet
Ambari Futures
Extend core capabilities to
include the critical tasks
associated with provisioning
and operating Hadoop clusters.
Cluster Operations
Enable insight into job
performance and reduce the
burden on specialized Hadoop
skills and knowledge.
Job Diagnostics
Expose integration and
customization points so Hadoop
can interoperate with existing
operational tooling.
Extensible Platform
Make Hadoop easier to manage
Page 13
Beyond 1.4
• Ambari Python CLI
• Delete Hosts
• Add Services
• Improved Configuration Mgmt with host-level overrides
• Log Aggregation
• Rack awareness
• Manage Full Stack HA
• Customizable Metric Graphs and Heatmaps
• Capacity Scheduler Usage Reporting
• Job Diagnostics Improvements
• Ambari Server HA
• Finer-grain Ambari user roles
• Cluster Manifest (cluster takeover)
• New Services: Accumulo, Falcon, Flume, Knox, Storm, …
• HDFS Mirroring/Rebalance
• HBase Compaction
Page 14
Host Group Configuration Controls
• Set custom configuration properties at the host level
for one or more hosts
• Important for handing “heterogeneous” clusters
• AMBARI-1509 and AMBARI-1370
HEAPSIZE= 1024
HEAPSIZE= 2048
Page 15
Ambari + Accumulo Integration
• Using Ambari to deploy / manage cluster with
Accumulo
–Google Summer of Code project
–See AMBARI-1930
MapReduce
Hive
Distributed File System
HBasePig
Page 16
Job Diagnostics
• Enhanced swimlane visualizations
• See job DAG with task overlay
• See task scatter plot across jobs
Page 17
Stack Definitions
• Design Goals
–Ambari should be able to support choice of Hadoop stacks
–Ambari should enable adding new components to an existing stack
• Define which Services are available (services)
• Define where to get the packages (repos)
S S S SStack B
repos
services
S S S SStack A
repos
services
S S S S
Stack C
extends
Stack B
repos
services
S
S+
Page 18
Cluster Blueprints
• Perform “Headless Install”
• Export blueprint from cluster
• Boot wizard with blueprint
• Save wizard as blueprint
BLUEPRINT
<stack>
<host>
<service>
<component>
<config>
Ambari
Server
MANIFEST
<host>
<meta>
<configs>
<configs>
<config>
Blueprint
Page 19
APIs, APIs, and More APIs
• Consistent front-end REST API
• Service Provider plugin architecture
“ZERO TOUCH” INSTALLS
“LIGHTS OUT” WORKFLOWS
“BRING MY OWN” SCENARIOS
Blueprint ClusterAmbari
Alarm Decom Inform
Custom Provider
Page 20
Get Involved!
• Project Website
– http://incubator.apache.org/ambari/
• Check out Ambari
– Quick Start Guide
– https://cwiki.apache.org/confluence/display/AMBARI/Quick+Start+Guide
• Mailing Lists
– ambari-user@incubator.apache.org
– ambari-dev@incubator.apache.org
• IRC Chanel
– @apacheambari
• Upcoming Meet-up, Sep 25 (Wednesday)
– http://www.meetup.com/Apache-Ambari-User-Group/
Page 21
Thanks!
• Questions?
Page 22

Contenu connexe

Tendances

Apache CloudStack 4.2: A First Look
Apache CloudStack 4.2: A First LookApache CloudStack 4.2: A First Look
Apache CloudStack 4.2: A First Look
Shanker Balan
 
Apache Ambari - What's New in 1.2.4
Apache Ambari - What's New in 1.2.4Apache Ambari - What's New in 1.2.4
Apache Ambari - What's New in 1.2.4
Hortonworks
 
Bacd zenoss
Bacd zenossBacd zenoss
Bacd zenoss
ke4qqq
 

Tendances (20)

Google Cloud Platform, Compute Engine, and App Engine
Google Cloud Platform, Compute Engine, and App EngineGoogle Cloud Platform, Compute Engine, and App Engine
Google Cloud Platform, Compute Engine, and App Engine
 
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
 
Facebook Retrospective - Big data-world-europe-2012
Facebook Retrospective - Big data-world-europe-2012Facebook Retrospective - Big data-world-europe-2012
Facebook Retrospective - Big data-world-europe-2012
 
Apache CloudStack 4.2: A First Look
Apache CloudStack 4.2: A First LookApache CloudStack 4.2: A First Look
Apache CloudStack 4.2: A First Look
 
Running MongoDB on AWS
Running MongoDB on AWSRunning MongoDB on AWS
Running MongoDB on AWS
 
Apache Ambari - What's New in 1.2.4
Apache Ambari - What's New in 1.2.4Apache Ambari - What's New in 1.2.4
Apache Ambari - What's New in 1.2.4
 
Hosting Drupal on Amazon EC2
Hosting Drupal on Amazon EC2Hosting Drupal on Amazon EC2
Hosting Drupal on Amazon EC2
 
Hive on spark berlin buzzwords
Hive on spark berlin buzzwordsHive on spark berlin buzzwords
Hive on spark berlin buzzwords
 
Scalability at GROU.PS
Scalability at GROU.PSScalability at GROU.PS
Scalability at GROU.PS
 
Bacd zenoss
Bacd zenossBacd zenoss
Bacd zenoss
 
AutoScaling and Drupal
AutoScaling and DrupalAutoScaling and Drupal
AutoScaling and Drupal
 
Openstack heat & How Autoscaling works
Openstack heat & How Autoscaling worksOpenstack heat & How Autoscaling works
Openstack heat & How Autoscaling works
 
Optimizing training on Apache MXNet
Optimizing training on Apache MXNetOptimizing training on Apache MXNet
Optimizing training on Apache MXNet
 
Optimizing costs with spot instances
Optimizing costs with spot instancesOptimizing costs with spot instances
Optimizing costs with spot instances
 
Whats New In GigaSpaces Xap 7.0
Whats New In GigaSpaces Xap 7.0Whats New In GigaSpaces Xap 7.0
Whats New In GigaSpaces Xap 7.0
 
We All Live in a Yellow (Serverless) Submarine
We All Live in a Yellow (Serverless) SubmarineWe All Live in a Yellow (Serverless) Submarine
We All Live in a Yellow (Serverless) Submarine
 
An Introduction to OpenStack Heat
An Introduction to OpenStack HeatAn Introduction to OpenStack Heat
An Introduction to OpenStack Heat
 
Ansible + Hadoop
Ansible + HadoopAnsible + Hadoop
Ansible + Hadoop
 
OpenStack Momentum
OpenStack  MomentumOpenStack  Momentum
OpenStack Momentum
 
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
 

En vedette

He student profiles in 2014 15 prospectus page-13
He student profiles in 2014 15 prospectus page-13He student profiles in 2014 15 prospectus page-13
He student profiles in 2014 15 prospectus page-13
bwcelearning
 
How to Build High Value Relationships Under Theory Z
How to Build High Value Relationships Under Theory ZHow to Build High Value Relationships Under Theory Z
How to Build High Value Relationships Under Theory Z
Anne Kohler
 
He student profiles in 2014 15 prospectus page-26
He student profiles in 2014 15 prospectus page-26He student profiles in 2014 15 prospectus page-26
He student profiles in 2014 15 prospectus page-26
bwcelearning
 
02 DesmineralizacióN 2009
02 DesmineralizacióN 200902 DesmineralizacióN 2009
02 DesmineralizacióN 2009
Miguel Neira
 

En vedette (20)

Sadigh Gallery Egyptian Art Extravaganza
Sadigh Gallery Egyptian Art ExtravaganzaSadigh Gallery Egyptian Art Extravaganza
Sadigh Gallery Egyptian Art Extravaganza
 
He student profiles in 2014 15 prospectus page-13
He student profiles in 2014 15 prospectus page-13He student profiles in 2014 15 prospectus page-13
He student profiles in 2014 15 prospectus page-13
 
Role of Computers in HRM _ Abhilasha_Karan_Lavanya_Sanchit
Role of Computers in HRM _ Abhilasha_Karan_Lavanya_SanchitRole of Computers in HRM _ Abhilasha_Karan_Lavanya_Sanchit
Role of Computers in HRM _ Abhilasha_Karan_Lavanya_Sanchit
 
How to Build High Value Relationships Under Theory Z
How to Build High Value Relationships Under Theory ZHow to Build High Value Relationships Under Theory Z
How to Build High Value Relationships Under Theory Z
 
He student profiles in 2014 15 prospectus page-26
He student profiles in 2014 15 prospectus page-26He student profiles in 2014 15 prospectus page-26
He student profiles in 2014 15 prospectus page-26
 
Sadigh Gallery Holiday Art Deals 2015
Sadigh Gallery Holiday Art Deals 2015Sadigh Gallery Holiday Art Deals 2015
Sadigh Gallery Holiday Art Deals 2015
 
MongoDB: An Introduction - July 2011
MongoDB:  An Introduction - July 2011MongoDB:  An Introduction - July 2011
MongoDB: An Introduction - July 2011
 
Gripshort
GripshortGripshort
Gripshort
 
Cronograma sierra 2015-2016.revfinal
Cronograma sierra 2015-2016.revfinalCronograma sierra 2015-2016.revfinal
Cronograma sierra 2015-2016.revfinal
 
Presentación Crowdfunding para proyectos de innovación cultural
Presentación Crowdfunding para proyectos de innovación culturalPresentación Crowdfunding para proyectos de innovación cultural
Presentación Crowdfunding para proyectos de innovación cultural
 
Explain explain
Explain explainExplain explain
Explain explain
 
Enfermedad inflamatoria pélvica - CICAT-SALUD
Enfermedad inflamatoria pélvica - CICAT-SALUDEnfermedad inflamatoria pélvica - CICAT-SALUD
Enfermedad inflamatoria pélvica - CICAT-SALUD
 
Crowdfunding para proyectos de innovación cultural. Karraskan/ColaBoraBora
Crowdfunding para proyectos de innovación cultural. Karraskan/ColaBoraBoraCrowdfunding para proyectos de innovación cultural. Karraskan/ColaBoraBora
Crowdfunding para proyectos de innovación cultural. Karraskan/ColaBoraBora
 
Формирование бюджета Николаевского сельского поселения на 2016 год
Формирование бюджета Николаевского сельского поселения на 2016 годФормирование бюджета Николаевского сельского поселения на 2016 год
Формирование бюджета Николаевского сельского поселения на 2016 год
 
Diploma in mechanical cadd certificate
Diploma in mechanical cadd certificateDiploma in mechanical cadd certificate
Diploma in mechanical cadd certificate
 
PEDIATRIA: Liquidos y electrolitos
PEDIATRIA: Liquidos y electrolitosPEDIATRIA: Liquidos y electrolitos
PEDIATRIA: Liquidos y electrolitos
 
02 DesmineralizacióN 2009
02 DesmineralizacióN 200902 DesmineralizacióN 2009
02 DesmineralizacióN 2009
 
pediatria
  pediatria  pediatria
pediatria
 
Introduction to PgBench
Introduction to PgBenchIntroduction to PgBench
Introduction to PgBench
 
Asociación Española de Crowdfunding
Asociación Española de CrowdfundingAsociación Española de Crowdfunding
Asociación Española de Crowdfunding
 

Similaire à Ambari hadoop-ops-meetup-2013-09-19.final

Ambari Meetup: Ambari Futures
Ambari Meetup: Ambari FuturesAmbari Meetup: Ambari Futures
Ambari Meetup: Ambari Futures
Hortonworks
 
Apache Ambari - What's New in 1.6.0
Apache Ambari - What's New in 1.6.0Apache Ambari - What's New in 1.6.0
Apache Ambari - What's New in 1.6.0
Hortonworks
 
Self-Service Provisioning and Hadoop Management with Apache Ambari
Self-Service Provisioning and  Hadoop Management with Apache AmbariSelf-Service Provisioning and  Hadoop Management with Apache Ambari
Self-Service Provisioning and Hadoop Management with Apache Ambari
DataWorks Summit
 
Managing 2000 Node Cluster with Ambari
Managing 2000 Node Cluster with AmbariManaging 2000 Node Cluster with Ambari
Managing 2000 Node Cluster with Ambari
DataWorks Summit
 

Similaire à Ambari hadoop-ops-meetup-2013-09-19.final (20)

Managing your Hadoop Clusters with Apache Ambari
Managing your Hadoop Clusters with Apache AmbariManaging your Hadoop Clusters with Apache Ambari
Managing your Hadoop Clusters with Apache Ambari
 
Managing your Hadoop Clusters with Ambari
Managing your Hadoop Clusters with AmbariManaging your Hadoop Clusters with Ambari
Managing your Hadoop Clusters with Ambari
 
Accumulo Summit 2014: Monitoring Apache Accumulo
Accumulo Summit 2014: Monitoring Apache AccumuloAccumulo Summit 2014: Monitoring Apache Accumulo
Accumulo Summit 2014: Monitoring Apache Accumulo
 
Ambari Meetup: Ambari Futures
Ambari Meetup: Ambari FuturesAmbari Meetup: Ambari Futures
Ambari Meetup: Ambari Futures
 
Ambari Meetup: What's New in Ambari
Ambari Meetup: What's New in AmbariAmbari Meetup: What's New in Ambari
Ambari Meetup: What's New in Ambari
 
Apache Ambari - What's New in 1.6.0
Apache Ambari - What's New in 1.6.0Apache Ambari - What's New in 1.6.0
Apache Ambari - What's New in 1.6.0
 
Self-Service Provisioning and Hadoop Management with Apache Ambari
Self-Service Provisioning and  Hadoop Management with Apache AmbariSelf-Service Provisioning and  Hadoop Management with Apache Ambari
Self-Service Provisioning and Hadoop Management with Apache Ambari
 
MySQL in the Hosted Cloud
MySQL in the Hosted CloudMySQL in the Hosted Cloud
MySQL in the Hosted Cloud
 
Managing 2000 Node Cluster with Ambari
Managing 2000 Node Cluster with AmbariManaging 2000 Node Cluster with Ambari
Managing 2000 Node Cluster with Ambari
 
Apache Ambari BOF - Overview - Hadoop Summit 2013
Apache Ambari BOF - Overview - Hadoop Summit 2013Apache Ambari BOF - Overview - Hadoop Summit 2013
Apache Ambari BOF - Overview - Hadoop Summit 2013
 
Simplified Cluster Operation and Troubleshooting
Simplified Cluster Operation and TroubleshootingSimplified Cluster Operation and Troubleshooting
Simplified Cluster Operation and Troubleshooting
 
Simplified Cluster Operation & Troubleshooting
Simplified Cluster Operation & TroubleshootingSimplified Cluster Operation & Troubleshooting
Simplified Cluster Operation & Troubleshooting
 
CloudStack / Saltstack lightning talk at DevOps Amsterdam
CloudStack / Saltstack lightning talk at DevOps AmsterdamCloudStack / Saltstack lightning talk at DevOps Amsterdam
CloudStack / Saltstack lightning talk at DevOps Amsterdam
 
Apache Ambari: Simplified Hadoop Cluster Operation & Troubleshooting
Apache Ambari: Simplified Hadoop Cluster Operation & TroubleshootingApache Ambari: Simplified Hadoop Cluster Operation & Troubleshooting
Apache Ambari: Simplified Hadoop Cluster Operation & Troubleshooting
 
Hortonworks Technical Workshop: Apache Ambari
Hortonworks Technical Workshop:   Apache AmbariHortonworks Technical Workshop:   Apache Ambari
Hortonworks Technical Workshop: Apache Ambari
 
Databases in the Hosted Cloud
Databases in the Hosted CloudDatabases in the Hosted Cloud
Databases in the Hosted Cloud
 
Webinar - DreamObjects/Ceph Case Study
Webinar - DreamObjects/Ceph Case StudyWebinar - DreamObjects/Ceph Case Study
Webinar - DreamObjects/Ceph Case Study
 
2015 zData Inc. - Apache Ambari Overview
2015 zData Inc. - Apache Ambari Overview2015 zData Inc. - Apache Ambari Overview
2015 zData Inc. - Apache Ambari Overview
 
An Overview of Ambari
An Overview of AmbariAn Overview of Ambari
An Overview of Ambari
 
Hadoop and OpenStack
Hadoop and OpenStackHadoop and OpenStack
Hadoop and OpenStack
 

Plus de Chris Westin

Mysql proxy presentation_yahoo
Mysql proxy presentation_yahooMysql proxy presentation_yahoo
Mysql proxy presentation_yahoo
Chris Westin
 

Plus de Chris Westin (20)

Data torrent meetup-productioneng
Data torrent meetup-productionengData torrent meetup-productioneng
Data torrent meetup-productioneng
 
Cluster management and automation with cloudera manager
Cluster management and automation with cloudera managerCluster management and automation with cloudera manager
Cluster management and automation with cloudera manager
 
Building low latency java applications with ehcache
Building low latency java applications with ehcacheBuilding low latency java applications with ehcache
Building low latency java applications with ehcache
 
SDN/OpenFlow #lspe
SDN/OpenFlow #lspeSDN/OpenFlow #lspe
SDN/OpenFlow #lspe
 
cfengine3 at #lspe
cfengine3 at #lspecfengine3 at #lspe
cfengine3 at #lspe
 
mongodb-aggregation-may-2012
mongodb-aggregation-may-2012mongodb-aggregation-may-2012
mongodb-aggregation-may-2012
 
Nimbula lspe-2012-04-19
Nimbula lspe-2012-04-19Nimbula lspe-2012-04-19
Nimbula lspe-2012-04-19
 
mongodb-brief-intro-february-2012
mongodb-brief-intro-february-2012mongodb-brief-intro-february-2012
mongodb-brief-intro-february-2012
 
Stingray - Riverbed Technology
Stingray - Riverbed TechnologyStingray - Riverbed Technology
Stingray - Riverbed Technology
 
MongoDB's New Aggregation framework
MongoDB's New Aggregation frameworkMongoDB's New Aggregation framework
MongoDB's New Aggregation framework
 
Replication and replica sets
Replication and replica setsReplication and replica sets
Replication and replica sets
 
Architecting a Scale Out Cloud Storage Solution
Architecting a Scale Out Cloud Storage SolutionArchitecting a Scale Out Cloud Storage Solution
Architecting a Scale Out Cloud Storage Solution
 
FlashCache
FlashCacheFlashCache
FlashCache
 
Large Scale Cacti
Large Scale CactiLarge Scale Cacti
Large Scale Cacti
 
Practical Replication June-2011
Practical Replication June-2011Practical Replication June-2011
Practical Replication June-2011
 
MongoDB: An Introduction - june-2011
MongoDB:  An Introduction - june-2011MongoDB:  An Introduction - june-2011
MongoDB: An Introduction - june-2011
 
Ganglia Overview-v2
Ganglia Overview-v2Ganglia Overview-v2
Ganglia Overview-v2
 
MongoDB Aggregation MongoSF May 2011
MongoDB Aggregation MongoSF May 2011MongoDB Aggregation MongoSF May 2011
MongoDB Aggregation MongoSF May 2011
 
Mysql Proxy Presentation Yahoo
Mysql Proxy Presentation YahooMysql Proxy Presentation Yahoo
Mysql Proxy Presentation Yahoo
 
Mysql proxy presentation_yahoo
Mysql proxy presentation_yahooMysql proxy presentation_yahoo
Mysql proxy presentation_yahoo
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Dernier (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 

Ambari hadoop-ops-meetup-2013-09-19.final

  • 1. Managing Your Hadoop Clusters with Apache Ambari September 19, 2013 Sumit Mohanty @ Hortonworks
  • 2. Hello! • Sumit Mohanty –Committer / PPMC member, Apache Ambari –Member of Technical Staff @ Hortonworks –smohanty@hortonworks.com Page 2
  • 3. Today, We’ll Go Over… • Intro • Features • Demo • Architecture • Futures • Q & A Page 3
  • 4. Ambari: Enterprise Hadoop Operations Ambari is an open source framework for provisioning, managing and monitoring Apache Hadoop clusters HADOOP Storage & Process at Scale AMBARI PROVISION MANAGE MONITOR AMBARI WEB Page 4
  • 5. Apache Ambari – 100% Open Source! • Active community • 50+ Contributors / 30+ Committers • 170+ Ambari User Group Members • Steady progress/release cycle Page 5 Release Version Release Date JIRAs Resolved 0.9.0 Sep 2012 402 1.2.0 Feb 2013 441 1.2.1 Mar 2013 134 1.2.2 Apr 2013 106 1.2.3 Jun 2013 515 1.2.4 Jul 2013 113 1.2.5 Sep 2013 350 1.4.x Oct 2013 580+  Current Release  Today’s Demo
  • 6. Features Today Provisioning: Simplified deployment across platforms Managing: Consistent controls across the Stack Monitoring: Visibility into key cluster metrics - Single pane of glass for Hadoop & System status - Pre-configured metrics & alerts - Single point for cluster operations - Customize w/o dealing with Hadoop complexities - Advanced configurations and host controls - Wizard-driven cluster install experience - Deploy 10s,100s or 1000s of Hadoop servers - Cloud, virtual and physical environments Page 6
  • 7. New Features • Add slaves components to hosts • Stop/Start All Services • Enable Kerberos Security • Enable NameNode HA • Host Status Filtering Page 7
  • 8. New Features • Provision, manage and monitoring Hadoop 2.0 Stack • HDFS2, YARN, MR2 • HBase –New Heatmaps specific to Region Servers Page 8
  • 9. New Features • Ambari supports Postgres, Oracle • Configure Hive and Oozie to use MySQL or Oracle • Ambari + Redhat GlusterFS Integration • Deploy / manage cluster with dist. file system other than HDFS –HCFS: GlusterFS as first implementation –Pluggability with other HCFS’s Page 9
  • 11. Ambari System Architecture Ambari Server Host Agent gmond Host Agent gmond Ganglia Server Agent Host Agent gmondgmetad gmond Ambari Web DB REST /clusters Nagios Server Agent Page 11
  • 12. Ambari Architecture Page 12 DB Orchestrator SPI REST API Request Dispatcher Ambari Web Ambari Server Metrics AuthProvider /clusters /services /hosts /workflows/jobs /users, … User Store java RDBMS javascript RDBM S AD/ LDAP REST API for integration Auth Provider Cluster Configurations Web Client 100% REST Ambari Agents ganglia nagios Alerts Pluggable Service Providersfalcon Data Mgmt jmx python puppet
  • 13. Ambari Futures Extend core capabilities to include the critical tasks associated with provisioning and operating Hadoop clusters. Cluster Operations Enable insight into job performance and reduce the burden on specialized Hadoop skills and knowledge. Job Diagnostics Expose integration and customization points so Hadoop can interoperate with existing operational tooling. Extensible Platform Make Hadoop easier to manage Page 13
  • 14. Beyond 1.4 • Ambari Python CLI • Delete Hosts • Add Services • Improved Configuration Mgmt with host-level overrides • Log Aggregation • Rack awareness • Manage Full Stack HA • Customizable Metric Graphs and Heatmaps • Capacity Scheduler Usage Reporting • Job Diagnostics Improvements • Ambari Server HA • Finer-grain Ambari user roles • Cluster Manifest (cluster takeover) • New Services: Accumulo, Falcon, Flume, Knox, Storm, … • HDFS Mirroring/Rebalance • HBase Compaction Page 14
  • 15. Host Group Configuration Controls • Set custom configuration properties at the host level for one or more hosts • Important for handing “heterogeneous” clusters • AMBARI-1509 and AMBARI-1370 HEAPSIZE= 1024 HEAPSIZE= 2048 Page 15
  • 16. Ambari + Accumulo Integration • Using Ambari to deploy / manage cluster with Accumulo –Google Summer of Code project –See AMBARI-1930 MapReduce Hive Distributed File System HBasePig Page 16
  • 17. Job Diagnostics • Enhanced swimlane visualizations • See job DAG with task overlay • See task scatter plot across jobs Page 17
  • 18. Stack Definitions • Design Goals –Ambari should be able to support choice of Hadoop stacks –Ambari should enable adding new components to an existing stack • Define which Services are available (services) • Define where to get the packages (repos) S S S SStack B repos services S S S SStack A repos services S S S S Stack C extends Stack B repos services S S+ Page 18
  • 19. Cluster Blueprints • Perform “Headless Install” • Export blueprint from cluster • Boot wizard with blueprint • Save wizard as blueprint BLUEPRINT <stack> <host> <service> <component> <config> Ambari Server MANIFEST <host> <meta> <configs> <configs> <config> Blueprint Page 19
  • 20. APIs, APIs, and More APIs • Consistent front-end REST API • Service Provider plugin architecture “ZERO TOUCH” INSTALLS “LIGHTS OUT” WORKFLOWS “BRING MY OWN” SCENARIOS Blueprint ClusterAmbari Alarm Decom Inform Custom Provider Page 20
  • 21. Get Involved! • Project Website – http://incubator.apache.org/ambari/ • Check out Ambari – Quick Start Guide – https://cwiki.apache.org/confluence/display/AMBARI/Quick+Start+Guide • Mailing Lists – ambari-user@incubator.apache.org – ambari-dev@incubator.apache.org • IRC Chanel – @apacheambari • Upcoming Meet-up, Sep 25 (Wednesday) – http://www.meetup.com/Apache-Ambari-User-Group/ Page 21