SlideShare une entreprise Scribd logo
1  sur  46
Harnessing Data-in-Motion with
Hortonworks DataFlow
HDF 2.0: Apache Ambari Integration
Ali Bajwa
Principal Partner
Solutions Engineer
Yolanda Davis
Senior Software
Engineer
Recap
HDF 2.0: Apache NiFi, Apache MiNiFi,
Apache Kafka, Apache Storm, Apache
Ambari
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Hortonworks DataFlow Manages Data-in-Motion
Core
InfrastructureSources
 Constrained
 High-latency
 Localized context
 Hybrid – cloud / on-premises
 Low-latency
 Global context
Regional
Infrastructure
Apache NiFi, Apache MiNiFi, Apache Kafka, Apache Storm are trademarks of the Apache Software Foundation
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
What is Apache Ambari?
Health Checks, Alerts
Stacks, Views
Lifecycle controls, Rolling
Restarts, Decommission/Re-
commission
Host Groups, Versioning,
Compare, Revert,
Recommendations, Security
Setup
Install Wizard (UI),
Blueprints (API)
Config
Management
ExtensibilityMonitoring
Service
Management
Cluster
Provisioning
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Apache Ambari Integration in HDF 2.0
• Deployment
• Configuration
• Debugging
• Monitoring
Ease of Deployment
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Deployment
Deploy Apache NiFi through:
– Apache Ambari install wizard or
– Operationalize via blueprints automation
Choose which nodes Apache NiFi can
be installed on
So either:
– Choose Apache NiFi hosts at time of
cluster install
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Deployment
– Or add Apache NiFi to existing host (post-cluster install) and then start it
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Deployment
–Can configure which user/group Apache NiFi runs as (when cluster installed or when
Apache NiFi is added to existing cluster)
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Deployment
 Can remove Apache NiFi service from Apache Ambari
– Note: this does not remove bits
 Can stop/start/configure Apache NiFi across cluster via:
– Apache Ambari UI or
– Apache Ambari REST API
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Deployment
 Can stop/start Apache NiFi on individual hosts via Apache Ambari UI and also via
REST API
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Deployment
 Quicklink for easy access to Apache
NiFi UI
– URL dynamically determined based on:
• whether SSL enabled and
• which ports user configured
Demo #1: Deploy HDF 2.0
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Demo #1: Deploy HDF 2.0
•Deploy HDF cluster using Apache Ambari Install Wizard:
http://docs.hortonworks.com/HDPDocuments/HDF2/HDF-2.0.0/bk_ambari-installation/content/ch03s01.html
–Install Apache Ambari
–Install HDF mpack
–Open Apache Ambari WebUI
–Run through Install Wizard
Ease of Configuration
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
 Configure once via Apache Ambari (instead of repeating config on each node)
 Support for most Apache NiFi config files (e.g. nifi.properties, bootstrap.conf etc)
 Assistance for admin during Apache NiFi configuration:
– Help text displayed on hover with property descriptions
– Checkboxes instead of true/false values
– User friendly labels and default values
– ‘Computed’ values can be automatically handled (e.g. node address)
 Get benefit of Apache Ambari config features:
– Config history:
• diff between versions, revert to older version etc
– Update configs via Apache Ambari REST API
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
•Manage host specific configs using Config groups feature
–e.g. to bump memory for Apache NiFi on one of the nodes, click Override
button:
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
 ...create new config group and add desired host to it
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
 ‘Common’ configs grouped together and exposed in first config section
(‘Advanced nifi-ambari-config’) to allow configuration of:
– Ports (nonSSL, SSL, protocol)
– Initial and max memory (Xms, Xmx)
– Repo default dir locations (provenance, content, db, flow file)
– ‘Internal’ dir location - contains files NiFi will write to
• ‘conf’ subdir for flow/tar.gz, authorizations.xml
• ‘state’ subdir for internal state
• Can change subdir names by prefixing the desired subdir name with
‘{nifi_internal_dir}/’
– Sensitive props key (used to encrypt sensitive property values)
– Zookeeper znode for nifi
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
 nifi.properties exposed under ‘Advanced nifi-properties’ as key/value pairs with
helptext
– values replaced by Apache Ambari shown surrounded by double braces e.g.{{ }} but
can be overridden by end user
– Properties can be updated or added via ‘Custom nifi-properties’ and will get
written to all nodes
– ‘Computed’ values handled:
• ‘Node address’ fields replaced with each host's own FQDN
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
 Other property based config files exposed as jinja templates
– Values replaced by Apache Ambari shown surrounded by double braces e.g. {{ }} but can be overridden by
end user
– Properties can be added/updated and will get written to all nodes
25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Configuration
Other xml based config files also exposed as jinja templates
– Values replaced by Apache Ambari shown surrounded by double braces e.g. {{ }} but can be overridden
– Elements can be updated/added and will get written to all nodes
– Config files written out with
either 0400 or 0600 perm
• Why?
• Some property files contain
plaintext passwords
Ease of Debugging
28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Debugging
 Logsearch integration for ease of visualizing/debugging Apache NiFi logs w/o connecting via SSH to
hosts (e.g nifi_app.log, nifi_user.log, nifi_bootstrap.log)
 By default, monitors FATAL,ERROR,WARN messages (for all HDF services)
29 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Debugging
 Can view/drill into errors at component level or host level
 Can filter errors based on severity (fatal, error, warn, info, debug, trace)
 Can exclude ‘noisy’ messages to find the needle in the haystack
Ease of Monitoring
31 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
Apache NiFi Service check: Used to ensure that Apache NiFi UI is up. Can also be
invoked via REST API for automation
32 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
Apache NiFi Alert: Host level Apache Ambari alert when Apache NiFi process
goes down
– Can temporarily be disabled by turning on maintenance mode
33 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
Alerts tab allows:
– Disabling and Configuring alerts (e.g. change interval)
34 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
 Can setup email or
SNMP notifications to
admins
35 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
 Apache Ambari Metrics (AMS) integration
– Service autocreates Apache Ambari reporting task, pointing to AMS collector host/port (autodetected)
36 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
This is achieved by setting up an initial flow.xml (configurable via Amari)...
Tip: can auto-deploy any Apache NiFi flow by pasting its flow.xml here
37 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
...and passing arguments via bootstrap.conf
 Ensures Apache NiFi autodetects updated location of AMS collector (in case it’s moved)
38 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
 Results in AMS dashboard
for Apache NiFi metrics,
such as:
– Flowfiles sent/received
– MBs read/written
– JVM usage/thread counts
39 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
 Dashboard widgets can:
– be drilled into to see
results from last 1,2,4
hours, day, week etc
– export metrics data as csv
or JSON
40 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Ease of Monitoring
 Same metrics can be viewed in Grafana dashboard
– Access via quicklink under ‘Ambari metrics’ service
– Allows users to easily create custom dashboards for each component
41 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Summary - Apache NiFi in Apache Ambari
• HDF available via management pack (mpack) that defines required services, stack
definitions etc
• Wizard Driven Setup
• Provides access to Apache NiFi configurations, start/stop/restart
• Has a Reporting Task for Apache Ambari Metrics
• In Cluster Mode by Default
• Leverages Zookeeper on HDF for State Management (not Embedded Zookeeper)
• Some Stack Advisor configurations (Ranger integration, Apache NiFi Certificate
Authority)
• Audit logging via Apache Ambari Infra (Internal Solr)
* You cannot install Apache Ambari to manage an HDF cluster on a system where
HDP is already installed (requires a dedicated instance)
Demo #2: Explore HDF 2.0
43 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Demo #2: Explore HDF 2.0
 Explore HDF 2.0
– Apache NiFi configurations
– Configuration groups
– Metrics
– Logsearch integration
– Service checks
– Alerts
Try it out!
45 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Try it out!
 Deploy HDF cluster using either:
– a) Install Wizard: http://docs.hortonworks.com/HDPDocuments/HDF2/HDF-2.0.0/bk_ambari-installation/content/ch03s01.html
– b) Blueprints: https://community.hortonworks.com/articles/56849/automate-deployment-of-hdf-20-clusters-using-ambar.html
* Reminder: You cannot install Apache Ambari to manage an HDF cluster on a system where HDP is
already installed (requires a dedicated instance)
46 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Questions? https://community.hortonworks.com/
Hortonworks Community Connection:
Data Ingestion and Streaming

Contenu connexe

Tendances

Hive present-and-feature-shanghai
Hive present-and-feature-shanghaiHive present-and-feature-shanghai
Hive present-and-feature-shanghaiYifeng Jiang
 
ODPi 101: Who we are, What we do
ODPi 101: Who we are, What we doODPi 101: Who we are, What we do
ODPi 101: Who we are, What we doHortonworks
 
Log Analytics Optimization
Log Analytics OptimizationLog Analytics Optimization
Log Analytics OptimizationHortonworks
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHortonworks
 
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1Hortonworks
 
Hive ACID Apache BigData 2016
Hive ACID Apache BigData 2016Hive ACID Apache BigData 2016
Hive ACID Apache BigData 2016alanfgates
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks
 
Apache NiFi Toronto Meetup
Apache NiFi Toronto MeetupApache NiFi Toronto Meetup
Apache NiFi Toronto MeetupHortonworks
 
Apache Hive 2.0; SQL, Speed, Scale
Apache Hive 2.0; SQL, Speed, ScaleApache Hive 2.0; SQL, Speed, Scale
Apache Hive 2.0; SQL, Speed, ScaleHortonworks
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Hortonworks
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariManaging Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariHortonworks
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFiHortonworks
 
Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Hortonworks
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseHortonworks
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseDataWorks Summit
 
Introduction to Apache NiFi - Seattle Scalability Meetup
Introduction to Apache NiFi - Seattle Scalability MeetupIntroduction to Apache NiFi - Seattle Scalability Meetup
Introduction to Apache NiFi - Seattle Scalability MeetupSaptak Sen
 
Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS Hortonworks
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiDataWorks Summit
 
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power SystemsDelivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power SystemsHortonworks
 

Tendances (20)

Hive present-and-feature-shanghai
Hive present-and-feature-shanghaiHive present-and-feature-shanghai
Hive present-and-feature-shanghai
 
ODPi 101: Who we are, What we do
ODPi 101: Who we are, What we doODPi 101: Who we are, What we do
ODPi 101: Who we are, What we do
 
Log Analytics Optimization
Log Analytics OptimizationLog Analytics Optimization
Log Analytics Optimization
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
 
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1
 
Hive ACID Apache BigData 2016
Hive ACID Apache BigData 2016Hive ACID Apache BigData 2016
Hive ACID Apache BigData 2016
 
Apache NiFi Crash Course Intro
Apache NiFi Crash Course IntroApache NiFi Crash Course Intro
Apache NiFi Crash Course Intro
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
 
Apache NiFi Toronto Meetup
Apache NiFi Toronto MeetupApache NiFi Toronto Meetup
Apache NiFi Toronto Meetup
 
Apache Hive 2.0; SQL, Speed, Scale
Apache Hive 2.0; SQL, Speed, ScaleApache Hive 2.0; SQL, Speed, Scale
Apache Hive 2.0; SQL, Speed, Scale
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariManaging Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache Ambari
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
 
Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4 Apache Ambari - What's New in 2.4
Apache Ambari - What's New in 2.4
 
Double Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSenseDouble Your Hadoop Hardware Performance with SmartSense
Double Your Hadoop Hardware Performance with SmartSense
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
 
Introduction to Apache NiFi - Seattle Scalability Meetup
Introduction to Apache NiFi - Seattle Scalability MeetupIntroduction to Apache NiFi - Seattle Scalability Meetup
Introduction to Apache NiFi - Seattle Scalability Meetup
 
Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS Hortonworks Data Cloud for AWS
Hortonworks Data Cloud for AWS
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
 
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power SystemsDelivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
 

En vedette

Apache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup SlidesApache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup SlidesIsheeta Sanghi
 
Real-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFiReal-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFiManish Gupta
 
Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4Hortonworks
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks
 
Apache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNApache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNHortonworks
 
How to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDBHow to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDBHortonworks
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseHortonworks
 
Building large scale applications in yarn with apache twill
Building large scale applications in yarn with apache twillBuilding large scale applications in yarn with apache twill
Building large scale applications in yarn with apache twillHenry Saputra
 
Harnessing the power of YARN with Apache Twill
Harnessing the power of YARN with Apache TwillHarnessing the power of YARN with Apache Twill
Harnessing the power of YARN with Apache TwillTerence Yim
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambariHortonworks
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with DataSeth Familian
 
Apache Hive on ACID
Apache Hive on ACIDApache Hive on ACID
Apache Hive on ACIDHortonworks
 
HDFS Futures: NameNode Federation for Improved Efficiency and Scalability
HDFS Futures: NameNode Federation for Improved Efficiency and ScalabilityHDFS Futures: NameNode Federation for Improved Efficiency and Scalability
HDFS Futures: NameNode Federation for Improved Efficiency and ScalabilityHortonworks
 
Integrating Apache NiFi and Apache Apex
Integrating Apache NiFi and Apache Apex Integrating Apache NiFi and Apache Apex
Integrating Apache NiFi and Apache Apex Apache Apex
 
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2Cloudera, Inc.
 
node-crate: node.js and big data
 node-crate: node.js and big data node-crate: node.js and big data
node-crate: node.js and big dataStefan Thies
 
Integrating NiFi and Apex
Integrating NiFi and ApexIntegrating NiFi and Apex
Integrating NiFi and ApexBryan Bende
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarImpetus Technologies
 

En vedette (20)

Apache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup SlidesApache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup Slides
 
Real-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFiReal-Time Data Flows with Apache NiFi
Real-Time Data Flows with Apache NiFi
 
Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4Hortonworks Data In Motion Series Part 4
Hortonworks Data In Motion Series Part 4
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2
 
Integrating Apache Spark and NiFi for Data Lakes
Integrating Apache Spark and NiFi for Data LakesIntegrating Apache Spark and NiFi for Data Lakes
Integrating Apache Spark and NiFi for Data Lakes
 
Apache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNApache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARN
 
How to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDBHow to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDB
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
 
Dataflow with Apache NiFi - Crash Course - HS16SJ
Dataflow with Apache NiFi - Crash Course - HS16SJDataflow with Apache NiFi - Crash Course - HS16SJ
Dataflow with Apache NiFi - Crash Course - HS16SJ
 
Building large scale applications in yarn with apache twill
Building large scale applications in yarn with apache twillBuilding large scale applications in yarn with apache twill
Building large scale applications in yarn with apache twill
 
Harnessing the power of YARN with Apache Twill
Harnessing the power of YARN with Apache TwillHarnessing the power of YARN with Apache Twill
Harnessing the power of YARN with Apache Twill
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambari
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with Data
 
Apache Hive on ACID
Apache Hive on ACIDApache Hive on ACID
Apache Hive on ACID
 
HDFS Futures: NameNode Federation for Improved Efficiency and Scalability
HDFS Futures: NameNode Federation for Improved Efficiency and ScalabilityHDFS Futures: NameNode Federation for Improved Efficiency and Scalability
HDFS Futures: NameNode Federation for Improved Efficiency and Scalability
 
Integrating Apache NiFi and Apache Apex
Integrating Apache NiFi and Apache Apex Integrating Apache NiFi and Apache Apex
Integrating Apache NiFi and Apache Apex
 
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
 
node-crate: node.js and big data
 node-crate: node.js and big data node-crate: node.js and big data
node-crate: node.js and big data
 
Integrating NiFi and Apex
Integrating NiFi and ApexIntegrating NiFi and Apex
Integrating NiFi and Apex
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 

Similaire à Hortonworks Data In Motion Series Part 3 - HDF Ambari

Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariManaging Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariJayush Luniya
 
Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Hortonworks
 
Manage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariManage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariJayush Luniya
 
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariManage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariDataWorks Summit
 
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureApache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureHortonworks
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiJoe Percivall
 
Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop EcosystemApache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop EcosystemBryan Bende
 
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveFuture of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveAldrin Piri
 
Apache Phoenix Query Server PhoenixCon2016
Apache Phoenix Query Server PhoenixCon2016Apache Phoenix Query Server PhoenixCon2016
Apache Phoenix Query Server PhoenixCon2016Josh Elser
 
Hadoop in adtech
Hadoop in adtechHadoop in adtech
Hadoop in adtechYuta Imai
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataWorks Summit
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiAldrin Piri
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsTimothy Spann
 
State of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityState of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityAccumulo Summit
 
Micro services vs hadoop
Micro services vs hadoopMicro services vs hadoop
Micro services vs hadoopGergely Devenyi
 
Curb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure ClusterCurb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure Clusterahortonworks
 

Similaire à Hortonworks Data In Motion Series Part 3 - HDF Ambari (20)

Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache AmbariManaging Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache Ambari
 
Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1Apache Ambari - What's New in 2.1
Apache Ambari - What's New in 2.1
 
Manage Add-on Services in Apache Ambari
Manage Add-on Services in Apache AmbariManage Add-on Services in Apache Ambari
Manage Add-on Services in Apache Ambari
 
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariManage Add-On Services with Apache Ambari
Manage Add-On Services with Apache Ambari
 
Apache Ambari: Past, Present, Future
Apache Ambari: Past, Present, FutureApache Ambari: Past, Present, Future
Apache Ambari: Past, Present, Future
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
 
Apache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop EcosystemApache NiFi in the Hadoop Ecosystem
Apache NiFi in the Hadoop Ecosystem
 
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveFuture of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep Dive
 
Row/Column- Level Security in SQL for Apache Spark
Row/Column- Level Security in SQL for Apache SparkRow/Column- Level Security in SQL for Apache Spark
Row/Column- Level Security in SQL for Apache Spark
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
Apache Phoenix Query Server PhoenixCon2016
Apache Phoenix Query Server PhoenixCon2016Apache Phoenix Query Server PhoenixCon2016
Apache Phoenix Query Server PhoenixCon2016
 
Hadoop in adtech
Hadoop in adtechHadoop in adtech
Hadoop in adtech
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFi
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration Options
 
State of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityState of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & Community
 
Micro services vs hadoop
Micro services vs hadoopMicro services vs hadoop
Micro services vs hadoop
 
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache AmbariStreamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
 
Curb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure ClusterCurb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure Cluster
 

Plus de Hortonworks

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyHortonworks
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakHortonworks
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsHortonworks
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysHortonworks
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's NewHortonworks
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerHortonworks
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsHortonworks
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeHortonworks
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidHortonworks
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleHortonworks
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATAHortonworks
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Hortonworks
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseHortonworks
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseHortonworks
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationHortonworks
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementHortonworks
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHortonworks
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCHortonworks
 

Plus de Hortonworks (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 

Dernier

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Dernier (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Hortonworks Data In Motion Series Part 3 - HDF Ambari

  • 1. Harnessing Data-in-Motion with Hortonworks DataFlow HDF 2.0: Apache Ambari Integration Ali Bajwa Principal Partner Solutions Engineer Yolanda Davis Senior Software Engineer
  • 2. Recap HDF 2.0: Apache NiFi, Apache MiNiFi, Apache Kafka, Apache Storm, Apache Ambari
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Hortonworks DataFlow Manages Data-in-Motion Core InfrastructureSources  Constrained  High-latency  Localized context  Hybrid – cloud / on-premises  Low-latency  Global context Regional Infrastructure Apache NiFi, Apache MiNiFi, Apache Kafka, Apache Storm are trademarks of the Apache Software Foundation
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved What is Apache Ambari? Health Checks, Alerts Stacks, Views Lifecycle controls, Rolling Restarts, Decommission/Re- commission Host Groups, Versioning, Compare, Revert, Recommendations, Security Setup Install Wizard (UI), Blueprints (API) Config Management ExtensibilityMonitoring Service Management Cluster Provisioning
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Apache Ambari Integration in HDF 2.0 • Deployment • Configuration • Debugging • Monitoring
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Deployment Deploy Apache NiFi through: – Apache Ambari install wizard or – Operationalize via blueprints automation Choose which nodes Apache NiFi can be installed on So either: – Choose Apache NiFi hosts at time of cluster install
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Deployment – Or add Apache NiFi to existing host (post-cluster install) and then start it
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Deployment –Can configure which user/group Apache NiFi runs as (when cluster installed or when Apache NiFi is added to existing cluster)
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Deployment  Can remove Apache NiFi service from Apache Ambari – Note: this does not remove bits  Can stop/start/configure Apache NiFi across cluster via: – Apache Ambari UI or – Apache Ambari REST API
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Deployment  Can stop/start Apache NiFi on individual hosts via Apache Ambari UI and also via REST API
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Deployment  Quicklink for easy access to Apache NiFi UI – URL dynamically determined based on: • whether SSL enabled and • which ports user configured
  • 13. Demo #1: Deploy HDF 2.0
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Demo #1: Deploy HDF 2.0 •Deploy HDF cluster using Apache Ambari Install Wizard: http://docs.hortonworks.com/HDPDocuments/HDF2/HDF-2.0.0/bk_ambari-installation/content/ch03s01.html –Install Apache Ambari –Install HDF mpack –Open Apache Ambari WebUI –Run through Install Wizard
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration  Configure once via Apache Ambari (instead of repeating config on each node)  Support for most Apache NiFi config files (e.g. nifi.properties, bootstrap.conf etc)  Assistance for admin during Apache NiFi configuration: – Help text displayed on hover with property descriptions – Checkboxes instead of true/false values – User friendly labels and default values – ‘Computed’ values can be automatically handled (e.g. node address)  Get benefit of Apache Ambari config features: – Config history: • diff between versions, revert to older version etc – Update configs via Apache Ambari REST API
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration •Manage host specific configs using Config groups feature –e.g. to bump memory for Apache NiFi on one of the nodes, click Override button:
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration  ...create new config group and add desired host to it
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration  ‘Common’ configs grouped together and exposed in first config section (‘Advanced nifi-ambari-config’) to allow configuration of: – Ports (nonSSL, SSL, protocol) – Initial and max memory (Xms, Xmx) – Repo default dir locations (provenance, content, db, flow file) – ‘Internal’ dir location - contains files NiFi will write to • ‘conf’ subdir for flow/tar.gz, authorizations.xml • ‘state’ subdir for internal state • Can change subdir names by prefixing the desired subdir name with ‘{nifi_internal_dir}/’ – Sensitive props key (used to encrypt sensitive property values) – Zookeeper znode for nifi
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration  nifi.properties exposed under ‘Advanced nifi-properties’ as key/value pairs with helptext – values replaced by Apache Ambari shown surrounded by double braces e.g.{{ }} but can be overridden by end user – Properties can be updated or added via ‘Custom nifi-properties’ and will get written to all nodes – ‘Computed’ values handled: • ‘Node address’ fields replaced with each host's own FQDN
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 24. 24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration  Other property based config files exposed as jinja templates – Values replaced by Apache Ambari shown surrounded by double braces e.g. {{ }} but can be overridden by end user – Properties can be added/updated and will get written to all nodes
  • 25. 25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 26. 26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Configuration Other xml based config files also exposed as jinja templates – Values replaced by Apache Ambari shown surrounded by double braces e.g. {{ }} but can be overridden – Elements can be updated/added and will get written to all nodes – Config files written out with either 0400 or 0600 perm • Why? • Some property files contain plaintext passwords
  • 28. 28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Debugging  Logsearch integration for ease of visualizing/debugging Apache NiFi logs w/o connecting via SSH to hosts (e.g nifi_app.log, nifi_user.log, nifi_bootstrap.log)  By default, monitors FATAL,ERROR,WARN messages (for all HDF services)
  • 29. 29 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Debugging  Can view/drill into errors at component level or host level  Can filter errors based on severity (fatal, error, warn, info, debug, trace)  Can exclude ‘noisy’ messages to find the needle in the haystack
  • 31. 31 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring Apache NiFi Service check: Used to ensure that Apache NiFi UI is up. Can also be invoked via REST API for automation
  • 32. 32 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring Apache NiFi Alert: Host level Apache Ambari alert when Apache NiFi process goes down – Can temporarily be disabled by turning on maintenance mode
  • 33. 33 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring Alerts tab allows: – Disabling and Configuring alerts (e.g. change interval)
  • 34. 34 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring  Can setup email or SNMP notifications to admins
  • 35. 35 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring  Apache Ambari Metrics (AMS) integration – Service autocreates Apache Ambari reporting task, pointing to AMS collector host/port (autodetected)
  • 36. 36 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring This is achieved by setting up an initial flow.xml (configurable via Amari)... Tip: can auto-deploy any Apache NiFi flow by pasting its flow.xml here
  • 37. 37 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring ...and passing arguments via bootstrap.conf  Ensures Apache NiFi autodetects updated location of AMS collector (in case it’s moved)
  • 38. 38 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring  Results in AMS dashboard for Apache NiFi metrics, such as: – Flowfiles sent/received – MBs read/written – JVM usage/thread counts
  • 39. 39 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring  Dashboard widgets can: – be drilled into to see results from last 1,2,4 hours, day, week etc – export metrics data as csv or JSON
  • 40. 40 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Ease of Monitoring  Same metrics can be viewed in Grafana dashboard – Access via quicklink under ‘Ambari metrics’ service – Allows users to easily create custom dashboards for each component
  • 41. 41 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Summary - Apache NiFi in Apache Ambari • HDF available via management pack (mpack) that defines required services, stack definitions etc • Wizard Driven Setup • Provides access to Apache NiFi configurations, start/stop/restart • Has a Reporting Task for Apache Ambari Metrics • In Cluster Mode by Default • Leverages Zookeeper on HDF for State Management (not Embedded Zookeeper) • Some Stack Advisor configurations (Ranger integration, Apache NiFi Certificate Authority) • Audit logging via Apache Ambari Infra (Internal Solr) * You cannot install Apache Ambari to manage an HDF cluster on a system where HDP is already installed (requires a dedicated instance)
  • 42. Demo #2: Explore HDF 2.0
  • 43. 43 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Demo #2: Explore HDF 2.0  Explore HDF 2.0 – Apache NiFi configurations – Configuration groups – Metrics – Logsearch integration – Service checks – Alerts
  • 45. 45 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Try it out!  Deploy HDF cluster using either: – a) Install Wizard: http://docs.hortonworks.com/HDPDocuments/HDF2/HDF-2.0.0/bk_ambari-installation/content/ch03s01.html – b) Blueprints: https://community.hortonworks.com/articles/56849/automate-deployment-of-hdf-20-clusters-using-ambar.html * Reminder: You cannot install Apache Ambari to manage an HDF cluster on a system where HDP is already installed (requires a dedicated instance)
  • 46. 46 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Questions? https://community.hortonworks.com/ Hortonworks Community Connection: Data Ingestion and Streaming