SlideShare une entreprise Scribd logo
1  sur  47
1 © Hortonworks Inc. 2011 – 2017. All Rights Reserved1 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Boost Apache Hadoop Hardware
Performance 2X with SmartSense
Paul Codding
Product Management Director
2 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hortonworks Connected Data Platforms and Solutions
Data Services
Hortonworks Solutions
Enterprise Data
Warehouse Optimization
Cyber Security and
Threat Management
Internet of Things
and Streaming Analytics
Data Center
Hortonworks Data Suite
HDFHDP
Hortonworks
Connection
Cloud
Hortonworks Data Cloud
AWS HDInsight
Hortonworks Connection
Enablement Subscription
SmartSense™
Premier Operational Support
Educational Services
Professional Services
Community Connection
3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hortonworks Connection Ensures Success of Your Big Data Journey
4 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
5 Reasons Why You Need More Than Just Open Source Software
The open source community doesn’t ensure everything works together and is
certified for the data center and cloud platforms you rely on. Hortonworks does.1
The unprecedented pace of open source innovation is both a benefit
and a challenge. Hortonworks can help; it’s what we do.2
Your enterprise needs more than just support for the latest open source
versions. Hortonworks supports and maintains the versions you rely on.3
The community doesn’t ensure that consistent security, governance, and
operations are built in. Hortonworks takes enterprise needs seriously.4
The community is not responsible for your success with open source
technologies and tools. Hortonworks success is built on your success.5
5 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Increase Performance
Prevent Issues
Accelerate Case Resolution
Understand Your Cluster
6 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Issue: YARN @ capacity, struggling to add
more use cases
Before SmartSense
 Could only run 500 jobs concurrently
 1100 jobs would be pending waiting for
resources at peak hours
7 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
After Applying only 3 SmartSense Recommendations
They can now run 1200 concurrent jobs
...with only 350 waiting jobs at peak hours
Issue: YARN @ capacity, struggling to add
more use cases
Before SmartSense
 Could only run 500 jobs concurrently
 1100 jobs would be pending waiting for
resources at peak hours
8 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
After Applying only 3 SmartSense Recommendations
They can now run 1200 concurrent jobs
...with only 350 waiting jobs at peak hours
Issue: YARN @ capacity, struggling to add
more use cases
Before SmartSense
 Could only run 500 jobs concurrently
 1100 jobs would be pending waiting for
resources at peak hours
With SmartSense = 2X Throughput Improvement
9 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hardware ($$$)
Hadoop Performance
• Type of CPU & Core Count
• Type & Amount of Memory
• Type & Number of Disks
10 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hardware ($$$)
Operating System
Hadoop Performance
• Type of CPU & Core Count
• Type & Amount of Memory
• Type & Number of Disks
• Kernel Configuration
• Disk Mount/Tuning
• Network Configuration
11 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hardware ($$$)
Operating System
Hadoop Daemons
Hadoop Performance
• Type of CPU & Core Count
• Type & Amount of Memory
• Type & Number of Disks
• Kernel Configuration
• Disk Mount/Tuning
• Network Configuration
• YARN/MR/Tez Memory Configuration
• HDFS Configuration
• ZooKeeper Configuration
12 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hardware ($$$)
Operating System
Hadoop Daemons
Hadoop Performance
• Type of CPU & Core Count
• Type & Amount of Memory
• Type & Number of Disks
• Kernel Configuration
• Disk Mount/Tuning
• Network Configuration
• YARN/MR/Tez Memory Configuration
• HDFS Configuration
• ZooKeeper Configuration
13 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
What we do
A M B A R I
O P S S m a r t S e n s e
S E R V E R
B U N D L E
G AT E WAY
S m a r t S e n s e
A n a l y t i c s
S m a r t S e n s e
S E RV I C E
Collection Diagnostic
Information
Secure & Send Analyze & Recommend
14 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hardware ($$$)
Operating System
Hadoop Daemons
Hadoop Performance
• Type of CPU & Core Count
• Type & Amount of Memory
• Type & Number of Disks
• Kernel Configuration
• Disk Mount/Tuning
• Network Configuration
• YARN/MR/Tez Memory Configuration
• HDFS Configuration
• ZooKeeper Configuration
15 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
YARN Memory Configuration
Containers
Unit of allocation for memory and compute
Scheduler Configuration
 Minimum Container Size
 Maximum Container Size
YARN NodeManager Configuration
 How much memory can be used by YARN on each cluster node
YARN Cluster
1
2
3
4
5
6
7
64 GB
16 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
YARN Memory Configuration
Containers
Unit of allocation for memory and compute
Scheduler Configuration
 Minimum Container Size: 5GB
 Maximum Container Size: 35GB
YARN NodeManager Configuration
 How much memory can be used by YARN on each cluster node
– 35GB
YARN Cluster
1
2
3
4
5
6
7
64 GB
17 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
YARN Memory Configuration
5
YARN ClusterApplication YARN Scheduler
I need 5 2GB
containers
Min: 5GB
Max: 35GB
201
2
3
4
5
6
7
18 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
YARN Memory Configuration
Application YARN Scheduler
I need 5 2GB
containers
Min: 5GB
Max: 35GB
5
5
5
5
5
YARN Cluster
5
201
2
3
4
5
6
7
19 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
YARN Memory Configuration
Application YARN Scheduler
I need 5 2GB
containers
Min: 5GB
Max: 35GB
Application is taking 25GB of resources when it only needs 10GB
5
5
5
5
5
YARN Cluster
5
201
2
3
4
5
6
7
20 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
YARN Memory Configuration
Gatorade: $2.50
Machine only takes Cash
EXACT CHANGE REQUIRED!
21 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
YARN Memory Configuration
Gatorade: $2.50
Machine only takes Cash
EXACT CHANGE REQUIRED!
Minimum Withdrawal: $20
Maximum Withdrawal: $500
22 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
YARN Memory Configuration
Containers
Unit of allocation for memory and compute
Scheduler Configuration
 Minimum Container Size: 2GB vs 5GB
 Maximum Container Size: 10GB vs 35GB
YARN NodeManager Configuration
 How much memory can be used by YARN on each cluster node
– 56GB vs 35GB
YARN Cluster
1
2
3
4
5
6
7
64 GB
23 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
After Applying only 3 SmartSense Recommendations
They can now run 1200 concurrent jobs
...with only 350 waiting jobs at peak hours
Issue: YARN @ capacity, struggling to add
more use cases
Before SmartSense
 Could only run 500 jobs concurrently
 1100 jobs would be pending waiting for
resources at peak hours
With SmartSense = 2X Throughput Improvement
24 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Increase Performance
Prevent Issues
Accelerate Case Resolution
Understand Your Cluster
25 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Support Cases by Type
Configuration
Environment
Education
No Response
Product Defect
Unreproducible
Use Case Advice
Works as Designed
Other
SmartSense Today – Prevent Issues
26 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Support Cases by Type
Configuration
Environment
Education
No Response
Product Defect
Unreproducible
Use Case Advice
Works as Designed
Other
SmartSense Today – Prevent Issues
27 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Support Cases by Type
Configuration
Environment
Education
No Response
Product Defect
Unreproducible
Use Case Advice
Works as Designed
Other
SmartSense Today – Prevent Issues
30% of support cases are configuration issues—
this is where SmartSense adds incredible value
28 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Prevent Issues
SmartSense analyzes Bundles for configuration issues – recommendations are produced and
made available for each cluster in the Hortonworks Support Portal
Recommendations prevent operational issues, and improve performance and overall cluster
throughput.
29 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Prevent Issues
SmartSense analyzes Bundles for configuration issues – recommendations are produced and
made available for each cluster in the Hortonworks Support Portal
Recommendations prevent operational issues, and improve performance and overall cluster
throughput.
30 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Increase Performance
Prevent Issues
Accelerate Case Resolution
Understand Your Cluster
31 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Accelerate Case Resolution
SmartSense provides Hadoop Operators with an Ambari Integrated tool to quickly capture
diagnostic information for specific services and hosts into a single “Bundle” that’s automatically
uploaded to Hortonworks Support.
Significantly reduces the back-and-forth nature of troubleshooting issues.
A M B A R I
O P S
H O R TO N WO R K S
S U P P O R T
S U P P O R T
C A S E
S m a r t S e n s e
S E R V E R
B U N D L E
G AT E WAY
32 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Accelerate Case Resolution
SmartSense provides Hadoop Operators with an Ambari Integrated tool to quickly capture
diagnostic information for specific services and hosts into a single “Bundle” that’s automatically
uploaded to Hortonworks Support.
Significantly reduces the back-and-forth nature of troubleshooting issues.
A M B A R I
O P S
H O R TO N WO R K S
S U P P O R T
S U P P O R T
C A S E
S m a r t S e n s e
S E R V E R
B U N D L E
G AT E WAY
33 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Accelerate Case Resolution
SmartSense provides Hadoop Operators with an Ambari Integrated tool to quickly capture
diagnostic information for specific services and hosts into a single “Bundle” that’s automatically
uploaded to Hortonworks Support.
Significantly reduces the back-and-forth nature of troubleshooting issues.
A M B A R I
O P S
H O R TO N WO R K S
S U P P O R T
S U P P O R T
C A S E
S m a r t S e n s e
S E R V E R
B U N D L E
G AT E WAY
34 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Accelerate Case Resolution
SmartSense provides Hadoop Operators with an Ambari Integrated tool to quickly capture
diagnostic information for specific services and hosts into a single “Bundle” that’s automatically
uploaded to Hortonworks Support.
Significantly reduces the back-and-forth nature of troubleshooting issues.
A M B A R I
O P S
H O R TO N WO R K S
S U P P O R T
S U P P O R T
C A S E
S m a r t S e n s e
S E R V E R
B U N D L E
G AT E WAY
35 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Data Capture Architecture
L A N D I N G Z O N E
S E RV E R
G AT E WAY
A M B A R I
A G E N T A G E N T
A G E N TA G E N TA G E N T
A G E N T
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
S m a r t S e n s e
A n a l y t i c s
36 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Data Capture Architecture
L A N D I N G Z O N E
S E RV E R
G AT E WAY
A M B A R I
A G E N T A G E N T
A G E N TA G E N TA G E N T
A G E N T
B U N D L E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
Agent to Server: TLS
Bundle: AES 256/RSA 1024
Landing Zone: SOC2 Certified
S m a r t S e n s e
A n a l y t i c s
37 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Data Capture Architecture
L A N D I N G Z O N E
S E RV E R
G AT E WAY
A M B A R I
A G E N T A G E N T
A G E N TA G E N TA G E N T
A G E N T
B U N D L E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
Agent to Server: TLS
Bundle: AES 256/RSA 1024
Server to Gateway: TLS
Landing Zone: SOC2 Certified
S m a r t S e n s e
A n a l y t i c s
38 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Data Capture Architecture
L A N D I N G Z O N E
S E RV E R
G AT E WAY
A M B A R I
A G E N T A G E N T
A G E N TA G E N TA G E N T
A G E N T
B U N D L E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
Agent to Server: TLS
Bundle: AES 256/RSA 1024
Server to Gateway: TLS
Landing Zone: SOC2 Certified
Gateway to Landing Zone: HTTPS (TLS 1.2) or SFTP (AES)
S m a r t S e n s e
A n a l y t i c s
39 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Increase Performance
Prevent Issues
Accelerate Case Resolution
Understand Your Cluster
40 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
“Who’s creating all of these small files in HDFS!?”
“What are my top 10 most active users, and longest running
jobs?”
“How much should I charge users for their cluster resource
use?”
SmartSense Today – Understand Your Cluster
41 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Understand Your Cluster
Chargeback Reporting
42 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Understand Your Cluster
Chargeback Reporting
HDFS Dashboards
43 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
SmartSense Today – Understand Your Cluster
Chargeback Reporting
HDFS Dashboards
YARN Dashboards
44 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Impact of Hortonworks SmartSense
0
200
400
600
800
1000
1200
1400
Without
SmartSense
With
SmartSense
Concurrent Jobs
B U N D L E
2X Throughput
Improvement
Address 30% of
Issues
Configuration Issues
Avoid 10% of
Sev1 Issues
Production Down
Single-Bundle
Case Resolution
25% of the Time
SmartSense
Troubleshooting Bundle
45 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Questions
46 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hortonworks Connected Data Platforms and Solutions
Data Services
Hortonworks Solutions
Enterprise Data
Warehouse Optimization
Cyber Security and
Threat Management
Internet of Things
and Streaming Analytics
Data Center
Hortonworks Data Suite
HDFHDP
Hortonworks
Connection
Cloud
Hortonworks Data Cloud
AWS HDInsight
Hortonworks Connection
Enablement Subscription
SmartSense™
Premier Operational Support
Educational Services
Professional Services
Community Connection
© DataWorks Summit and Hadoop Summit 2017. All Rights Reserved47
DataWorks Summit 2017
http://dataworkssummit.com

Contenu connexe

Tendances

Tendances (20)

Edw Optimization Solution
Edw Optimization Solution Edw Optimization Solution
Edw Optimization Solution
 
Enabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNEnabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARN
 
What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
 
Row/Column- Level Security in SQL for Apache Spark
Row/Column- Level Security in SQL for Apache SparkRow/Column- Level Security in SQL for Apache Spark
Row/Column- Level Security in SQL for Apache Spark
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
Hadoop and Spark – Perfect Together
Hadoop and Spark – Perfect TogetherHadoop and Spark – Perfect Together
Hadoop and Spark – Perfect Together
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
 
Running Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudRunning Enterprise Workloads in the Cloud
Running Enterprise Workloads in the Cloud
 
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache AmbariStreamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
 
Hive present-and-feature-shanghai
Hive present-and-feature-shanghaiHive present-and-feature-shanghai
Hive present-and-feature-shanghai
 
Scaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC IsilonScaling real time streaming architectures with HDF and Dell EMC Isilon
Scaling real time streaming architectures with HDF and Dell EMC Isilon
 
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
Enabling Apache Zeppelin and Spark for Data Science in the EnterpriseEnabling Apache Zeppelin and Spark for Data Science in the Enterprise
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
 
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingApache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
 
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
 
Running Zeppelin in Enterprise
Running Zeppelin in EnterpriseRunning Zeppelin in Enterprise
Running Zeppelin in Enterprise
 
Authoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using SliderAuthoring and Hosting Applications on YARN using Slider
Authoring and Hosting Applications on YARN using Slider
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
 

En vedette

En vedette (20)

Transformation Processing Smackdown; Spark vs Hive vs Pig
Transformation Processing Smackdown; Spark vs Hive vs PigTransformation Processing Smackdown; Spark vs Hive vs Pig
Transformation Processing Smackdown; Spark vs Hive vs Pig
 
Dive into Spark Streaming
Dive into Spark StreamingDive into Spark Streaming
Dive into Spark Streaming
 
Enabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical EnterpriseEnabling the Real Time Analytical Enterprise
Enabling the Real Time Analytical Enterprise
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsApache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
 
Squeezing Deep Learning Into Mobile Phones
Squeezing Deep Learning Into Mobile PhonesSqueezing Deep Learning Into Mobile Phones
Squeezing Deep Learning Into Mobile Phones
 
67 Weeks of TensorFlow
67 Weeks of TensorFlow67 Weeks of TensorFlow
67 Weeks of TensorFlow
 
SAS - Hortonworks: Creating the Omnichannel Experience in Retail webinar marc...
SAS - Hortonworks: Creating the Omnichannel Experience in Retail webinar marc...SAS - Hortonworks: Creating the Omnichannel Experience in Retail webinar marc...
SAS - Hortonworks: Creating the Omnichannel Experience in Retail webinar marc...
 
Top 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data AnalyticsTop 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data Analytics
 
A Multi Colored YARN
A Multi Colored YARNA Multi Colored YARN
A Multi Colored YARN
 
Hive - 1455: Cloud Storage
Hive - 1455: Cloud StorageHive - 1455: Cloud Storage
Hive - 1455: Cloud Storage
 
Dynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDPDynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDP
 
How Universities Use Big Data to Transform Education
How Universities Use Big Data to Transform EducationHow Universities Use Big Data to Transform Education
How Universities Use Big Data to Transform Education
 
The path to a Modern Data Architecture in Financial Services
The path to a Modern Data Architecture in Financial ServicesThe path to a Modern Data Architecture in Financial Services
The path to a Modern Data Architecture in Financial Services
 
Image Classification Done Simply using Keras and TensorFlow
Image Classification Done Simply using Keras and TensorFlow Image Classification Done Simply using Keras and TensorFlow
Image Classification Done Simply using Keras and TensorFlow
 
Architecting a Next Generation Data Platform
Architecting a Next Generation Data PlatformArchitecting a Next Generation Data Platform
Architecting a Next Generation Data Platform
 
Design in Tech Report 2017
Design in Tech Report 2017Design in Tech Report 2017
Design in Tech Report 2017
 
ELSA France "Teaching is us!"
ELSA France "Teaching is us!" ELSA France "Teaching is us!"
ELSA France "Teaching is us!"
 
Getting involved with Open Source at the ASF
Getting involved with Open Source at the ASFGetting involved with Open Source at the ASF
Getting involved with Open Source at the ASF
 
Hortonworks technical workshop operations with ambari
Hortonworks technical workshop   operations with ambariHortonworks technical workshop   operations with ambari
Hortonworks technical workshop operations with ambari
 
Everything You Wanted to Know About LinkedIn, but Were Too Embarrassed to Ask
Everything You Wanted to Know About LinkedIn, but Were Too Embarrassed to AskEverything You Wanted to Know About LinkedIn, but Were Too Embarrassed to Ask
Everything You Wanted to Know About LinkedIn, but Were Too Embarrassed to Ask
 

Similaire à Double Your Hadoop Hardware Performance with SmartSense

Similaire à Double Your Hadoop Hardware Performance with SmartSense (20)

Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseStreamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
 
Powering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the CloudPowering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the Cloud
 
Hadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise HadoopHadoop Present - Open Enterprise Hadoop
Hadoop Present - Open Enterprise Hadoop
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
 
Oracle Database Appliance, ODA, X7-2 portfolio.
Oracle Database Appliance, ODA, X7-2 portfolio.Oracle Database Appliance, ODA, X7-2 portfolio.
Oracle Database Appliance, ODA, X7-2 portfolio.
 
Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...
 
Hadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop SummitHadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop Summit
 
Enterprise Data Science at Scale
Enterprise Data Science at ScaleEnterprise Data Science at Scale
Enterprise Data Science at Scale
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 
Taming the Elephant: Efficient and Effective Apache Hadoop Management
Taming the Elephant: Efficient and Effective Apache Hadoop ManagementTaming the Elephant: Efficient and Effective Apache Hadoop Management
Taming the Elephant: Efficient and Effective Apache Hadoop Management
 
Enterprise data science at scale
Enterprise data science at scaleEnterprise data science at scale
Enterprise data science at scale
 
In-Place analytics with Unified Data Access
In-Place analytics with Unified Data AccessIn-Place analytics with Unified Data Access
In-Place analytics with Unified Data Access
 
Storm Demo Talk - Denver Apr 2015
Storm Demo Talk - Denver Apr 2015Storm Demo Talk - Denver Apr 2015
Storm Demo Talk - Denver Apr 2015
 
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
Enterprise Data Science at Scale @ Princeton, NJ 14-Nov-2017
 
Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture
 
Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
 

Plus de Hortonworks

Plus de Hortonworks (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Double Your Hadoop Hardware Performance with SmartSense

  • 1. 1 © Hortonworks Inc. 2011 – 2017. All Rights Reserved1 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Boost Apache Hadoop Hardware Performance 2X with SmartSense Paul Codding Product Management Director
  • 2. 2 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hortonworks Connected Data Platforms and Solutions Data Services Hortonworks Solutions Enterprise Data Warehouse Optimization Cyber Security and Threat Management Internet of Things and Streaming Analytics Data Center Hortonworks Data Suite HDFHDP Hortonworks Connection Cloud Hortonworks Data Cloud AWS HDInsight Hortonworks Connection Enablement Subscription SmartSense™ Premier Operational Support Educational Services Professional Services Community Connection
  • 3. 3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hortonworks Connection Ensures Success of Your Big Data Journey
  • 4. 4 © Hortonworks Inc. 2011 – 2017. All Rights Reserved 5 Reasons Why You Need More Than Just Open Source Software The open source community doesn’t ensure everything works together and is certified for the data center and cloud platforms you rely on. Hortonworks does.1 The unprecedented pace of open source innovation is both a benefit and a challenge. Hortonworks can help; it’s what we do.2 Your enterprise needs more than just support for the latest open source versions. Hortonworks supports and maintains the versions you rely on.3 The community doesn’t ensure that consistent security, governance, and operations are built in. Hortonworks takes enterprise needs seriously.4 The community is not responsible for your success with open source technologies and tools. Hortonworks success is built on your success.5
  • 5. 5 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Increase Performance Prevent Issues Accelerate Case Resolution Understand Your Cluster
  • 6. 6 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Issue: YARN @ capacity, struggling to add more use cases Before SmartSense  Could only run 500 jobs concurrently  1100 jobs would be pending waiting for resources at peak hours
  • 7. 7 © Hortonworks Inc. 2011 – 2017. All Rights Reserved After Applying only 3 SmartSense Recommendations They can now run 1200 concurrent jobs ...with only 350 waiting jobs at peak hours Issue: YARN @ capacity, struggling to add more use cases Before SmartSense  Could only run 500 jobs concurrently  1100 jobs would be pending waiting for resources at peak hours
  • 8. 8 © Hortonworks Inc. 2011 – 2017. All Rights Reserved After Applying only 3 SmartSense Recommendations They can now run 1200 concurrent jobs ...with only 350 waiting jobs at peak hours Issue: YARN @ capacity, struggling to add more use cases Before SmartSense  Could only run 500 jobs concurrently  1100 jobs would be pending waiting for resources at peak hours With SmartSense = 2X Throughput Improvement
  • 9. 9 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hardware ($$$) Hadoop Performance • Type of CPU & Core Count • Type & Amount of Memory • Type & Number of Disks
  • 10. 10 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hardware ($$$) Operating System Hadoop Performance • Type of CPU & Core Count • Type & Amount of Memory • Type & Number of Disks • Kernel Configuration • Disk Mount/Tuning • Network Configuration
  • 11. 11 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hardware ($$$) Operating System Hadoop Daemons Hadoop Performance • Type of CPU & Core Count • Type & Amount of Memory • Type & Number of Disks • Kernel Configuration • Disk Mount/Tuning • Network Configuration • YARN/MR/Tez Memory Configuration • HDFS Configuration • ZooKeeper Configuration
  • 12. 12 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hardware ($$$) Operating System Hadoop Daemons Hadoop Performance • Type of CPU & Core Count • Type & Amount of Memory • Type & Number of Disks • Kernel Configuration • Disk Mount/Tuning • Network Configuration • YARN/MR/Tez Memory Configuration • HDFS Configuration • ZooKeeper Configuration
  • 13. 13 © Hortonworks Inc. 2011 – 2017. All Rights Reserved What we do A M B A R I O P S S m a r t S e n s e S E R V E R B U N D L E G AT E WAY S m a r t S e n s e A n a l y t i c s S m a r t S e n s e S E RV I C E Collection Diagnostic Information Secure & Send Analyze & Recommend
  • 14. 14 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hardware ($$$) Operating System Hadoop Daemons Hadoop Performance • Type of CPU & Core Count • Type & Amount of Memory • Type & Number of Disks • Kernel Configuration • Disk Mount/Tuning • Network Configuration • YARN/MR/Tez Memory Configuration • HDFS Configuration • ZooKeeper Configuration
  • 15. 15 © Hortonworks Inc. 2011 – 2017. All Rights Reserved YARN Memory Configuration Containers Unit of allocation for memory and compute Scheduler Configuration  Minimum Container Size  Maximum Container Size YARN NodeManager Configuration  How much memory can be used by YARN on each cluster node YARN Cluster 1 2 3 4 5 6 7 64 GB
  • 16. 16 © Hortonworks Inc. 2011 – 2017. All Rights Reserved YARN Memory Configuration Containers Unit of allocation for memory and compute Scheduler Configuration  Minimum Container Size: 5GB  Maximum Container Size: 35GB YARN NodeManager Configuration  How much memory can be used by YARN on each cluster node – 35GB YARN Cluster 1 2 3 4 5 6 7 64 GB
  • 17. 17 © Hortonworks Inc. 2011 – 2017. All Rights Reserved YARN Memory Configuration 5 YARN ClusterApplication YARN Scheduler I need 5 2GB containers Min: 5GB Max: 35GB 201 2 3 4 5 6 7
  • 18. 18 © Hortonworks Inc. 2011 – 2017. All Rights Reserved YARN Memory Configuration Application YARN Scheduler I need 5 2GB containers Min: 5GB Max: 35GB 5 5 5 5 5 YARN Cluster 5 201 2 3 4 5 6 7
  • 19. 19 © Hortonworks Inc. 2011 – 2017. All Rights Reserved YARN Memory Configuration Application YARN Scheduler I need 5 2GB containers Min: 5GB Max: 35GB Application is taking 25GB of resources when it only needs 10GB 5 5 5 5 5 YARN Cluster 5 201 2 3 4 5 6 7
  • 20. 20 © Hortonworks Inc. 2011 – 2017. All Rights Reserved YARN Memory Configuration Gatorade: $2.50 Machine only takes Cash EXACT CHANGE REQUIRED!
  • 21. 21 © Hortonworks Inc. 2011 – 2017. All Rights Reserved YARN Memory Configuration Gatorade: $2.50 Machine only takes Cash EXACT CHANGE REQUIRED! Minimum Withdrawal: $20 Maximum Withdrawal: $500
  • 22. 22 © Hortonworks Inc. 2011 – 2017. All Rights Reserved YARN Memory Configuration Containers Unit of allocation for memory and compute Scheduler Configuration  Minimum Container Size: 2GB vs 5GB  Maximum Container Size: 10GB vs 35GB YARN NodeManager Configuration  How much memory can be used by YARN on each cluster node – 56GB vs 35GB YARN Cluster 1 2 3 4 5 6 7 64 GB
  • 23. 23 © Hortonworks Inc. 2011 – 2017. All Rights Reserved After Applying only 3 SmartSense Recommendations They can now run 1200 concurrent jobs ...with only 350 waiting jobs at peak hours Issue: YARN @ capacity, struggling to add more use cases Before SmartSense  Could only run 500 jobs concurrently  1100 jobs would be pending waiting for resources at peak hours With SmartSense = 2X Throughput Improvement
  • 24. 24 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Increase Performance Prevent Issues Accelerate Case Resolution Understand Your Cluster
  • 25. 25 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Support Cases by Type Configuration Environment Education No Response Product Defect Unreproducible Use Case Advice Works as Designed Other SmartSense Today – Prevent Issues
  • 26. 26 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Support Cases by Type Configuration Environment Education No Response Product Defect Unreproducible Use Case Advice Works as Designed Other SmartSense Today – Prevent Issues
  • 27. 27 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Support Cases by Type Configuration Environment Education No Response Product Defect Unreproducible Use Case Advice Works as Designed Other SmartSense Today – Prevent Issues 30% of support cases are configuration issues— this is where SmartSense adds incredible value
  • 28. 28 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Prevent Issues SmartSense analyzes Bundles for configuration issues – recommendations are produced and made available for each cluster in the Hortonworks Support Portal Recommendations prevent operational issues, and improve performance and overall cluster throughput.
  • 29. 29 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Prevent Issues SmartSense analyzes Bundles for configuration issues – recommendations are produced and made available for each cluster in the Hortonworks Support Portal Recommendations prevent operational issues, and improve performance and overall cluster throughput.
  • 30. 30 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Increase Performance Prevent Issues Accelerate Case Resolution Understand Your Cluster
  • 31. 31 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Accelerate Case Resolution SmartSense provides Hadoop Operators with an Ambari Integrated tool to quickly capture diagnostic information for specific services and hosts into a single “Bundle” that’s automatically uploaded to Hortonworks Support. Significantly reduces the back-and-forth nature of troubleshooting issues. A M B A R I O P S H O R TO N WO R K S S U P P O R T S U P P O R T C A S E S m a r t S e n s e S E R V E R B U N D L E G AT E WAY
  • 32. 32 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Accelerate Case Resolution SmartSense provides Hadoop Operators with an Ambari Integrated tool to quickly capture diagnostic information for specific services and hosts into a single “Bundle” that’s automatically uploaded to Hortonworks Support. Significantly reduces the back-and-forth nature of troubleshooting issues. A M B A R I O P S H O R TO N WO R K S S U P P O R T S U P P O R T C A S E S m a r t S e n s e S E R V E R B U N D L E G AT E WAY
  • 33. 33 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Accelerate Case Resolution SmartSense provides Hadoop Operators with an Ambari Integrated tool to quickly capture diagnostic information for specific services and hosts into a single “Bundle” that’s automatically uploaded to Hortonworks Support. Significantly reduces the back-and-forth nature of troubleshooting issues. A M B A R I O P S H O R TO N WO R K S S U P P O R T S U P P O R T C A S E S m a r t S e n s e S E R V E R B U N D L E G AT E WAY
  • 34. 34 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Accelerate Case Resolution SmartSense provides Hadoop Operators with an Ambari Integrated tool to quickly capture diagnostic information for specific services and hosts into a single “Bundle” that’s automatically uploaded to Hortonworks Support. Significantly reduces the back-and-forth nature of troubleshooting issues. A M B A R I O P S H O R TO N WO R K S S U P P O R T S U P P O R T C A S E S m a r t S e n s e S E R V E R B U N D L E G AT E WAY
  • 35. 35 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Data Capture Architecture L A N D I N G Z O N E S E RV E R G AT E WAY A M B A R I A G E N T A G E N T A G E N TA G E N TA G E N T A G E N T WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E S m a r t S e n s e A n a l y t i c s
  • 36. 36 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Data Capture Architecture L A N D I N G Z O N E S E RV E R G AT E WAY A M B A R I A G E N T A G E N T A G E N TA G E N TA G E N T A G E N T B U N D L E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E Agent to Server: TLS Bundle: AES 256/RSA 1024 Landing Zone: SOC2 Certified S m a r t S e n s e A n a l y t i c s
  • 37. 37 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Data Capture Architecture L A N D I N G Z O N E S E RV E R G AT E WAY A M B A R I A G E N T A G E N T A G E N TA G E N TA G E N T A G E N T B U N D L E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E Agent to Server: TLS Bundle: AES 256/RSA 1024 Server to Gateway: TLS Landing Zone: SOC2 Certified S m a r t S e n s e A n a l y t i c s
  • 38. 38 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Data Capture Architecture L A N D I N G Z O N E S E RV E R G AT E WAY A M B A R I A G E N T A G E N T A G E N TA G E N TA G E N T A G E N T B U N D L E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E Agent to Server: TLS Bundle: AES 256/RSA 1024 Server to Gateway: TLS Landing Zone: SOC2 Certified Gateway to Landing Zone: HTTPS (TLS 1.2) or SFTP (AES) S m a r t S e n s e A n a l y t i c s
  • 39. 39 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Increase Performance Prevent Issues Accelerate Case Resolution Understand Your Cluster
  • 40. 40 © Hortonworks Inc. 2011 – 2017. All Rights Reserved “Who’s creating all of these small files in HDFS!?” “What are my top 10 most active users, and longest running jobs?” “How much should I charge users for their cluster resource use?” SmartSense Today – Understand Your Cluster
  • 41. 41 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Understand Your Cluster Chargeback Reporting
  • 42. 42 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Understand Your Cluster Chargeback Reporting HDFS Dashboards
  • 43. 43 © Hortonworks Inc. 2011 – 2017. All Rights Reserved SmartSense Today – Understand Your Cluster Chargeback Reporting HDFS Dashboards YARN Dashboards
  • 44. 44 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Impact of Hortonworks SmartSense 0 200 400 600 800 1000 1200 1400 Without SmartSense With SmartSense Concurrent Jobs B U N D L E 2X Throughput Improvement Address 30% of Issues Configuration Issues Avoid 10% of Sev1 Issues Production Down Single-Bundle Case Resolution 25% of the Time SmartSense Troubleshooting Bundle
  • 45. 45 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Questions
  • 46. 46 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hortonworks Connected Data Platforms and Solutions Data Services Hortonworks Solutions Enterprise Data Warehouse Optimization Cyber Security and Threat Management Internet of Things and Streaming Analytics Data Center Hortonworks Data Suite HDFHDP Hortonworks Connection Cloud Hortonworks Data Cloud AWS HDInsight Hortonworks Connection Enablement Subscription SmartSense™ Premier Operational Support Educational Services Professional Services Community Connection
  • 47. © DataWorks Summit and Hadoop Summit 2017. All Rights Reserved47 DataWorks Summit 2017 http://dataworkssummit.com

Notes de l'éditeur

  1. TALK TRACK Hortonworks Powers the Future of Data: data-in-motion, data-at-rest, and Modern Data Applications. [NEXT SLIDE]
  2. TALK TRACK At the highest level, this is what Hortonworks offers our customers. As I describe the various components, remember that all of these individual pieces interact to make your Data Plane work. The bottom of the diagram shows our unique flexibility with regards to implementation: in the data center or the cloud. Up to the left, we also offer pre-configured solutions that deliver the tech in optimal bundles to meet specific business needs that we see most often: Enterprise Data Warehouse Optimization Cyber Security and Threat Management Internet of Things and Streaming Analytics Of course, if none of these pre-configured solutions is what you need, we do customized solutions. At the center of all of this is Hortonworks Connection, which includes: Subscription Services Hortonworks SmartSense, which uses Hadoop machine learning for proactive recommendations to tune your Hadoop cluster. Premier Operational Support Educational Services Professional Servies, and Hortonworks Community Connection. [NEXT SLIDE]
  3. Hortonworks enables the people, processes, technology to all work together for your maximum benefit.
  4. We know the platform, we ensure it works together, and we’ll be there when something goes wrong to fix it for you. We are your voice to the community and we are able to make the broad cross-cutting changes Enterprises require. You don’t get that without Hortonworks Support.
  5. Calculations are a moving target – what was best practice last year is not this year Awareness of new features
  6. [CLOSE]
  7. TALK TRACK At the highest level, this is what Hortonworks offers our customers. As I describe the various components, remember that all of these individual pieces interact to make your Data Plane work. The bottom of the diagram shows our unique flexibility with regards to implementation: in the data center or the cloud. Up to the left, we also offer pre-configured solutions that deliver the tech in optimal bundles to meet specific business needs that we see most often: Enterprise Data Warehouse Optimization Cyber Security and Threat Management Internet of Things and Streaming Analytics Of course, if none of these pre-configured solutions is what you need, we do customized solutions. At the center of all of this is Hortonworks Connection, which includes: Subscription Services Hortonworks SmartSense, which uses Hadoop machine learning for proactive recommendations to tune your Hadoop cluster. Premier Operational Support Educational Services Professional Servies, and Hortonworks Community Connection. [NEXT SLIDE]