SlideShare une entreprise Scribd logo
1  sur  32
1
Automating Cloud Cluster Deployment:
Beyond the Book
Bill Havanki, Cloudera
2
Agenda
Who am I?
Whirlwind introduction to the Cloud
How things take time
Automation techniques and tools
Cluster usage patterns from automation
Dragons
3
Bill Havanki
• Working for Cloudera ~4 years now
• Worked in US government
contracting for ~15 years before that
• Originally from New Jersey
• Wrote a book about Hadoop and
Cloud
Who am I?
4
What is Hadoop?
You probably already know. MOVING ON
5
What is the Cloud?
"The cloud" is a large set of computing resources made available by a cloud
provider for customers to use and control for general purposes.
instances
networking
storage
AWS
Google
MS Azure
starting
stopping
modifying
Hadoop!
6
What is Hadoop in the Cloud?
Hadoop in the Cloud means running Hadoop clusters on resources offered by a
cloud provider.
7
Instances Networking Storage
Key cloud resource concepts
c = a + b
8
Resource Creation
Easy! Through web consoles, API, or CLI!
https://www.youtube.com/watch?v=0OIBMBteXcI
9
Manually is fine for
trying things out,
but you want
automation for real
work.
Faster, Must Go Faster
cluster admins
analysts
10
Why does it take so loooooooong
Cloud provider operations take time
• launching the manager instance
• launching the worker instances
• patching / updating as necessary
11
Why does it take so loooooooong
Hadoop setup operations take time
• installing Java and Hadoop
• setting up cluster accounts
• configuring HDFS and YARN and …
• formatting HDFS
• establishing SSH tunnels or SOCKS proxies or …
• Hive? Installing Hive, setting up the metastore, …
• Spark? Installing Spark, …
12
Custom images
Bake in the common setup work for instances.
• OS updates
• Java and Hadoop installation
• account creation (and SSH key configuration)
• most of Hadoop configuration
13
Images can be
created manually,
but tools like Packer
automate the
process.
Automate image creation
14
{
"builders": [{
"type": "amazon-ebs",
"instance_type": "m4.large",
"region": "us-east-1",
"source_ami": "ami-6d1c2007",
...
}],
"provisioners": [
{
"type": "shell",
"script": "prov/packages.sh"
},
{
"type": "shell",
"script": "prov/users.sh"
},
...
]
}
Example Packer template JSON
15
AWS CLI
pip install awscli
Google Cloud SDK
yum install google-cloud-sdk
apt-get install google-cloud-sdk
Azure CLI
curl -L
https://aka.ms/InstallAzureCli |
bash
apt-get install azure-cli
Hands-on automation using provider CLIs and SDKs
aws ec2 run-instances
--image-id ami-6d1c2007
--instance-type m4.large
--key-name mypair
gcloud compute instances
create manager-1
--image-family centos-7
--machine-type
n1-highcpu-4
az vm create –n manager-1
–g my-resource-group
--image CentOS
--size Standard_D4_v3
16
Creating clusters
is one thing.
Growing and
shrinking is
another.
Just when you thought you were out …
17
Monitors show you the current conditions
Monitor cluster metrics like HDFS usage and YARN / compute load
18
Alerts activate when thresholds are crossed
Create alarm conditions for when metrics cross thresholds that indicate
overutilization … or underutilization
Alarms can send messages and trigger actions, like adding or removing instances
19
New patterns
Automation makes things faster, which leads to new ways of working.
20
Standardized cluster types
Build out images, each with the optimal mix of services for a type of work
• log / IoT ingest and ETL
• heavy back-end analysis
• live web or application support
Maintain images with the latest and greatest services and tools.
Version control them for repeatability and testability.
21
Automated cluster elasticity
Use those metrics and alarms to adjust cluster sizes based on demand.
• end of quarter
• long weekend
• holiday rush
Save money and remain responsive.
22
Transient clusters
Don’t run clusters unless they are in use. Create and destroy them based on
need.
• save cloud costs
• reduce administrative and troubleshooting burden
• reduce cluster variations
• shift between regions … or providers, more easily
• improve security by reducing sharing
23
Self-service clusters
Let cluster users run the automation.
• no organizational bottlenecks
• higher customer satisfaction
24
Ready to bang out some scripts?
You’ll want to use an automation framework of course.
25
BUT
26
It’s tougher than it looks
• How will you manage credentials?
• How will you track asynchronous tasks?
• What do you do when the provider has a problem?
• How will you retry tasks?
• Do you need to set up networks or security rules?
27
It’s tougher than it looks
• Do you need to span providers?
• Will you need a UI?
• Will others want to use the automation?
• How will you set up databases?
• High availability? TLS? Kerberos?
28
Think about existing cluster automation tools
Some are provider specific.
29
Think about existing cluster automation tools
Some are not provider specific.
30
Where to go from here?
• identify your own opportunities for automation
• make some images, by hand if you like
• try Packer
• learn your cloud provider’s API and CLI
• write some automation tools
• learn about and try existing cluster automation tools
31
mh2c.com/strata17
32
Thank you
Bill Havanki, Cloudera
bhavanki@cloudera.com

Contenu connexe

Tendances

Tendances (20)

Building Node.js applications for Microsoft Azure cloud
Building Node.js applications for Microsoft Azure cloudBuilding Node.js applications for Microsoft Azure cloud
Building Node.js applications for Microsoft Azure cloud
 
AWS re:Invent 2016 Recap: What Happened, What It Means
AWS re:Invent 2016 Recap: What Happened, What It MeansAWS re:Invent 2016 Recap: What Happened, What It Means
AWS re:Invent 2016 Recap: What Happened, What It Means
 
Optimizing AWS Economics
Optimizing AWS EconomicsOptimizing AWS Economics
Optimizing AWS Economics
 
無伺服器架構和Containers on AWS入門
無伺服器架構和Containers on AWS入門 無伺服器架構和Containers on AWS入門
無伺服器架構和Containers on AWS入門
 
AWS Meet-up Atlanta: AWS Economics
AWS Meet-up Atlanta: AWS EconomicsAWS Meet-up Atlanta: AWS Economics
AWS Meet-up Atlanta: AWS Economics
 
Webinar: Securely Configuring and Mining AWS CloudTrail
Webinar: Securely Configuring and Mining AWS CloudTrailWebinar: Securely Configuring and Mining AWS CloudTrail
Webinar: Securely Configuring and Mining AWS CloudTrail
 
SoCal NodeJS Meetup 20170215_aws_lambda
SoCal NodeJS Meetup 20170215_aws_lambdaSoCal NodeJS Meetup 20170215_aws_lambda
SoCal NodeJS Meetup 20170215_aws_lambda
 
Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process E...
Using Amazon CloudWatch Events,  AWS Lambda and Spark Streaming  to Process E...Using Amazon CloudWatch Events,  AWS Lambda and Spark Streaming  to Process E...
Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process E...
 
Architecting on The Cloud
Architecting on The CloudArchitecting on The Cloud
Architecting on The Cloud
 
AWS meetup「Apache Spark on EMR」
AWS meetup「Apache Spark on EMR」AWS meetup「Apache Spark on EMR」
AWS meetup「Apache Spark on EMR」
 
AWS re:Invent 2016: Chalk Talk: Succeeding at Infrastructure-as-Code (GPSCT312)
AWS re:Invent 2016: Chalk Talk: Succeeding at Infrastructure-as-Code (GPSCT312)AWS re:Invent 2016: Chalk Talk: Succeeding at Infrastructure-as-Code (GPSCT312)
AWS re:Invent 2016: Chalk Talk: Succeeding at Infrastructure-as-Code (GPSCT312)
 
Cloud computing - an insight into "how does it really work ?"
Cloud computing - an insight into "how does it really work ?" Cloud computing - an insight into "how does it really work ?"
Cloud computing - an insight into "how does it really work ?"
 
Azure and web sites hackaton deck
Azure and web sites hackaton deckAzure and web sites hackaton deck
Azure and web sites hackaton deck
 
Webinar - Big Data: Let's SMACK - Jorg Schad
Webinar - Big Data: Let's SMACK - Jorg SchadWebinar - Big Data: Let's SMACK - Jorg Schad
Webinar - Big Data: Let's SMACK - Jorg Schad
 
Introduction to Amazon Web Services (AWS)
Introduction to Amazon Web Services (AWS)Introduction to Amazon Web Services (AWS)
Introduction to Amazon Web Services (AWS)
 
10 minutes fun with Cloud API comparison
10 minutes fun with Cloud API comparison10 minutes fun with Cloud API comparison
10 minutes fun with Cloud API comparison
 
MongoDB at community engine
MongoDB at community engineMongoDB at community engine
MongoDB at community engine
 
Using Packer to Migrate XenServer Infrastructure to CloudStack
Using Packer to Migrate XenServer Infrastructure to CloudStackUsing Packer to Migrate XenServer Infrastructure to CloudStack
Using Packer to Migrate XenServer Infrastructure to CloudStack
 
Downtime is not an option - day 2 operations - Jörg Schad
Downtime is not an option - day 2 operations -  Jörg SchadDowntime is not an option - day 2 operations -  Jörg Schad
Downtime is not an option - day 2 operations - Jörg Schad
 
Deep Dive on Amazon S3 (May 2016)
Deep Dive on Amazon S3 (May 2016)Deep Dive on Amazon S3 (May 2016)
Deep Dive on Amazon S3 (May 2016)
 

Similaire à Automating Cloud Cluster Deployment: Beyond the Book

Azure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiAzure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish Kalamati
Girish Kalamati
 
Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)
MvkZ
 
Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)
MvkZ
 
Real world cloud formation feb 2014 final
Real world cloud formation feb 2014 finalReal world cloud formation feb 2014 final
Real world cloud formation feb 2014 final
Howard Glynn
 

Similaire à Automating Cloud Cluster Deployment: Beyond the Book (20)

DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...
DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...
DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...
 
Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...
Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...
Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...
 
Tips and Tricks for Running Container Workloads on AWS
Tips and Tricks for Running Container Workloads on AWSTips and Tricks for Running Container Workloads on AWS
Tips and Tricks for Running Container Workloads on AWS
 
Azure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiAzure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish Kalamati
 
7 Common Questions About a Cloud Management Platform
7 Common Questions About a Cloud Management Platform7 Common Questions About a Cloud Management Platform
7 Common Questions About a Cloud Management Platform
 
Practical Cloud & Workflow Orchestration
Practical Cloud & Workflow OrchestrationPractical Cloud & Workflow Orchestration
Practical Cloud & Workflow Orchestration
 
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...
 
Cloud-Native DevOps: Simplifying application lifecycle management with AWS | ...
Cloud-Native DevOps: Simplifying application lifecycle management with AWS | ...Cloud-Native DevOps: Simplifying application lifecycle management with AWS | ...
Cloud-Native DevOps: Simplifying application lifecycle management with AWS | ...
 
Application Lifecycle Management on AWS
Application Lifecycle Management on AWSApplication Lifecycle Management on AWS
Application Lifecycle Management on AWS
 
Is Serverless The New Swiss Cheese? - AWS Seattle User Group
Is Serverless The New Swiss Cheese? - AWS Seattle User GroupIs Serverless The New Swiss Cheese? - AWS Seattle User Group
Is Serverless The New Swiss Cheese? - AWS Seattle User Group
 
Head in the clouds
Head in the cloudsHead in the clouds
Head in the clouds
 
02-WhyCloud.pdf
02-WhyCloud.pdf02-WhyCloud.pdf
02-WhyCloud.pdf
 
Power of Azure Devops
Power of Azure DevopsPower of Azure Devops
Power of Azure Devops
 
Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)
 
Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)
 
Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)
 
Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)
 
Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)Big datatraining.in devops-part2 (1)
Big datatraining.in devops-part2 (1)
 
Managing Your Cloud Assets
Managing Your Cloud AssetsManaging Your Cloud Assets
Managing Your Cloud Assets
 
Real world cloud formation feb 2014 final
Real world cloud formation feb 2014 finalReal world cloud formation feb 2014 final
Real world cloud formation feb 2014 final
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Dernier (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

Automating Cloud Cluster Deployment: Beyond the Book

  • 1. 1 Automating Cloud Cluster Deployment: Beyond the Book Bill Havanki, Cloudera
  • 2. 2 Agenda Who am I? Whirlwind introduction to the Cloud How things take time Automation techniques and tools Cluster usage patterns from automation Dragons
  • 3. 3 Bill Havanki • Working for Cloudera ~4 years now • Worked in US government contracting for ~15 years before that • Originally from New Jersey • Wrote a book about Hadoop and Cloud Who am I?
  • 4. 4 What is Hadoop? You probably already know. MOVING ON
  • 5. 5 What is the Cloud? "The cloud" is a large set of computing resources made available by a cloud provider for customers to use and control for general purposes. instances networking storage AWS Google MS Azure starting stopping modifying Hadoop!
  • 6. 6 What is Hadoop in the Cloud? Hadoop in the Cloud means running Hadoop clusters on resources offered by a cloud provider.
  • 7. 7 Instances Networking Storage Key cloud resource concepts c = a + b
  • 8. 8 Resource Creation Easy! Through web consoles, API, or CLI! https://www.youtube.com/watch?v=0OIBMBteXcI
  • 9. 9 Manually is fine for trying things out, but you want automation for real work. Faster, Must Go Faster cluster admins analysts
  • 10. 10 Why does it take so loooooooong Cloud provider operations take time • launching the manager instance • launching the worker instances • patching / updating as necessary
  • 11. 11 Why does it take so loooooooong Hadoop setup operations take time • installing Java and Hadoop • setting up cluster accounts • configuring HDFS and YARN and … • formatting HDFS • establishing SSH tunnels or SOCKS proxies or … • Hive? Installing Hive, setting up the metastore, … • Spark? Installing Spark, …
  • 12. 12 Custom images Bake in the common setup work for instances. • OS updates • Java and Hadoop installation • account creation (and SSH key configuration) • most of Hadoop configuration
  • 13. 13 Images can be created manually, but tools like Packer automate the process. Automate image creation
  • 14. 14 { "builders": [{ "type": "amazon-ebs", "instance_type": "m4.large", "region": "us-east-1", "source_ami": "ami-6d1c2007", ... }], "provisioners": [ { "type": "shell", "script": "prov/packages.sh" }, { "type": "shell", "script": "prov/users.sh" }, ... ] } Example Packer template JSON
  • 15. 15 AWS CLI pip install awscli Google Cloud SDK yum install google-cloud-sdk apt-get install google-cloud-sdk Azure CLI curl -L https://aka.ms/InstallAzureCli | bash apt-get install azure-cli Hands-on automation using provider CLIs and SDKs aws ec2 run-instances --image-id ami-6d1c2007 --instance-type m4.large --key-name mypair gcloud compute instances create manager-1 --image-family centos-7 --machine-type n1-highcpu-4 az vm create –n manager-1 –g my-resource-group --image CentOS --size Standard_D4_v3
  • 16. 16 Creating clusters is one thing. Growing and shrinking is another. Just when you thought you were out …
  • 17. 17 Monitors show you the current conditions Monitor cluster metrics like HDFS usage and YARN / compute load
  • 18. 18 Alerts activate when thresholds are crossed Create alarm conditions for when metrics cross thresholds that indicate overutilization … or underutilization Alarms can send messages and trigger actions, like adding or removing instances
  • 19. 19 New patterns Automation makes things faster, which leads to new ways of working.
  • 20. 20 Standardized cluster types Build out images, each with the optimal mix of services for a type of work • log / IoT ingest and ETL • heavy back-end analysis • live web or application support Maintain images with the latest and greatest services and tools. Version control them for repeatability and testability.
  • 21. 21 Automated cluster elasticity Use those metrics and alarms to adjust cluster sizes based on demand. • end of quarter • long weekend • holiday rush Save money and remain responsive.
  • 22. 22 Transient clusters Don’t run clusters unless they are in use. Create and destroy them based on need. • save cloud costs • reduce administrative and troubleshooting burden • reduce cluster variations • shift between regions … or providers, more easily • improve security by reducing sharing
  • 23. 23 Self-service clusters Let cluster users run the automation. • no organizational bottlenecks • higher customer satisfaction
  • 24. 24 Ready to bang out some scripts? You’ll want to use an automation framework of course.
  • 26. 26 It’s tougher than it looks • How will you manage credentials? • How will you track asynchronous tasks? • What do you do when the provider has a problem? • How will you retry tasks? • Do you need to set up networks or security rules?
  • 27. 27 It’s tougher than it looks • Do you need to span providers? • Will you need a UI? • Will others want to use the automation? • How will you set up databases? • High availability? TLS? Kerberos?
  • 28. 28 Think about existing cluster automation tools Some are provider specific.
  • 29. 29 Think about existing cluster automation tools Some are not provider specific.
  • 30. 30 Where to go from here? • identify your own opportunities for automation • make some images, by hand if you like • try Packer • learn your cloud provider’s API and CLI • write some automation tools • learn about and try existing cluster automation tools
  • 32. 32 Thank you Bill Havanki, Cloudera bhavanki@cloudera.com