SlideShare a Scribd company logo
1 of 35
Download to read offline
AWS Well-Architected
Framework
Operational Excellence Pillar
I’m Jonathan LaCour, CTO of Reliam
● Technologist
● Programmer
● Cloud Strategist
● Bourbon Junkie
Nice to meet you!
Hello there
Reliam is an AWS certified consulting and managed
services provider based in Southern California.
Serving customers globally from startups to enterprise, Reliam’s certified solutions
architects and engineers incorporate AWS best practices including the Well
Architected Framework to advise companies on workload migration, architecture
and optimization to drive rapid adoption of AWS services and high customer
satisfaction.
Reliam’s obsessive customer focus, coupled with operational excellence, expert
technical solutions, industry-leading SLAs, and proven strategies & best practices,
delivers on our promise to each customer to ensure their continued success
throughout the entire lifecycle of their technology journey.
Question
How is your company currently using AWS?
Agenda
● Introduction to Well-Architected
● Design Principles for Operational Excellence
● Areas of Operational Excellence
○ Preparation
○ Operation
○ Evolution
● Q&A
The Well-Architected Framework
AWS Well
Architected
The Five Pillars
● Operational Excellence
● Security
● Reliability
● Performance Efficiency
● Cost Optimization
AWS Well
Architected
Benefits
● Build and deploy faster
● Lower or mitigate risk
● Make informed decisions
● Learn AWS best practices
Reliam’s Insights from Well-Architected Reviews
Review
Free
Remediation
$10,000 for 40 hours
$5,000
AWS Service Credit
Reliam Accelerated Well-Architected Review Package
Operational Excellence Pillar
Operational
Excellence
Design Principles
Perform operations as code
Annotated documentation
Frequent, small, reversible changes
Refine ops procedures frequently
Anticipate failure
Learn from all operational failures
Perform operations as code
Software Engineering Practices
● Automated testing
● CI/CD pipelines
● Version control
● Code review and standards
Operations as Code
● Everything is software
● Bring software practices to
operations and infrastructure
● De-risk, ensure consistency
Question
Has your organization adopted Infrastructure as Code?
Annotated documentation
On-Prem Environments
● Manual documentation
● Prone to error
● Docs drift from reality
● Operational agility suffers
Cloud Environments
● Automated documentation
● Useful to humans & systems
● Docs reflect reality
● Operational agility improves
Frequent, small, reversible changes
Traditional Approach
● Software releases are large,
high-risk bundles of changes
● Agile practitioners bundle
many “sprints” into a release
● Systems are monolithic
Continuous Approach
● Change is the new normal
● Systems are composed of
small, focused components
● All changes are designed to
be quickly reversible
Question
What is your typical release cadence?
Refine operations procedures frequently
Software Engineering Procedures
● Regular cadence of
“retrospective” meetings
● Improvements progressively
integrated
Operations Procedures
● Regular cadence of “game
days” and associated retros
● Improvements progressively
integrated
Question
Does your organization have regular “Game Days?”
Anticipate failure
Typical Operations Teams
● Reactive approach to failure
● Post-mortem exercises after
failures, if at all
● Problems usually discovered
in production
Operationally Excellent Teams
● Proactive approach to failure
● Pre-mortem exercises
● Test, validate, & measure
scenarios in Game Days
● Problems usually anticipated
Question
Do you regularly schedule “pre-mortem” meetings?
Learn from all operational failures
Evolution Requires Sharing
● Drive change by sharing
● Involve product, marketing,
and finance in improvements
● Establish a culture of
continuous evolution
Operational
Excellence
Focus Areas
Preparation
Operation
Evolution
Focus: Preparation
Operational Priorities
Successful operations teams are enlightened operations teams.
● Experts on their workloads
● Aware of shared business goals
● Clearly understand their role
● Grasp of regulatory and compliance constraints
Proper prioritization without context is impossible.
Question
Do you feel that your operations teams are enlightened?
Focus: Preparation
Design for Operations
Intentionally consider deployment, updates, and operations by design.
● Everything as code
● Structured CI/CD pipelines
● Shared libraries of common tools and templates
● Obsessive observability – data, data, data!
Empower yourself to act quickly during incidents.
Focus: Preparation
Operational Readiness
Technology is important, but so are process and procedure.
● Accurate documentation – checklists, runbooks, and playbooks
● Trained, right-sized team… no shortcuts!
● Governance to control readiness
Codify process and procedure with AWS: resource tags, event triggers, AWS
Systems Manager Run Command, Lambda, CloudWatch Events, etc.
Question
Does your operations team have documented procedures?
Focus: Operation
Understanding Operational Health
Operational excellence requires immediately available, accurate insight into
key metrics that are aligned with business requirements.
● Performance, cost, availability, latency, etc.
● Collect and aggregate data
● Implement dashboards and alerting
AWS provides CloudWatch, Amazon ElasticSearch with Kibana, and many
other tools to enable your understanding of operational health.
Focus: Operation
Responding to Events
Armed with key metrics, alerting, and dashboards, your team can respond to
events with confidence.
● Consider business impact when prioritizing
● Script responses through operations as code, leveraging data
● Implement automated rollbacks to known good versions
● Embrace AWS auto-scaling
After navigating an incident, always perform root cause analysis and a full
post-mortem.
Focus: Evolution
Learning from Experience
The greatest indicator of success for ops teams? A passion for learning.
● Every incident is an opportunity
● Encourage ops teams to analyze, experiment, and improve
● AWS provides extensive platform to enable
Be sure to pull in all parts of the business to add differing points of view,
surfacing new opportunities for improvement.
Question
How does your organization view operational events?
Focus: Evolution
Share Learnings
Many companies have multiple product and operations teams. Share your
lessons broadly to drive a culture of improvement.
● Leverage AWS platform for sharing best practices, such as
CloudFormation templates, Chef Cookbooks, and Lambda functions.
● Use AWS IAM to define permissions for controlled access.
Evolution isn’t a localized process.
Summary
AWS WAF is a powerful collection of best practices
WAF Program Partners like Reliam can help accelerate your journey
Operational Excellence Pillar
● Design Principles
● Focus Areas
○ Preparation
○ Operation
○ Evolution
Q&A
Thanks for Attending!

More Related Content

What's hot

What's hot (20)

Introduction to AWS Security
Introduction to AWS SecurityIntroduction to AWS Security
Introduction to AWS Security
 
Cloud Migration Workshop
Cloud Migration WorkshopCloud Migration Workshop
Cloud Migration Workshop
 
Introduction to AWS Cost Management
Introduction to AWS Cost ManagementIntroduction to AWS Cost Management
Introduction to AWS Cost Management
 
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
 
Deploy and Govern at Scale with AWS Control Tower
Deploy and Govern at Scale with AWS Control TowerDeploy and Govern at Scale with AWS Control Tower
Deploy and Govern at Scale with AWS Control Tower
 
Azure Migrate
Azure MigrateAzure Migrate
Azure Migrate
 
Simplify & Standardise your migration to AWS with a Migration Landing Zone
Simplify & Standardise your migration to AWS with a Migration Landing ZoneSimplify & Standardise your migration to AWS with a Migration Landing Zone
Simplify & Standardise your migration to AWS with a Migration Landing Zone
 
AWS Business Essentials
AWS Business EssentialsAWS Business Essentials
AWS Business Essentials
 
Immersion Day - Well Architected Workshop - June 2019
Immersion Day - Well Architected Workshop - June 2019Immersion Day - Well Architected Workshop - June 2019
Immersion Day - Well Architected Workshop - June 2019
 
Introduction to AWS Organizations
Introduction to AWS OrganizationsIntroduction to AWS Organizations
Introduction to AWS Organizations
 
Monitoring in Azure
Monitoring in AzureMonitoring in Azure
Monitoring in Azure
 
Well-Architected Bootcamp
Well-Architected BootcampWell-Architected Bootcamp
Well-Architected Bootcamp
 
Fundamentals of AWS Security
Fundamentals of AWS SecurityFundamentals of AWS Security
Fundamentals of AWS Security
 
AWS Cloud Cost Optimization
AWS Cloud Cost OptimizationAWS Cloud Cost Optimization
AWS Cloud Cost Optimization
 
Azure migration
Azure migrationAzure migration
Azure migration
 
Building the business case for AWS
Building the business case for AWSBuilding the business case for AWS
Building the business case for AWS
 
So you want to be Well-Architected?
So you want to be Well-Architected?So you want to be Well-Architected?
So you want to be Well-Architected?
 
Simplify & Standardise Your Migration to AWS with a Migration Landing Zone
Simplify & Standardise Your Migration to AWS with a Migration Landing ZoneSimplify & Standardise Your Migration to AWS with a Migration Landing Zone
Simplify & Standardise Your Migration to AWS with a Migration Landing Zone
 
AWS Control Tower
AWS Control TowerAWS Control Tower
AWS Control Tower
 
Living the AWS Well Architected Framework
Living the AWS Well Architected FrameworkLiving the AWS Well Architected Framework
Living the AWS Well Architected Framework
 

Similar to AWS Well-Architected Framework: Operational Excellence Pillar

AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
Amazon Web Services
 
A Crash Course in Building Site Reliability
A Crash Course in Building Site ReliabilityA Crash Course in Building Site Reliability
A Crash Course in Building Site Reliability
Acquia
 

Similar to AWS Well-Architected Framework: Operational Excellence Pillar (20)

Demystifying Devops - Uday kumar
Demystifying Devops - Uday kumarDemystifying Devops - Uday kumar
Demystifying Devops - Uday kumar
 
Fundamentals of Agile
Fundamentals of AgileFundamentals of Agile
Fundamentals of Agile
 
DevOps Primer : Presented by Uday Kumar
DevOps Primer : Presented by Uday KumarDevOps Primer : Presented by Uday Kumar
DevOps Primer : Presented by Uday Kumar
 
Agile ncr2016 ppt
Agile ncr2016 pptAgile ncr2016 ppt
Agile ncr2016 ppt
 
Dev ops != Dev+Ops
Dev ops != Dev+OpsDev ops != Dev+Ops
Dev ops != Dev+Ops
 
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdfADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
 
The Release Manager is Dead. Long Live the Release Manager!
The Release Manager is Dead. Long Live the Release Manager!The Release Manager is Dead. Long Live the Release Manager!
The Release Manager is Dead. Long Live the Release Manager!
 
The Release Manager is Dead. Long Live the Release Manager.
The Release Manager is Dead. Long Live the Release Manager.The Release Manager is Dead. Long Live the Release Manager.
The Release Manager is Dead. Long Live the Release Manager.
 
[WSO2Con USA 2018] Winning Strategy For Enterprise Integration to Empower Dig...
[WSO2Con USA 2018] Winning Strategy For Enterprise Integration to Empower Dig...[WSO2Con USA 2018] Winning Strategy For Enterprise Integration to Empower Dig...
[WSO2Con USA 2018] Winning Strategy For Enterprise Integration to Empower Dig...
 
What is DevOps? What is DevOps CoE?
What is DevOps? What is DevOps CoE? What is DevOps? What is DevOps CoE?
What is DevOps? What is DevOps CoE?
 
Unleashing change impact mining for sap dev ops
Unleashing change impact mining for sap dev opsUnleashing change impact mining for sap dev ops
Unleashing change impact mining for sap dev ops
 
Singlepoint AWS Well-Architected Review
Singlepoint AWS Well-Architected ReviewSinglepoint AWS Well-Architected Review
Singlepoint AWS Well-Architected Review
 
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
 
Powering Safe Launch @ Scale (Feature Flags, Targeting, Experimentation)
Powering Safe Launch @ Scale (Feature Flags, Targeting, Experimentation)Powering Safe Launch @ Scale (Feature Flags, Targeting, Experimentation)
Powering Safe Launch @ Scale (Feature Flags, Targeting, Experimentation)
 
Principle 11 needs to go! by Ken France at #AgileIndia2019
Principle 11 needs to go! by Ken France at #AgileIndia2019Principle 11 needs to go! by Ken France at #AgileIndia2019
Principle 11 needs to go! by Ken France at #AgileIndia2019
 
A Comprehensive Step-by-Step Guide for Designing an Agile-Friendly Automation...
A Comprehensive Step-by-Step Guide for Designing an Agile-Friendly Automation...A Comprehensive Step-by-Step Guide for Designing an Agile-Friendly Automation...
A Comprehensive Step-by-Step Guide for Designing an Agile-Friendly Automation...
 
Agile testing
Agile testingAgile testing
Agile testing
 
Best Practices for a Repeatable Shift-Left Commitment
Best Practices for a Repeatable Shift-Left CommitmentBest Practices for a Repeatable Shift-Left Commitment
Best Practices for a Repeatable Shift-Left Commitment
 
A Crash Course in Building Site Reliability
A Crash Course in Building Site ReliabilityA Crash Course in Building Site Reliability
A Crash Course in Building Site Reliability
 
XP Practices as Scaffolding for Breakthrough Companies
XP Practices as Scaffolding for Breakthrough CompaniesXP Practices as Scaffolding for Breakthrough Companies
XP Practices as Scaffolding for Breakthrough Companies
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

AWS Well-Architected Framework: Operational Excellence Pillar

  • 2. I’m Jonathan LaCour, CTO of Reliam ● Technologist ● Programmer ● Cloud Strategist ● Bourbon Junkie Nice to meet you! Hello there
  • 3. Reliam is an AWS certified consulting and managed services provider based in Southern California. Serving customers globally from startups to enterprise, Reliam’s certified solutions architects and engineers incorporate AWS best practices including the Well Architected Framework to advise companies on workload migration, architecture and optimization to drive rapid adoption of AWS services and high customer satisfaction. Reliam’s obsessive customer focus, coupled with operational excellence, expert technical solutions, industry-leading SLAs, and proven strategies & best practices, delivers on our promise to each customer to ensure their continued success throughout the entire lifecycle of their technology journey.
  • 4. Question How is your company currently using AWS?
  • 5. Agenda ● Introduction to Well-Architected ● Design Principles for Operational Excellence ● Areas of Operational Excellence ○ Preparation ○ Operation ○ Evolution ● Q&A
  • 7. AWS Well Architected The Five Pillars ● Operational Excellence ● Security ● Reliability ● Performance Efficiency ● Cost Optimization
  • 8. AWS Well Architected Benefits ● Build and deploy faster ● Lower or mitigate risk ● Make informed decisions ● Learn AWS best practices
  • 9. Reliam’s Insights from Well-Architected Reviews
  • 10. Review Free Remediation $10,000 for 40 hours $5,000 AWS Service Credit Reliam Accelerated Well-Architected Review Package
  • 12. Operational Excellence Design Principles Perform operations as code Annotated documentation Frequent, small, reversible changes Refine ops procedures frequently Anticipate failure Learn from all operational failures
  • 13. Perform operations as code Software Engineering Practices ● Automated testing ● CI/CD pipelines ● Version control ● Code review and standards Operations as Code ● Everything is software ● Bring software practices to operations and infrastructure ● De-risk, ensure consistency
  • 14. Question Has your organization adopted Infrastructure as Code?
  • 15. Annotated documentation On-Prem Environments ● Manual documentation ● Prone to error ● Docs drift from reality ● Operational agility suffers Cloud Environments ● Automated documentation ● Useful to humans & systems ● Docs reflect reality ● Operational agility improves
  • 16. Frequent, small, reversible changes Traditional Approach ● Software releases are large, high-risk bundles of changes ● Agile practitioners bundle many “sprints” into a release ● Systems are monolithic Continuous Approach ● Change is the new normal ● Systems are composed of small, focused components ● All changes are designed to be quickly reversible
  • 17. Question What is your typical release cadence?
  • 18. Refine operations procedures frequently Software Engineering Procedures ● Regular cadence of “retrospective” meetings ● Improvements progressively integrated Operations Procedures ● Regular cadence of “game days” and associated retros ● Improvements progressively integrated
  • 19. Question Does your organization have regular “Game Days?”
  • 20. Anticipate failure Typical Operations Teams ● Reactive approach to failure ● Post-mortem exercises after failures, if at all ● Problems usually discovered in production Operationally Excellent Teams ● Proactive approach to failure ● Pre-mortem exercises ● Test, validate, & measure scenarios in Game Days ● Problems usually anticipated
  • 21. Question Do you regularly schedule “pre-mortem” meetings?
  • 22. Learn from all operational failures Evolution Requires Sharing ● Drive change by sharing ● Involve product, marketing, and finance in improvements ● Establish a culture of continuous evolution
  • 24. Focus: Preparation Operational Priorities Successful operations teams are enlightened operations teams. ● Experts on their workloads ● Aware of shared business goals ● Clearly understand their role ● Grasp of regulatory and compliance constraints Proper prioritization without context is impossible.
  • 25. Question Do you feel that your operations teams are enlightened?
  • 26. Focus: Preparation Design for Operations Intentionally consider deployment, updates, and operations by design. ● Everything as code ● Structured CI/CD pipelines ● Shared libraries of common tools and templates ● Obsessive observability – data, data, data! Empower yourself to act quickly during incidents.
  • 27. Focus: Preparation Operational Readiness Technology is important, but so are process and procedure. ● Accurate documentation – checklists, runbooks, and playbooks ● Trained, right-sized team… no shortcuts! ● Governance to control readiness Codify process and procedure with AWS: resource tags, event triggers, AWS Systems Manager Run Command, Lambda, CloudWatch Events, etc.
  • 28. Question Does your operations team have documented procedures?
  • 29. Focus: Operation Understanding Operational Health Operational excellence requires immediately available, accurate insight into key metrics that are aligned with business requirements. ● Performance, cost, availability, latency, etc. ● Collect and aggregate data ● Implement dashboards and alerting AWS provides CloudWatch, Amazon ElasticSearch with Kibana, and many other tools to enable your understanding of operational health.
  • 30. Focus: Operation Responding to Events Armed with key metrics, alerting, and dashboards, your team can respond to events with confidence. ● Consider business impact when prioritizing ● Script responses through operations as code, leveraging data ● Implement automated rollbacks to known good versions ● Embrace AWS auto-scaling After navigating an incident, always perform root cause analysis and a full post-mortem.
  • 31. Focus: Evolution Learning from Experience The greatest indicator of success for ops teams? A passion for learning. ● Every incident is an opportunity ● Encourage ops teams to analyze, experiment, and improve ● AWS provides extensive platform to enable Be sure to pull in all parts of the business to add differing points of view, surfacing new opportunities for improvement.
  • 32. Question How does your organization view operational events?
  • 33. Focus: Evolution Share Learnings Many companies have multiple product and operations teams. Share your lessons broadly to drive a culture of improvement. ● Leverage AWS platform for sharing best practices, such as CloudFormation templates, Chef Cookbooks, and Lambda functions. ● Use AWS IAM to define permissions for controlled access. Evolution isn’t a localized process.
  • 34. Summary AWS WAF is a powerful collection of best practices WAF Program Partners like Reliam can help accelerate your journey Operational Excellence Pillar ● Design Principles ● Focus Areas ○ Preparation ○ Operation ○ Evolution