TechEvent 2019: Chaos Engineering - here we go; Lothar Wieske - Trivadis

•

1 j'aime•103 vues

Trivadis

Chaos Engineering - here we go; Lothar Wieske - Trivadis TechEvent 2019

Technologie

news.trivadis.com/blog@lwieske
Chaos Engineering
Here We Go
Lothar

Lothar
I am solutions architect and digital disruptor.
Since 2009, I work at the intersection between
cloud and analytics. Digital disruption is coming
to ever more sectors and I want to understand its
technological, societal and economical impacts.
Before 2009, I managed large project budgets,
turned to an architect later on and built a digital
radiology and migrated the Miles & More.
@lwieske news.trivadis.com/blog

“The cloud isn’t a place, it’s a
way of doing IT.”
Michael Dell

Cloud native technologies empower organizations to
build and run scalable applications in modern,
dynamic environments such as public, private, and
hybrid clouds. Containers, service meshes,
microservices, immutable infrastructure, and
declarative APIs exemplify this approach.

2012: Netflix Open Sourced Chaos Monkey.
2016: Netflix Completed Transition To a 100% AWS Infrastructure
Cloud Changed the Way Netflix Runs the Company

Netflix Handled Amazon Maintenance
Update
• Amazon performed a major maintenance update at the end of September 2014 in order to patch a
security vulnerability in a Xen hypervisor affecting about 10% of their global fleet of cloud servers.
• Netflix has a long history of using their Simian army - Chaos Monkey, Gorilla and Kong – to force
reboots of their servers in order to see how the overall system reacts and what can be done to
improve resilience. The problem this time was that the operation would affect some of their
database servers, more exactly 218 Cassandra nodes. It is one thing to perform a live restart of a
server streaming a video, and it is a lot more difficult to do the same to a stateful database.
• Out of our 2700+ production Cassandra nodes, 218 were rebooted.
• 22 Cassandra nodes were on hardware that did not reboot successfully.
• They were detected and replaced with minimal human intervention.
• Netflix experienced 0 downtime that weekend.

Infrastructure
Switching
Application
People
ToolsTools
Tools
Chaos
Engineering
Team
Security
Red
Team

Apache License 2.0, https://commons.wikimedia.org/w/index.php?curid=63503083

PRINCIPLES OF CHAOS ENGINEERING
• The following principles describe an ideal application of Chaos Engineering, applied to the processes
of experimentation described above. The degree to which these principles are pursued strongly
correlates to the confidence we can have in a distributed system at scale.
• Build a Hypothesis around Steady State Behavior
• Vary Real-world Events
• Run Experiments in Production
• Automate Experiments to Run Continuously
• Minimize Blast Radius
• Experimenting in production has the potential to cause unnecessary customer pain. While there
must be an allowance for some short-term negative impact, it is the responsibility and obligation of
the Chaos Engineer to ensure the fallout from experiments are minimized and contained.

Chaos Engineering Is Not Just Tools.
Culture Is Part Of Your System.
Complexity Is Part Of Your System.
Testing In Production? Yes You Can!
You Should Chaos Engineer Everything Cloud
and Microservices – Among Others

Integration Workshops
Orientation Workshops Elaboration Workshops Conception Workshops
Cloud Native
Leadership
Cloud Native
Apps
Cloud Native
Architectures
Teams & Skills
DevOps
Cloud Native
Data
Cloud Native
Journey
Cloud Native Landscape
Walkthrough
Cloud Native
Security
Cloud Native
Lighthouse

TechEvent 2019: Chaos Engineering - here we go; Lothar Wieske - Trivadis

Recommandé

GOTO Amsterdam 2017 - Enterprise Fast LaneChristian Deger

Going Cloud Native - It Takes a PlatformChip Childers

Avoiding Cloud OutageNati Shalom

MAKING MONEY from openstackHui Cheng

Red Hat Summit - What are your digital foundations?Eric D. Schabell

The new stack isn’t a stack: Fragmentation and terraforming  the service layerDonnie Berkholz

App Dev in the Cloud: Not my circus, not my monkeys...Eric D. Schabell

Simplifying The Cloud Top 10 Questions By SMBsSun Digital, Inc.

Recommandé

GOTO Amsterdam 2017 - Enterprise Fast LaneChristian Deger

Going Cloud Native - It Takes a PlatformChip Childers

Avoiding Cloud OutageNati Shalom

MAKING MONEY from openstackHui Cheng

Red Hat Summit - What are your digital foundations?Eric D. Schabell

The new stack isn’t a stack: Fragmentation and terraforming  the service layerDonnie Berkholz

App Dev in the Cloud: Not my circus, not my monkeys...Eric D. Schabell

Simplifying The Cloud Top 10 Questions By SMBsSun Digital, Inc.

Red Hat Summit - Discover the foundations of digital transformationEric D. Schabell

Redefining The Hybrid Cloud: Rackspace And The EMC FederationKenneth Hui

Can we hack open source #cloud platforms to help reduce emissions?Tom Raftery

Cloud 2.0: Containers, Microservices and Cloud HybridizationMark Hinkle

Vps server 2RicoVierra08

Pulling Back the Curtain –CloudStack in Private and Community CloudsChip Childers

The Cloud Native Journey with Simon ElishaChloe Jackson

Breaking through the CloudsAndy Piper

Data Strategy – What Does an Enterprise Data Cloud Mean for Your Agency?scoopnewsgroup

Ultimate AppDev Stack is Cloud SuiteEric D. Schabell

What is cloud native and why should I care?LibbySchulze

Connect Expo 2015 - Australia - Bringing OpenStack into the EnterpriseRandy Bias

History of Data-Centric Transformationscoopnewsgroup

10 predictions for cloud native in 2021Cheryl Hung

Cloud Computing and Open SourceJohn Willis

Red Hat Forum Poland 2019 - 3 Pitfalls Everyone Should Avoid with Hybrid Mult...Eric D. Schabell

CloudCamp Chicago May 2014CloudCamp Chicago

CloudCamp Chicago - Big Data & Cloud May 2015 - All SlidesCloudCamp Chicago

OpenStack & the Evolving Cloud EcosystemMark Voelker

Microservices in the cloud at AutoScout24Christian Deger

Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your CloudMark Hinkle

Kubernetes and Container Technologies from Cloud Native Computing FoundationCloud Standards Customer Council

Contenu connexe

Tendances

Red Hat Summit - Discover the foundations of digital transformationEric D. Schabell

Redefining The Hybrid Cloud: Rackspace And The EMC FederationKenneth Hui

Can we hack open source #cloud platforms to help reduce emissions?Tom Raftery

Cloud 2.0: Containers, Microservices and Cloud HybridizationMark Hinkle

Vps server 2RicoVierra08

Pulling Back the Curtain –CloudStack in Private and Community CloudsChip Childers

The Cloud Native Journey with Simon ElishaChloe Jackson

Breaking through the CloudsAndy Piper

Data Strategy – What Does an Enterprise Data Cloud Mean for Your Agency?scoopnewsgroup

Ultimate AppDev Stack is Cloud SuiteEric D. Schabell

What is cloud native and why should I care?LibbySchulze

Connect Expo 2015 - Australia - Bringing OpenStack into the EnterpriseRandy Bias

History of Data-Centric Transformationscoopnewsgroup

10 predictions for cloud native in 2021Cheryl Hung

Cloud Computing and Open SourceJohn Willis

Red Hat Forum Poland 2019 - 3 Pitfalls Everyone Should Avoid with Hybrid Mult...Eric D. Schabell

CloudCamp Chicago May 2014CloudCamp Chicago

CloudCamp Chicago - Big Data & Cloud May 2015 - All SlidesCloudCamp Chicago

OpenStack & the Evolving Cloud EcosystemMark Voelker

Microservices in the cloud at AutoScout24Christian Deger

Tendances (20)

Red Hat Summit - Discover the foundations of digital transformation

Redefining The Hybrid Cloud: Rackspace And The EMC Federation

Can we hack open source #cloud platforms to help reduce emissions?

Cloud 2.0: Containers, Microservices and Cloud Hybridization

Vps server 2

Pulling Back the Curtain –CloudStack in Private and Community Clouds

The Cloud Native Journey with Simon Elisha

Breaking through the Clouds

Data Strategy – What Does an Enterprise Data Cloud Mean for Your Agency?

Ultimate AppDev Stack is Cloud Suite

What is cloud native and why should I care?

Connect Expo 2015 - Australia - Bringing OpenStack into the Enterprise

History of Data-Centric Transformation

10 predictions for cloud native in 2021

Cloud Computing and Open Source

Red Hat Forum Poland 2019 - 3 Pitfalls Everyone Should Avoid with Hybrid Mult...

CloudCamp Chicago May 2014

CloudCamp Chicago - Big Data & Cloud May 2015 - All Slides

OpenStack & the Evolving Cloud Ecosystem

Microservices in the cloud at AutoScout24

Similaire à TechEvent 2019: Chaos Engineering - here we go; Lothar Wieske - Trivadis

Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your CloudMark Hinkle

Kubernetes and Container Technologies from Cloud Native Computing FoundationCloud Standards Customer Council

Disaster recovery solutions and datacentre replacementsOVHcloud

Cloud 2.0 - How Containers, Microservices and Open Source Software are Redefi...Mark Hinkle

The Cloud Revolution - Philippines Cloud SummitRandy Bias

Cloud computing.pptxandrewbourget

Docker?!?! But I'm a SysAdminDocker, Inc.

CWIN17 london becoming cloud native part 2 - guy martin dockerCapgemini

Ca technology exchange virtualizationrsravi

FLUX - Crash Course in Cloud 2.0 Mark Hinkle

Cloud ComputingKishor Satpathy

Server Virtualization and Cloud Computing: Four Hidden Impacts on ...webhostingguy

How Cloud Computing will change how you and your team will run ITPeter HJ van Eijk

Developing Hybrid Cloud ApplicationsDaniel Berg

Technology insights: Decision Science PlatformDecision Science Community

OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud ComputingMark Hinkle

Navigating Cloud and Multi-CloudAdvanced Technology Consulting (ATC)

Pm440 Presentation Black Cloudguesta946d0

[OpenStack Days Korea 2016] An SDN Pioneer's Vision of NetworkingOpenStack Korea Community

Introduction to Chaos EngineeringRaymond Adrian (Rad) Butalid

Similaire à TechEvent 2019: Chaos Engineering - here we go; Lothar Wieske - Trivadis (20)

Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your Cloud

Kubernetes and Container Technologies from Cloud Native Computing Foundation

Disaster recovery solutions and datacentre replacements

Cloud 2.0 - How Containers, Microservices and Open Source Software are Redefi...

The Cloud Revolution - Philippines Cloud Summit

Cloud computing.pptx

Docker?!?! But I'm a SysAdmin

CWIN17 london becoming cloud native part 2 - guy martin docker

Ca technology exchange virtualization

FLUX - Crash Course in Cloud 2.0

Cloud Computing

Server Virtualization and Cloud Computing: Four Hidden Impacts on ...

How Cloud Computing will change how you and your team will run IT

Developing Hybrid Cloud Applications

Technology insights: Decision Science Platform

OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing

Navigating Cloud and Multi-Cloud

Pm440 Presentation Black Cloud

[OpenStack Days Korea 2016] An SDN Pioneer's Vision of Networking

Introduction to Chaos Engineering

Plus de Trivadis

Azure Days 2019: Azure Chatbot Development for Airline Irregularities (Remco ...Trivadis

Azure Days 2019: Trivadis Azure Foundation – Das Fundament für den ... (Nisan...Trivadis

Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)Trivadis

Azure Days 2019: Master the Move to Azure (Konrad Brunner)Trivadis

Azure Days 2019: Keynote Azure Switzerland – Status Quo und Ausblick (Primo A...Trivadis

Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Trivadis

Azure Days 2019: Get Connected with Azure API Management (Gerry Keune & Stefa...Trivadis

Azure Days 2019: Infrastructure as Code auf Azure (Jonas Wanninger & Daniel H...Trivadis

Azure Days 2019: Wie bringt man eine Data Analytics Plattform in die Cloud? (...Trivadis

Azure Days 2019: Azure@Helsana: Die Erweiterung von Dynamics CRM mit Azure Po...Trivadis

TechEvent 2019: Kundenstory - Kein Angebot, kein Auftrag – Wie Du ein individ...Trivadis

TechEvent 2019: Oracle Database Appliance M/L - Erfahrungen und Erfolgsmethod...Trivadis

TechEvent 2019: Security 101 für Web Entwickler; Roland Krüger - TrivadisTrivadis

TechEvent 2019: Trivadis & Swisscom Partner Angebote; Konrad Häfeli, Markus O...Trivadis

TechEvent 2019: DBaaS from Swisscom Cloud powered by Trivadis; Konrad Häfeli ...Trivadis

TechEvent 2019: Status of the partnership Trivadis and EDB - Comparing Postgr...Trivadis

TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...Trivadis

TechEvent 2019: Kundenstory - Vom Hauptmann zu Köpenick zum Polizisten 2020 -...Trivadis

TechEvent 2019: Vom Rechenzentrum in die Oracle Cloud - Übertragungsmethoden;...Trivadis

TechEvent 2019: The sleeping Power of Data; Eberhard Lösch - TrivadisTrivadis

Plus de Trivadis (20)

Azure Days 2019: Azure Chatbot Development for Airline Irregularities (Remco ...

Azure Days 2019: Trivadis Azure Foundation – Das Fundament für den ... (Nisan...

Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)

Azure Days 2019: Master the Move to Azure (Konrad Brunner)

Azure Days 2019: Keynote Azure Switzerland – Status Quo und Ausblick (Primo A...

Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)

Azure Days 2019: Get Connected with Azure API Management (Gerry Keune & Stefa...

Azure Days 2019: Infrastructure as Code auf Azure (Jonas Wanninger & Daniel H...

Azure Days 2019: Wie bringt man eine Data Analytics Plattform in die Cloud? (...

Azure Days 2019: Azure@Helsana: Die Erweiterung von Dynamics CRM mit Azure Po...

TechEvent 2019: Kundenstory - Kein Angebot, kein Auftrag – Wie Du ein individ...

TechEvent 2019: Oracle Database Appliance M/L - Erfahrungen und Erfolgsmethod...

TechEvent 2019: Security 101 für Web Entwickler; Roland Krüger - Trivadis

TechEvent 2019: Trivadis & Swisscom Partner Angebote; Konrad Häfeli, Markus O...

TechEvent 2019: DBaaS from Swisscom Cloud powered by Trivadis; Konrad Häfeli ...

TechEvent 2019: Status of the partnership Trivadis and EDB - Comparing Postgr...

TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...

TechEvent 2019: Kundenstory - Vom Hauptmann zu Köpenick zum Polizisten 2020 -...

TechEvent 2019: Vom Rechenzentrum in die Oracle Cloud - Übertragungsmethoden;...

TechEvent 2019: The sleeping Power of Data; Eberhard Lösch - Trivadis

Dernier

Google AI Hackathon: LLM based Evaluator for RAGSujit Pal

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

Understanding the Laravel MVC ArchitecturePixlogix Infotech

A Call to Action for Generative AI in 2024Results

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Dernier (20)

Google AI Hackathon: LLM based Evaluator for RAG

The Codex of Business Writing Software for Real-World Solutions 2.pptx

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Finology Group – Insurtech Innovation Award 2024

Presentation on how to chat with PDF using ChatGPT code interpreter

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

[2024]Digital Global Overview Report 2024 Meltwater.pdf

08448380779 Call Girls In Civil Lines Women Seeking Men

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...

Maximizing Board Effectiveness 2024 Webinar.pptx

Understanding the Laravel MVC Architecture

A Call to Action for Generative AI in 2024

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Data Cloud, More than a CDP by Matt Robison

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

My Hashitalk Indonesia April 2024 Presentation

Unblocking The Main Thread Solving ANRs and Frozen Frames

08448380779 Call Girls In Friends Colony Women Seeking Men

Breaking the Kubernetes Kill Chain: Host Path Mount

TechEvent 2019: Chaos Engineering - here we go; Lothar Wieske - Trivadis

1. news.trivadis.com/blog@lwieske Chaos Engineering Here We Go Lothar

2. Lothar I am solutions architect and digital disruptor. Since 2009, I work at the intersection between cloud and analytics. Digital disruption is coming to ever more sectors and I want to understand its technological, societal and economical impacts. Before 2009, I managed large project budgets, turned to an architect later on and built a digital radiology and migrated the Miles & More. @lwieske news.trivadis.com/blog

3. Cloud Computing and Cloud Native

4. “The cloud isn’t a place, it’s a way of doing IT.” Michael Dell

5. Cloud native technologies empower organizations to build and run scalable applications in modern, dynamic environments such as public, private, and hybrid clouds. Containers, service meshes, microservices, immutable infrastructure, and declarative APIs exemplify this approach.

6. Chaos Engineering

7. Werner Vogels Adrian Cockcroft

8. 2012: Netflix Open Sourced Chaos Monkey. 2016: Netflix Completed Transition To a 100% AWS Infrastructure Cloud Changed the Way Netflix Runs the Company

9. Netflix Handled Amazon Maintenance Update • Amazon performed a major maintenance update at the end of September 2014 in order to patch a security vulnerability in a Xen hypervisor affecting about 10% of their global fleet of cloud servers. • Netflix has a long history of using their Simian army - Chaos Monkey, Gorilla and Kong – to force reboots of their servers in order to see how the overall system reacts and what can be done to improve resilience. The problem this time was that the operation would affect some of their database servers, more exactly 218 Cassandra nodes. It is one thing to perform a live restart of a server streaming a video, and it is a lot more difficult to do the same to a stateful database. • Out of our 2700+ production Cassandra nodes, 218 were rebooted. • 22 Cassandra nodes were on hardware that did not reboot successfully. • They were detected and replaced with minimal human intervention. • Netflix experienced 0 downtime that weekend.

10. Infrastructure Switching Application People ToolsTools Tools Chaos Engineering Team Security Red Team

11. Apache License 2.0, https://commons.wikimedia.org/w/index.php?curid=63503083

12. PRINCIPLES OF CHAOS ENGINEERING • The following principles describe an ideal application of Chaos Engineering, applied to the processes of experimentation described above. The degree to which these principles are pursued strongly correlates to the confidence we can have in a distributed system at scale. • Build a Hypothesis around Steady State Behavior • Vary Real-world Events • Run Experiments in Production • Automate Experiments to Run Continuously • Minimize Blast Radius • Experimenting in production has the potential to cause unnecessary customer pain. While there must be an allowance for some short-term negative impact, it is the responsibility and obligation of the Chaos Engineer to ensure the fallout from experiments are minimized and contained.

13.

14. Chaos Engineering Is Not Just Tools. Culture Is Part Of Your System. Complexity Is Part Of Your System. Testing In Production? Yes You Can! You Should Chaos Engineer Everything Cloud and Microservices – Among Others

15. Integration Workshops Orientation Workshops Elaboration Workshops Conception Workshops Cloud Native Leadership Cloud Native Apps Cloud Native Architectures Teams & Skills DevOps Cloud Native Data Cloud Native Journey Cloud Native Landscape Walkthrough Cloud Native Security Cloud Native Lighthouse