Chicago DevOps Meetup Nov2019

What is
Michael Villiger, Dynatrace
@mikevilliger
@keptnProject
https://keptn.sh
https://github.com/keptn

2
Industry is pushing “Autonomous Cloud Management”

3
MTTI
Mean Time to Innovation
MTTR
Mean Time to Remediate
4.8 days
4 hours
~ 10min
12.5 days 2 days ~ 1 hour
The reality and evidence supports the need for ACM!
https://dynatrace.ai/acsurvey
Only < 5% are “Cloud Native”

4
Increase Quality &
Level of Automation
Increase Speed &
Reduce Costs
Automated
Testing
Continuous
Performance
Auto Quality
Gates
AUTOMATE
OPERATIONS
AUTOMATE
DEPLOYMENT
AUTOMATE
MONITORING
Feature
Flagging
Adaptive
Scaling
Auto
Roll-Back
Canary
Releases
Blue /
Green
Deploymen
ts
Auto-
Remediation
AUTOMATE
QUALITY
Strategically Used as Pipeline Feature
Building Blocks for ACM/Cloud Natives!

5
That is why we are building
Because cloud native delivery and operations is a BIG challenge for enterprises!
Cloud Native

6
Which problem does solve?
6
CI/CD Pipeline
• This example: 350+ lines
• Information about
• Target platform (k8s, …)
• Environments (dev, hardening, …)
• Tools (Terraform, Helm, hey, …)
• Process (build, deploy, test, evaluate, …)
Pipelines seem to be becoming our new future
unmanageable legacy code!

7
7
pipeline {
stages {
stage('Deploy to dev namespace') {
steps {
container(‘helm’) {
}
}
}
stage('Run tests') {
steps {
container(‘hey’) {
}
}
}
stage(‘Evaluate performance’) {
steps {
container(‘curl’) {
}
}
}
if (evaluation.passed) {
stage('Deploy to staging') {
steps {
}
}
}
}
}
}
pipeline {
stages {
steps {
}
}
}
steps {
container(‘jmeter’) {
}
}
}
steps {
}
}
}
steps {
}
}
}
}
}
}
pipeline {
stages {
steps {
container(‘kustomize’) {
}
}
}
steps {
}
}
}
steps {
}
}
}
steps {
}
}
}
}
}
}
pipeline {
stages {
steps {
}
}
}
steps {
container(‘selenium’) {
}
}
}
steps {
}
}
}
}
}
}
pipeline {
stages {
steps {
}
}
}
steps {
}
}
}
steps {
}
}
}
steps {
}
}
}
}
}
}
1 Service = 1 Pipeline 1 Project = x Pipelines n Teams = n*x Pipelines

8
Challenge
• Add hardening stage?
• Use different tool for deployment?
• Add notifications to all steps?
• Enforce manual approval before
promoting to production for a
period of time?
8

9
How does solve that?
Keptn enables you to
• Define application delivery and operations processes declaratively
• Use predefined CloudEvents to separate the process from the tools
• Integrate and easily switch between different tools
9

10
Declarative delivery flow: Shipyard
pipeline {
stages {
steps {
}
}
}
steps {
}
}
}
steps {
}
}
}
steps {
}
}
}
}
}
}
keptn: v1
type: shipyard
stages:
- name: “dev”
deployment: “direct”
test: “functional”
promotion: “automatic”
- name: “hardening”
deployment: “blue-green”
- approval: “manual”
test: “performance”
promotion: “manual”
- name: “prod”
release: “canary”
- interval: “10m”
- increase: “20%”
Shipyard specifies STAGES and
WHAT TO DO in these stages

11
Standardized way of communication: CloudEvents
keptn: v1
type: shipyard
stages:
- name: “dev”
deployment: “direct”
test: “functional”
promotion: “automatic”
- name: “hardening”
- approval: “manual”
test: “performance”
promotion: “manual”
- name: “prod”
release: “canary”
- interval: “10m”
- increase: “20%”
CloudEvent
Events decouple WHAT from WHO and
provide context information
{
"type": "sh.keptn.event.start-deploy",
"specversion": "0.2",
"source": "https://github.com/keptn/keptn/cli",
"id": "49ac0dec-a83b-4bc1-9dc0-1f050c7e781b",
"time": "2019-06-07T07:02:15.64489Z",
"contenttype": "application/json",
"shkeptncontext":"49ac0dec-a83b-4bc1-9dc0-1f050c7e789b",
"data": {
"project": "bookshop",
"stage": "dev",
"service": "details",
"image": "bookshop/details:0.9.1"
}
}
CloudEvent
CloudEvent
CloudEvent
CloudEvent
PubSub provider

12
Declarative tool definition: Uniform
pipeline {
stages {
steps {
}
}
}
steps {
}
}
}
steps {
}
}
}
steps {
}
}
}
}
}
}
keptn: v1
type: uniform
services:
- name: “slack-trail”
image: “keptn/slack:1.0”
env:
- name: “SLACK_WEBHOOK”
value: “https://hooks.sl..."
topics:
- “*”
- name: “deploy-svc”
image: “argo/argocd-svc:1.3”
topics:
- “start_deploy”
- name: “performance-test”
image: “keptn/jmeter:0.7”
topics:
- “start_test”
...
Uniform specifies WHO reacts to
which EVENTS

1313
Challenge Challenge accepted
• Add hardening stage? Add stage in Shipyard.
• Use different tool for deployment? Switch tool in Uniform.
• Add notifications to all steps? Add tool in Uniform on all events.
• Enforce manual approval before Change approval in Shipyard.
promoting to production for a
period of time?
How does solve that?

14
Example: Application Delivery
SLI.yml
SLO.yml

15
Second: Continuous Delivery is NOT ENOUGH!

16
Continuous Delivery – Launch control
Launch operations are supervised and
controlled from several control rooms (also
known as a firing room). The controllers are
in control of pre-launch checks, the booster
and spacecraft. Once the rocket has cleared
the launch tower (usually within the first
10–15 seconds), is when control is switched
over to the Mission Control Center

17
Continuous Operations – Mission Control
A mission control center (MCC, sometimes
called a flight control center or operations
center) is a facility that manages space flights,
usually from the point of launch until landing
or the end of the mission. It is part of
the ground segment of spacecraft operations.
A staff of flight controllers and other support
personnel monitor all aspects of the mission
using telemetry, and send commands to the
vehicle using ground stations

18
Quote: “We spend more time in manual communicaton than remediating issues”
Second: has Continuous Operations at its Core!
ENGAGE TRIAGE FIND & ASSEMBLE RESOLVE RESTORE
MANUAL
COMMUNICATION
MANUAL
COMM
Before
After
RESTORERESOLVE
NUMBER
OF ISSUES BEFORE: mostly manual
AFTER: mostly automated

19
Mission Control
“Automated Operations” = Day 2 Ops
Launch Control
“Continuous Deployment” = Day 1 Ops
keptn accelerates building autonomous clouds
Event-driven runbook automation
Productionproblemscan beautomaticallyremediatedin
real-timebyexecutingrunbooksthatrequirenomanual
intervention.
Self-healing blue/green deployments
Deploymentsthatfollowthe“Operationsas Code”
paradigmautomaticallyremediateproblemsandget
yourdeploymentpipelineworkingagain inundera
minute.
Automated multistage unbreakable delivery
pipelines
GitOps-enableddeliverypipelineswithautomated
qualitygates supportautomatedtestingandmonitoring-
as-a-service.

20
Designed for modern applications
GitOps-based collaboration
AllkeptnworkflowsarebasedontheGitOps
paradigm.
Operator patterns for all logic
components
Logiccomponentscan bereusedforother
operationaltasks.
Monitoring and operations as code
Developer-friendlydefinitionofmonitoringand
operationaltasks.
Built on and for Kubernetes
Builtformoderncloud-nativeenvironments.
Event-driven and serverless
Powerfulwitha minimal
resourcefootprint.
Pluggable tooling
Alltoolsleveragedbykeptn
can bereplacedbased
onyourtoolpreferences.

21
Example: Automated Operations
SLI.yml
SLO.aml
REM.yml
Git provider
Monitoringprovider
+
1
(1) Add operation instructions
(3) Setup & configure monitoring
(4) Monitor services
(5) Detect issues based on SLO
(6) Alert Keptn
(7) Find remediation action
(8) Execute remediation action
(9) Receive monitoring feedback (success)
CPU
exhausted!
Scale up Cope with
load
(2) Store & version files

22
Delivery/Operations Personas
22Developers
Defines
remediation
actions
SRE
Service Level
Objectives
Defines stages
and process
DevOps
Provides Service
Level Indicators
Defines tools

23
Key features of
• Message-driven control plane for delivery/operations: declarative
• Standardized CloudEvents for communication
• GitOps built-in
• Configuration versioned in git; Upstreams to remote git (GitHub, GitLab, BitBucket, …)
• Enables automated operations (self-healing)
• SLIs, SLOs, Remediation Actions
• Works well in multi-stage, multi-cluster scenarios
• Supports non-k8s applications
• Observability built-in
23

24
Observability built-in: ‘s bridge

25
Roadmap
• Extend and collaborate on CloudEvent specification
• Enable easy interoperability between common CNCF tools
• Add additional cloud-native practices (canary, feature-flag based self-healing)
• Enhance user interface and observability
• Extend the UI and implement W3C trace-context conform tracing
• Build support for uniforms and Keptn’s wardrobe (service registry)
• Integrate a number of tools and provide a public registry
• Improved self-healing and auto-remediation capabilities
• Handle common problems out-of-the-box?

26
MTTI
= Mean Time to Innovation
MTTR
Mean Time to Remediate
4.8 days
4 hours
~ 10min
12.5 days 2 days ~ 1 hour
We are building Keptn to re-shape this reality
GROW this number!

27
Where and How to Contribute?

28
Config ChatOps IT Auto.
Deploy Test Observe
Keptn core is fundamentally event driven:
0.5.0
• sh.keptn.internal.event.project.create
• sh.keptn.internal.event.service.create
• sh.keptn.event.configuration.change
• sh.keptn.event.monitoring.configure
• sh.keptn.events.deployment-finished
• sh.keptn.events.evaluation-done
• sh.keptn.events.tests-finished
• sh.keptn.events.problem
1. Do Something
2. Send Event
1. Consume Event
2. Do Something
1. Consume Event
2. Do Something
3. Send Event
1. Consume Event
2. Do Something
1. Consume Event
2. Do Something

29
Some example Keptn Contribution Use Cases
• Send new-artifact event when new build artifact is generated
• Consume all or specific events for notification purposes
• Consume deployment-finished events to begin tests
• Consume configuration-changed events to execute automated tasks (i.e. Jenkins)
• Consume problem events to begin execute remediation action (i.e. ServiceNow, XMatters, Ansible)
• Disregard events entirely and provide additional source of metrics to Pitometer! (i.e. Prometheus)

30
Example 1: JIRA Service
• Subscribes to:
• Creates JIRA ticket upon failed Pitometer deployment evaluation
• Ticket includes:
• Failed deployment stage (i.e. Dev, Staging, Prod)
• Failed service
• Total Pitometer Score
• Pitometer Score Pass Threshold
• Pitometer Score Warning Treshold
• Pitometer Indicator ID
• Pitometer Evaluation Result
http://github.com/keptn-contrib/jira-service
1. Consume Event
2. Do Something

31
Example 2: Neotys
• Subscribes to:
• Publishes to:
• sh.keptn.events.test-finished
• Executes a NeoLoad test stored in source control alongside application source
• Consumes test-strategy from Shipyard file
• Pitometer NeoLoad source allows use of NeoLoad performance test results for build validation in
Pitometer
https://github.com/keptn-contrib/neoload-service
https://github.com/neotyskeptn/pitometer-source-neoload
1. Consume Event
2. Do Something
3. Send Event

32
Example 3: UFO Service
• Subscribes to:
• sh.keptn.events.new-artifact
• sh.keptn.events.tests-finished
• new-artifact event = blue LEDs
• deployment-finished event = purple LEDs
• evaluation-done Pass = green LEDs
• evaluation-done Fail = red LEDs
https://github.com/keptn-contrib/ufo-service
1. Consume Event
2. Do Something

33
Relevant events by provider use-case:
• Testing Tools/Services:
• Subscribe to sh.keptn.events.deployment-finished
• Pitometer source for test results
• Consumes test-strategy from Shipyard file
• Monitoring Tools/Services:
• Pitometer source
• Continuous Integration or Build Tools/Services:
• Publish to sh.keptn.event.configuration-changed
• ChatOps Tools/Services
• Subscribe to ALL events
• Automation Tools/Services
• Subscribe to sh.keptn.events.problem to remediate a problem
• Subscribe to sh.keptn.events.configuration-changed to execute additional deployment tasks

34
A message-driven control plane
for application delivery
and automated operations.
What is ?
34

35
• https://keptn.sh
• https://github.com/keptn/community
• @keptnproject
• keptn@dynatrace.com
• keptn.slack.com
• Biweekly community meetings
• Zoom https://dynatrace.zoom.us/j/578828410
• Notes https://tinyurl.com/keptn-meeting-notes
Join the community

Keptn architecture
Platform
keptn
cli api
REST API
eventbroker
bridge
mongodb-
datastore
gitconfiguration-service
internal-
keptn-
services
distributor
distributor
keptn services
Continuous
Delivery
Container
Registry
Operations
Automation AIOps
GitOps
Test
Automation ChatOps
Data
Provider
dev staging production keptnNamespaces:
distributor

Simple installation and usage
38
$ curl -sL https://get.keptn.sh | sudo -E bash
$ keptn install --platform=gke
<download examples>
$ git clone https://github.com/keptn/examples.git
$ cd examples/onboarding-carts
<create project and onboard service>
$ keptn create project sockshop shipyard.yaml
$ keptn onboard service carts --project=sockshop --chart=carts-0.1.0.tgz
<configure monitoring and add load test>
$ keptn configure monitoring prometheus
$ keptn add-resource --project=sockshop --service=carts --stage=staging
--resource=jmeter/load.jmx --resourceUri=jmeter/load.jmx
<ship a new artifact for the onboarded service>
$ keptn send event new-artifact --project=sockshop --service=carts
--image=docker.io/keptnexamples/carts --tag=0.9.1
Full use case description:
● https://keptn.sh/docs/0.5.0.beta/installation/setup-keptn/
● https://keptn.sh/docs/0.5.0.beta/usecases/onboard-carts-service/

Chicago DevOps Meetup Nov2019

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Chicago DevOps Meetup Nov2019

Similaire à Chicago DevOps Meetup Nov2019 (20)

Dernier

Dernier (20)

Chicago DevOps Meetup Nov2019

Notes de l'éditeur