DevOps Transformation at Dynatrace and with Dynatrace

DevOps Transformation
at Dynatrace and
with Dynatrace
CMG Boston, April 20th 2017
Andreas Grabner: @grabnerandi, andreas.grabner@dynatrace.com
Podcast: https://www.spreaker.com/user/pureperformance
Dynatrace Trial: http://bit.ly/dtsaastrial

confidential
How I explain DevOps Transformation!
or
From Waterfall to Continuous Innovation
through DevOps Automation and Culture

confidential
24 “Features in a Box” Ship the whole box!
Photo-Bombed!
Very late feedback 
F r u s t r a t i o n !
Quality Control!
Back to Customer

confidential
Continuous User Driven Innovation
1 “Feature at a Time”
Optimize Before DeployImmediate Customer Feedback

confidential
Use Case: DevOps
Transformation @ Dynatrace

confidential
2011: APM about to be disrupted!
 Migrate from On-Prem to VM, Cloud, Containers and PaaS
 Architectures include micro-services, on-demand scaling,
self-healing
 ”Cloud Natives“ demand SaaS based solutions
 Digital Transformers demand Analytics for Biz, Dev, Ops &
Sec
 Many new players on the market

confidential
Challenges to master!
 Bridging the gap between ”New Stack“ and “Enterprise Stack“
 Deploying the same way our customers do: Continuously!
 Not disrupting current operations and slower moving customers
 Aligning 300+ engineers across 3 different geos
 Solution: Innovation through Incubation!

confidential
% 20%
organization & culture technology
DevOps Transformation @ Dynatrace

2 major releases/year
customers deploy & operate on-prem
26 major releases/year
170 prod deployments/day
self-service online sales
SaaS & Managed
2011 2016
sprint releases (continuous-delivery)
1h: Code -> Prod6months
major/minor release

11 COMPANY CONFIDENTIAL – DO NOT DISTRIBUTE #Perform2015
Developer will never do that!
Operator’s job

confidential
Shift-Left Quality
Quality/Performance matters in Dev/Staging as well!
Make Dev/CSA/PM dependent from Quality in trunk!
DevOps = start thinking like an Ops before Commit
Shift-Right Metrics
enable DEVs defining quality metrics
make DEVs to the primary consumers of their metrics

confidential
How we increased Sprint Quality
Sprint Reviews Done on “dynaSprint“
• Daily Builds get deployed on “dynaDay“. Sprint builds to “dynaSprint
• If you can only show it “on your dev machine“ its NOT DONE!
Deploy Sprint Builds into our internal Production Enviornment
• We monitor Website, Support, Licensing, Community ... With Dynatrace
• If we break our own back office software we ALL feel the pain right away

confidential
 Which Features to Optimize? Which Features to „Phase Out“
 Allows Reducing Technical and Business Debt
How we Prioritized Features

confidential
Monitoring as Pipeline & Platform Feature
Dev Perf/Test Ops Biz
Faster Innovation with Quality Gates
Faster Acting on Feedback
Unit Perf
Cont. Perf
New Deploy
New Capability
CI CD Remove/Promote
Triage/Optimize
Update Tests
Innovate/Design
$$$
Lower Costs
Happy Users

confidential
acting as
Engineers
Role of Dynatrace DevOps Team
Dynatrace Managed/SaaS
Orchestration Layer
DynatracePipeline Visualization
Deployment Timeline
Log Overview
using Dynatrace Log APIJIRA Integrations
&
Product Managers

confidential
https://github.com/Dynatrace/ufo
Raising Awareness of Pipeline Quality

confidential
Learnings when scaling DevOps Pipelines
Service Team
A
Service Team B
Service Team X
Improve “Efficiency”
Cloud Ops
Ensure “Operational Service”
PM/Biz
Improve“Business”

confidential
Be proud of your feature!
DevOps  NoOps

confidential
Dynatrace Transformation by the numbers
26
170
Releases / Year
Deployments / Day
31000 60h
Unit & Int Tests / hour UI Tests per Build
More Quality
~200 340
Code commits / day Stories per sprint
More Agile
93%
Production bugs found by Dev
More Stability 450 99.998%
Global EC2 Instances Global Availability

confidential
Dynatrace Feedback Loop Use Cases

Dev: Shift-Left - Architectural Regression Decisions
= Functional Result (passed/failed)
+ Web Performance Metrics (# of Images, # of JavaScript, Page Load Time, ...)
+ App Performance Metrics (# of SQL, # of Logs, # of API Calls, # of Exceptions ...)
Fail the build early!

confidential
Warm Up Phase
Low Load for a couple of mins
Peak Load: 2x Regular Load Simulation
Twice the load requires more than twice
the resources. Services start failing
1x Regular Load
Validating scaling behavior.
Understanding resource
requirements
Perf/Test Use Case: Scalability Decisions

confidential
Service Teams: Architecture Validation

Service Teams: Continuous Performance Validation
“Performance Signature”
for Build Nov 16
“Performance Signature”
for Build Nov 17

Service Teams: Fact-Based Actions to find Regressions
GOOD BUILD BAD BUILD

4x $$$ to IaaS
Ops: Resource / Cost Driven Decisions

Ops: Resource / Cost Driven Decisions
Deployment of
new Release
New service
using most
of the CPU!
New service
using most
of the CPU!

confidential
Ops: Deployment Rollback or Keep Decisions

Total Number of Users
per User Experience
Conversion Rate
Biz: User Feedback Driven Decisions

New Features + Day # 1 of Mkt Push
Overall increase of Users!
Jump in Conversion Rate!

Users keep growing
Increase # of “tolerating” users!
Lower Conversion as Day #1
Day #2 of Marketing Campaign

Drop in Conversion Rate
Spikes in FRUSTRATED Users!
Hotfix Deployment was rolled out

User Experience Back to Normal
Jump in Conversion Rate!
Fix of the Hotfix was rolled out

Biz: User Behavior Driven Decisions

confidential
Scaling DevOps in a Cloud Native World with Dynatrace
Service Team A
Service Team B
Service Team X
Improve “Performance Signature”
Continuous Performance, Shift-Left, Failure, Usage Feedback
Cloud Ops
Ensure “Operational Service”
Monitoring as a Service, Capacity Planning, Risk/Cost Control
PM/Biz
Improve“BusinessSignature”
Usage,Behavior,Costs,Innovate,A/BTesting,…

www.dynatrace.com
confidential

confidential
Additional Lessons Learned

#1: Going from 6 Months to 1 Month On Premise Updates
• Challenge: Monolith download too big for our customers
• Impact: Update Process was error prone and “All or Nothing“
• Solution: Componentize, Automate Rollout/Rollback Capability,
A/B Rollout Model
Increased velocity uncovered bottlenecks!
@grabnerandi

#2: Education on Frequent Updates
• Challenge: Release Education used to happen 60-90
Days after the release
• Impact: Upgrade to latest version happened very late
• Solution: Education Integrated into Continuous Delivery:
Dev Blogs, YouTube Videos...
@grabnerandi

#3: Availabilty of Development / Test Environments
• Challenge: Supporting many different tech stack makes it
hard to maintain it
• Impact: Long running support tickets and long feature
development
• Solution: Infrastructure as Code gives “On Demand“ access to
these enviornments
@grabnerandi

DevOps Transformation at Dynatrace and with Dynatrace

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à DevOps Transformation at Dynatrace and with Dynatrace

Similaire à DevOps Transformation at Dynatrace and with Dynatrace (20)

Plus de Andreas Grabner

Plus de Andreas Grabner (20)

Dernier

Dernier (20)

DevOps Transformation at Dynatrace and with Dynatrace

Notes de l'éditeur