SlideShare a Scribd company logo
1 of 41
Download to read offline
Florian Lautenschlager
florian.lautenschlager@qaware.de
@flolaut
Observability in action! the wild!
Hamburg, 20. Juni 2018
Josef Fuchshuber
josef.fuchshuber@qaware.de
@fuchshuber
Florian Lautenschlager
& &&
@&
Josef Fuchshuber
& &&
@&@
Observability in action the wild! 5
In our cloud backend we have a vital microservice ecosystem.
Our team is just as vital and heterogeneous as our software.
Observability in action the wild! 6
Platform Developer
App Developer
Skill Developer
Client Developer
Tester
Ops
Help Desk
Product Management
Data Scientist
UX Designer
Observability isn't just for operations.
What is the hardest step in the DevOps process?
Observability in action the wild! 8
DEV OPS
Much better: The 6 Cs of the DevOps Cycle.
Observability in action the wild! 9Source: https://dzone.com/articles/6-cs-of-devops-adoption
Observability in the wild!
A case study… and how we found
collaborative monitoring.
Monitoring Toolchain: Simply Cloud Native Standard.
Observability in action the wild! 11
Metrics Events Traces
Java (Spring Boot) or Python
on
Azure / Kubernetes / Openshift / Docker
Monitoring
Technical and Functional
Observability in action the wild! 12
Hardware
Hypervisor
Operating System
Kubernetes
Docker
Runtime
Application
Generic monitoring that
does not need knowledge
about the application.
Monitoring that does
need knowledge about
the application.
Health of platform and application Telemetry data
Infrastructure-Monitoring
Application-Monitoring
Monitoring
Technical and Functional
Observability in action the wild! 13
Questions:
Services are up and running
Services can accept traffic
Sources:
Kubestate-Exporter
Prometheus-Node-Exporter
JMX, top, iostat etc.
Questions:
Use-Cases runtimes
Service level agreements
Sources:
Specific instrumentation
(around use cases, etc.)
Health of platform and application Telemetry data
Hardware
Hypervisor
Operating System
Kubernetes
Docker
Runtime
Application
Infrastructure-Monitoring
Application-Monitoring
USE Dashboard
Observability in action the wild!
RED Dashboard
Observability in action the wild!
I know. Most of you do this already. But what about ..
Observability in action the wild! 16
Collaborative Monitoring!?!?
An example is the best explanation.
Observability in action the wild! 17
and a chatbot…
and a monitoring toolchain…
Once there was a little tiny application…
Observability in action the wild! 18
Observability in action the wild! 19
Snip Snap
Links request
with trace and
logs.
verbose
Observability in action the wild! 20
Or in case
of an error
Observability in action the wild! 21
Total duration
Involved services <click>
Standard Zipkin Features
Code-Slide: Standardize tracing and metrics.
Observability in action the wild! 22
Traces and metrics for every database call
with standardized names and trace tags.
database_call_duration{repository=yy, Call=zz}
Code-Slide: Standardize tracing logs and tags.
Observability in action the wild! 23
Span logs: We model database calls as well
as other expensive calls as logs using a
template to reduce the size of traces:
db:<Repo>.<Call> took: xx ms.
call:<Class>.<Method> took: xx ms.
Span tags: Used to model values that are
valid for a span. We use a template to
standardize tags.
span.tag. (to mark our tags)
Environment (staging, integration , etc.)
db (to mark spans with db calls.)
param.<name>=value (call parameters)
Observability in action the wild! 24
Logs for
a given
trace
Involved
Services
Standard EFK + Contextual Logging
Code-Slide: Contextual logging.
Observability in action the wild! 25
Context of a log event.
Everyone can easily see the logs for a specific context (trace etc.)
Observability in action the wild! 26
Or for checking
the health of
the services
Observability in action the wild! 27
Or for checking
the status of
e2e tests
end-2-end tests are also integrated in our observability stack.
Observability in action the wild! 28
See the logs
VIDEO =)
Run in their own docker containers
execute spock tests periodically
and export Prometheus metrics
Our current setup: A chatbot as generic interface.
Observability in action the wild! 29
Development Setup!
and even our help desk / first level support.
Observability in action the wild! 30
Production Setup!
Early prototype of the Customer Care Observability Tool.
Observability in action the wild! 31
Activate tracing
for a user
Health
Checks
+
e2e
Logs
Observability in action the wild! 32
Ease of communication
within bug tickets.
Observability in action the wild!
Happy end.
33
Summary
Collaborative Monitoring.
Observability in action the wild! 35
Monitoring that allows everyone to benefit of without
the need of expert knowledge.
Three steps to enable collaborative monitoring.
Observability in action the wild! 36
Standardize
metrics, logs
and traces
Link and
combine them
as far as
possible
Integrate them
into everyone's
tools
Start Here
Correlate Events and Trace by Context
Metrics with Events and Traces by Time
Structured Logging + Context, Metric names, etc.
Tools your team
Did we create an uncontrollable observability monster?
Observability in action the wild! 37
There’s No
Such Thing as
a Free Lunch
• The more complex a
microservice architecture is,
the more sophisticated the
observability solution must be.
• For Collaborative Observability
there is no out of the box
solution.
Observability in action the wild! 38
Collaborative Monitoring by everyone.
Observability in action the wild! 39
Ease of use.
Simple general interface to access various monitoring tools.
Integrated into everyone's daily tools (ChatBots, E-Mail, etc.)
Support all kinds of teams: Operations / Dev-Ops / Developers / QA-Team / My mum =)
Allow everyone to get superman insights.
Decrease Mean Time To Recovery (MTTR) with a fast analysis
Integrates different kinds of monitoring data (traces, metrics and logs) of different monitoring layers.
The right information. Provide relevant information for different teams, e.g. runtimes for perf. engineer.
Level of Detail: Abstract (use case level) for management vs. details (database calls) for developers
The behavior of system is not just a single metric.
Lessons Learned
Observability in action the wild! 40
Tool stack is awesome: Prometheus, Sleuth / Zipkin, Logging (fluentD, elastic) is stable with a good
documentation.
Maximum flexibility compared to commercial products.
But: Effort for concepts, implementation and quality checks. Conventions and rulesets are important!
Mindset: We found that we had to convince people first. But we have seen a high level of acceptance.
Example: Chatbot with trace-links is standard tool for discussing possible bugs between all project roles.
Development and system understanding: No need of “cloudy” conversations. Just provide the context, e.g. a
trace id.
Example: Issues typically contain the context (trace id) that points the developer to the logs and the trace.
Observability in action the wild! 41
Any Questions?
Come to our
booth
We’re hiring!
#CloudNativeNerd
#CloudKoffer
chatbot:
cvi scale
up team

More Related Content

More from QAware GmbH

Migration von stark regulierten Anwendungen in die Cloud: Dem Teufel die See...
 Migration von stark regulierten Anwendungen in die Cloud: Dem Teufel die See... Migration von stark regulierten Anwendungen in die Cloud: Dem Teufel die See...
Migration von stark regulierten Anwendungen in die Cloud: Dem Teufel die See...
QAware GmbH
 

More from QAware GmbH (20)

Cloud Migration mit KI: der Turbo
Cloud Migration mit KI: der Turbo Cloud Migration mit KI: der Turbo
Cloud Migration mit KI: der Turbo
 
Migration von stark regulierten Anwendungen in die Cloud: Dem Teufel die See...
 Migration von stark regulierten Anwendungen in die Cloud: Dem Teufel die See... Migration von stark regulierten Anwendungen in die Cloud: Dem Teufel die See...
Migration von stark regulierten Anwendungen in die Cloud: Dem Teufel die See...
 
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
 
Endlich gute API Tests. Boldly Testing APIs Where No One Has Tested Before.
Endlich gute API Tests. Boldly Testing APIs Where No One Has Tested Before.Endlich gute API Tests. Boldly Testing APIs Where No One Has Tested Before.
Endlich gute API Tests. Boldly Testing APIs Where No One Has Tested Before.
 
Kubernetes with Cilium in AWS - Experience Report!
Kubernetes with Cilium in AWS - Experience Report!Kubernetes with Cilium in AWS - Experience Report!
Kubernetes with Cilium in AWS - Experience Report!
 
50 Shades of K8s Autoscaling
50 Shades of K8s Autoscaling50 Shades of K8s Autoscaling
50 Shades of K8s Autoscaling
 
Kontinuierliche Sicherheitstests für APIs mit Testkube und OWASP ZAP
Kontinuierliche Sicherheitstests für APIs mit Testkube und OWASP ZAPKontinuierliche Sicherheitstests für APIs mit Testkube und OWASP ZAP
Kontinuierliche Sicherheitstests für APIs mit Testkube und OWASP ZAP
 
Service Mesh Pain & Gain. Experiences from a client project.
Service Mesh Pain & Gain. Experiences from a client project.Service Mesh Pain & Gain. Experiences from a client project.
Service Mesh Pain & Gain. Experiences from a client project.
 
50 Shades of K8s Autoscaling
50 Shades of K8s Autoscaling50 Shades of K8s Autoscaling
50 Shades of K8s Autoscaling
 
Blue turns green! Approaches and technologies for sustainable K8s clusters.
Blue turns green! Approaches and technologies for sustainable K8s clusters.Blue turns green! Approaches and technologies for sustainable K8s clusters.
Blue turns green! Approaches and technologies for sustainable K8s clusters.
 
Per Anhalter zu Cloud Nativen API Gateways
Per Anhalter zu Cloud Nativen API GatewaysPer Anhalter zu Cloud Nativen API Gateways
Per Anhalter zu Cloud Nativen API Gateways
 
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
 
How to speed up Spring Integration Tests
How to speed up Spring Integration TestsHow to speed up Spring Integration Tests
How to speed up Spring Integration Tests
 
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-ClusterAus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
Aus blau wird grün! Ansätze und Technologien für nachhaltige Kubernetes-Cluster
 
Cloud Migration – Eine Strategie die funktioniert
Cloud Migration – Eine Strategie die funktioniertCloud Migration – Eine Strategie die funktioniert
Cloud Migration – Eine Strategie die funktioniert
 
Policy Driven Microservices mit Open Policy Agent
Policy Driven Microservices mit Open Policy AgentPolicy Driven Microservices mit Open Policy Agent
Policy Driven Microservices mit Open Policy Agent
 
Make Developers Fly: Principles for Platform Engineering
Make Developers Fly: Principles for Platform EngineeringMake Developers Fly: Principles for Platform Engineering
Make Developers Fly: Principles for Platform Engineering
 
Security Lab: OIDC in der Praxis
Security Lab: OIDC in der PraxisSecurity Lab: OIDC in der Praxis
Security Lab: OIDC in der Praxis
 
Die nächsten 100 Microservices
Die nächsten 100 MicroservicesDie nächsten 100 Microservices
Die nächsten 100 Microservices
 
Enterprise-level Kubernetes Security mit Open Source Tools - geht das?
Enterprise-level Kubernetes Security mit Open Source Tools - geht das?Enterprise-level Kubernetes Security mit Open Source Tools - geht das?
Enterprise-level Kubernetes Security mit Open Source Tools - geht das?
 

Recently uploaded

Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 

Recently uploaded (20)

Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 

Observability in a̶c̶t̶i̶o̶n̶! the wild!

  • 1. Florian Lautenschlager florian.lautenschlager@qaware.de @flolaut Observability in action! the wild! Hamburg, 20. Juni 2018 Josef Fuchshuber josef.fuchshuber@qaware.de @fuchshuber
  • 4.
  • 5. Observability in action the wild! 5 In our cloud backend we have a vital microservice ecosystem.
  • 6. Our team is just as vital and heterogeneous as our software. Observability in action the wild! 6 Platform Developer App Developer Skill Developer Client Developer Tester Ops Help Desk Product Management Data Scientist UX Designer
  • 7. Observability isn't just for operations.
  • 8. What is the hardest step in the DevOps process? Observability in action the wild! 8 DEV OPS
  • 9. Much better: The 6 Cs of the DevOps Cycle. Observability in action the wild! 9Source: https://dzone.com/articles/6-cs-of-devops-adoption
  • 10. Observability in the wild! A case study… and how we found collaborative monitoring.
  • 11. Monitoring Toolchain: Simply Cloud Native Standard. Observability in action the wild! 11 Metrics Events Traces Java (Spring Boot) or Python on Azure / Kubernetes / Openshift / Docker
  • 12. Monitoring Technical and Functional Observability in action the wild! 12 Hardware Hypervisor Operating System Kubernetes Docker Runtime Application Generic monitoring that does not need knowledge about the application. Monitoring that does need knowledge about the application. Health of platform and application Telemetry data Infrastructure-Monitoring Application-Monitoring
  • 13. Monitoring Technical and Functional Observability in action the wild! 13 Questions: Services are up and running Services can accept traffic Sources: Kubestate-Exporter Prometheus-Node-Exporter JMX, top, iostat etc. Questions: Use-Cases runtimes Service level agreements Sources: Specific instrumentation (around use cases, etc.) Health of platform and application Telemetry data Hardware Hypervisor Operating System Kubernetes Docker Runtime Application Infrastructure-Monitoring Application-Monitoring
  • 14. USE Dashboard Observability in action the wild!
  • 15. RED Dashboard Observability in action the wild!
  • 16. I know. Most of you do this already. But what about .. Observability in action the wild! 16 Collaborative Monitoring!?!?
  • 17. An example is the best explanation. Observability in action the wild! 17 and a chatbot… and a monitoring toolchain… Once there was a little tiny application…
  • 18. Observability in action the wild! 18
  • 19. Observability in action the wild! 19 Snip Snap Links request with trace and logs. verbose
  • 20. Observability in action the wild! 20 Or in case of an error
  • 21. Observability in action the wild! 21 Total duration Involved services <click> Standard Zipkin Features
  • 22. Code-Slide: Standardize tracing and metrics. Observability in action the wild! 22 Traces and metrics for every database call with standardized names and trace tags. database_call_duration{repository=yy, Call=zz}
  • 23. Code-Slide: Standardize tracing logs and tags. Observability in action the wild! 23 Span logs: We model database calls as well as other expensive calls as logs using a template to reduce the size of traces: db:<Repo>.<Call> took: xx ms. call:<Class>.<Method> took: xx ms. Span tags: Used to model values that are valid for a span. We use a template to standardize tags. span.tag. (to mark our tags) Environment (staging, integration , etc.) db (to mark spans with db calls.) param.<name>=value (call parameters)
  • 24. Observability in action the wild! 24 Logs for a given trace Involved Services Standard EFK + Contextual Logging
  • 25. Code-Slide: Contextual logging. Observability in action the wild! 25 Context of a log event. Everyone can easily see the logs for a specific context (trace etc.)
  • 26. Observability in action the wild! 26 Or for checking the health of the services
  • 27. Observability in action the wild! 27 Or for checking the status of e2e tests
  • 28. end-2-end tests are also integrated in our observability stack. Observability in action the wild! 28 See the logs VIDEO =) Run in their own docker containers execute spock tests periodically and export Prometheus metrics
  • 29. Our current setup: A chatbot as generic interface. Observability in action the wild! 29 Development Setup!
  • 30. and even our help desk / first level support. Observability in action the wild! 30 Production Setup!
  • 31. Early prototype of the Customer Care Observability Tool. Observability in action the wild! 31 Activate tracing for a user Health Checks + e2e Logs
  • 32. Observability in action the wild! 32 Ease of communication within bug tickets.
  • 33. Observability in action the wild! Happy end. 33
  • 35. Collaborative Monitoring. Observability in action the wild! 35 Monitoring that allows everyone to benefit of without the need of expert knowledge.
  • 36. Three steps to enable collaborative monitoring. Observability in action the wild! 36 Standardize metrics, logs and traces Link and combine them as far as possible Integrate them into everyone's tools Start Here Correlate Events and Trace by Context Metrics with Events and Traces by Time Structured Logging + Context, Metric names, etc. Tools your team
  • 37. Did we create an uncontrollable observability monster? Observability in action the wild! 37
  • 38. There’s No Such Thing as a Free Lunch • The more complex a microservice architecture is, the more sophisticated the observability solution must be. • For Collaborative Observability there is no out of the box solution. Observability in action the wild! 38
  • 39. Collaborative Monitoring by everyone. Observability in action the wild! 39 Ease of use. Simple general interface to access various monitoring tools. Integrated into everyone's daily tools (ChatBots, E-Mail, etc.) Support all kinds of teams: Operations / Dev-Ops / Developers / QA-Team / My mum =) Allow everyone to get superman insights. Decrease Mean Time To Recovery (MTTR) with a fast analysis Integrates different kinds of monitoring data (traces, metrics and logs) of different monitoring layers. The right information. Provide relevant information for different teams, e.g. runtimes for perf. engineer. Level of Detail: Abstract (use case level) for management vs. details (database calls) for developers The behavior of system is not just a single metric.
  • 40. Lessons Learned Observability in action the wild! 40 Tool stack is awesome: Prometheus, Sleuth / Zipkin, Logging (fluentD, elastic) is stable with a good documentation. Maximum flexibility compared to commercial products. But: Effort for concepts, implementation and quality checks. Conventions and rulesets are important! Mindset: We found that we had to convince people first. But we have seen a high level of acceptance. Example: Chatbot with trace-links is standard tool for discussing possible bugs between all project roles. Development and system understanding: No need of “cloudy” conversations. Just provide the context, e.g. a trace id. Example: Issues typically contain the context (trace id) that points the developer to the logs and the trace.
  • 41. Observability in action the wild! 41 Any Questions? Come to our booth We’re hiring! #CloudNativeNerd #CloudKoffer chatbot: cvi scale up team