Monitoring and observability

•Télécharger en tant que PPTX, PDF•

0 j'aime•195 vues

Monitoring involves collecting logs, metrics and alerts to detect issues, while observability provides insight into internal system states. The presenter faced problems determining causes of performance drops. They will discuss starting with monitoring basics like logging, tracing and metrics. They will then explain how to transition to domain-oriented observability through techniques like aspect-oriented programming to better understand the system. Observability aims to answer any questions about internal states using monitoring tools.

Logiciels

From Monitoring to
Domain-Oriented
Observability
What’s the difference between monitoring and
observability and why does it matter?

2. About me
● Took part in developing of microservice architecture based on
Event Sourcing and CQRS
● I have obtained a position of Tech Lead. I implemented Canary
release and Feature Toggles (aka Feature Flags), migrated
microservice from REST to Event-Driven
● Made a webinar about microservices testing.
● I am trying to apply best engineering practices to Safeguard Cyber project
● I want to build such a process in a company (team) in which it will be pleasant to work and develop
professionally. Where ideas will be heard, where a person will not need to sacrifice his family or
health for professional growth.

3. Parts of presentation
● Why you should start thinking about monitoring and our case
● Theoretical minimum about monitoring
● How to switch from monitoring to Domain-oriented Observability

4. Part 1
● Why you should start thinking about monitoring and our case

14. Problems we faced with
● What is the cause of performance drop?
● What can lead to poor system performance?
● How can certain changes influence the system?
● Our product is difficult to fit in SLA

15. Part 2
● Theoretical minimum about monitoring

16 Monitoring
1) Logging
2) Tracing
3) Metrics
4) Alerts

“A log is an immutable,
timestamped record of event
describing what happened over
time “
17 Logging

“A trace is a representation of series
of causally related distributed events
that encode the end-to-end request
flow through a distributed system”
21. Tracing

“Metrics are a numeric
representation of data
measure over intervals of
time”
23. Metrics

● throughput
● success
● error
● performance
27. Metrics subtypes

● Rate - the number of requests, per second, you services are serving.
● Errors - the number of failed requests per second.
● Duration - distributions of the amount of time each request takes.
28. Three key metrics by RED methodology

Automated alerts are essential to
monitoring. They allow you to spot
problems anywhere in your
infrastructure, so that you can
rapidly identify their causes and
minimize service degradation and
disruption. Alerts draw human
attention to the particular systems
that require observation,
inspection, and intervention.
31. Alerts

● There should be people’s reaction
● Alert should have priority
● There should be possibility to disable notifications
● Alert should provide further instructions
32. Alert rules

34. Part 3
● How to switch from monitoring to Domain-oriented Observability

Definition:
“In control theory, observability is a measure of how well internal states of a
system can be inferred from knowledge of its external outputs. The observability
and controllability of a system are mathematical duals.”
- Wikipedia
In English:
Can you understand what’s happening inside your code and system, simply by
asking questions using your tools? Can you answer any new question you think
of, or only the ones you prepared for?
35. Observability

42. Domain Probe: DiscountInstrumentation

● AOP
● DECORATOR
● λάμβδα
43. Other opportunities

Start being proactive
Don’t be firefighters

● The RED Method
● Monitoring Distributed Systems
● Domain-Oriented Observability
● Distributed Systems Observability by Cindy Sridharan
● Testing in Production, the safe way
● Deploy != Release part1 and part2
● SRE: Observability: Metric Namespaces and Structures
● Observability: Metric, Logging, and Tracing
● Decorator
● Monitoring in the time of Cloud Native
● https://www.elastic.co/learn
46. Resources:

Recommandé

How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...Splunk

ObservabilityMaganathin Veeraragaloo

Cloud-Native ObservabilityTyler Treat

Observability – the good, the bad, and the uglyTimetrix

Observability vs APM vs Monitoring Comparisonjeetendra mandal

Observability, Distributed Tracing, and Open Source: The Missing PrimerVMware Tanzu

Observability & DatadogJamesAnderson599331

THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.ioDevOpsDays Tel Aviv

Recommandé

How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...Splunk

ObservabilityMaganathin Veeraragaloo

Cloud-Native ObservabilityTyler Treat

Observability – the good, the bad, and the uglyTimetrix

Observability vs APM vs Monitoring Comparisonjeetendra mandal

Observability, Distributed Tracing, and Open Source: The Missing PrimerVMware Tanzu

Observability & DatadogJamesAnderson599331

THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.ioDevOpsDays Tel Aviv

Observability at Scale Knoldus Inc.

ObservabilityDiego Pacheco

Observability Enes Altınok

Observability-101Piyush Baderia

More Than Monitoring: How Observability Takes You From Firefighting to Fire P...DevOps.com

MeasureWorks - Performance Labs - Why Observability Matters!MeasureWorks

Observability For Modern ApplicationsAmazon Web Services

Monitoring & ObservabilityLumban Sopian

Observability, what, why and howNeeraj Bagga

Observability for modern applications MoovingON

Combining Logs, Metrics, and Traces for Unified ObservabilityElasticsearch

Principles of System Observability Janis Orlovs

SRE 101Diego Pacheco

.conf Go 2022 - Observability SessionSplunk

ObservabilityEbru Cucen Çüçen

Opentelemetry - From frontend to backendSebastian Poxhofer

Logging and observabilityAnton Drukh

Monitor every app, in every stage, with free and open Elastic APMElasticsearch

OpenTelemetry For ArchitectsKevin Brockhoff

OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdfNETWAYS

Observability in highly distributed systemsDevOps Indonesia

5 Clear Signs You Need Security Policy AutomationTufin

Contenu connexe

Tendances

Observability at Scale Knoldus Inc.

ObservabilityDiego Pacheco

Observability Enes Altınok

Observability-101Piyush Baderia

More Than Monitoring: How Observability Takes You From Firefighting to Fire P...DevOps.com

MeasureWorks - Performance Labs - Why Observability Matters!MeasureWorks

Observability For Modern ApplicationsAmazon Web Services

Monitoring & ObservabilityLumban Sopian

Observability, what, why and howNeeraj Bagga

Observability for modern applications MoovingON

Combining Logs, Metrics, and Traces for Unified ObservabilityElasticsearch

Principles of System Observability Janis Orlovs

SRE 101Diego Pacheco

.conf Go 2022 - Observability SessionSplunk

ObservabilityEbru Cucen Çüçen

Opentelemetry - From frontend to backendSebastian Poxhofer

Logging and observabilityAnton Drukh

Monitor every app, in every stage, with free and open Elastic APMElasticsearch

OpenTelemetry For ArchitectsKevin Brockhoff

OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdfNETWAYS

Tendances (20)

Observability at Scale

Observability

Observability-101

More Than Monitoring: How Observability Takes You From Firefighting to Fire P...

MeasureWorks - Performance Labs - Why Observability Matters!

Observability For Modern Applications

Monitoring & Observability

Observability, what, why and how

Observability for modern applications

Combining Logs, Metrics, and Traces for Unified Observability

Principles of System Observability

SRE 101

.conf Go 2022 - Observability Session

Observability

Opentelemetry - From frontend to backend

Logging and observability

Monitor every app, in every stage, with free and open Elastic APM

OpenTelemetry For Architects

OSMC 2022 | OpenTelemetry 101 by Dotan Horovit s.pdf

Similaire à Monitoring and observability

Observability in highly distributed systemsDevOps Indonesia

5 Clear Signs You Need Security Policy AutomationTufin

ThirdEye - LinkedIn's Business-wide monitoring platformAkshay Rai

Top 10 Practices of Highly Successful DevOps Incident Management TeamsMatthew Boeckman

Monitorama - Please, no more Minutes, Milliseconds, Monoliths or Monitoring T...Adrian Cockcroft

CeritaLakeisha Jones

Top 10 Practices of Highly Successful DevOps Incident Management TeamsDeborah Schalm

Top 10 Practices of Highly Successful DevOps Incident Management TeamsDevOps.com

BSIT3CD_Continuation of Cyber incident response (1).pdfStevenJoeBiago

Cloud Native DevOpsJim Bugwadia

Monitoring - deeper diveRobert Kubiś

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

What is Platform Observability? An OverviewKumar Kolaganti

VTU 5TH SEM CSE SOFTWARE ENGINEERING SOLVED PAPERS - JUN13 DEC13 JUN14 DEC14 ...vtunotesbysree

From sensor readings to prediction: on the process of developing practical so...Manuel Martín

Clone of an organizationIRJET Journal

Agile Gurugram 2023 | Observability for Modern Applications. How does it help...AgileNetwork

Observability for Application Developers (1)-1.pptxOpsTree solutions

The differing ways to monitor and instrumentJonah Kowall

WTF is a Microservice - Rafael Schloming, DatawireAmbassador Labs

Similaire à Monitoring and observability (20)

Observability in highly distributed systems

5 Clear Signs You Need Security Policy Automation

ThirdEye - LinkedIn's Business-wide monitoring platform

Top 10 Practices of Highly Successful DevOps Incident Management Teams

Monitorama - Please, no more Minutes, Milliseconds, Monoliths or Monitoring T...

Cerita

Top 10 Practices of Highly Successful DevOps Incident Management Teams

BSIT3CD_Continuation of Cyber incident response (1).pdf

Cloud Native DevOps

Monitoring - deeper dive

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...

What is Platform Observability? An Overview

VTU 5TH SEM CSE SOFTWARE ENGINEERING SOLVED PAPERS - JUN13 DEC13 JUN14 DEC14 ...

From sensor readings to prediction: on the process of developing practical so...

Clone of an organization

Agile Gurugram 2023 | Observability for Modern Applications. How does it help...

Observability for Application Developers (1)-1.pptx

The differing ways to monitor and instrument

WTF is a Microservice - Rafael Schloming, Datawire

Plus de Danylenko Max

How to write clean testsDanylenko Max

Consumer Driven Contract.pdfDanylenko Max

Consumer driven contractDanylenko Max

Fail fast! approachDanylenko Max

How to successfullygrow a code review cultureDanylenko Max

Testing microservicesDanylenko Max

Plus de Danylenko Max (6)

How to write clean tests

Consumer Driven Contract.pdf

Consumer driven contract

Fail fast! approach

How to successfullygrow a code review culture

Testing microservices

Dernier

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfonteinmasabamasaba

%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburgmasabamasaba

%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba

%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba

AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek

Generic or specific? Making sensible software design decisionsBert Jan Schrijver

Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...Nitya salvi

Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions

Announcing Codolex 2.0 from GDK SoftwareJim McKeeth

%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba

%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...masabamasaba

The Top App Development Trends Shaping the Industry in 2024-25 .pdfayushiqss

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls

Direct Style Effect Systems -The Print[A] Example- A Comprehension AidPhilip Schwarz

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda

8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba

%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba

%in Durban+277-882-255-28 abortion pills for sale in Durbanmasabamasaba

Dernier (20)

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...

%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein

%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg

%in tembisa+277-882-255-28 abortion pills for sale in tembisa

%in ivory park+277-882-255-28 abortion pills for sale in ivory park

AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques

Generic or specific? Making sensible software design decisions

Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...

Introducing Microsoft’s new Enterprise Work Management (EWM) Solution

Announcing Codolex 2.0 from GDK Software

%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...

%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...

The Top App Development Trends Shaping the Industry in 2024-25 .pdf

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️

Direct Style Effect Systems -The Print[A] Example- A Comprehension Aid

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...

8257 interfacing 2 in microprocessor for btech students

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...

%in Midrand+277-882-255-28 abortion pills for sale in midrand

%in Durban+277-882-255-28 abortion pills for sale in Durban

Monitoring and observability

1. From Monitoring to Domain-Oriented Observability What’s the difference between monitoring and observability and why does it matter?

2. 2. About me ● Took part in developing of microservice architecture based on Event Sourcing and CQRS ● I have obtained a position of Tech Lead. I implemented Canary release and Feature Toggles (aka Feature Flags), migrated microservice from REST to Event-Driven ● Made a webinar about microservices testing. ● I am trying to apply best engineering practices to Safeguard Cyber project ● I want to build such a process in a company (team) in which it will be pleasant to work and develop professionally. Where ideas will be heard, where a person will not need to sacrifice his family or health for professional growth.

3. 3. Parts of presentation ● Why you should start thinking about monitoring and our case ● Theoretical minimum about monitoring ● How to switch from monitoring to Domain-oriented Observability

4. 4. Part 1 ● Why you should start thinking about monitoring and our case

5. 5. About our product

6. 6. Threat Detection

7. 7. Cyber Defense

8. 8. Machine Learning and AI

9. 9. Fire on Production

10. 10. Fire on production

11. 11. Fire on production

12. 12. Fire on production

13. 12. Fire on production

14. 14. Problems we faced with ● What is the cause of performance drop? ● What can lead to poor system performance? ● How can certain changes influence the system? ● Our product is difficult to fit in SLA

15. 15. Part 2 ● Theoretical minimum about monitoring

16. 16 Monitoring 1) Logging 2) Tracing 3) Metrics 4) Alerts

17. “A log is an immutable, timestamped record of event describing what happened over time “ 17 Logging

18. 18. Kibana filters

19. 19. Kibana filters result

20. 20. Errors frequency analysis

21. “A trace is a representation of series of causally related distributed events that encode the end-to-end request flow through a distributed system” 21. Tracing

22. 22. Tracing

23. “Metrics are a numeric representation of data measure over intervals of time” 23. Metrics

24. 24. Metrics

25. 25. Metrics

26. 26. Metrics

27. ● throughput ● success ● error ● performance 27. Metrics subtypes

28. ● Rate - the number of requests, per second, you services are serving. ● Errors - the number of failed requests per second. ● Duration - distributions of the amount of time each request takes. 28. Three key metrics by RED methodology

29. 31. Trends are very important.

30. 30. Trends are very important

31. Automated alerts are essential to monitoring. They allow you to spot problems anywhere in your infrastructure, so that you can rapidly identify their causes and minimize service degradation and disruption. Alerts draw human attention to the particular systems that require observation, inspection, and intervention. 31. Alerts

32. ● There should be people’s reaction ● Alert should have priority ● There should be possibility to disable notifications ● Alert should provide further instructions 32. Alert rules

33. 33. Alerts in Kibana

34. 34. Part 3 ● How to switch from monitoring to Domain-oriented Observability

35. Definition: “In control theory, observability is a measure of how well internal states of a system can be inferred from knowledge of its external outputs. The observability and controllability of a system are mathematical duals.” - Wikipedia In English: Can you understand what’s happening inside your code and system, simply by asking questions using your tools? Can you answer any new question you think of, or only the ones you prepared for? 35. Observability

36. 36. Black and White box Component View

37. 37. Observability code example

38. 38. Observability code example

39. 39. Cleanup the mess

40. 40. Cleanup the mess

41. 41. Moving code to class

42. 42. Domain Probe: DiscountInstrumentation

43. ● AOP ● DECORATOR ● λάμβδα 43. Other opportunities

44. 44. Testing + Monitoring

45. 45. Testing + Monitoring

46. Start being proactive Don’t be firefighters

47. ● The RED Method ● Monitoring Distributed Systems ● Domain-Oriented Observability ● Distributed Systems Observability by Cindy Sridharan ● Testing in Production, the safe way ● Deploy != Release part1 and part2 ● SRE: Observability: Metric Namespaces and Structures ● Observability: Metric, Logging, and Tracing ● Decorator ● Monitoring in the time of Cloud Native ● https://www.elastic.co/learn 46. Resources:

48. Questions