Ce diaporama a bien été signalé.
Nous utilisons votre profil LinkedIn et vos données d’activité pour vous proposer des publicités personnalisées et pertinentes. Vous pouvez changer vos préférences de publicités à tout moment.
Precise information
Alert the right person
Automation
Service is alive
•
Is my application alive on the minimum
number required by my SLA?
•
2 out of 5 instances of my-app are ...
Alert
Sensu
Queries Nginx
Alert & SLA
ZooKeeper
Planned
Configuration
Service owner
Nginx
Service Load
Balancer
Is-alive
Alert
Sensu
Queries Nginx
Alert & SLA
ZooKeeper
Planned
Configuration
Service owner
Nginx
Service Load
Balancer
Is-alive
A...
Service anomalies
BECKEND
•
Identify unhealthy KPIs per end-points
•
Abnormal increase in error rate for
class.method.get
...
Anomaly Alert
Anodot
Time series
anomaly
detection
Alerts & graphs
statsd
Stats
aggregation
Forwarding
metrics
JVM servers...
Anomaly Alert
Anodot
Time series
anomaly
detection
Alerts & graphs
statsd
Stats
aggregation
Forwarding
metrics
JVM servers...
Service anomalies
FRONTEND
•
Users effected or not? How and where?
•
Success event count is low
•
Client error increased i...
Anomaly
Alert
STORM &
ESPER
Realtime
streaming
processing
Metrics / 1m
Client
flume
events Graphs
Client
flume
events
Anod...
Anomaly
Alert
STORM &
ESPER
Realtime
streaming
processing
Metrics / 1m
Client
flume
events Graphs
Client
flume
events
Prec...
Alerts management
•
Which active alerts do I have?
•
What changes could cause the problem?
•
I have 2 active alerts on MyS...
Alert
BigPanda
Central alerts &
changes
Alerts &
Changes
Changes
Integrations
Deployments
Chef uploads
Alerts
integrations...
Alert
BigPanda
Central alerts &
changes
Alerts &
Changes
Changes
Integrations
Deployments
Chef uploads
Alerts
integrations...
Questions?
Prochain SlideShare
Chargement dans…5
×

1

Partager

Télécharger pour lire hors ligne

Monitoring HOWTOs

Télécharger pour lire hors ligne

5 examples of unique monitoring solutions in Wix including technical HOWTO

Monitoring HOWTOs

  1. 1. Precise information Alert the right person Automation
  2. 2. Service is alive • Is my application alive on the minimum number required by my SLA? • 2 out of 5 instances of my-app are not responding to isAlive • my-app requires a minimum of 3 instances to meet the SLA
  3. 3. Alert Sensu Queries Nginx Alert & SLA ZooKeeper Planned Configuration Service owner Nginx Service Load Balancer Is-alive
  4. 4. Alert Sensu Queries Nginx Alert & SLA ZooKeeper Planned Configuration Service owner Nginx Service Load Balancer Is-alive Alert the right person Precise information Automation
  5. 5. Service anomalies BECKEND • Identify unhealthy KPIs per end-points • Abnormal increase in error rate for class.method.get • Abnormal increase in performance or decrease in RPM
  6. 6. Anomaly Alert Anodot Time series anomaly detection Alerts & graphs statsd Stats aggregation Forwarding metrics JVM servers Metrics library metrics / 1m Graphs
  7. 7. Anomaly Alert Anodot Time series anomaly detection Alerts & graphs statsd Stats aggregation Forwarding metrics JVM servers Metrics library metrics / 1m Graphs Precise information Alert the right person Automation
  8. 8. Service anomalies FRONTEND • Users effected or not? How and where? • Success event count is low • Client error increased in Canada from Chrome
  9. 9. Anomaly Alert STORM & ESPER Realtime streaming processing Metrics / 1m Client flume events Graphs Client flume events Anodot Time series anomaly detection Alerts & graphs
  10. 10. Anomaly Alert STORM & ESPER Realtime streaming processing Metrics / 1m Client flume events Graphs Client flume events Precise information Alert the right person Automation Anodot Time series anomaly detection Alerts & graphs
  11. 11. Alerts management • Which active alerts do I have? • What changes could cause the problem? • I have 2 active alerts on MySql and 2 deployments in the last hour
  12. 12. Alert BigPanda Central alerts & changes Alerts & Changes Changes Integrations Deployments Chef uploads Alerts integrations NewRelic Sensu Nagios PingDom Web UI
  13. 13. Alert BigPanda Central alerts & changes Alerts & Changes Changes Integrations Deployments Chef uploads Alerts integrations NewRelic Sensu Nagios PingDom Precise information Alert the right person Automation Precise information Web UI
  14. 14. Questions?
  • dotxymox

    Dec. 23, 2015

5 examples of unique monitoring solutions in Wix including technical HOWTO

Vues

Nombre de vues

564

Sur Slideshare

0

À partir des intégrations

0

Nombre d'intégrations

9

Actions

Téléchargements

4

Partages

0

Commentaires

0

Mentions J'aime

1

×