SlideShare une entreprise Scribd logo
1  sur  16
Télécharger pour lire hors ligne
HAWKULAR ALERTING
Flexible and scalable alerting services
Lucas Ponce
Hawkular Team
November 2016
Hawkular Alerting2
CLICK TO ADD TEXT
Hawkular Alerting3
Hawkular Alerting
In a nutshell
● Generic alerting services
Suitable for multiple business domains
● Flexible and scalable
Simple REST API
Cassandra backend
Cluster ready
● Pluggable actions architecture
Out of the box plugins
Simplified API to develop new plugins
Hawkular Alerting4
Action Plugins
Generic Alerting Services
Alerting philosophy
Trigger definitions
X > 10, Y < 20
N < 80% of M
K is DOWN
E.url=/x/y/z and E.time > 10s
Data
Events Events
Alerts
Hawkular Alerting5
Generic Alerting Services
Data, Events and Alerts
Data
Events
Alerts
Time-series data (gauges, counters, availability)
Stored on Hawkular Metrics
Simple condition expressions
Generic representation of a captured happening
Structured data
Internal: generated by the Alerting engine
External: provided as data
Events that required human attention
Lifecycle (Open, Acknowledged, Resolved)
Internal: generated by the Alerting engine
Hawkular Alerting6
Generic Alerting Services
Triggers, Conditions and Dampenings
Trigger definitions
X > 10, Y < 20
N < 80% of M
K is DOWN
E.url=/x/y/z and E.time > 10s
Trigger
IF Conditions are met
THEN Fires an Event or Alert
Execute Actions
Conditions
Triggers can have one or more conditions
Triggers can fire when ALL or ANY of conditions are met
Dampening
Triggers can fire every time Conditions are met, or
Define dampening rules like
responseTime > 10s [4 times in 5 minutes]
Hawkular Alerting7
Generic Alerting Services
Alert Lifecycle
Alerts
Open
Acknowledged
Resolved
Alert is fired by a Trigger
Alert is acknowledged by a User
Starting investigation on it
Alert can be resolved by a User, or
Alert can be resolved automatically
Lifecycle transitions can execute Actions
Action Plugins
Hawkular Alerting8
Generic Alerting Services
AUTORESOLVE
Alerts
Open
Resolved
Trigger can define
FIRING Conditions (when a bad situation happens)
Generates an Alert
AUTORESOLVE Conditions (when the situation is gone)
Automatically resolves the Alert
i.e.
FIRING responseTime > 10s
AUTORESOLVE responseTime <= 10s
Hawkular Alerting9
MemberTrigger definition
ResponseTime-A > 10s in last 5m
MemUsed-A > 500Mb
DiskUsed-A > 10 Gb
MemberTrigger definition
ResponseTime-A > 10s in last 5m
MemUsed-A > 500Mb
DiskUsed-A > 10 Gb
Generic Alerting Services
Managing Group of Triggers
GroupTrigger
It acts like a template
GroupTrigger definition
ResponseTime > 10s in last 5m
MemUsed > 500Mb
DiskUsed > 10 Gb
MemberTriggers
Generated from a GroupTrigger
Personalize data Ids
Managed from parent GroupTrigger
i.e. GroupTrigger applied to
machines [A, B, C]
MemberTrigger definition
ResponseTime-A > 10s in last 5m
MemUsed-A > 500Mb
DiskUsed-A > 10 Gb
Hawkular Alerting10
Generic Alerting Services
Conditions
Condition Type Examples
AVAILABILITY X is DOWN, Y is NOT_UP
COMPARE X > 80% of Y
EVENT
event.id starts 'IDXYZ', event.tag.category == 'Server',
event.tag.from ends '.com'
EXTERNAL Delegate the evaluation on external pluggable Alerters
MISSING Not X in last 5 minutes
RATE X > 10 per-minute
STRING X matches “A.*B”
RANGE X inside [10, 20), X outside [100,200]
THRESHOLD X > 10, Y <= 20
Hawkular Alerting11
Flexible and Scalable
Simple REST API
● Main REST API
● Additional Hawkular clients
Ruby
Python (in progress)
Java
Content-Type: application/json
/hawkular/alerts
GET /
POST /triggers
...
http://www.hawkular.org/docs/rest/rest-alerts.html
Content-Type: application/json
/hawkular/alerts
GET /
POST /triggers
...
http://www.hawkular.org/docs/rest/rest-alerts.html
Hawkular Alerting12
Flexible and Scalable
Complex Events Processing and Cassandra backend
Alerting Engine Optimized CEP algorithm
Stateful Rules Engine
Backed by JBoss Drools
Cassandra Scalable
Distributed
Structured data store
Hawkular Alerting13
Flexible and Scalable
Cluster Ready
Alerting Engine
Cassandra
Horizontal scalability
Full distribution of data and definitions
Cloud ready
Hawkular Alerting14
Pluggable Actions Architecture
Out of the box plugins
Action Plugins Email
Webhooks
Sms (Twilio SDK)
Aerogear
PagerDuty
Irc
File
DIY plugins
@Plugin(name = “my-diy-plugin)
Public class DiyPlugin implements ActionPluginListener {
public void process(ActionMessage msg) {
// Process the msg
}
}
@Plugin(name = “my-diy-plugin)
Public class DiyPlugin implements ActionPluginListener {
public void process(ActionMessage msg) {
// Process the msg
}
}
Hawkular Alerting15
Deployments
Choose what you need
● Standalone
Embed an Alerting engine into your system
● Hawkular Metrics + Alerting
Metrics Storage + Alerting engine
Available on Openshift deployments
● Hawkular Services
Metrics Storage + Alerting engine + Inventory and more
Available as Middleware provider on ManageIQ
THANK YOU
hawkular.org
#hawkular on Freenode.net
@hawkular_org
github.com/hawkular

Contenu connexe

Similaire à Hawkular Alerting

The hidden engineering behind machine learning products at Helixa
The hidden engineering behind machine learning products at HelixaThe hidden engineering behind machine learning products at Helixa
The hidden engineering behind machine learning products at HelixaAlluxio, Inc.
 
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...Amazon Web Services
 
The Enterprise IT Checklist for Docker Operations
The Enterprise IT Checklist for Docker Operations The Enterprise IT Checklist for Docker Operations
The Enterprise IT Checklist for Docker Operations Nicola Kabar
 
Intelligent Monitoring
Intelligent MonitoringIntelligent Monitoring
Intelligent MonitoringIntelie
 
Monitor everything from physical hardware to application functionality
Monitor everything from physical hardware to application functionalityMonitor everything from physical hardware to application functionality
Monitor everything from physical hardware to application functionalityNicolas Seyvet
 
AWS re:Invent 2016: IoT Blueprints: Optimizing Supply for Smart Agriculture f...
AWS re:Invent 2016: IoT Blueprints: Optimizing Supply for Smart Agriculture f...AWS re:Invent 2016: IoT Blueprints: Optimizing Supply for Smart Agriculture f...
AWS re:Invent 2016: IoT Blueprints: Optimizing Supply for Smart Agriculture f...Amazon Web Services
 
How we sleep well at night using Hystrix at Finn.no
How we sleep well at night using Hystrix at Finn.noHow we sleep well at night using Hystrix at Finn.no
How we sleep well at night using Hystrix at Finn.noHenning Spjelkavik
 
Introducing the WSO2 Complex Event Processor
Introducing the WSO2 Complex Event ProcessorIntroducing the WSO2 Complex Event Processor
Introducing the WSO2 Complex Event ProcessorWSO2
 
Stream Processing with Ballerina
Stream Processing with BallerinaStream Processing with Ballerina
Stream Processing with BallerinaBallerina
 
Infrastructure as a service for Mobile Testing as a Service
Infrastructure as a service for Mobile Testing as a ServiceInfrastructure as a service for Mobile Testing as a Service
Infrastructure as a service for Mobile Testing as a ServiceSaurabh Jinturkar
 
Webinar - What's new in Axon 3
Webinar - What's new in Axon 3 Webinar - What's new in Axon 3
Webinar - What's new in Axon 3 Allard Buijze
 
Aws + Puppet = Dynamic Scale
Aws + Puppet = Dynamic ScaleAws + Puppet = Dynamic Scale
Aws + Puppet = Dynamic ScalePuppet
 
CQRS and Event Sourcing: A DevOps perspective
CQRS and Event Sourcing: A DevOps perspectiveCQRS and Event Sourcing: A DevOps perspective
CQRS and Event Sourcing: A DevOps perspectiveMaria Gomez
 
spChains: A Declarative Framework for Data Stream Processing in Pervasive App...
spChains: A Declarative Framework for Data Stream Processing in Pervasive App...spChains: A Declarative Framework for Data Stream Processing in Pervasive App...
spChains: A Declarative Framework for Data Stream Processing in Pervasive App...Fulvio Corno
 
Best Practices for Managing Security Operations in AWS - AWS July 2016 Webina...
Best Practices for Managing Security Operations in AWS - AWS July 2016 Webina...Best Practices for Managing Security Operations in AWS - AWS July 2016 Webina...
Best Practices for Managing Security Operations in AWS - AWS July 2016 Webina...Amazon Web Services
 
XcellHost - Performance Cloud Servers
XcellHost -  Performance Cloud Servers XcellHost -  Performance Cloud Servers
XcellHost - Performance Cloud Servers Samir Jhaveri
 
Google Cloud Platform monitoring with Zabbix
Google Cloud Platform monitoring with ZabbixGoogle Cloud Platform monitoring with Zabbix
Google Cloud Platform monitoring with ZabbixMax Kuzkin
 
善用分析與推播訊息增加及留住用戶
善用分析與推播訊息增加及留住用戶善用分析與推播訊息增加及留住用戶
善用分析與推播訊息增加及留住用戶Amazon Web Services
 

Similaire à Hawkular Alerting (20)

The hidden engineering behind machine learning products at Helixa
The hidden engineering behind machine learning products at HelixaThe hidden engineering behind machine learning products at Helixa
The hidden engineering behind machine learning products at Helixa
 
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
AWS Public Sector Symposium 2014 Canberra | Putting the "Crowd" to work in th...
 
The Enterprise IT Checklist for Docker Operations
The Enterprise IT Checklist for Docker Operations The Enterprise IT Checklist for Docker Operations
The Enterprise IT Checklist for Docker Operations
 
Intelligent Monitoring
Intelligent MonitoringIntelligent Monitoring
Intelligent Monitoring
 
Monitor everything from physical hardware to application functionality
Monitor everything from physical hardware to application functionalityMonitor everything from physical hardware to application functionality
Monitor everything from physical hardware to application functionality
 
AWS re:Invent 2016: IoT Blueprints: Optimizing Supply for Smart Agriculture f...
AWS re:Invent 2016: IoT Blueprints: Optimizing Supply for Smart Agriculture f...AWS re:Invent 2016: IoT Blueprints: Optimizing Supply for Smart Agriculture f...
AWS re:Invent 2016: IoT Blueprints: Optimizing Supply for Smart Agriculture f...
 
Android workshop
Android workshopAndroid workshop
Android workshop
 
How we sleep well at night using Hystrix at Finn.no
How we sleep well at night using Hystrix at Finn.noHow we sleep well at night using Hystrix at Finn.no
How we sleep well at night using Hystrix at Finn.no
 
Introducing the WSO2 Complex Event Processor
Introducing the WSO2 Complex Event ProcessorIntroducing the WSO2 Complex Event Processor
Introducing the WSO2 Complex Event Processor
 
Stream Processing with Ballerina
Stream Processing with BallerinaStream Processing with Ballerina
Stream Processing with Ballerina
 
Stream Processing with Ballerina
Stream Processing with BallerinaStream Processing with Ballerina
Stream Processing with Ballerina
 
Infrastructure as a service for Mobile Testing as a Service
Infrastructure as a service for Mobile Testing as a ServiceInfrastructure as a service for Mobile Testing as a Service
Infrastructure as a service for Mobile Testing as a Service
 
Webinar - What's new in Axon 3
Webinar - What's new in Axon 3 Webinar - What's new in Axon 3
Webinar - What's new in Axon 3
 
Aws + Puppet = Dynamic Scale
Aws + Puppet = Dynamic ScaleAws + Puppet = Dynamic Scale
Aws + Puppet = Dynamic Scale
 
CQRS and Event Sourcing: A DevOps perspective
CQRS and Event Sourcing: A DevOps perspectiveCQRS and Event Sourcing: A DevOps perspective
CQRS and Event Sourcing: A DevOps perspective
 
spChains: A Declarative Framework for Data Stream Processing in Pervasive App...
spChains: A Declarative Framework for Data Stream Processing in Pervasive App...spChains: A Declarative Framework for Data Stream Processing in Pervasive App...
spChains: A Declarative Framework for Data Stream Processing in Pervasive App...
 
Best Practices for Managing Security Operations in AWS - AWS July 2016 Webina...
Best Practices for Managing Security Operations in AWS - AWS July 2016 Webina...Best Practices for Managing Security Operations in AWS - AWS July 2016 Webina...
Best Practices for Managing Security Operations in AWS - AWS July 2016 Webina...
 
XcellHost - Performance Cloud Servers
XcellHost -  Performance Cloud Servers XcellHost -  Performance Cloud Servers
XcellHost - Performance Cloud Servers
 
Google Cloud Platform monitoring with Zabbix
Google Cloud Platform monitoring with ZabbixGoogle Cloud Platform monitoring with Zabbix
Google Cloud Platform monitoring with Zabbix
 
善用分析與推播訊息增加及留住用戶
善用分析與推播訊息增加及留住用戶善用分析與推播訊息增加及留住用戶
善用分析與推播訊息增加及留住用戶
 

Dernier

Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessUXDXConf
 
Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsUXDXConf
 
Connecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKConnecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKUXDXConf
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101vincent683379
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIES VE
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfFIDO Alliance
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka DoktorováCzechDreamin
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxJennifer Lim
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfChristopherTHyatt
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Patrick Viafore
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomCzechDreamin
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...FIDO Alliance
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...FIDO Alliance
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...CzechDreamin
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde
 
Buy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdfBuy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdfEasyPrinterHelp
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreelreely ones
 

Dernier (20)

Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering Teams
 
Connecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAKConnecting the Dots in Product Design at KAYAK
Connecting the Dots in Product Design at KAYAK
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Buy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdfBuy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdf
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreel
 

Hawkular Alerting

  • 1. HAWKULAR ALERTING Flexible and scalable alerting services Lucas Ponce Hawkular Team November 2016
  • 3. Hawkular Alerting3 Hawkular Alerting In a nutshell ● Generic alerting services Suitable for multiple business domains ● Flexible and scalable Simple REST API Cassandra backend Cluster ready ● Pluggable actions architecture Out of the box plugins Simplified API to develop new plugins
  • 4. Hawkular Alerting4 Action Plugins Generic Alerting Services Alerting philosophy Trigger definitions X > 10, Y < 20 N < 80% of M K is DOWN E.url=/x/y/z and E.time > 10s Data Events Events Alerts
  • 5. Hawkular Alerting5 Generic Alerting Services Data, Events and Alerts Data Events Alerts Time-series data (gauges, counters, availability) Stored on Hawkular Metrics Simple condition expressions Generic representation of a captured happening Structured data Internal: generated by the Alerting engine External: provided as data Events that required human attention Lifecycle (Open, Acknowledged, Resolved) Internal: generated by the Alerting engine
  • 6. Hawkular Alerting6 Generic Alerting Services Triggers, Conditions and Dampenings Trigger definitions X > 10, Y < 20 N < 80% of M K is DOWN E.url=/x/y/z and E.time > 10s Trigger IF Conditions are met THEN Fires an Event or Alert Execute Actions Conditions Triggers can have one or more conditions Triggers can fire when ALL or ANY of conditions are met Dampening Triggers can fire every time Conditions are met, or Define dampening rules like responseTime > 10s [4 times in 5 minutes]
  • 7. Hawkular Alerting7 Generic Alerting Services Alert Lifecycle Alerts Open Acknowledged Resolved Alert is fired by a Trigger Alert is acknowledged by a User Starting investigation on it Alert can be resolved by a User, or Alert can be resolved automatically Lifecycle transitions can execute Actions Action Plugins
  • 8. Hawkular Alerting8 Generic Alerting Services AUTORESOLVE Alerts Open Resolved Trigger can define FIRING Conditions (when a bad situation happens) Generates an Alert AUTORESOLVE Conditions (when the situation is gone) Automatically resolves the Alert i.e. FIRING responseTime > 10s AUTORESOLVE responseTime <= 10s
  • 9. Hawkular Alerting9 MemberTrigger definition ResponseTime-A > 10s in last 5m MemUsed-A > 500Mb DiskUsed-A > 10 Gb MemberTrigger definition ResponseTime-A > 10s in last 5m MemUsed-A > 500Mb DiskUsed-A > 10 Gb Generic Alerting Services Managing Group of Triggers GroupTrigger It acts like a template GroupTrigger definition ResponseTime > 10s in last 5m MemUsed > 500Mb DiskUsed > 10 Gb MemberTriggers Generated from a GroupTrigger Personalize data Ids Managed from parent GroupTrigger i.e. GroupTrigger applied to machines [A, B, C] MemberTrigger definition ResponseTime-A > 10s in last 5m MemUsed-A > 500Mb DiskUsed-A > 10 Gb
  • 10. Hawkular Alerting10 Generic Alerting Services Conditions Condition Type Examples AVAILABILITY X is DOWN, Y is NOT_UP COMPARE X > 80% of Y EVENT event.id starts 'IDXYZ', event.tag.category == 'Server', event.tag.from ends '.com' EXTERNAL Delegate the evaluation on external pluggable Alerters MISSING Not X in last 5 minutes RATE X > 10 per-minute STRING X matches “A.*B” RANGE X inside [10, 20), X outside [100,200] THRESHOLD X > 10, Y <= 20
  • 11. Hawkular Alerting11 Flexible and Scalable Simple REST API ● Main REST API ● Additional Hawkular clients Ruby Python (in progress) Java Content-Type: application/json /hawkular/alerts GET / POST /triggers ... http://www.hawkular.org/docs/rest/rest-alerts.html Content-Type: application/json /hawkular/alerts GET / POST /triggers ... http://www.hawkular.org/docs/rest/rest-alerts.html
  • 12. Hawkular Alerting12 Flexible and Scalable Complex Events Processing and Cassandra backend Alerting Engine Optimized CEP algorithm Stateful Rules Engine Backed by JBoss Drools Cassandra Scalable Distributed Structured data store
  • 13. Hawkular Alerting13 Flexible and Scalable Cluster Ready Alerting Engine Cassandra Horizontal scalability Full distribution of data and definitions Cloud ready
  • 14. Hawkular Alerting14 Pluggable Actions Architecture Out of the box plugins Action Plugins Email Webhooks Sms (Twilio SDK) Aerogear PagerDuty Irc File DIY plugins @Plugin(name = “my-diy-plugin) Public class DiyPlugin implements ActionPluginListener { public void process(ActionMessage msg) { // Process the msg } } @Plugin(name = “my-diy-plugin) Public class DiyPlugin implements ActionPluginListener { public void process(ActionMessage msg) { // Process the msg } }
  • 15. Hawkular Alerting15 Deployments Choose what you need ● Standalone Embed an Alerting engine into your system ● Hawkular Metrics + Alerting Metrics Storage + Alerting engine Available on Openshift deployments ● Hawkular Services Metrics Storage + Alerting engine + Inventory and more Available as Middleware provider on ManageIQ
  • 16. THANK YOU hawkular.org #hawkular on Freenode.net @hawkular_org github.com/hawkular