SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
S U M M I T
Lo n don
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Using AIOps to Reduce Incidents
Volume
Itai David Njanji
Senior Consultant, OI
AWS Professional Services
S e s s i o n I D : O P S 1
Paul Ferguson
Global Practice Manager
AWS Professional Services
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Poll
How many of you are using AWS CloudWatch?
How many of you are using third party monitoring tools?
How many of you are using ITSM tools for Incident Management?
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Agenda
• Introduction: Challenges of IT Ops
• Review ITIL Guidelines
• IT Ops to AIOps
• AIOps
• Summary
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Data overload
Gaming IoT sensorsDevices
External
systems
and
applications
Web content
Logs, logs, and
more logs …
Databases Servers NetworkingStorage
Internal
systems
and
applications
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
IT Ops
Drowning in
tickets and
noise
More
Automation
Time to detect
and fix issues is
too long
Need more
agile, roll out
changes faster
Challenges of IT Ops
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Sustain
“Traditional Ops”
Grow
“DevOps”
Optimize
“CloudOps”
OperationsEngineering
PlatformApplications
DevOpsDev Team /
COTS
Cloud Platform
Engineering
ITSM
OperationsEngineering
PlatformApplications
DevOpsDevOps
Teams
ITSM
Cloud Platform Engineering
OperationsEngineering
PlatformApplications
DevOpsDev Team /
COTS
Cloud
Platform
Engineering
Cloud
Operations
App
Ops
Platform
Ops
ITSM
Transitional Strategic Strategic
Ticketing System (ITSM) in Operating Models
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Scale
“AIOps”
AI OperationsEngineering
PlatformApplications
DevOps
DevOps
Teams
ITSM
ML
Cloud Platform Engineering
Strategic
AIOps Operating Model
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
PROCESS
Core
Operations
Functions
Platform Architecture &
Governance
Event & Incident
Management
Provisioning &
Configuration
Management
Availability &
Continuity
Management
Security &
Control
Functions
Change Management
Resource Inventory
Management
Identity & Access
Management
Security Management
Business
Management
Functions
Financial Management
Capacity Planning &
Forecasting
Organizational Change
Management
Vendor Management
Supporting
Functions
Reporting & Analytics
Continuous
Improvement
Application Lifecycle
Management
IT Processes
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Best practices for modern application development
• Enable experimentation by creating a culture of ownership
• Componentize applications using microservices
• Update applications and infrastructure quickly by automating the release
pipeline
• Model and provision application resources using infrastructure as code
• Simplify infrastructure management with serverless technologies
• Improve application performance by increasing observability
• Secure the entire application lifecycle by automating security
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
DNA of Modern Apps
• Large Data Volume
• Connected (APIs etc.)
• Evolves quickly (CI/CD)
Increased Incident Volume
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Javascript to process
notifications
ITSM Tool
Tickets/CMDB Update
Incidents: AWS-ITSM Integration
AWS Config Amazon CloudWatch
Publish to SNS topicAlarmDiscovery
Amazon Simple
Notification Service
HTTPS notification
Topic
AWS Cloud
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
More Monitoring Tools
ITSM Data Overload
AIOps can reduce noise and increase insights:
• Alert Clustering
• Anomaly Detection
• Neural Feedback
ITSM Hygiene
Incident and Event Management
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 1. Set up for
Observability
Platform Logs
Events
Application Logs
Code telemetry
API and Users
Infrastructure Logs
Amazon
CloudWatch
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 2. Set up delivery, Collection, and Storage
ingestion
Amazon CloudWatch Amazon Kinesis
Amazon CloudWatch Amazon EMR Amazon Elastic
Search
Amazon Redshift Amazon Simple Storage
Service (S3)
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 3. Querying and patternsmining
Amazon CloudWatch Amazon Athena Amazon Kinesis Data
Analytics
Amazon SageMaker
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 4.
Alerting, Notification, andremediation
Amazon CloudWatch Amazon Simple
Notification Service
AWS Lambda
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 5.
Integrate with ITSM Tooling
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AIOps
Predictive &
Preventive Insights
Patterns,
tools & data
ingestion
Planned events
Budgets
Threat intel
Ticket Reductions
Decisive
Response
Timely
Recovery
New Pattern
discovery
Continuous Learning
Infrastructure Logs
Platform Logs
Application Logs
Code telemetry
Events
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS Marketplace Solutions
• Turn Key solutions - Less time to production
• Address people challenges: Expert guidance
• Proven solutions and more ..
Event Noise
Filtering
Incident Detection
Entropy
Time
Proximity
Logical
Topology
Linguistic Proximity
Behavior Feedback
Neural
Feedback
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Old Way vs. AIOps Way
Enterprise Event Correlation Old Way AIOps
Approach Rules Algorithms and automated workflow
Dependency Human Human & Machine
Configuration Definition Rule Logic + Inputs Outcomes
Configuration Amount 1000+ Rules < 10 Definitions
Configuration Time Days & Continuous Mins & One Time
Correlation Technique Hard Matching Fuzzy Matching
Machine Learning None Supervised & Unsupervised
Accuracy 20%? 80% percentile +
Tolerates App/Infra Changes No Yes
Maintenance Resources 2-3 people 1 part-time
Cost of Ownership High Low
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I TS U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Itai David Njanji
injanji@amazon.com

Contenu connexe

Tendances

Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Amazon Web Services
 

Tendances (20)

Modernizing Infrastructure Monitoring and Management with AIOps
Modernizing Infrastructure Monitoring and Management with AIOpsModernizing Infrastructure Monitoring and Management with AIOps
Modernizing Infrastructure Monitoring and Management with AIOps
 
AWS Well-Architected Framework
AWS Well-Architected FrameworkAWS Well-Architected Framework
AWS Well-Architected Framework
 
Practical FinOps in Practice
Practical FinOps in PracticePractical FinOps in Practice
Practical FinOps in Practice
 
Living the AWS Well Architected Framework
Living the AWS Well Architected FrameworkLiving the AWS Well Architected Framework
Living the AWS Well Architected Framework
 
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
Introduction to the Well-Architected Framework and Tool - SVC208 - Anaheim AW...
 
API Management in Digital Transformation
API Management in Digital TransformationAPI Management in Digital Transformation
API Management in Digital Transformation
 
How to Execute a Successful API Strategy
How to Execute a Successful API StrategyHow to Execute a Successful API Strategy
How to Execute a Successful API Strategy
 
Cloud Migration Workshop
Cloud Migration WorkshopCloud Migration Workshop
Cloud Migration Workshop
 
An Introduction to the AWS Well Architected Framework - Webinar
An Introduction to the AWS Well Architected Framework - WebinarAn Introduction to the AWS Well Architected Framework - Webinar
An Introduction to the AWS Well Architected Framework - Webinar
 
Observability for modern applications
Observability for modern applications  Observability for modern applications
Observability for modern applications
 
FinOps
FinOpsFinOps
FinOps
 
Being Well-Architected in the Cloud
Being Well-Architected in the CloudBeing Well-Architected in the Cloud
Being Well-Architected in the Cloud
 
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
An Overview of Best Practices for Large Scale Migrations - AWS Transformation...
 
Observability at Scale
Observability at Scale Observability at Scale
Observability at Scale
 
Perform a Cloud Readiness Assessment for Your Own Company
Perform a Cloud Readiness Assessment for Your Own CompanyPerform a Cloud Readiness Assessment for Your Own Company
Perform a Cloud Readiness Assessment for Your Own Company
 
.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session.conf Go 2022 - Observability Session
.conf Go 2022 - Observability Session
 
Observability – the good, the bad, and the ugly
Observability – the good, the bad, and the uglyObservability – the good, the bad, and the ugly
Observability – the good, the bad, and the ugly
 
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
 
Observability For Modern Applications
Observability For Modern ApplicationsObservability For Modern Applications
Observability For Modern Applications
 
AWS Well-Architected Framework: Operational Excellence Pillar
AWS Well-Architected Framework: Operational Excellence PillarAWS Well-Architected Framework: Operational Excellence Pillar
AWS Well-Architected Framework: Operational Excellence Pillar
 

Similaire à Using AIOps to reduce incidents volume

Similaire à Using AIOps to reduce incidents volume (20)

AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics Services
AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics ServicesAWS Summit Singapore 2019 | Realising Business Value with AWS Analytics Services
AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics Services
 
Building intelligent applications using AI services
Building intelligent applications using AI servicesBuilding intelligent applications using AI services
Building intelligent applications using AI services
 
AI/ML Week: Strengthen Cybersecurity
AI/ML Week: Strengthen CybersecurityAI/ML Week: Strengthen Cybersecurity
AI/ML Week: Strengthen Cybersecurity
 
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit Sydney
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit SydneyCloud Operating Models for Accelerated Cloud Transformation - AWS Summit Sydney
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit Sydney
 
Starting your Cloud Journey - AWSomeDay Israel
Starting your Cloud Journey - AWSomeDay IsraelStarting your Cloud Journey - AWSomeDay Israel
Starting your Cloud Journey - AWSomeDay Israel
 
Starting your cloud journey - AWSomeDay Israel
Starting your cloud journey - AWSomeDay IsraelStarting your cloud journey - AWSomeDay Israel
Starting your cloud journey - AWSomeDay Israel
 
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
 
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019
 
DevOps at Amazon
DevOps at AmazonDevOps at Amazon
DevOps at Amazon
 
Take action on your security & compliance alerts with AWS Security Hub - SEC2...
Take action on your security & compliance alerts with AWS Security Hub - SEC2...Take action on your security & compliance alerts with AWS Security Hub - SEC2...
Take action on your security & compliance alerts with AWS Security Hub - SEC2...
 
Automated Security Remediation
Automated Security RemediationAutomated Security Remediation
Automated Security Remediation
 
Modern Application Development in the Cloud
Modern Application Development in the CloudModern Application Development in the Cloud
Modern Application Development in the Cloud
 
Starting your Cloud Transformation Journey - Tel Aviv Summit 2018
Starting your Cloud Transformation Journey - Tel Aviv Summit 2018Starting your Cloud Transformation Journey - Tel Aviv Summit 2018
Starting your Cloud Transformation Journey - Tel Aviv Summit 2018
 
Starting your Cloud Transformation Journey - Tel Aviv Summit 2018
Starting your Cloud Transformation Journey - Tel Aviv Summit 2018Starting your Cloud Transformation Journey - Tel Aviv Summit 2018
Starting your Cloud Transformation Journey - Tel Aviv Summit 2018
 
Artificial intelligence in actions: delivering a new experience to Formula 1 ...
Artificial intelligence in actions: delivering a new experience to Formula 1 ...Artificial intelligence in actions: delivering a new experience to Formula 1 ...
Artificial intelligence in actions: delivering a new experience to Formula 1 ...
 
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS Summit
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS SummitThreat detection and mitigation at AWS - SEC201 - Atlanta AWS Summit
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS Summit
 
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
 
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習
 
Industry 4.0 in the cloud - SVC214 - Chicago AWS Summit
Industry 4.0 in the cloud - SVC214 - Chicago AWS SummitIndustry 4.0 in the cloud - SVC214 - Chicago AWS Summit
Industry 4.0 in the cloud - SVC214 - Chicago AWS Summit
 
Rendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSRendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWS
 

Plus de Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Plus de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Using AIOps to reduce incidents volume

  • 1. S U M M I T Lo n don
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Using AIOps to Reduce Incidents Volume Itai David Njanji Senior Consultant, OI AWS Professional Services S e s s i o n I D : O P S 1 Paul Ferguson Global Practice Manager AWS Professional Services
  • 3. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Poll How many of you are using AWS CloudWatch? How many of you are using third party monitoring tools? How many of you are using ITSM tools for Incident Management?
  • 4. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Agenda • Introduction: Challenges of IT Ops • Review ITIL Guidelines • IT Ops to AIOps • AIOps • Summary
  • 5. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Data overload Gaming IoT sensorsDevices External systems and applications Web content Logs, logs, and more logs … Databases Servers NetworkingStorage Internal systems and applications
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T IT Ops Drowning in tickets and noise More Automation Time to detect and fix issues is too long Need more agile, roll out changes faster Challenges of IT Ops
  • 8. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Sustain “Traditional Ops” Grow “DevOps” Optimize “CloudOps” OperationsEngineering PlatformApplications DevOpsDev Team / COTS Cloud Platform Engineering ITSM OperationsEngineering PlatformApplications DevOpsDevOps Teams ITSM Cloud Platform Engineering OperationsEngineering PlatformApplications DevOpsDev Team / COTS Cloud Platform Engineering Cloud Operations App Ops Platform Ops ITSM Transitional Strategic Strategic Ticketing System (ITSM) in Operating Models
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Scale “AIOps” AI OperationsEngineering PlatformApplications DevOps DevOps Teams ITSM ML Cloud Platform Engineering Strategic AIOps Operating Model
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T PROCESS Core Operations Functions Platform Architecture & Governance Event & Incident Management Provisioning & Configuration Management Availability & Continuity Management Security & Control Functions Change Management Resource Inventory Management Identity & Access Management Security Management Business Management Functions Financial Management Capacity Planning & Forecasting Organizational Change Management Vendor Management Supporting Functions Reporting & Analytics Continuous Improvement Application Lifecycle Management IT Processes
  • 12. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Best practices for modern application development • Enable experimentation by creating a culture of ownership • Componentize applications using microservices • Update applications and infrastructure quickly by automating the release pipeline • Model and provision application resources using infrastructure as code • Simplify infrastructure management with serverless technologies • Improve application performance by increasing observability • Secure the entire application lifecycle by automating security
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T DNA of Modern Apps • Large Data Volume • Connected (APIs etc.) • Evolves quickly (CI/CD) Increased Incident Volume
  • 15. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Javascript to process notifications ITSM Tool Tickets/CMDB Update Incidents: AWS-ITSM Integration AWS Config Amazon CloudWatch Publish to SNS topicAlarmDiscovery Amazon Simple Notification Service HTTPS notification Topic AWS Cloud
  • 16. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T More Monitoring Tools ITSM Data Overload AIOps can reduce noise and increase insights: • Alert Clustering • Anomaly Detection • Neural Feedback ITSM Hygiene Incident and Event Management
  • 17. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 1. Set up for Observability Platform Logs Events Application Logs Code telemetry API and Users Infrastructure Logs Amazon CloudWatch
  • 19. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 2. Set up delivery, Collection, and Storage ingestion Amazon CloudWatch Amazon Kinesis Amazon CloudWatch Amazon EMR Amazon Elastic Search Amazon Redshift Amazon Simple Storage Service (S3)
  • 20. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 3. Querying and patternsmining Amazon CloudWatch Amazon Athena Amazon Kinesis Data Analytics Amazon SageMaker
  • 21. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 4. Alerting, Notification, andremediation Amazon CloudWatch Amazon Simple Notification Service AWS Lambda
  • 22. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 5. Integrate with ITSM Tooling
  • 23. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AIOps Predictive & Preventive Insights Patterns, tools & data ingestion Planned events Budgets Threat intel Ticket Reductions Decisive Response Timely Recovery New Pattern discovery Continuous Learning Infrastructure Logs Platform Logs Application Logs Code telemetry Events
  • 24. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 25. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS Marketplace Solutions • Turn Key solutions - Less time to production • Address people challenges: Expert guidance • Proven solutions and more .. Event Noise Filtering Incident Detection Entropy Time Proximity Logical Topology Linguistic Proximity Behavior Feedback Neural Feedback
  • 26. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 27. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Old Way vs. AIOps Way Enterprise Event Correlation Old Way AIOps Approach Rules Algorithms and automated workflow Dependency Human Human & Machine Configuration Definition Rule Logic + Inputs Outcomes Configuration Amount 1000+ Rules < 10 Definitions Configuration Time Days & Continuous Mins & One Time Correlation Technique Hard Matching Fuzzy Matching Machine Learning None Supervised & Unsupervised Accuracy 20%? 80% percentile + Tolerates App/Infra Changes No Yes Maintenance Resources 2-3 people 1 part-time Cost of Ownership High Low
  • 28. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 29. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I TS U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 30. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Itai David Njanji injanji@amazon.com