SlideShare une entreprise Scribd logo
1  sur  25
Cognitive Behavior Analysis
framework for Fault Prediction
     in Cloud Computing
   (NoF’12, Nov 21st-23rd, 2012, Tunis, Tunisia)

Reza FARRAHI MOGHADDAM, Fereydoun FARRAHI MOGHADDAM,
             Vahid ASGHARI, Mohamed CHERIET

     Synchromedia Lab, ETS, University of Quebec, Montreal, Quebec, Canada




                Laboratory for Multimedia
              Communication in Telepresence
Outline

        Motivation for Behavior Analysis (BA) and
         Failure Prediction
            Proposed BA framework
              Probabilistic Behavior Analysis
              Simulated Probabilistic Behavior Analysis
              Behavior-Time Profile Modeling and Analysis

        Scalability of the Proposed BA framework
        Conclusions and Future Prospects

11/23/2012                         NoF’12                    2
Why Behavior Analysis (BA)?
            Benefits of BA for Failure Prediction
              Preventing Service-Layer or System-Level failures
              Enabling operation in “unallowable” states to save
              energy and cost, and also to reduce footprint
            Profiling the Actors
              Profiling end users, service providers, and other
              actors in a computing business (for example, a
              telecom business)
              The ensemble of these actors resembles more an
              ecosystem than a system
              Profiling helps in:
               • Smart management of resources
               • Building reputations and trust for actors
               • Identifying and isolating wrong-acting actors and threats
11/23/2012                                  NoF’12                           3
Why Failure Prediction?
A new failure source: Cyclic ElastoPlastic Operation (CEPO)



                 Cyclic
              elastoplastic                         Hardware factor
                operation



       Software           Human              Middleware          Other
                                               factor           factors
        factor            factor



 11/23/2012                         NoF’12                                4
Cyclic elastoplastic operation (CEPO):
              in Civil and Mechanical Engineering

 Safe operation in plastic mode
 Repeatable transitions between elastic and
 plastic modes
                                           Plastic regime
 Cyclic operation is the key
                                                             Plastic
                                        Elastic regime
                                                         Collapse Point




11/23/2012                   NoF’12                               5
Cyclic elastoplastic operation (CEPO):
                     its counterparts in Computing Systems

     Carbon Enabling Effect and Green Push: Doing more with less
     1. PUE of Data centers
             Increasing inlet air flow temperature (2-4% energy saving per 1°C increase)
                 For example: PUE = 1.5, 20% saving (5°C)  PUE = 1.2
             Reducing or eliminating fans
         Failure at component level (servers) increases with temperature (ASHRAE TC
           9.9. 2011)
         Failure Prediction and Behavior Analysis can isolate component-level failures
           (even before their occurrence) in order to prevent system-level failures (which
           may violate SLO constraints)
         Again, cyclic operation is the key to success
     2. Can be applied to Bandwidth too??                                                                           Uncertainty increases with the
                                                                                                                  length of stay in the plastic mode
                                                                                      Bearable stress level
                                                                                                              Plastic mode




                                                                 Stress on System
                                                                                      Elastic mode




                                                                                    Allowable Elastic range     Inlet temperature
11/23/2012                                       NoF’12                                                                                     6
The Proposed BA framework

            An Ensemble-of-Experts approach:
              The sub-paradigms
               • Probabilistic Behavior Analysis
               • Simulated Probabilistic Behavior Analysis
               • Behavior-Time Profile Modeling and Analysis
            Two different pictures:
              Systemic picture
              Ecosystemic picture




11/23/2012                              NoF’12                 7
BA Framework:
             Systemic picture




11/23/2012        NoF’12        8
BA Framework:
             Ecosystemic picture




11/23/2012          NoF’12         9
Multiple layers in
                          BA framework

  Layers vs (physical and non-
  physical) location: Toward Location   Various layers
  Intelligence in Computing systems         Hardware (Compute/Network)
                                            Hardware Drivers/Software
                                            Middleware/Protocols
                                            Virtualware
                                            Virtualware Drivers/Software
                                            Applications (Software)




11/23/2012                          NoF’12                              10
Sub-paradigm 1:
                  Probabilistic Behavior Analysis
      Each layer of system is considered as a graph
      Sub-graphs constitute super-components of

       higher levels (vertical scaling)
      The behavior is modeled as PoA



            The PoA is related to CDF of failure:


            The Differential Density Function (DDF):


11/23/2012                        NoF’12                11
Sub-paradigm 1:
             Probabilistic Behavior Analysis
       An example of a 2-component system:




11/23/2012                 NoF’12              12
Sub-paradigm 1:
                     Tanh distribution
             Tanh CDFs              Tanh DDFs




11/23/2012                 NoF’12               13
Sub-paradigm 1:
                 Probabilistic Behavior Analysis

             Lanl05 database        Lanl05 database statistics
                                    Duration: 9 years

                                    Retrieved from FTA
                                    Availability statistics:
                                           19874 records
                                           mean = 1777.99 (hrs)
                                           std = 3462.33
                                           Skewness = 3.09
                                           GoF p-value (Tanh) = 0.500
                                           GoF p-value (Weib.) = 0.416
                                       Unavailability statistics:
                                         mean = 5.88 (hrs)
                                         std = 78.39
                                         Skewness = 43.96
11/23/2012                     NoF’12                                     14
Sub-paradigm 2:
                 Simulated Probabilistic Behavior Analysis



      For highly-complex system topologies, the CDFs of
      high-level sub-graphs and components is estimated
      using simulation based on CDFs of basic components
      It can be also used to validate the calculations of the
      first sub-paradigm
      Monte Carlo strategy is used
      In each run, the fault time of each basic component is

      calculated randomly based on its CDF
      The cumulative behavior of all runs of the high-level
      sub-graph is used to estimate its CDF
      1000-run simulations have been used


11/23/2012                       NoF’12                          15
Sub-paradigm 2:
              Simulated Probabilistic Behavior Analysis



  MC simulation: G_1,1            MC simulation: G_2,1




11/23/2012                    NoF’12                      16
Sub-paradigm 2:
             Simulated Probabilistic Behavior Analysis



  MC simulation: CDFs            MC simulation: DDFs




11/23/2012                   NoF’12                      17
Sub-paradigm 3:
                Behavior-Time Profile Modeling and Analysis


       Time-profile of components characteristics collected
       by opportunistic agents across the system (or
       ecosystem)
       Time-profile of state transitions in components and

       also higher level sub-graphs at various layers
       collected or injected by BSU
       Machine learning methods are used to match the
       state transitions with the characteristics
         Support Vector Machine (SVM)
         Bayesian networks
         Agent-based data mining
         Fuzzy logic
         ···
11/23/2012                       NoF’12                        18
Sub-paradigm 3:
                      Behavior-Time Profile Modeling and Analysis


         Four motivations for behavior-time profile
          analysis:
              Spontaneous faults compared to cause-and-effect
               faults have been reduced significantly
               • Less pure hardware-caused faults compared to interaction-
                 caused faults
              Patterns and cycles in fault occurrence and in
              general in behavior
              Handling of faulty systems that do not have any
              faulty components
               • context-sensitive diagnosis [Lamperti2011]
              handling of gradual events

11/23/2012                              NoF’12                               19
Sub-paradigm 3:
             Behavior-Time Profile Modeling and Analysis

     A simple example:




11/23/2012                    NoF’12                       20
SLA and Service Grading
        Even without considering elastoplastic use case, BA can help in
         upgrading a service (for example, to the telco grade)
        Probability of Availability (PoA): Lease-based business models
              Predicting, isolating and resolving failure events at component or sub-
               system levels before they get to the Service Layer.
        Probability of Completion (PoC): Task-based business models
        Countermeasure options:
              Put out high risk components (maintenance tickets)
              Temporal redundancy
        But, all this depends on the ability to predict high risk or failure

        An example:
          No BA: Major fault mode with MTBF = 10 weeks, MTTR = 10
           minutes  52:09 minutes downtime a year < 52:33  4nines
          With BA: 90% of faults are detected 15 minutes before system
           failure  5:13 minutes downtime a year < 5:15  5nines

11/23/2012                                     NoF’12                                    21
Countermeasures and
                        cost savings

                                      Two alternative modes to save
        An example: Full system       both energy (cost) and life
                                      expectancy of components




11/23/2012                        NoF’12                              22
Scalability

    Horizontal and Vertical scaling            Federated scaling




11/23/2012                            NoF’12                       23
Conclusions and Future
                                  Prospects
        A multi-paradigm, multi-layer, multi-level cognitive behavior analysis
        framework is introduced
        Three sub-paradigms (cross-cover):
          Statistical inference
          Statistical inference by means of simulation
          Time-profile modeling and analysis
       Multiple granularity analysis and scalability:
         Horizontal, vertical and hierarchical scaling
       Including other layers in the analysis: virtualware and middleware
       Estimation of PoA to improve system dependability and its service grade
       A new distribution is introduced: Tanh distribution
          validated on a real database: lanl05 database
       Future Prospects:
          Large-scale operation of each sub-paradigm
          Cognitive Response: Multi-Expert Decision Making, Cognitive Models
          Integration of the framework with real computing systems:
             • OpenStack, Open GSN
          Machine learning techniques for the time-profile modeling sub-paradigm
          Development of more sophisticated distributions


11/23/2012                                      NoF’12                              24
Thanks you, Any question!
                                                      BATG




Reza                       Fereydoun                   Vahid                     Mohamed
FARRAHI                    FARRAHI                     ASGHARI,                  CHERIET,
MOGHADDAM,                 MOGHADDAM,                  Eng., Ph.D., MIEEE        Eng., Ph.D., SMIEEE
Eng., Ph.D., MIEEE         Eng., M.Sc., MIEEE          vahid@emt.inrs.ca         mohamed.cheriet@etsmtl.ca
imriss@ieee.org,           farrahi@ieee.org,
rfarrahi@synchromedia.ca   ffarrahi@synchromedia.ca
    Research Associate            PhD Student              Postdoctoral Fellow   Director of Synchromedia Lab

                             http://www.synchromedia.ca/
                                                  NSERC

Contenu connexe

Similaire à Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing

600.412.Lecture06
600.412.Lecture06600.412.Lecture06
600.412.Lecture06ragibhasan
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...MediaEval2012
 
Elastic High Performance Applications – A Composition Framework
Elastic High Performance Applications – A Composition FrameworkElastic High Performance Applications – A Composition Framework
Elastic High Performance Applications – A Composition FrameworkHong-Linh Truong
 
Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing...
Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing...Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing...
Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing...Reza Farrahi Moghaddam, PhD, BEng
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTERN Australia
 
M2M Platform-as-a-Service for Sustainability Governance
M2M Platform-as-a-Service for Sustainability GovernanceM2M Platform-as-a-Service for Sustainability Governance
M2M Platform-as-a-Service for Sustainability GovernanceHong-Linh Truong
 
Robust techniques for background subtraction in urban
Robust techniques for background subtraction in urbanRobust techniques for background subtraction in urban
Robust techniques for background subtraction in urbantaylor_1313
 
Green Computing Observatory
Green Computing ObservatoryGreen Computing Observatory
Green Computing ObservatoryCecile Germain
 
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)Eswar Publications
 
RadioSense RTSS 2012
RadioSense RTSS 2012RadioSense RTSS 2012
RadioSense RTSS 2012Qi Xin
 
11.compression technique using dct fractal compression
11.compression technique using dct fractal compression11.compression technique using dct fractal compression
11.compression technique using dct fractal compressionAlexander Decker
 
Compression technique using dct fractal compression
Compression technique using dct fractal compressionCompression technique using dct fractal compression
Compression technique using dct fractal compressionAlexander Decker
 
Matthias Vallentin - Towards Interactive Network Forensics and Incident Respo...
Matthias Vallentin - Towards Interactive Network Forensics and Incident Respo...Matthias Vallentin - Towards Interactive Network Forensics and Incident Respo...
Matthias Vallentin - Towards Interactive Network Forensics and Incident Respo...boundary_slides
 
IaaS Cloud Benchmarking: Approaches, Challenges, and Experience
IaaS Cloud Benchmarking: Approaches, Challenges, and ExperienceIaaS Cloud Benchmarking: Approaches, Challenges, and Experience
IaaS Cloud Benchmarking: Approaches, Challenges, and ExperienceAlexandru Iosup
 
Project Report on Modeling and Robust Control of Blu-Ray disc Servo Mechanisms
Project Report on Modeling and Robust Control of Blu-Ray disc Servo MechanismsProject Report on Modeling and Robust Control of Blu-Ray disc Servo Mechanisms
Project Report on Modeling and Robust Control of Blu-Ray disc Servo MechanismsManu Mitra
 
IRJET - Face Recognition in Digital Documents with Live Image
IRJET - Face Recognition in Digital Documents with Live ImageIRJET - Face Recognition in Digital Documents with Live Image
IRJET - Face Recognition in Digital Documents with Live ImageIRJET Journal
 
Real-time Analytics with HBase (short version)
Real-time Analytics with HBase (short version)Real-time Analytics with HBase (short version)
Real-time Analytics with HBase (short version)alexbaranau
 
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...Francisco (Paco) Florez-Revuelta
 

Similaire à Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing (20)

600.412.Lecture06
600.412.Lecture06600.412.Lecture06
600.412.Lecture06
 
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene...
 
Elastic High Performance Applications – A Composition Framework
Elastic High Performance Applications – A Composition FrameworkElastic High Performance Applications – A Composition Framework
Elastic High Performance Applications – A Composition Framework
 
Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing...
Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing...Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing...
Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing...
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasets
 
Df24693697
Df24693697Df24693697
Df24693697
 
M2M Platform-as-a-Service for Sustainability Governance
M2M Platform-as-a-Service for Sustainability GovernanceM2M Platform-as-a-Service for Sustainability Governance
M2M Platform-as-a-Service for Sustainability Governance
 
Robust techniques for background subtraction in urban
Robust techniques for background subtraction in urbanRobust techniques for background subtraction in urban
Robust techniques for background subtraction in urban
 
Green Computing Observatory
Green Computing ObservatoryGreen Computing Observatory
Green Computing Observatory
 
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
 
RadioSense RTSS 2012
RadioSense RTSS 2012RadioSense RTSS 2012
RadioSense RTSS 2012
 
11.compression technique using dct fractal compression
11.compression technique using dct fractal compression11.compression technique using dct fractal compression
11.compression technique using dct fractal compression
 
Compression technique using dct fractal compression
Compression technique using dct fractal compressionCompression technique using dct fractal compression
Compression technique using dct fractal compression
 
Matthias Vallentin - Towards Interactive Network Forensics and Incident Respo...
Matthias Vallentin - Towards Interactive Network Forensics and Incident Respo...Matthias Vallentin - Towards Interactive Network Forensics and Incident Respo...
Matthias Vallentin - Towards Interactive Network Forensics and Incident Respo...
 
IaaS Cloud Benchmarking: Approaches, Challenges, and Experience
IaaS Cloud Benchmarking: Approaches, Challenges, and ExperienceIaaS Cloud Benchmarking: Approaches, Challenges, and Experience
IaaS Cloud Benchmarking: Approaches, Challenges, and Experience
 
Project Report on Modeling and Robust Control of Blu-Ray disc Servo Mechanisms
Project Report on Modeling and Robust Control of Blu-Ray disc Servo MechanismsProject Report on Modeling and Robust Control of Blu-Ray disc Servo Mechanisms
Project Report on Modeling and Robust Control of Blu-Ray disc Servo Mechanisms
 
Lean
LeanLean
Lean
 
IRJET - Face Recognition in Digital Documents with Live Image
IRJET - Face Recognition in Digital Documents with Live ImageIRJET - Face Recognition in Digital Documents with Live Image
IRJET - Face Recognition in Digital Documents with Live Image
 
Real-time Analytics with HBase (short version)
Real-time Analytics with HBase (short version)Real-time Analytics with HBase (short version)
Real-time Analytics with HBase (short version)
 
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
A Multiple Kernel Learning Based Fusion Framework for Real-Time Multi-View Ac...
 

Plus de Reza Farrahi Moghaddam, PhD, BEng

40 Gbps Access for Metro networks: Implications in terms of Sustainability an...
40 Gbps Access for Metro networks: Implications in terms of Sustainability an...40 Gbps Access for Metro networks: Implications in terms of Sustainability an...
40 Gbps Access for Metro networks: Implications in terms of Sustainability an...Reza Farrahi Moghaddam, PhD, BEng
 
A Multiple-Expert Binarization Framework for Multispectral Images
A Multiple-Expert Binarization Framework for Multispectral ImagesA Multiple-Expert Binarization Framework for Multispectral Images
A Multiple-Expert Binarization Framework for Multispectral ImagesReza Farrahi Moghaddam, PhD, BEng
 
A Sustainable Future: Potentials of our Tools (ICT and Energy) and Responsibi...
A Sustainable Future: Potentials of our Tools (ICT and Energy) and Responsibi...A Sustainable Future: Potentials of our Tools (ICT and Energy) and Responsibi...
A Sustainable Future: Potentials of our Tools (ICT and Energy) and Responsibi...Reza Farrahi Moghaddam, PhD, BEng
 
Challenges and complexities in application of LCA approaches in the case of I...
Challenges and complexities in application of LCA approaches in the case of I...Challenges and complexities in application of LCA approaches in the case of I...
Challenges and complexities in application of LCA approaches in the case of I...Reza Farrahi Moghaddam, PhD, BEng
 
Reza Farrahi Moghaddam's Progress Report. From the Perspective of the Axe of ...
Reza Farrahi Moghaddam's Progress Report. From the Perspective of the Axe of ...Reza Farrahi Moghaddam's Progress Report. From the Perspective of the Axe of ...
Reza Farrahi Moghaddam's Progress Report. From the Perspective of the Axe of ...Reza Farrahi Moghaddam, PhD, BEng
 
Reza Farrahi Moghaddam’s Progress Report within the Perspective of the GSTC P...
Reza Farrahi Moghaddam’s Progress Report within the Perspective of the GSTC P...Reza Farrahi Moghaddam’s Progress Report within the Perspective of the GSTC P...
Reza Farrahi Moghaddam’s Progress Report within the Perspective of the GSTC P...Reza Farrahi Moghaddam, PhD, BEng
 
TIC pour un développement durable de la société (ICT for Sustainable Developm...
TIC pour un développement durable de la société (ICT for Sustainable Developm...TIC pour un développement durable de la société (ICT for Sustainable Developm...
TIC pour un développement durable de la société (ICT for Sustainable Developm...Reza Farrahi Moghaddam, PhD, BEng
 
Unsupervised ensemble of experts (EoE) framework for automatic binarization o...
Unsupervised ensemble of experts (EoE) framework for automatic binarization o...Unsupervised ensemble of experts (EoE) framework for automatic binarization o...
Unsupervised ensemble of experts (EoE) framework for automatic binarization o...Reza Farrahi Moghaddam, PhD, BEng
 

Plus de Reza Farrahi Moghaddam, PhD, BEng (11)

40 Gbps Access for Metro networks: Implications in terms of Sustainability an...
40 Gbps Access for Metro networks: Implications in terms of Sustainability an...40 Gbps Access for Metro networks: Implications in terms of Sustainability an...
40 Gbps Access for Metro networks: Implications in terms of Sustainability an...
 
A Multiple-Expert Binarization Framework for Multispectral Images
A Multiple-Expert Binarization Framework for Multispectral ImagesA Multiple-Expert Binarization Framework for Multispectral Images
A Multiple-Expert Binarization Framework for Multispectral Images
 
Sustainability: Actors, Behavior, and Transparency
Sustainability: Actors, Behavior, and TransparencySustainability: Actors, Behavior, and Transparency
Sustainability: Actors, Behavior, and Transparency
 
A Sustainable Future: Potentials of our Tools (ICT and Energy) and Responsibi...
A Sustainable Future: Potentials of our Tools (ICT and Energy) and Responsibi...A Sustainable Future: Potentials of our Tools (ICT and Energy) and Responsibi...
A Sustainable Future: Potentials of our Tools (ICT and Energy) and Responsibi...
 
Challenges and complexities in application of LCA approaches in the case of I...
Challenges and complexities in application of LCA approaches in the case of I...Challenges and complexities in application of LCA approaches in the case of I...
Challenges and complexities in application of LCA approaches in the case of I...
 
Sustainability Analysis of Broadband wireless access (BWA)
Sustainability Analysis of Broadband wireless access (BWA)Sustainability Analysis of Broadband wireless access (BWA)
Sustainability Analysis of Broadband wireless access (BWA)
 
Reza Farrahi Moghaddam's Progress Report. From the Perspective of the Axe of ...
Reza Farrahi Moghaddam's Progress Report. From the Perspective of the Axe of ...Reza Farrahi Moghaddam's Progress Report. From the Perspective of the Axe of ...
Reza Farrahi Moghaddam's Progress Report. From the Perspective of the Axe of ...
 
Reza Farrahi Moghaddam’s Progress Report within the Perspective of the GSTC P...
Reza Farrahi Moghaddam’s Progress Report within the Perspective of the GSTC P...Reza Farrahi Moghaddam’s Progress Report within the Perspective of the GSTC P...
Reza Farrahi Moghaddam’s Progress Report within the Perspective of the GSTC P...
 
TIC pour un développement durable de la société (ICT for Sustainable Developm...
TIC pour un développement durable de la société (ICT for Sustainable Developm...TIC pour un développement durable de la société (ICT for Sustainable Developm...
TIC pour un développement durable de la société (ICT for Sustainable Developm...
 
Unsupervised ensemble of experts (EoE) framework for automatic binarization o...
Unsupervised ensemble of experts (EoE) framework for automatic binarization o...Unsupervised ensemble of experts (EoE) framework for automatic binarization o...
Unsupervised ensemble of experts (EoE) framework for automatic binarization o...
 
Life cycle assessment (LCA) for ICT
Life cycle assessment (LCA) for ICTLife cycle assessment (LCA) for ICT
Life cycle assessment (LCA) for ICT
 

Dernier

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 

Dernier (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing

  • 1. Cognitive Behavior Analysis framework for Fault Prediction in Cloud Computing (NoF’12, Nov 21st-23rd, 2012, Tunis, Tunisia) Reza FARRAHI MOGHADDAM, Fereydoun FARRAHI MOGHADDAM, Vahid ASGHARI, Mohamed CHERIET Synchromedia Lab, ETS, University of Quebec, Montreal, Quebec, Canada Laboratory for Multimedia Communication in Telepresence
  • 2. Outline  Motivation for Behavior Analysis (BA) and Failure Prediction  Proposed BA framework  Probabilistic Behavior Analysis  Simulated Probabilistic Behavior Analysis  Behavior-Time Profile Modeling and Analysis  Scalability of the Proposed BA framework  Conclusions and Future Prospects 11/23/2012 NoF’12 2
  • 3. Why Behavior Analysis (BA)?  Benefits of BA for Failure Prediction  Preventing Service-Layer or System-Level failures  Enabling operation in “unallowable” states to save energy and cost, and also to reduce footprint  Profiling the Actors  Profiling end users, service providers, and other actors in a computing business (for example, a telecom business)  The ensemble of these actors resembles more an ecosystem than a system  Profiling helps in: • Smart management of resources • Building reputations and trust for actors • Identifying and isolating wrong-acting actors and threats 11/23/2012 NoF’12 3
  • 4. Why Failure Prediction? A new failure source: Cyclic ElastoPlastic Operation (CEPO) Cyclic elastoplastic Hardware factor operation Software Human Middleware Other factor factors factor factor 11/23/2012 NoF’12 4
  • 5. Cyclic elastoplastic operation (CEPO): in Civil and Mechanical Engineering  Safe operation in plastic mode  Repeatable transitions between elastic and plastic modes Plastic regime  Cyclic operation is the key Plastic Elastic regime Collapse Point 11/23/2012 NoF’12 5
  • 6. Cyclic elastoplastic operation (CEPO): its counterparts in Computing Systems Carbon Enabling Effect and Green Push: Doing more with less 1. PUE of Data centers Increasing inlet air flow temperature (2-4% energy saving per 1°C increase) For example: PUE = 1.5, 20% saving (5°C)  PUE = 1.2 Reducing or eliminating fans Failure at component level (servers) increases with temperature (ASHRAE TC 9.9. 2011) Failure Prediction and Behavior Analysis can isolate component-level failures (even before their occurrence) in order to prevent system-level failures (which may violate SLO constraints) Again, cyclic operation is the key to success 2. Can be applied to Bandwidth too?? Uncertainty increases with the length of stay in the plastic mode Bearable stress level Plastic mode Stress on System Elastic mode Allowable Elastic range Inlet temperature 11/23/2012 NoF’12 6
  • 7. The Proposed BA framework  An Ensemble-of-Experts approach:  The sub-paradigms • Probabilistic Behavior Analysis • Simulated Probabilistic Behavior Analysis • Behavior-Time Profile Modeling and Analysis  Two different pictures:  Systemic picture  Ecosystemic picture 11/23/2012 NoF’12 7
  • 8. BA Framework: Systemic picture 11/23/2012 NoF’12 8
  • 9. BA Framework: Ecosystemic picture 11/23/2012 NoF’12 9
  • 10. Multiple layers in BA framework Layers vs (physical and non- physical) location: Toward Location Various layers Intelligence in Computing systems  Hardware (Compute/Network)  Hardware Drivers/Software  Middleware/Protocols  Virtualware  Virtualware Drivers/Software  Applications (Software) 11/23/2012 NoF’12 10
  • 11. Sub-paradigm 1: Probabilistic Behavior Analysis  Each layer of system is considered as a graph  Sub-graphs constitute super-components of higher levels (vertical scaling)  The behavior is modeled as PoA  The PoA is related to CDF of failure:  The Differential Density Function (DDF): 11/23/2012 NoF’12 11
  • 12. Sub-paradigm 1: Probabilistic Behavior Analysis  An example of a 2-component system: 11/23/2012 NoF’12 12
  • 13. Sub-paradigm 1: Tanh distribution Tanh CDFs Tanh DDFs 11/23/2012 NoF’12 13
  • 14. Sub-paradigm 1: Probabilistic Behavior Analysis Lanl05 database Lanl05 database statistics  Duration: 9 years  Retrieved from FTA  Availability statistics:  19874 records  mean = 1777.99 (hrs)  std = 3462.33  Skewness = 3.09  GoF p-value (Tanh) = 0.500  GoF p-value (Weib.) = 0.416  Unavailability statistics:  mean = 5.88 (hrs)  std = 78.39  Skewness = 43.96 11/23/2012 NoF’12 14
  • 15. Sub-paradigm 2: Simulated Probabilistic Behavior Analysis  For highly-complex system topologies, the CDFs of high-level sub-graphs and components is estimated using simulation based on CDFs of basic components  It can be also used to validate the calculations of the first sub-paradigm  Monte Carlo strategy is used  In each run, the fault time of each basic component is calculated randomly based on its CDF  The cumulative behavior of all runs of the high-level sub-graph is used to estimate its CDF  1000-run simulations have been used 11/23/2012 NoF’12 15
  • 16. Sub-paradigm 2: Simulated Probabilistic Behavior Analysis MC simulation: G_1,1 MC simulation: G_2,1 11/23/2012 NoF’12 16
  • 17. Sub-paradigm 2: Simulated Probabilistic Behavior Analysis MC simulation: CDFs MC simulation: DDFs 11/23/2012 NoF’12 17
  • 18. Sub-paradigm 3: Behavior-Time Profile Modeling and Analysis  Time-profile of components characteristics collected by opportunistic agents across the system (or ecosystem)  Time-profile of state transitions in components and also higher level sub-graphs at various layers collected or injected by BSU  Machine learning methods are used to match the state transitions with the characteristics  Support Vector Machine (SVM)  Bayesian networks  Agent-based data mining  Fuzzy logic  ··· 11/23/2012 NoF’12 18
  • 19. Sub-paradigm 3: Behavior-Time Profile Modeling and Analysis  Four motivations for behavior-time profile analysis:  Spontaneous faults compared to cause-and-effect faults have been reduced significantly • Less pure hardware-caused faults compared to interaction- caused faults  Patterns and cycles in fault occurrence and in general in behavior  Handling of faulty systems that do not have any faulty components • context-sensitive diagnosis [Lamperti2011]  handling of gradual events 11/23/2012 NoF’12 19
  • 20. Sub-paradigm 3: Behavior-Time Profile Modeling and Analysis A simple example: 11/23/2012 NoF’12 20
  • 21. SLA and Service Grading  Even without considering elastoplastic use case, BA can help in upgrading a service (for example, to the telco grade)  Probability of Availability (PoA): Lease-based business models  Predicting, isolating and resolving failure events at component or sub- system levels before they get to the Service Layer.  Probability of Completion (PoC): Task-based business models  Countermeasure options:  Put out high risk components (maintenance tickets)  Temporal redundancy  But, all this depends on the ability to predict high risk or failure  An example:  No BA: Major fault mode with MTBF = 10 weeks, MTTR = 10 minutes  52:09 minutes downtime a year < 52:33  4nines  With BA: 90% of faults are detected 15 minutes before system failure  5:13 minutes downtime a year < 5:15  5nines 11/23/2012 NoF’12 21
  • 22. Countermeasures and cost savings Two alternative modes to save An example: Full system both energy (cost) and life expectancy of components 11/23/2012 NoF’12 22
  • 23. Scalability Horizontal and Vertical scaling Federated scaling 11/23/2012 NoF’12 23
  • 24. Conclusions and Future Prospects  A multi-paradigm, multi-layer, multi-level cognitive behavior analysis framework is introduced  Three sub-paradigms (cross-cover):  Statistical inference  Statistical inference by means of simulation  Time-profile modeling and analysis  Multiple granularity analysis and scalability:  Horizontal, vertical and hierarchical scaling  Including other layers in the analysis: virtualware and middleware  Estimation of PoA to improve system dependability and its service grade  A new distribution is introduced: Tanh distribution  validated on a real database: lanl05 database  Future Prospects:  Large-scale operation of each sub-paradigm  Cognitive Response: Multi-Expert Decision Making, Cognitive Models  Integration of the framework with real computing systems: • OpenStack, Open GSN  Machine learning techniques for the time-profile modeling sub-paradigm  Development of more sophisticated distributions 11/23/2012 NoF’12 24
  • 25. Thanks you, Any question! BATG Reza Fereydoun Vahid Mohamed FARRAHI FARRAHI ASGHARI, CHERIET, MOGHADDAM, MOGHADDAM, Eng., Ph.D., MIEEE Eng., Ph.D., SMIEEE Eng., Ph.D., MIEEE Eng., M.Sc., MIEEE vahid@emt.inrs.ca mohamed.cheriet@etsmtl.ca imriss@ieee.org, farrahi@ieee.org, rfarrahi@synchromedia.ca ffarrahi@synchromedia.ca Research Associate PhD Student Postdoctoral Fellow Director of Synchromedia Lab http://www.synchromedia.ca/ NSERC