SlideShare une entreprise Scribd logo
1  sur  32
Télécharger pour lire hors ligne
Performance Analysis of Grid workflows
      in K-WfGrid and ASKALON
         Hong-Linh Truong and Thomas Fahringer
                    Distributed and Parallel Systems Group
            Institute of Computer Science, University of Innsbruck
                            {truong,tf}@dps.uibk.ac.at
                        http://dps.uibk.ac.at/projects/pma

    With contributions from:
    Bartosz Balis, Marian Bubak, Peter Brunner, Schahram Dustdar, Jakub
    Dziwisz, Andreas Hoheisel, Tilman Linden, Radu Prodan, Francesco
    Nerieri, Kuba Rozkwitalski, Robert Samborski




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.
Outline
       Motivation
       Objectives and approach
       Performance metrics and ontologies
       Architecture of Grid monitoring and analysis
       Conclusions




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   2
Performance Instrumentation, Monitoring, and Analysis
                     for the Grid
       Challenging task because of dynamic nature of
    the Grid and its applications
        Combination of fine-grained and coarse-grained
        models
        Multilingual applications
        Heterogeneous and dynamic systems
       Not just performance but also dependability
      The focus of the talk
        Our concepts, architecture, interfaces and
        integration

Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   3
ASKALON Toolkit
                  http://dps.uibk.ac.at/projects/askalon/




       Programming and execution environment for Grid workflows
         Integrated various services for scheduling, executing,
         monitoring and analyzing Grid workflows
      Target applications
        Workflows of scientific applications (C/Fortran)
         • Material science, flooding, astrophysics, movie
           rendering, etc.
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   4
The K-WfGrid Project
     An FP6 EU project
          www.kwfgrid.net
                                                                    Execute workflow
     Focuses
          Automatic construction and
          reuse of workflows based on
                                                        Construct                         Monitor
          knowledge gathered through
          execution                                     workflow                      environment
     Target applications                                              K-WfGrid
          Workflows of Grid/Web services               Reuse                                 Analyze
          Grid/Web services may invoke                 knowledge                          information
          legacy applications
          Business (Coordinated Traffic
          Management, EPR) and
                                                                   Capture knowledge
          scientific (e.g., Food simulation)
          applications




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.          5
The K-WfGrid Project
     An FP6 EU project
          www.kwfgrid.net
                                                                    Execute workflow
     Focuses
          Automatic construction and
          reuse of workflows based on
                                                        Construct                         Monitor
          knowledge gathered through
          execution                                     workflow                      environment
                                        Many Grid users and
     Target applications                                 K-WfGrid
                                        developers we know
          Workflows of Grid/Web services       Reuse               Analyze
                                emphasize the interoperability, information
          Grid/Web services may invoke         knowledge
          legacy applications    integration and reliability, not
          Business (Coordinated Traffic
          Management, EPR) and
                                          just performance
                                                                   Capture knowledge
          scientific (e.g., Food simulation)
          applications




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.      6
Objectives, Requirements, Approaches
       Performance monitoring and analysis for Grid workflows of
       Web/Grid services and multilingual components

       Performance and dependability metrics
       Monitoring and performance analysis
         Service-oriented distributed architecture and peer-to-
         peer model
         Unified monitoring and performance analysis system
         covering infrastructure and applications
         Standardized data representations for monitoring data
         and events
         Adaptive and generic sensors, distributed analysis,
         performance bottleneck search

Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   7
Hierarchical View of Grid Workflows
                                       <parallel>
             Workflow
                                        <activity name="mProject1">
                                        <executable name="mProject1"/>
                                        </activity>
      Workflow Region n
                                        <activity name="mProject2">
                                        <executable name="mProject2"/>
                                         </activity>
             Activity m


                                       </parallel>
                                                          mProject1Service.java
      Invoked Application m
                                                                 public void mProject1() {
                                                                         A();

                                                                         while () {
  Code            Code           Code                                       ...
 Region 1       Region …        Region q                                     }

                                                                 }
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.      8
Hierarchical View of Grid Workflows
                                       <parallel>
             Workflow
                                        <activity name="mProject1">
                                        <executable name="mProject1"/>
                                        </activity>
      Workflow Region n
                                        <activity name="mProject2">
                                        <executable name="mProject2"/>
                                         </activity>
             Activity m


                                       </parallel>
                                         Monitoring and
                                                     mProject1Service.java
      Invoked Application m            analysis at multiple
                                                          public void mProject1() {
                                      levels of abstraction
                                                                         A();

                                                                         while () {
  Code            Code           Code                                       ...
 Region 1       Region …        Region q                                     }

                                                                 }
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   9
Workflow Execution Model (Simplified)




                                                                  Local scheduler

       Workflow execution
        Spanning multiple Grid sites
        Highly inter-organizational, inter-related and dynamic
       Multiple levels of job scheduling
        At workflow execution engine (part of WfMS)
        At Grid sites
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   10
Performance Metrics of Grid Workflows
      Interesting performance metrics associated with multiple
    levels of abstraction
         Metrics can be used in workflow composition, for
         comparing different invoked applications of a single activity,
         adaptive scheduling, etc.
      Five levels of abstraction
         Code region, Invoked application, Activity ,Workflow
         Region, Workflow
         Topdown PMA: from a higher level to a lower one




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   11
Monitoring and Measuring Performance
                    Metrics
    Performance monitoring and analysis tools
       Operate at multiple levels
       Correlate performance metrics from multiple levels
   Middleware and application instrumentation
      Instrument execution engine
       • Execution engine can be distributed or centralized
      Instrument applications
       • Distributed, spanning multiple Grid sites
    Challenging problems: the complexity of performance tools and data
      Integrate multiple performance monitoring tools executed on multiple
      Grid sites
      Integrate performance data produced by various tools

      We need common concepts for performance data associated with
                          Grid workflows
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   12
Utilizing Ontologies for Describing Performance
               Data of Grid workflows
      Metrics ontology
        Specifies which performance metrics a tool can provide
        Simplifies the access to performance metrics provided by various tools
     Performance data integration
        Performance data integration based on common concepts.
        High-level search and retrieval of performance data
     Knowledge base performance data of Grid workflows
        Utilized by high-level tools such as schedulers, workflow composition
        tools, etc.
        Used to re(discover) workflow patterns, interactions in workflows, to
        check correct execution, etc.
     Distributed performance analysis
         Performance analysis requests can be built based on ontologies

Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   13
Workflow Performance Ontology

     WfPerfOnto (Ontology describing Performance data of Grid
    Workflows)
       Specifies performance metrics
       Basic concepts
        • Concepts reflect the hierarchical structure of a workflow
        • Static and dynamic workflow performance data
       Relationships
        • Static and dynamic relationships among concepts




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   14
WfPerfOnto




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   15
Monitoring and Analysis Scenario




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   16
Components for Grid PMA
      Three main parts of a unified performance monitoring,
    instrumentation and analysis system
          Monitoring and instrumentation services
          Performance analysis services
          Performance service interfaces and data representations
           • We must reuse existing tools and techniques as much as
             possible

      Integration model
          Loosely coupled: among Grid sites/organizations
           • Utilizing SOA for performance tools
          Tightly coupled: within services deployed in a single Grid
          site/organization
           • Interfacing to existing (parallel) performance tools


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   17
K-WfGrid Monitoring and Analysis Architecture




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   18
Self-Managing Sensor-Based Middleware
       Integrating diverse types of sensors into a single system
          Event-driven and demand-driven sensors for system and
          applications monitoring, rule-based monitoring
       Self-managing services
         Service-based operations and TCP-based stream data
         delivery
         Peer-to-peer Grid services for the monitoring middleware
       Query and subscription of monitoring data
        Data query and subscription
        Group-based data query and subscription, and notification




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   19
Self-Managing Sensor Based Monitoring
                      Middleware




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   20
Workflow Instrumentation
      Issues to be addressed
          Multiple levels of instrumentation
          Instrumentation of multilingual applications (C/Java/Fortran)
          Must be dynamic (enabled) instrumentation

      ASKALON and K-WfGrid approach
          Utilizing existing instrumentation techniques
           • OCM-G (Roland Wismueller and Marian Bubak)
                  dynamic enabled instrumentation C/Fortran
           • Dyninst (Barton Miller, Jeff Hollingsworth)
                  for binary code generated from C/Fortran
           • Source/dynamic instrumentation of Java (from Java 1.5)
          Using APART standardized intermediate representation (SIR)
          XML-based request for controlling instrumentation


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   21
DIPAS: Distributed Performance Analysis
           DIPAS Portal                     DIPAS Portal


                  DIPAS                       DIPAS
                                                                               Knowledge
                 GateWay                     GateWay                         Builder Agent
                                                                                 GOM:
            Grid analysis                                                   Ontology Memory
                                            Grid analysis
               agent                           agent
                                                                           Monitoring Service
                            Grid analysis
                               agent                                            GWES:
                               DIPAS                                        Execution Service
  Grid analysis agent accepts WARL and returns performance metrics
  described in XML under a tree of metrics
     WARL (workflow analysis request langauge): based on concepts and
     properties in WfPerfOnto
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   22
temporal overheads


Workflow Overhead                                                     unidentified overhead

                                                                      middleware
                                                                                           resource brokerage


Classification                                                                             performance prediction

                                                                                          scheduling

                                                                                                           optimization algorithm

                                                                                                           rescheduling


      Middleware                                                                          execution management

                                                                                                             control of parallelism


           Scheduler                                                                                                             fork construct

                                                                                                                                 barrier



           Resource Manager
                                                                                                             job preparation

                                                                                                                                  archive/compression of data

                                                                                                                                 extractdecompression/

           Execution management                                                                              job submission

                                                                                                                                 submission decision policy

            • Control of parallelism                                                                                             GRAM submission

                                                                                                                                 GRAM polling


            • Loss of parallelism                                                                                                queue

                                                                                                                                                  queue waiting

                                                                                                                                                  queue residency



      Loss of parallelism                                                                 restart

                                                                                                             job failure

                                                                                                           job cancel


           Load imbalance                                                                  service latency

                                                                                                               resource broker


           Serialization                                                                                     performance prediction

                                                                                                                  scheduler



           Replicated job                                                                 security
                                                                                                             execution management



                                                                          loss of parallelism



      Data transfer                                                                       load imbalance

                                                                                           serialisation

                                                                                          replicated job




      Activity
                                                                          data transfer

                                                                                          file staging

                                                                                                             stage in


           Parallel processing overheads                                                   access to database
                                                                                                             stage out




           External load                                                                   third party data transfer

                                                                                           input from user

                                                                                           data transfer imbalance

                                                                          activity


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   parallel overheads

                                                                                           external load
                                                                                                                                                  23
Current status of the implementation
      SOA-based
          Monitoring, Instrumentation and Analysis services are GT4 based
          XML-based for performance data representations and requests
      Monitoring and Instrumentation Services
          gSOAP, GSI-based dynamic instrumentation service
          Java-based dynamic instrumentation
          OCM-G
          But we need to integrate them into a single framework for workflows
          and it is a non trivial task
      Analysis services
          GT4-based with distributed components
          Simple language for workflow analysis request (WARL), designed
          based on WfPerfOnto
          Metrics are described in XML


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   24
Example: Dynamic Instrumentation




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   25
Snapshot: Online System Monitoring




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   26
Rule-based Monitoring
       Sensors use rules to analyze monitoring data
       Rules are based on IBM ABLE toolkit

       Example:
          A network path in the
          Austrian Grid, bandwidth <
          5MB/s (based on Iperf)

          Define a fuzzy variable with
          VERY LOW, LOW, MEDUM,
          HIGH, VERY HIGH

          Fuzzy rules: Events send
          when bandwidth VERY LOW
          or VERY HIGH
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   27
Online Infrastructure Monitoring




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   28
DAG-based workflow monitoring and
                       analysis
       Online application
            profiling




  Monitoring data of
  workflow activity
     execution




                                                               Monitoring data of
                                                              computational node




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   29
K-WfGrid: Online Workflow Monitoring and
                       Analysis




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   30
Online Workflow Analysis




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   31
Conclusions
    The architecture of monitoring and analysis service must tackle the
   dynamics and the diversity of the Grid
          Service-oriented, peer-to-peer model, adaptive sensors
     Integration and reuse are important issues
          Loosely coupled and tightly coupled
          Do not neglect data representations and service interfaces
     Performance metrics and ontology for Grid workflows
          What performance metrics are important and how to measure them
          Common and generic concepts and relationships monitored and alayzed
    Given well-defined service interfaces, data representation, performance
   metrics and ontology
          Simplify the integration among components
          Towards automatic, intelligent and distributed performance performance
          analysis

                        UIBK perf. work: http://dps.uibk.ac.at/projects/pma/
                       Papers: http://dps.uibk.ac.at/index.pl/publications


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   32

Contenu connexe

Similaire à Performance Analysis of Grid Workflows in K-WfGrid and ASKALON

K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...Hong-Linh Truong
 
Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Performance Metrics and Ontology for Describing Performance Data of Grid Work...Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Performance Metrics and Ontology for Describing Performance Data of Grid Work...Hong-Linh Truong
 
Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution ISSGC Summer School
 
Webinar september 2013
Webinar september 2013Webinar september 2013
Webinar september 2013Marc Gille
 
LUXproject Functionality Overview R12.2
LUXproject Functionality Overview R12.2LUXproject Functionality Overview R12.2
LUXproject Functionality Overview R12.2Alexander Zagvozdin
 
Presentation for use
Presentation   for usePresentation   for use
Presentation for usebolu804
 
Zou Layered VO PDCAT2008 V0.5 Concise
Zou Layered VO PDCAT2008 V0.5 ConciseZou Layered VO PDCAT2008 V0.5 Concise
Zou Layered VO PDCAT2008 V0.5 Conciseyongqiangzou
 
Building resilient scheduling in distributed systems with Spring
Building resilient scheduling in distributed systems with SpringBuilding resilient scheduling in distributed systems with Spring
Building resilient scheduling in distributed systems with SpringMarek Jeszka
 
Automation Failover in Openstack
Automation Failover in OpenstackAutomation Failover in Openstack
Automation Failover in Openstackjannahyusoff1
 
20100512 Workflow Ramage
20100512 Workflow Ramage20100512 Workflow Ramage
20100512 Workflow RamageSteven Ramage
 
Presentation for slideshare
Presentation   for slidesharePresentation   for slideshare
Presentation for slidesharebolu804
 
Network Analyzer and Report Generation Tool for NS-2 using TCL Script
Network Analyzer and Report Generation Tool for NS-2 using TCL ScriptNetwork Analyzer and Report Generation Tool for NS-2 using TCL Script
Network Analyzer and Report Generation Tool for NS-2 using TCL ScriptIRJET Journal
 
Live Coding 12 Factor App
Live Coding 12 Factor AppLive Coding 12 Factor App
Live Coding 12 Factor AppEmily Jiang
 
LUXproject functionality overview R11.8
LUXproject functionality overview R11.8LUXproject functionality overview R11.8
LUXproject functionality overview R11.8Alexander Zagvozdin
 
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu FengHC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu FengAMD Developer Central
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Rejunevating software reengineering processes
Rejunevating software reengineering processesRejunevating software reengineering processes
Rejunevating software reengineering processesmanishthaper
 
Microservices and modularity with java
Microservices and modularity with javaMicroservices and modularity with java
Microservices and modularity with javaDPC Consulting Ltd
 
IPv4 to IPv6 network transformation
IPv4 to IPv6 network transformationIPv4 to IPv6 network transformation
IPv4 to IPv6 network transformationNikolay Milovanov
 

Similaire à Performance Analysis of Grid Workflows in K-WfGrid and ASKALON (20)

K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
 
Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Performance Metrics and Ontology for Describing Performance Data of Grid Work...Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Performance Metrics and Ontology for Describing Performance Data of Grid Work...
 
Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution
 
Webinar september 2013
Webinar september 2013Webinar september 2013
Webinar september 2013
 
LUXproject Functionality Overview R12.2
LUXproject Functionality Overview R12.2LUXproject Functionality Overview R12.2
LUXproject Functionality Overview R12.2
 
Presentation for use
Presentation   for usePresentation   for use
Presentation for use
 
Zou Layered VO PDCAT2008 V0.5 Concise
Zou Layered VO PDCAT2008 V0.5 ConciseZou Layered VO PDCAT2008 V0.5 Concise
Zou Layered VO PDCAT2008 V0.5 Concise
 
Building resilient scheduling in distributed systems with Spring
Building resilient scheduling in distributed systems with SpringBuilding resilient scheduling in distributed systems with Spring
Building resilient scheduling in distributed systems with Spring
 
Automation Failover in Openstack
Automation Failover in OpenstackAutomation Failover in Openstack
Automation Failover in Openstack
 
20100512 Workflow Ramage
20100512 Workflow Ramage20100512 Workflow Ramage
20100512 Workflow Ramage
 
Presentation for slideshare
Presentation   for slidesharePresentation   for slideshare
Presentation for slideshare
 
Network Analyzer and Report Generation Tool for NS-2 using TCL Script
Network Analyzer and Report Generation Tool for NS-2 using TCL ScriptNetwork Analyzer and Report Generation Tool for NS-2 using TCL Script
Network Analyzer and Report Generation Tool for NS-2 using TCL Script
 
Live Coding 12 Factor App
Live Coding 12 Factor AppLive Coding 12 Factor App
Live Coding 12 Factor App
 
LUXproject functionality overview R11.8
LUXproject functionality overview R11.8LUXproject functionality overview R11.8
LUXproject functionality overview R11.8
 
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu FengHC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Rejunevating software reengineering processes
Rejunevating software reengineering processesRejunevating software reengineering processes
Rejunevating software reengineering processes
 
Automation chapt 3
Automation chapt 3Automation chapt 3
Automation chapt 3
 
Microservices and modularity with java
Microservices and modularity with javaMicroservices and modularity with java
Microservices and modularity with java
 
IPv4 to IPv6 network transformation
IPv4 to IPv6 network transformationIPv4 to IPv6 network transformation
IPv4 to IPv6 network transformation
 

Plus de Hong-Linh Truong

QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning ServicesQoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning ServicesHong-Linh Truong
 
Sharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service DevelopmentSharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service DevelopmentHong-Linh Truong
 
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy TradeoffMeasuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy TradeoffHong-Linh Truong
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsDevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsHong-Linh Truong
 
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...Hong-Linh Truong
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Hong-Linh Truong
 
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesModeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesHong-Linh Truong
 
Characterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data AnalyticsCharacterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data AnalyticsHong-Linh Truong
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANEnabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANHong-Linh Truong
 
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud ApplicationsAnalytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud ApplicationsHong-Linh Truong
 
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...Hong-Linh Truong
 
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...Hong-Linh Truong
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsManaging and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsHong-Linh Truong
 
Towards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTTowards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTHong-Linh Truong
 
On Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace ServicesOn Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace ServicesHong-Linh Truong
 
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...Hong-Linh Truong
 
On Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud SystemsOn Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud SystemsHong-Linh Truong
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...Hong-Linh Truong
 
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...Hong-Linh Truong
 
Governing Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under UncertaintiesGoverning Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under UncertaintiesHong-Linh Truong
 

Plus de Hong-Linh Truong (20)

QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning ServicesQoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
 
Sharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service DevelopmentSharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service Development
 
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy TradeoffMeasuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsDevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
 
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
 
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesModeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
 
Characterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data AnalyticsCharacterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data Analytics
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANEnabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
 
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud ApplicationsAnalytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
 
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
 
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsManaging and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and Clouds
 
Towards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTTowards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoT
 
On Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace ServicesOn Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace Services
 
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
 
On Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud SystemsOn Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud Systems
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
 
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
 
Governing Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under UncertaintiesGoverning Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under Uncertainties
 

Dernier

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 

Dernier (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

Performance Analysis of Grid Workflows in K-WfGrid and ASKALON

  • 1. Performance Analysis of Grid workflows in K-WfGrid and ASKALON Hong-Linh Truong and Thomas Fahringer Distributed and Parallel Systems Group Institute of Computer Science, University of Innsbruck {truong,tf}@dps.uibk.ac.at http://dps.uibk.ac.at/projects/pma With contributions from: Bartosz Balis, Marian Bubak, Peter Brunner, Schahram Dustdar, Jakub Dziwisz, Andreas Hoheisel, Tilman Linden, Radu Prodan, Francesco Nerieri, Kuba Rozkwitalski, Robert Samborski Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.
  • 2. Outline Motivation Objectives and approach Performance metrics and ontologies Architecture of Grid monitoring and analysis Conclusions Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 2
  • 3. Performance Instrumentation, Monitoring, and Analysis for the Grid Challenging task because of dynamic nature of the Grid and its applications Combination of fine-grained and coarse-grained models Multilingual applications Heterogeneous and dynamic systems Not just performance but also dependability The focus of the talk Our concepts, architecture, interfaces and integration Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 3
  • 4. ASKALON Toolkit http://dps.uibk.ac.at/projects/askalon/ Programming and execution environment for Grid workflows Integrated various services for scheduling, executing, monitoring and analyzing Grid workflows Target applications Workflows of scientific applications (C/Fortran) • Material science, flooding, astrophysics, movie rendering, etc. Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 4
  • 5. The K-WfGrid Project An FP6 EU project www.kwfgrid.net Execute workflow Focuses Automatic construction and reuse of workflows based on Construct Monitor knowledge gathered through execution workflow environment Target applications K-WfGrid Workflows of Grid/Web services Reuse Analyze Grid/Web services may invoke knowledge information legacy applications Business (Coordinated Traffic Management, EPR) and Capture knowledge scientific (e.g., Food simulation) applications Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 5
  • 6. The K-WfGrid Project An FP6 EU project www.kwfgrid.net Execute workflow Focuses Automatic construction and reuse of workflows based on Construct Monitor knowledge gathered through execution workflow environment Many Grid users and Target applications K-WfGrid developers we know Workflows of Grid/Web services Reuse Analyze emphasize the interoperability, information Grid/Web services may invoke knowledge legacy applications integration and reliability, not Business (Coordinated Traffic Management, EPR) and just performance Capture knowledge scientific (e.g., Food simulation) applications Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 6
  • 7. Objectives, Requirements, Approaches Performance monitoring and analysis for Grid workflows of Web/Grid services and multilingual components Performance and dependability metrics Monitoring and performance analysis Service-oriented distributed architecture and peer-to- peer model Unified monitoring and performance analysis system covering infrastructure and applications Standardized data representations for monitoring data and events Adaptive and generic sensors, distributed analysis, performance bottleneck search Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 7
  • 8. Hierarchical View of Grid Workflows <parallel> Workflow <activity name="mProject1"> <executable name="mProject1"/> </activity> Workflow Region n <activity name="mProject2"> <executable name="mProject2"/> </activity> Activity m </parallel> mProject1Service.java Invoked Application m public void mProject1() { A(); while () { Code Code Code ... Region 1 Region … Region q } } Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 8
  • 9. Hierarchical View of Grid Workflows <parallel> Workflow <activity name="mProject1"> <executable name="mProject1"/> </activity> Workflow Region n <activity name="mProject2"> <executable name="mProject2"/> </activity> Activity m </parallel> Monitoring and mProject1Service.java Invoked Application m analysis at multiple public void mProject1() { levels of abstraction A(); while () { Code Code Code ... Region 1 Region … Region q } } Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 9
  • 10. Workflow Execution Model (Simplified) Local scheduler Workflow execution Spanning multiple Grid sites Highly inter-organizational, inter-related and dynamic Multiple levels of job scheduling At workflow execution engine (part of WfMS) At Grid sites Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 10
  • 11. Performance Metrics of Grid Workflows Interesting performance metrics associated with multiple levels of abstraction Metrics can be used in workflow composition, for comparing different invoked applications of a single activity, adaptive scheduling, etc. Five levels of abstraction Code region, Invoked application, Activity ,Workflow Region, Workflow Topdown PMA: from a higher level to a lower one Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 11
  • 12. Monitoring and Measuring Performance Metrics Performance monitoring and analysis tools Operate at multiple levels Correlate performance metrics from multiple levels Middleware and application instrumentation Instrument execution engine • Execution engine can be distributed or centralized Instrument applications • Distributed, spanning multiple Grid sites Challenging problems: the complexity of performance tools and data Integrate multiple performance monitoring tools executed on multiple Grid sites Integrate performance data produced by various tools We need common concepts for performance data associated with Grid workflows Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 12
  • 13. Utilizing Ontologies for Describing Performance Data of Grid workflows Metrics ontology Specifies which performance metrics a tool can provide Simplifies the access to performance metrics provided by various tools Performance data integration Performance data integration based on common concepts. High-level search and retrieval of performance data Knowledge base performance data of Grid workflows Utilized by high-level tools such as schedulers, workflow composition tools, etc. Used to re(discover) workflow patterns, interactions in workflows, to check correct execution, etc. Distributed performance analysis Performance analysis requests can be built based on ontologies Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 13
  • 14. Workflow Performance Ontology WfPerfOnto (Ontology describing Performance data of Grid Workflows) Specifies performance metrics Basic concepts • Concepts reflect the hierarchical structure of a workflow • Static and dynamic workflow performance data Relationships • Static and dynamic relationships among concepts Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 14
  • 15. WfPerfOnto Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 15
  • 16. Monitoring and Analysis Scenario Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 16
  • 17. Components for Grid PMA Three main parts of a unified performance monitoring, instrumentation and analysis system Monitoring and instrumentation services Performance analysis services Performance service interfaces and data representations • We must reuse existing tools and techniques as much as possible Integration model Loosely coupled: among Grid sites/organizations • Utilizing SOA for performance tools Tightly coupled: within services deployed in a single Grid site/organization • Interfacing to existing (parallel) performance tools Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 17
  • 18. K-WfGrid Monitoring and Analysis Architecture Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 18
  • 19. Self-Managing Sensor-Based Middleware Integrating diverse types of sensors into a single system Event-driven and demand-driven sensors for system and applications monitoring, rule-based monitoring Self-managing services Service-based operations and TCP-based stream data delivery Peer-to-peer Grid services for the monitoring middleware Query and subscription of monitoring data Data query and subscription Group-based data query and subscription, and notification Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 19
  • 20. Self-Managing Sensor Based Monitoring Middleware Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 20
  • 21. Workflow Instrumentation Issues to be addressed Multiple levels of instrumentation Instrumentation of multilingual applications (C/Java/Fortran) Must be dynamic (enabled) instrumentation ASKALON and K-WfGrid approach Utilizing existing instrumentation techniques • OCM-G (Roland Wismueller and Marian Bubak) dynamic enabled instrumentation C/Fortran • Dyninst (Barton Miller, Jeff Hollingsworth) for binary code generated from C/Fortran • Source/dynamic instrumentation of Java (from Java 1.5) Using APART standardized intermediate representation (SIR) XML-based request for controlling instrumentation Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 21
  • 22. DIPAS: Distributed Performance Analysis DIPAS Portal DIPAS Portal DIPAS DIPAS Knowledge GateWay GateWay Builder Agent GOM: Grid analysis Ontology Memory Grid analysis agent agent Monitoring Service Grid analysis agent GWES: DIPAS Execution Service Grid analysis agent accepts WARL and returns performance metrics described in XML under a tree of metrics WARL (workflow analysis request langauge): based on concepts and properties in WfPerfOnto Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 22
  • 23. temporal overheads Workflow Overhead unidentified overhead middleware resource brokerage Classification performance prediction scheduling optimization algorithm rescheduling Middleware execution management control of parallelism Scheduler fork construct barrier Resource Manager job preparation archive/compression of data extractdecompression/ Execution management job submission submission decision policy • Control of parallelism GRAM submission GRAM polling • Loss of parallelism queue queue waiting queue residency Loss of parallelism restart job failure job cancel Load imbalance service latency resource broker Serialization performance prediction scheduler Replicated job security execution management loss of parallelism Data transfer load imbalance serialisation replicated job Activity data transfer file staging stage in Parallel processing overheads access to database stage out External load third party data transfer input from user data transfer imbalance activity Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. parallel overheads external load 23
  • 24. Current status of the implementation SOA-based Monitoring, Instrumentation and Analysis services are GT4 based XML-based for performance data representations and requests Monitoring and Instrumentation Services gSOAP, GSI-based dynamic instrumentation service Java-based dynamic instrumentation OCM-G But we need to integrate them into a single framework for workflows and it is a non trivial task Analysis services GT4-based with distributed components Simple language for workflow analysis request (WARL), designed based on WfPerfOnto Metrics are described in XML Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 24
  • 25. Example: Dynamic Instrumentation Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 25
  • 26. Snapshot: Online System Monitoring Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 26
  • 27. Rule-based Monitoring Sensors use rules to analyze monitoring data Rules are based on IBM ABLE toolkit Example: A network path in the Austrian Grid, bandwidth < 5MB/s (based on Iperf) Define a fuzzy variable with VERY LOW, LOW, MEDUM, HIGH, VERY HIGH Fuzzy rules: Events send when bandwidth VERY LOW or VERY HIGH Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 27
  • 28. Online Infrastructure Monitoring Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 28
  • 29. DAG-based workflow monitoring and analysis Online application profiling Monitoring data of workflow activity execution Monitoring data of computational node Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 29
  • 30. K-WfGrid: Online Workflow Monitoring and Analysis Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 30
  • 31. Online Workflow Analysis Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 31
  • 32. Conclusions The architecture of monitoring and analysis service must tackle the dynamics and the diversity of the Grid Service-oriented, peer-to-peer model, adaptive sensors Integration and reuse are important issues Loosely coupled and tightly coupled Do not neglect data representations and service interfaces Performance metrics and ontology for Grid workflows What performance metrics are important and how to measure them Common and generic concepts and relationships monitored and alayzed Given well-defined service interfaces, data representation, performance metrics and ontology Simplify the integration among components Towards automatic, intelligent and distributed performance performance analysis UIBK perf. work: http://dps.uibk.ac.at/projects/pma/ Papers: http://dps.uibk.ac.at/index.pl/publications Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 32