SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
Open-Source Software in the
                  Scientific World
                  Case study: Triana Software


IAN J.TAYLOR                                       ADINA RIPOSAN
Cardiff University, UK                           Military Technical Academy,
Center for Computation & Technology,
                          Technology,                        Contact Net Ltd,
Louisiana University, USA                                Bucharest, Romania


                              eLiberatica 2007
May 18-19, 2007               Brasov, Romania
TRIANA

                  Workflows Environment




                         eLiberatica 2007
May 18-19, 2007          Brasov, Romania
Open-Source scientific environment
       for workflow composition:

       => more than 500 applications are developed based on
       Triana in support to scientific groups around the world

         Can integrate within a number of different distributed
          environments
           =>for allowing true heterogeneous computing across
           different Grids and distributed paradigms

         Can specify distributed course-grained service workflows

                      http://www.trianacode.org/




                             eLiberatica 2007
May 18-19, 2007              Brasov, Romania
Some Examples of Domains

     Gravitational wave data analysis (GridOneD)
     Radio astronomy (with Manchester)
     Astrophysical simulations (Cactus)
     Data mining (DIPSO, Data mining Grid)
     Biodiversity Problems (Bdworld)
     Galaxy visualization
     Audio processing and distributed music information retrieval
      (MIR)
     Distributed peer-to-peer simulations (NRL and AgentJ)
     Grid-enabled medical simulations GEMSS)
     Environmental science (INFERNO)
     E-Health (Contact-Net)




                             eLiberatica 2007
May 18-19, 2007              Brasov, Romania
Projects
     GEO 600 project - http://www.geo600.uni-hannover.de/
                         http://www.geo600.uni-hannover.de/
      (gravitational wave data analysis)

     GridOneD - www.gridoned.org
                   www.gridoned.org
      (distributed computing - P2P and Grid)

     GridLab - http://www.gridlab.org/
                http://www.gridlab.org/

     DataMiningGrid - http://www.datamininggrid.org/
                       http://www.datamininggrid.org/

     BiodiversityWorld - http://www.bdworld.org/
                          http://www.bdworld.org/

     GEMSS - http://www.ccrl-nece.de/gemss/index.html
               http://www.ccrl-nece.de/gemss/index.html
      (Grid-enabled medical simulations)

     ReSC - http://www.resc.rdg.ac.uk/projects.php
             http://www.resc.rdg.ac.uk/projects.

     DART - http://www.mrsdart.com
               http://www.mrsdart.com
      (Digital Audio Retrieval using Triana)
                                     Triana)

                                 eLiberatica 2007
May 18-19, 2007                  Brasov, Romania
Triana environment is used for problem solving and
        orchestrating flows of operations/services

        => fine-grained dataflow applications
        => course-grained distributed workflow system



     Workflow is implicit in scientific algorithms that specify:
           a series of inter-dependent operations to be executed,
           connecting such algorithms in a series of derivations,

          => when aggregated perform some higher-level task




                             eLiberatica 2007
May 18-19, 2007              Brasov, Romania
Workflows can be:
           • simple => contain a few components
           • complex => logic-based support can be integrated to
             make intelligent decisions about the dynamic evolution of
             the particular workflow


     Tasks with inter-dependencies expressed and handled by a
        computation flow
         ⇒ the chain of elementary tasks are not necessarily linear
         ⇒ request a graph of interconnected tasks


     Workflow management is data driven:
        the scientific experiments need to process large datasets
        the scheduler responsible for distributing the
          computational load should take into account the
          input dataset as well as the workflow graph topology


                               eLiberatica 2007
May 18-19, 2007                Brasov, Romania
Triana Workflows
                  - multimodal, multimedia -




                          eLiberatica 2007
May 18-19, 2007           Brasov, Romania
Based on Triana => The Alchemist Infrastructure

           A new paradigm in search and discovery of
            distributed resources, based on:

                     multimodal workflows

                     the coupling of metadata fusion & social tagging with
                      the more traditional index-based search techniques




                                     eLiberatica 2007
May 18-19, 2007                      Brasov, Romania
Information sources:
  •      http://www.trianacode.org/
  •      http://www.wspeer.org/
  •      http://www.trianacode.org/p2ps/




                         eLiberatica 2007
May 18-19, 2007          Brasov, Romania
   WSPeer has been Triana’s Web Services toolkit for the past
        three years and many projects have used this combination to
        specify their distributed course-grained service workflows
                        http://www.wspeer.org/

        WSPeer - existing middleware which provides:
         • a SOAP messaging layer (using Web Services / WS-RF)
         • within a P2P network that supports a super-peer topology of
           rendezvous or advert caching peers
            =>to support the scalability of the discovery & access to information
            =>to cache application-specific data, scientific data & metadata
              (not just discovery information)


       P2PS has been used as the underlying P2P environment
        during this time
                        http://www.trianacode.org/p2ps/


                                     eLiberatica 2007
May 18-19, 2007                      Brasov, Romania
   Triana provides a number of different bindings to underlying
    middleware and therefore a number of possible modes of
    operation.

     • Triana on the one hand has a full binding to Java GAT
       interface, capable of invoking tools and services such as
       Condor, GridFTP, GRAM, etc.

     • On the other has integrated with service-based middleware,
       such as Web Services, WS-RF, Jxta and P2PS.

     • It also has the capability to dynamically wrap applications
       remotely behind Web Services interfaces so that existing
       software can be easily integrated.




                             eLiberatica 2007
May 18-19, 2007              Brasov, Romania
Triana, the GAT and the GAP

                                                                                      Service Based
Grid Computing:                                                                       Computing:
 Job Submission,                                                                           Deployment,
     File services                                                                         discovery and
 A Graphical Grid                                                                          communication
      Computing                                                                            with distributed
  Environment or                                                                           services e.g. P2P
            Portal                                                                         and (GSI) Web
                                                                                           services


                                                                               GAP Interface
         GAT Interface
                                                                                                    Web
Condor   Unicore    GridFTP    GRMS          WSRF                  P2PS             JXTA
                                                                                                   Services
 Globus RLS    PBS       GridLab       .NET                     P2PS             JXTA              UDDI
                                                              Discovery
                                                                          P2PS Discovery                       SOAP
         SSH       SGE   LDR       Other..                                                 JXTA
                                                    eLiberatica 2007      Pipes
                                                                                           Pipes
    May 18-19, 2007                                 Brasov, Romania
Triana Focus


   Two core underlying focuses:

     • Interactive graphical programming of the
       distributed tasks - complex editing
             Intuitive drag/drop flexible editing - copy/paste services,
              wizards for creating tools/toolboxes, user interfaces,
              adding nodes and multi-level grouping.
             Has been used as a “graphical editor” for other languages,
              e.g. DAG, VDLx (DAX in progress).

     • Heterogeneous workflows - Bridge the gap between
       different distributed environments
             Use cross-environment interfaces
             led to integration with GAT (pre SAGA), GAP


                                 eLiberatica 2007
May 18-19, 2007                  Brasov, Romania
Triana allows to spontaneously create and run data analysis
        algorithms on the data at its source.

        Such a component-based object-oriented approach allows
         scientists to easily create new algorithms that conform to an
         agreed and defined set of data types and can adapt to
         different internal parameters

        Makes it easy to create individual user interfaces for each
         component to allow the modification of its internal parameters

        Graphical approach

         => REUSABILITY of existing units

         => Allows simple type-safe orchestration of data analysis
         pipelines on-the-fly without the need for code-level
         reconfiguration.

     Triana enables project scientists to design and create systems
        than connect together a number of software components.


                              eLiberatica 2007
May 18-19, 2007               Brasov, Romania
ALCHEMIST

        Workflows for Data Discovery




                   eLiberatica 2007
May 18-19, 2007    Brasov, Romania
The Alchemist framework:

                       Project at Cardiff University

     Domain-independent workflows & search mechanism, built on
       a generic P2P (Peer-to-Peer) architecture, supporting:

        distributed database queries
        complex search algorithms based on workflows
          • composed as a collection of Peer-to-Peer overlays,
                                                     overlays,
            Grid-based services and distributed workflows


     ⇒   Uses industry standards such as Web services and SOAP for
         messaging
          ⇒ Alchemist framework & tools are extensible & interoperable




                                eLiberatica 2007
May 18-19, 2007                 Brasov, Romania
Alchemist framework is an alternate approach to the classic
          Internet search engines

                  built on top of decentralised technologies

                allowing users to proactively push information
          into a decentralised “search database”
                                         database”


          using standardised Web Services interfaces,
                                          interfaces,
                   developed within the business & Grid computing communities,
                   and hosted on a Peer-to-Peer (P2P) infrastructure


          The system already interfaces with existing Grid middleware
          (e.g. Globus) through Web Services interfaces
                Globus)




                                     eLiberatica 2007
May 18-19, 2007                      Brasov, Romania
Distributed P2P database framework
          For decentralizing metadata & data


        Based on Web Services technologies

        Unstructured P2P i.e. in the style of super-peers but
          • Allows different overlays to be created (data caching) - dynamic
            grouping
          • Allows different caching policies (replication, forwarding overlays
            etc) for groups
          • Allows sophisticated Grid-style security sign-on, delegation

        Based on existing technologies
          • Triana, P2PS and WSPeer




                                  eLiberatica 2007
May 18-19, 2007                   Brasov, Romania
A framework providing a P2P layer for supporting:
         • pluggable network discovery & caching overlays
         • the ability to execute distributed workflows


     Dynamic overlays can be created on-the-fly for the particular
       application
       => deployed onto the peers through the use of P2P groups:

                   Security services for participating overlay
                   Membership services, Group services

     Workflows packages are dynamically propagated onto the
       network using an overlay of package repositories

        => decentralised layer of package repository cachers
        (created by Alchemist’s dynamic overlay mechanism)


                                   eLiberatica 2007
May 18-19, 2007                    Brasov, Romania
eLiberatica 2007
May 18-19, 2007   Brasov, Romania
Built on existing well-tested technologies:

       Triana Workflow Environment http://www.trianacode.org/
                                    http://www.trianacode.org/
         • Can specify distributed course-grained service workflows
         • Used in radio astronomy, astrophysical simulations, gravitational
           wave analysis, data mining, biodiversity problems, grid-enabled
           medical simulations, environmental science, audio processing etc.

       WSPeer - existing middleware http://www.wspeer.org/
                                        http://www.wspeer.org/
        (Triana’s Web Services toolkit) which provides
         Triana’
         • a SOAP messaging layer (using Web Services / WS-RF)
         • within a P2P network that supports a super-peer topology of
           rendezvous or advert caching peers
            =>to support the scalability of the discovery & access to information
            =>to cache application-specific data, scientific data & metadata
              (not just discovery information)

       P2PS - the underlying P2P environment http://www.trianacode.org/p2ps/
                                              http://www.trianacode.org/p2ps/


                                     eLiberatica 2007
May 18-19, 2007                      Brasov, Romania
Alchemist framework can be customised for
                     particular fields / user-environments

                        =>   Biomedical research
                        =>   Astrophysics research
                        =>   Audio research




                                  eLiberatica 2007
May 18-19, 2007                   Brasov, Romania
   Alchemist toolkit integrates applications, data providers, digital
        content, and algorithms
            => enables the simple composition of mixed-media queries for
        combinational searches
              => to interpret heterogeneous datasets in a logically defined order
              => to multiplex search results
              => produce rich metadata


       Graphical workflow builder => Alchemist provides a
        framework for specifying complex search algorithms,
        using a series of logical search steps
              => application developers do not need to write custom software
        algorithms from scratch
              => are able to create complex queries and data fusion techniques
        in a modular and pluggable fashion




                                  eLiberatica 2007
May 18-19, 2007                   Brasov, Romania
Alchemist Workflows
                   and Data Discovery

                              for

                  Biomedical Research




                       eLiberatica 2007
May 18-19, 2007        Brasov, Romania
   Alchemist infrastructure provides an innovative
           mechanism for biomedical image data search and
           spectral data discovery
            • metadata generated from user and automatic
              annotations is fused
            • semantics can be included as part of the query


          Triana - as a scientific workflow environment -
           has been used in Bioinformatics projects as the
           Grid-based problem-solving environment for collaborative
           exploration and analysis of global biodiversity patterns




                              eLiberatica 2007
May 18-19, 2007               Brasov, Romania
Within the context of Biomedical sciences, the Alchemist
        has a significant potential to support:

      (i) distributed biomedical communities focused on a specific
          disease process

      (ii) disease-oriented collaborative studies which share large
          datasets

      (iii) integrative biology projects that need to analyse inter-
           related information at different levels
          (e.g. clinical, cellular, molecular and genomic)

      (iv) population-based studies (e.g. clinical trials in diabetes or
         cancer)




                               eLiberatica 2007
May 18-19, 2007                Brasov, Romania
For biomedical images, current practice usually involves searching
       databases containing:

    (i) patient data repositories

    (ii) case-oriented reference atlases (i.e. dynamic information across
        spatial and temporal scales of abstraction)

    (iii) training collections (documented biomedical images, either
         anonymised individual or averaged data and training datasets)

    Biomedical audio-visual content and associated metadata can be
       discovered and retrieved, allowing also the visualization of
       particular regions of interest within the images, or anomalies in
       the patterns of spectral data.
    While content-based retrieval is an active field of research, multiple
      modality search (using data from multiple sources and media,
      and data from different levels of biological organisation) can give
      deeper insights into the nature of biological entities and the
      processes they are involved in.

                               eLiberatica 2007
May 18-19, 2007                Brasov, Romania
By using the Alchemist for disease-oriented studies,
        P2P caching could support the search, selection and aggregation
        of similar biomedical and spectral datasets.


        => Caching, using rendezvous peers (as in WSPeer), can be
        adapted to cache similar requests for resources


              => store hits on nodes which are within closer proximity to
              the particular group of researchers interested in the file




                               eLiberatica 2007
May 18-19, 2007                Brasov, Romania
Mobile support is also provided for the Alchemist
      infrastructure

          Clinicians can benefit greatly from the possibility of using a
           mobile device to initiate search operations, retrieve medical
           audio/visual data and associated records, or operate data
           transfer between different static repositories, in a controlled
           and secure way.

          We also address the possibility of using mobile devices for
           workflow management by the remote control of the
           enactment engine from a mobile user interface, for the data
           fusion of different kinds of data and metadata and the
           integration of algorithms designed under a common problem-
           solving environment




                               eLiberatica 2007
May 18-19, 2007                Brasov, Romania
eLiberatica 2007
May 18-19, 2007   Brasov, Romania
Alchemist Workflows

                   for Data Discovery
                                in

                  Diabetic Retinopathy




                       eLiberatica 2007
May 18-19, 2007        Brasov, Romania
Diabetic Retinopathy (DR):

     => All patients with diabetes are at risk of
         developing DR, and its progression to a
         sight-threatening stage is often not detected

     => Disease that accounts for c.80-90% of cases
      of blindness due to diabetes in the UK




                         eLiberatica 2007
May 18-19, 2007          Brasov, Romania
SCOPE:

  Early detection of pathologic mechanisms underlying
    diabetic retinopathy in research and clinical trial
    scenarios:

       • Mechanisms for imaging and spectral data discovery
       • Vertical and horizontal biomedical data integration



  2 SCENARIOS:

       • Early Detection and Prevention of Retinal Disease

       • Investigational Drug Discovery




                             eLiberatica 2007
May 18-19, 2007              Brasov, Romania
The study of DR (diabetic retinopathy):

   Customised utilisation of Alchemist multimodal search/workflow
      system, spanning:
      system, spanning:

        • fundamental Biomedical research
        • routine clinical (screening) practices

   Guided by:
       • DRSSW (Diabetic Retinopathy Screening Service for Wales)
       • DRU (Diabetes Research Unit, Llandough Hospital)

      High quality research data collected => factors associated with graded
       outcomes of DR
      Additional data currently being obtained from primary care settings
        -   high-resolution retinal images
        -   associated quantitative physiological, demographic and other variables

       Alchemist system:
         pattern searches at multiple levels
         transformations of signal data
                                     eLiberatica 2007
May 18-19, 2007                      Brasov, Romania
Thank you for attention !

         For more information…



                  Adina.Riposan@contactnet.ro

                  Ian.J.Taylor@cs.cardiff.ac.uk




                             eLiberatica 2007
May 18-19, 2007              Brasov, Romania

Contenu connexe

Tendances

A Framework for transforming archaeological databases to ontological datasets
A Framework for transforming archaeological databases to ontological datasetsA Framework for transforming archaeological databases to ontological datasets
A Framework for transforming archaeological databases to ontological datasets
Monika Solanki
 
100615 htap network_brussels
100615 htap network_brussels100615 htap network_brussels
100615 htap network_brussels
Rudolf Husar
 
Real time data management on wsn
Real time data management on wsnReal time data management on wsn
Real time data management on wsn
TAIWAN
 
Analysis of Link State Resource Reservation Protocol for Congestion Managemen...
Analysis of Link State Resource Reservation Protocol for Congestion Managemen...Analysis of Link State Resource Reservation Protocol for Congestion Managemen...
Analysis of Link State Resource Reservation Protocol for Congestion Managemen...
ijgca
 
Semantic Search on Heterogeneous Wiki Systems - poster
Semantic Search on Heterogeneous Wiki Systems - posterSemantic Search on Heterogeneous Wiki Systems - poster
Semantic Search on Heterogeneous Wiki Systems - poster
Fabrizio Orlandi
 

Tendances (16)

Evaluating content-centric communication over power line communication infras...
Evaluating content-centric communication over power line communication infras...Evaluating content-centric communication over power line communication infras...
Evaluating content-centric communication over power line communication infras...
 
Ontology-Based Routing for Large-Scale Unstructured P2P Publish/Subscribe System
Ontology-Based Routing for Large-Scale Unstructured P2P Publish/Subscribe SystemOntology-Based Routing for Large-Scale Unstructured P2P Publish/Subscribe System
Ontology-Based Routing for Large-Scale Unstructured P2P Publish/Subscribe System
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
A NEW ALGORITHM FOR CONSTRUCTION OF A P2P MULTICAST HYBRID OVERLAY TREE BASED...
A NEW ALGORITHM FOR CONSTRUCTION OF A P2P MULTICAST HYBRID OVERLAY TREE BASED...A NEW ALGORITHM FOR CONSTRUCTION OF A P2P MULTICAST HYBRID OVERLAY TREE BASED...
A NEW ALGORITHM FOR CONSTRUCTION OF A P2P MULTICAST HYBRID OVERLAY TREE BASED...
 
Effects of power conservation, wireless coverage and cooperation on data diss...
Effects of power conservation, wireless coverage and cooperation on data diss...Effects of power conservation, wireless coverage and cooperation on data diss...
Effects of power conservation, wireless coverage and cooperation on data diss...
 
A Framework for transforming archaeological databases to ontological datasets
A Framework for transforming archaeological databases to ontological datasetsA Framework for transforming archaeological databases to ontological datasets
A Framework for transforming archaeological databases to ontological datasets
 
100615 htap network_brussels
100615 htap network_brussels100615 htap network_brussels
100615 htap network_brussels
 
L347176
L347176L347176
L347176
 
New approaches with chord in efficient p2p grid resource discovery
New approaches with chord in efficient p2p grid resource discoveryNew approaches with chord in efficient p2p grid resource discovery
New approaches with chord in efficient p2p grid resource discovery
 
Real time data management on wsn
Real time data management on wsnReal time data management on wsn
Real time data management on wsn
 
CONTENT BASED DATA TRANSFER MECHANISM FOR EFFICIENT BULK DATA TRANSFER IN GRI...
CONTENT BASED DATA TRANSFER MECHANISM FOR EFFICIENT BULK DATA TRANSFER IN GRI...CONTENT BASED DATA TRANSFER MECHANISM FOR EFFICIENT BULK DATA TRANSFER IN GRI...
CONTENT BASED DATA TRANSFER MECHANISM FOR EFFICIENT BULK DATA TRANSFER IN GRI...
 
Proposal of a Transparent Relay System with vNIC for Encrypted Overlay Networks
Proposal of a Transparent Relay System with vNIC for Encrypted Overlay NetworksProposal of a Transparent Relay System with vNIC for Encrypted Overlay Networks
Proposal of a Transparent Relay System with vNIC for Encrypted Overlay Networks
 
Analysis of Link State Resource Reservation Protocol for Congestion Managemen...
Analysis of Link State Resource Reservation Protocol for Congestion Managemen...Analysis of Link State Resource Reservation Protocol for Congestion Managemen...
Analysis of Link State Resource Reservation Protocol for Congestion Managemen...
 
Transfer reliability and congestion control strategies in opportunistic netwo...
Transfer reliability and congestion control strategies in opportunistic netwo...Transfer reliability and congestion control strategies in opportunistic netwo...
Transfer reliability and congestion control strategies in opportunistic netwo...
 
Survey on Synchronizing File Operations Along with Storage Scalable Mechanism
Survey on Synchronizing File Operations Along with Storage Scalable MechanismSurvey on Synchronizing File Operations Along with Storage Scalable Mechanism
Survey on Synchronizing File Operations Along with Storage Scalable Mechanism
 
Semantic Search on Heterogeneous Wiki Systems - poster
Semantic Search on Heterogeneous Wiki Systems - posterSemantic Search on Heterogeneous Wiki Systems - poster
Semantic Search on Heterogeneous Wiki Systems - poster
 

En vedette

A Startup Case Study of Open Source Software
A Startup Case Study of Open Source SoftwareA Startup Case Study of Open Source Software
A Startup Case Study of Open Source Software
Leon Ho
 
Software Testing Fundamentals
Software Testing FundamentalsSoftware Testing Fundamentals
Software Testing Fundamentals
Chankey Pathak
 

En vedette (8)

"Open Source for Public Libraries Case Study IBLA Soft Library Automation Sof...
"Open Source for Public Libraries Case Study IBLA Soft Library Automation Sof..."Open Source for Public Libraries Case Study IBLA Soft Library Automation Sof...
"Open Source for Public Libraries Case Study IBLA Soft Library Automation Sof...
 
A Startup Case Study of Open Source Software
A Startup Case Study of Open Source SoftwareA Startup Case Study of Open Source Software
A Startup Case Study of Open Source Software
 
Real Time Marketing & Social Media
Real Time Marketing & Social MediaReal Time Marketing & Social Media
Real Time Marketing & Social Media
 
Online Dating - Examining the drawbacks of online dating
Online Dating - Examining the drawbacks of online datingOnline Dating - Examining the drawbacks of online dating
Online Dating - Examining the drawbacks of online dating
 
research methodology scope of online food services in irctc by BIT mesra NOID...
research methodology scope of online food services in irctc by BIT mesra NOID...research methodology scope of online food services in irctc by BIT mesra NOID...
research methodology scope of online food services in irctc by BIT mesra NOID...
 
Simple Web Design Case Study (Website Design Process Walkthrough)
Simple Web Design Case Study (Website Design Process Walkthrough)Simple Web Design Case Study (Website Design Process Walkthrough)
Simple Web Design Case Study (Website Design Process Walkthrough)
 
Software Testing Fundamentals
Software Testing FundamentalsSoftware Testing Fundamentals
Software Testing Fundamentals
 
Software testing ppt
Software testing pptSoftware testing ppt
Software testing ppt
 

Similaire à "Open Source Software in the Scientific World Case Study Triana Software" by Adina Riposan @ eLiberatica 2007

CloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom ItaliaCloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom Italia
Gabriele Bozzi
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
abhiritva
 
Gridcomputingppt
GridcomputingpptGridcomputingppt
Gridcomputingppt
navjasser
 

Similaire à "Open Source Software in the Scientific World Case Study Triana Software" by Adina Riposan @ eLiberatica 2007 (20)

grid mining
grid mininggrid mining
grid mining
 
GridComputing-an introduction.ppt
GridComputing-an introduction.pptGridComputing-an introduction.ppt
GridComputing-an introduction.ppt
 
Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22
 
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
 
CloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom ItaliaCloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom Italia
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Inroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar vermaInroduction to grid computing by gargi shankar verma
Inroduction to grid computing by gargi shankar verma
 
IRJET- A Workflow Management System for Scalable Data Mining on Clouds
IRJET- A Workflow Management System for Scalable Data Mining on CloudsIRJET- A Workflow Management System for Scalable Data Mining on Clouds
IRJET- A Workflow Management System for Scalable Data Mining on Clouds
 
The Role of Machine Learning in Fluid Network Control and Data Planes.pdf
The Role of Machine Learning in Fluid Network Control and Data Planes.pdfThe Role of Machine Learning in Fluid Network Control and Data Planes.pdf
The Role of Machine Learning in Fluid Network Control and Data Planes.pdf
 
Grid computing
Grid computingGrid computing
Grid computing
 
Parking
ParkingParking
Parking
 
Real Time, Web 2.0, and Grid Systems
Real Time, Web 2.0, and Grid Systems Real Time, Web 2.0, and Grid Systems
Real Time, Web 2.0, and Grid Systems
 
Computing Outside The Box
Computing Outside The BoxComputing Outside The Box
Computing Outside The Box
 
grid computing
grid computinggrid computing
grid computing
 
Gridcomputingppt
GridcomputingpptGridcomputingppt
Gridcomputingppt
 
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCENETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
NETWORK TRAFFIC ANALYSIS: HADOOP PIG VS TYPICAL MAPREDUCE
 
CTS Conference Web 2.0 Tutorial Part 1
CTS Conference Web 2.0 Tutorial Part 1CTS Conference Web 2.0 Tutorial Part 1
CTS Conference Web 2.0 Tutorial Part 1
 
Grid Presentation
Grid PresentationGrid Presentation
Grid Presentation
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 

Plus de eLiberatica

Plus de eLiberatica (20)

"Understanding Free Software and Open Source Licensing" by Zak Greant @ eLibe...
"Understanding Free Software and Open Source Licensing" by Zak Greant @ eLibe..."Understanding Free Software and Open Source Licensing" by Zak Greant @ eLibe...
"Understanding Free Software and Open Source Licensing" by Zak Greant @ eLibe...
 
"Sun Open Source Universe" by Vassilis Boulogiorgos @ eLiberatica 2008
"Sun Open Source Universe" by Vassilis Boulogiorgos @ eLiberatica 2008"Sun Open Source Universe" by Vassilis Boulogiorgos @ eLiberatica 2008
"Sun Open Source Universe" by Vassilis Boulogiorgos @ eLiberatica 2008
 
"Komodo - Why we chose to make our product open source" by Shane Caraveo @ eL...
"Komodo - Why we chose to make our product open source" by Shane Caraveo @ eL..."Komodo - Why we chose to make our product open source" by Shane Caraveo @ eL...
"Komodo - Why we chose to make our product open source" by Shane Caraveo @ eL...
 
"Dell and Open Source" by Serban Zirnovan @ eLiberatica 2008
"Dell and Open Source" by Serban Zirnovan @ eLiberatica 2008"Dell and Open Source" by Serban Zirnovan @ eLiberatica 2008
"Dell and Open Source" by Serban Zirnovan @ eLiberatica 2008
 
"SocrateOpen after two years" by Remus Cazacu @ eLiberatica 2008
"SocrateOpen after two years" by Remus Cazacu @ eLiberatica 2008"SocrateOpen after two years" by Remus Cazacu @ eLiberatica 2008
"SocrateOpen after two years" by Remus Cazacu @ eLiberatica 2008
 
"Introducing Red Hat Training Center" by Radu Radulescu @ eLiberatica 2008
"Introducing Red Hat Training Center" by Radu Radulescu @ eLiberatica 2008"Introducing Red Hat Training Center" by Radu Radulescu @ eLiberatica 2008
"Introducing Red Hat Training Center" by Radu Radulescu @ eLiberatica 2008
 
"HP vision Governing the use of open source" by Martin Michlmayr @ eLiberatic...
"HP vision Governing the use of open source" by Martin Michlmayr @ eLiberatic..."HP vision Governing the use of open source" by Martin Michlmayr @ eLiberatic...
"HP vision Governing the use of open source" by Martin Michlmayr @ eLiberatic...
 
"Write the Future Open Standards Open Source OpenOffice" by Louis Suarez-Pott...
"Write the Future Open Standards Open Source OpenOffice" by Louis Suarez-Pott..."Write the Future Open Standards Open Source OpenOffice" by Louis Suarez-Pott...
"Write the Future Open Standards Open Source OpenOffice" by Louis Suarez-Pott...
 
"Open Source Software Middleware for The Internet of Things - Project ASPIRE"...
"Open Source Software Middleware for The Internet of Things - Project ASPIRE"..."Open Source Software Middleware for The Internet of Things - Project ASPIRE"...
"Open Source Software Middleware for The Internet of Things - Project ASPIRE"...
 
"Introducing eConference" by Eugen Rotariu @ eLiberatica 2008
"Introducing eConference" by Eugen Rotariu @ eLiberatica 2008"Introducing eConference" by Eugen Rotariu @ eLiberatica 2008
"Introducing eConference" by Eugen Rotariu @ eLiberatica 2008
 
"Mozilla Messaging and Thunderbird - why and how" by David Ascher @ eLiberati...
"Mozilla Messaging and Thunderbird - why and how" by David Ascher @ eLiberati..."Mozilla Messaging and Thunderbird - why and how" by David Ascher @ eLiberati...
"Mozilla Messaging and Thunderbird - why and how" by David Ascher @ eLiberati...
 
"For the first time in Europe Digital ID providers and OpenID service for Rom...
"For the first time in Europe Digital ID providers and OpenID service for Rom..."For the first time in Europe Digital ID providers and OpenID service for Rom...
"For the first time in Europe Digital ID providers and OpenID service for Rom...
 
"Standing on the Shoulders of Giants" by Brian King @ eLiberatica 2008
"Standing on the Shoulders of Giants" by Brian King @ eLiberatica 2008"Standing on the Shoulders of Giants" by Brian King @ eLiberatica 2008
"Standing on the Shoulders of Giants" by Brian King @ eLiberatica 2008
 
"Legal aspects related to a FLOSS based model business" by Bogdan Manolea @ e...
"Legal aspects related to a FLOSS based model business" by Bogdan Manolea @ e..."Legal aspects related to a FLOSS based model business" by Bogdan Manolea @ e...
"Legal aspects related to a FLOSS based model business" by Bogdan Manolea @ e...
 
"OSS in Public Administrations - A short Report from the European Level" by B...
"OSS in Public Administrations - A short Report from the European Level" by B..."OSS in Public Administrations - A short Report from the European Level" by B...
"OSS in Public Administrations - A short Report from the European Level" by B...
 
"BitDefender - What's Next" by Alexandru Balan @ eLiberatica 2008
"BitDefender - What's Next" by Alexandru Balan @ eLiberatica 2008"BitDefender - What's Next" by Alexandru Balan @ eLiberatica 2008
"BitDefender - What's Next" by Alexandru Balan @ eLiberatica 2008
 
"The Future of Enterprise Content Management" by Aleksander Farstad @ eLibera...
"The Future of Enterprise Content Management" by Aleksander Farstad @ eLibera..."The Future of Enterprise Content Management" by Aleksander Farstad @ eLibera...
"The Future of Enterprise Content Management" by Aleksander Farstad @ eLibera...
 
"Integrating Open Source into Your Business" by Adam Jollans @ eLiberatica 2008
"Integrating Open Source into Your Business" by Adam Jollans @ eLiberatica 2008"Integrating Open Source into Your Business" by Adam Jollans @ eLiberatica 2008
"Integrating Open Source into Your Business" by Adam Jollans @ eLiberatica 2008
 
"Open Source at Microsoft" by Zoli Herczeg @ eLiberatica 2008
"Open Source at Microsoft" by Zoli Herczeg @ eLiberatica 2008"Open Source at Microsoft" by Zoli Herczeg @ eLiberatica 2008
"Open Source at Microsoft" by Zoli Herczeg @ eLiberatica 2008
 
"The Past Present and Future of the Mozilla Foundation" by Zak Greant @ eLibe...
"The Past Present and Future of the Mozilla Foundation" by Zak Greant @ eLibe..."The Past Present and Future of the Mozilla Foundation" by Zak Greant @ eLibe...
"The Past Present and Future of the Mozilla Foundation" by Zak Greant @ eLibe...
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

"Open Source Software in the Scientific World Case Study Triana Software" by Adina Riposan @ eLiberatica 2007

  • 1. Open-Source Software in the Scientific World Case study: Triana Software IAN J.TAYLOR ADINA RIPOSAN Cardiff University, UK Military Technical Academy, Center for Computation & Technology, Technology, Contact Net Ltd, Louisiana University, USA Bucharest, Romania eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 2. TRIANA Workflows Environment eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 3. Open-Source scientific environment for workflow composition: => more than 500 applications are developed based on Triana in support to scientific groups around the world  Can integrate within a number of different distributed environments =>for allowing true heterogeneous computing across different Grids and distributed paradigms  Can specify distributed course-grained service workflows http://www.trianacode.org/ eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 4. Some Examples of Domains  Gravitational wave data analysis (GridOneD)  Radio astronomy (with Manchester)  Astrophysical simulations (Cactus)  Data mining (DIPSO, Data mining Grid)  Biodiversity Problems (Bdworld)  Galaxy visualization  Audio processing and distributed music information retrieval (MIR)  Distributed peer-to-peer simulations (NRL and AgentJ)  Grid-enabled medical simulations GEMSS)  Environmental science (INFERNO)  E-Health (Contact-Net) eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 5. Projects  GEO 600 project - http://www.geo600.uni-hannover.de/ http://www.geo600.uni-hannover.de/ (gravitational wave data analysis)  GridOneD - www.gridoned.org www.gridoned.org (distributed computing - P2P and Grid)  GridLab - http://www.gridlab.org/ http://www.gridlab.org/  DataMiningGrid - http://www.datamininggrid.org/ http://www.datamininggrid.org/  BiodiversityWorld - http://www.bdworld.org/ http://www.bdworld.org/  GEMSS - http://www.ccrl-nece.de/gemss/index.html http://www.ccrl-nece.de/gemss/index.html (Grid-enabled medical simulations)  ReSC - http://www.resc.rdg.ac.uk/projects.php http://www.resc.rdg.ac.uk/projects.  DART - http://www.mrsdart.com http://www.mrsdart.com (Digital Audio Retrieval using Triana) Triana) eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 6. Triana environment is used for problem solving and orchestrating flows of operations/services => fine-grained dataflow applications => course-grained distributed workflow system Workflow is implicit in scientific algorithms that specify:  a series of inter-dependent operations to be executed,  connecting such algorithms in a series of derivations, => when aggregated perform some higher-level task eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 7. Workflows can be: • simple => contain a few components • complex => logic-based support can be integrated to make intelligent decisions about the dynamic evolution of the particular workflow Tasks with inter-dependencies expressed and handled by a computation flow ⇒ the chain of elementary tasks are not necessarily linear ⇒ request a graph of interconnected tasks Workflow management is data driven:  the scientific experiments need to process large datasets  the scheduler responsible for distributing the computational load should take into account the input dataset as well as the workflow graph topology eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 8. Triana Workflows - multimodal, multimedia - eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 9. Based on Triana => The Alchemist Infrastructure  A new paradigm in search and discovery of distributed resources, based on:  multimodal workflows  the coupling of metadata fusion & social tagging with the more traditional index-based search techniques eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 10. Information sources: • http://www.trianacode.org/ • http://www.wspeer.org/ • http://www.trianacode.org/p2ps/ eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 11. WSPeer has been Triana’s Web Services toolkit for the past three years and many projects have used this combination to specify their distributed course-grained service workflows http://www.wspeer.org/ WSPeer - existing middleware which provides: • a SOAP messaging layer (using Web Services / WS-RF) • within a P2P network that supports a super-peer topology of rendezvous or advert caching peers =>to support the scalability of the discovery & access to information =>to cache application-specific data, scientific data & metadata (not just discovery information)  P2PS has been used as the underlying P2P environment during this time http://www.trianacode.org/p2ps/ eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 12. Triana provides a number of different bindings to underlying middleware and therefore a number of possible modes of operation. • Triana on the one hand has a full binding to Java GAT interface, capable of invoking tools and services such as Condor, GridFTP, GRAM, etc. • On the other has integrated with service-based middleware, such as Web Services, WS-RF, Jxta and P2PS. • It also has the capability to dynamically wrap applications remotely behind Web Services interfaces so that existing software can be easily integrated. eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 13. Triana, the GAT and the GAP Service Based Grid Computing: Computing: Job Submission, Deployment, File services discovery and A Graphical Grid communication Computing with distributed Environment or services e.g. P2P Portal and (GSI) Web services GAP Interface GAT Interface Web Condor Unicore GridFTP GRMS WSRF P2PS JXTA Services Globus RLS PBS GridLab .NET P2PS JXTA UDDI Discovery P2PS Discovery SOAP SSH SGE LDR Other.. JXTA eLiberatica 2007 Pipes Pipes May 18-19, 2007 Brasov, Romania
  • 14. Triana Focus  Two core underlying focuses: • Interactive graphical programming of the distributed tasks - complex editing  Intuitive drag/drop flexible editing - copy/paste services, wizards for creating tools/toolboxes, user interfaces, adding nodes and multi-level grouping.  Has been used as a “graphical editor” for other languages, e.g. DAG, VDLx (DAX in progress). • Heterogeneous workflows - Bridge the gap between different distributed environments  Use cross-environment interfaces  led to integration with GAT (pre SAGA), GAP eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 15. Triana allows to spontaneously create and run data analysis algorithms on the data at its source.  Such a component-based object-oriented approach allows scientists to easily create new algorithms that conform to an agreed and defined set of data types and can adapt to different internal parameters  Makes it easy to create individual user interfaces for each component to allow the modification of its internal parameters  Graphical approach => REUSABILITY of existing units => Allows simple type-safe orchestration of data analysis pipelines on-the-fly without the need for code-level reconfiguration. Triana enables project scientists to design and create systems than connect together a number of software components. eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 16. ALCHEMIST Workflows for Data Discovery eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 17. The Alchemist framework: Project at Cardiff University Domain-independent workflows & search mechanism, built on a generic P2P (Peer-to-Peer) architecture, supporting:  distributed database queries  complex search algorithms based on workflows • composed as a collection of Peer-to-Peer overlays, overlays, Grid-based services and distributed workflows ⇒ Uses industry standards such as Web services and SOAP for messaging ⇒ Alchemist framework & tools are extensible & interoperable eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 18. Alchemist framework is an alternate approach to the classic Internet search engines built on top of decentralised technologies allowing users to proactively push information into a decentralised “search database” database” using standardised Web Services interfaces, interfaces,  developed within the business & Grid computing communities,  and hosted on a Peer-to-Peer (P2P) infrastructure The system already interfaces with existing Grid middleware (e.g. Globus) through Web Services interfaces Globus) eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 19. Distributed P2P database framework For decentralizing metadata & data  Based on Web Services technologies  Unstructured P2P i.e. in the style of super-peers but • Allows different overlays to be created (data caching) - dynamic grouping • Allows different caching policies (replication, forwarding overlays etc) for groups • Allows sophisticated Grid-style security sign-on, delegation  Based on existing technologies • Triana, P2PS and WSPeer eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 20. A framework providing a P2P layer for supporting: • pluggable network discovery & caching overlays • the ability to execute distributed workflows Dynamic overlays can be created on-the-fly for the particular application => deployed onto the peers through the use of P2P groups:  Security services for participating overlay  Membership services, Group services Workflows packages are dynamically propagated onto the network using an overlay of package repositories => decentralised layer of package repository cachers (created by Alchemist’s dynamic overlay mechanism) eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 21. eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 22. Built on existing well-tested technologies:  Triana Workflow Environment http://www.trianacode.org/ http://www.trianacode.org/ • Can specify distributed course-grained service workflows • Used in radio astronomy, astrophysical simulations, gravitational wave analysis, data mining, biodiversity problems, grid-enabled medical simulations, environmental science, audio processing etc.  WSPeer - existing middleware http://www.wspeer.org/ http://www.wspeer.org/ (Triana’s Web Services toolkit) which provides Triana’ • a SOAP messaging layer (using Web Services / WS-RF) • within a P2P network that supports a super-peer topology of rendezvous or advert caching peers =>to support the scalability of the discovery & access to information =>to cache application-specific data, scientific data & metadata (not just discovery information)  P2PS - the underlying P2P environment http://www.trianacode.org/p2ps/ http://www.trianacode.org/p2ps/ eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 23. Alchemist framework can be customised for particular fields / user-environments => Biomedical research => Astrophysics research => Audio research eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 24. Alchemist toolkit integrates applications, data providers, digital content, and algorithms => enables the simple composition of mixed-media queries for combinational searches => to interpret heterogeneous datasets in a logically defined order => to multiplex search results => produce rich metadata  Graphical workflow builder => Alchemist provides a framework for specifying complex search algorithms, using a series of logical search steps => application developers do not need to write custom software algorithms from scratch => are able to create complex queries and data fusion techniques in a modular and pluggable fashion eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 25. Alchemist Workflows and Data Discovery for Biomedical Research eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 26. Alchemist infrastructure provides an innovative mechanism for biomedical image data search and spectral data discovery • metadata generated from user and automatic annotations is fused • semantics can be included as part of the query  Triana - as a scientific workflow environment - has been used in Bioinformatics projects as the Grid-based problem-solving environment for collaborative exploration and analysis of global biodiversity patterns eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 27. Within the context of Biomedical sciences, the Alchemist has a significant potential to support: (i) distributed biomedical communities focused on a specific disease process (ii) disease-oriented collaborative studies which share large datasets (iii) integrative biology projects that need to analyse inter- related information at different levels (e.g. clinical, cellular, molecular and genomic) (iv) population-based studies (e.g. clinical trials in diabetes or cancer) eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 28. For biomedical images, current practice usually involves searching databases containing: (i) patient data repositories (ii) case-oriented reference atlases (i.e. dynamic information across spatial and temporal scales of abstraction) (iii) training collections (documented biomedical images, either anonymised individual or averaged data and training datasets) Biomedical audio-visual content and associated metadata can be discovered and retrieved, allowing also the visualization of particular regions of interest within the images, or anomalies in the patterns of spectral data. While content-based retrieval is an active field of research, multiple modality search (using data from multiple sources and media, and data from different levels of biological organisation) can give deeper insights into the nature of biological entities and the processes they are involved in. eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 29. By using the Alchemist for disease-oriented studies, P2P caching could support the search, selection and aggregation of similar biomedical and spectral datasets. => Caching, using rendezvous peers (as in WSPeer), can be adapted to cache similar requests for resources => store hits on nodes which are within closer proximity to the particular group of researchers interested in the file eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 30. Mobile support is also provided for the Alchemist infrastructure  Clinicians can benefit greatly from the possibility of using a mobile device to initiate search operations, retrieve medical audio/visual data and associated records, or operate data transfer between different static repositories, in a controlled and secure way.  We also address the possibility of using mobile devices for workflow management by the remote control of the enactment engine from a mobile user interface, for the data fusion of different kinds of data and metadata and the integration of algorithms designed under a common problem- solving environment eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 31. eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 32. Alchemist Workflows for Data Discovery in Diabetic Retinopathy eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 33. Diabetic Retinopathy (DR): => All patients with diabetes are at risk of developing DR, and its progression to a sight-threatening stage is often not detected => Disease that accounts for c.80-90% of cases of blindness due to diabetes in the UK eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 34. SCOPE: Early detection of pathologic mechanisms underlying diabetic retinopathy in research and clinical trial scenarios: • Mechanisms for imaging and spectral data discovery • Vertical and horizontal biomedical data integration 2 SCENARIOS: • Early Detection and Prevention of Retinal Disease • Investigational Drug Discovery eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 35. The study of DR (diabetic retinopathy): Customised utilisation of Alchemist multimodal search/workflow system, spanning: system, spanning: • fundamental Biomedical research • routine clinical (screening) practices Guided by: • DRSSW (Diabetic Retinopathy Screening Service for Wales) • DRU (Diabetes Research Unit, Llandough Hospital)  High quality research data collected => factors associated with graded outcomes of DR  Additional data currently being obtained from primary care settings - high-resolution retinal images - associated quantitative physiological, demographic and other variables Alchemist system:  pattern searches at multiple levels  transformations of signal data eLiberatica 2007 May 18-19, 2007 Brasov, Romania
  • 36. Thank you for attention ! For more information… Adina.Riposan@contactnet.ro Ian.J.Taylor@cs.cardiff.ac.uk eLiberatica 2007 May 18-19, 2007 Brasov, Romania