SlideShare une entreprise Scribd logo
1  sur  15
Biodiversity Virtual e-Laboratory
An e-Infrastructure and e-Science environment supporting research
on biodiversity



WEB SERVICES INFRASTRUCTURES
FOR BIODIVERSITY SCIENCE

Alex Hardisty
Coordinator, Cardiff University


EUDAT User Forum, 11-12th March 2013, London
Products are “services” and “workflows”
• Workflows allow to process vast
  amounts of data, repeatedly
   – Build your own workflow: select and
     apply successive “services” (data
     analysis and processing steps)
   – Import data from one’s own research
     and/or from existing libraries (i.e. GBIF,
     Catalogue of Life)
• Access a library of workflows and
  re-use existing workflows.
   – Improves efficiency by reducing              Part of a workflow to study the
                                                  ecological niche of the horseshoe crab
     research time and overhead expenses
Creates powerful data processing tools Ecological Niche Modelling
                                        Biogeochemical modelling
 for biodiversity research              Metagenomics
• Carbon Sequestration                  Phylogenetics
                                        Population Modelling
• Ecosystem Functioning and Valuation Taxonomy
                                        Geospatial Visualization
• Invasive Species Management
An international virtual network of experts connecting
2 scientific communities: biodiversity and ICT
• Aims to foster cooperation in the community by:
   –   Discussing scientific use cases
   –   Identifying and deploying important Web Services
   –   Designing and offering workflows
   –   Training scientists
Supported by
       many friends


Fits into a portfolio
of initiatives
  • NoE: ALTER-Net, EDIT/PESI, LTER-Europe, EuroMarine, etc.
  • Projects: 4D4Life, agINFRA, Aquamaps, ArtDataBanken,
    BioFresh, Envri, EU BON, EUBrazilOpenBio, Fauna Iberica,
    i4Life, iMarine, Micro B3, OpenPlantBio, ViBRANT
  • Global: CAMERA, Catalogue of Life, COOPEUS, CReATIVE-B,
    EoL, GBIF, GSC Biodiversity WG, TreeBase, and many more
Important contribution
to infrastructure
BioVeL Tool Spectrum
Workflow design, compute     Concept Knowledge           Domain science

               Technical       Science           Domain
                 PAL            PAL              Scientist




     Taverna         Component           Taverna       Domain-Specific
    Workbench          Builder        Lite / Server        Website
                                                       (Taverna Player)
        High               Workflow Visibility               Low
Biodiversity
Catalogues &                                                                      Catalogue
                                         Workflows
Repositories                                   Components                           Services
                                                                                          BioCatalogue

                                                                    Curators
                                                Pro                              In the
Interfaces
                                                Makers                           Field
Design & Launch
                                                                                 Users                           Third Party
tools                      Taverna                                                             Lite
                                                                                                                 Channels
                          Workbench


                                                           Local        Public      BioVeL




                                                                                                      Services
                                                Data Mgt
                Servers




                                                                                                                   COTS          Shim
                                                           Local
                               Taverna Server              File
                                                                                   Data Mgt
Run time                                                   Stores
                                                                                   Workspace
Execution                                                  Local
                                                                                   Authentication
                                                           Data
                                                                                   Management
                                                           Sets                                                                Domain
                  Server Interaction Server                                        System




Deployment
Infrastructure
                                      Cloud
hosting, compute, storage
We’re at the halfway point
• Several workflows maturing nicely
   – Public Shared: Data refinement, Population modelling, Ecol. niche modelling
   – Beta: Phylogenetic inferencing
   – In the pipe: Biogeochemical process modelling, metagenomics, …

• Using Web services from GBIF, CoL, CRIA, Fraunhofer, INFN, ….
   – Developing new services: viz and data selection, phylo, metagenomics,
     Biome-BGC modelling, pop modelling
• A curated public catalogue of Web services
   – www.biodiversitycatalogue.org
• AWS cloud infrastructure, new user interfaces (tavlite1.biovel.eu)
• Growing profile in community
   – Steady enquiries from potential users and public training workshops
4 questions to address
1. How to use distributed centres to efficiently run
   distributed processing chains?
2. Is there a problem of data exchange?
   (And how to solve this)
3. Deploying codes close to data
4. Access and security issues around managing
   protected services
How to use distributed centres to efficiently
    run distributed processing chains?

Users’ workflows and
applications




Service and Data Providers
(INFN, BioVeL, GBIF, CoL,
EBI, BGBM, etc.)




Resource Providers
(EUDAT, EGI.eu, PRACE,
commercial cloud, etc.)
Is there a problem of data exchange?
            (And how to solve this)
• At simplest level, we need for the user:
   – A "starting place", where a workflow can find the data it needs
   – An "ending place", where a workflow can put its results
   – A "transient place" where temporary data / intermediate results can be
     put and retrieved
• For services we need:
   – Temporary spaces associated with specific services, supporting data
     movements between services
   – Separation of users and separation of workflow runs
• Summarise as :
   – A replicated distributed storage space, accessible to BioVeL services,
     (hence workflows) for both reading and writing; which presents to the
     user as a filespace, native to the user’s local environment.
       • = Dropbox for services, with fast replication between known service
         locations. Today, typically GB not TB
Deploying codes close to data
• BioVeL Appliance
  – A service packaged for DCI, deployed on-demand
  – Working with EGI Fedcloud on this
  – Could be deployed close to data but this only makes sense
    if this would be quicker than moving the data
     • So where is the break-even point?


• Taverna Server deployments
  – In connection with Web Services hosting       Taverna Server
Access and security issues around
      managing protected services
• We need a lightweight and standard solution for
   – User management & single sign-on to our Service Network
   – Permissions system for authorizing access to services
      • Same for Workspace Access Service (user workspace)


                                          User
                               Contract

                                          SP
                               Contract

                                          RP
Access and security issues around
      managing protected services
• We need a lightweight and standard solution for
   – User management & single sign-on to our Service Network
   – Permissions system for authorizing access to services
      • Same for Workspace Access Service (user workspace)
• 3-legged OAuth, extended
   – resource / service is
     independent of BioVeL
     OAuth provider
• Adopt from megx.net
   – marine ecological
     genomics
Questions?

BioVeL is funded by the
European Commission
7th Framework Programme (FP7).
It is part of its e-Infrastructures activity.

BioVeL contributes to LifeWatch and GEO BON.

BioVeL products are free to access.




Under FP7, the e-Infrastructures activity is part of the Research Infrastructures programme,
funded under the FP7 'Capacities' Specific Programme. It focuses on the further development
and evolution of the high-capacity and high-performance communication network (GÉANT),
distributed computing infrastructures (grids and clouds), supercomputer infrastructures,
simulation software, scientific data infrastructures, e-Science services as well as on the adoption
of e-Infrastructures by user communities.

Contenu connexe

Similaire à Eudat user forum-london-11march2013-biovel-v3

Choosing Your Windows Azure Platform Strategy
Choosing Your Windows Azure Platform StrategyChoosing Your Windows Azure Platform Strategy
Choosing Your Windows Azure Platform Strategy
drmarcustillett
 
Jonas On Windows Azure OW2con11, Nov 24-25, Paris
Jonas On Windows Azure OW2con11, Nov 24-25, ParisJonas On Windows Azure OW2con11, Nov 24-25, Paris
Jonas On Windows Azure OW2con11, Nov 24-25, Paris
OW2
 
6.Live Framework 和Mesh Services
6.Live Framework 和Mesh Services6.Live Framework 和Mesh Services
6.Live Framework 和Mesh Services
GaryYoung
 
Viestinnän seminaari 8.11.2012 / Exchange
Viestinnän seminaari 8.11.2012 / ExchangeViestinnän seminaari 8.11.2012 / Exchange
Viestinnän seminaari 8.11.2012 / Exchange
Salcom Group
 
Mach Technology
Mach Technology Mach Technology
Mach Technology
Open Stack
 
IBM Pulse 2013 session - DevOps for Mobile Apps
IBM Pulse 2013 session - DevOps for Mobile AppsIBM Pulse 2013 session - DevOps for Mobile Apps
IBM Pulse 2013 session - DevOps for Mobile Apps
Sanjeev Sharma
 

Similaire à Eudat user forum-london-11march2013-biovel-v3 (20)

Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011
Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011
Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011
 
Linking Services and Linked Data: Keynote for AIMSA 2012
Linking Services and Linked Data: Keynote for AIMSA 2012Linking Services and Linked Data: Keynote for AIMSA 2012
Linking Services and Linked Data: Keynote for AIMSA 2012
 
Leadership Symposium on Digital Media in Healthcare
Leadership Symposium on Digital Media in HealthcareLeadership Symposium on Digital Media in Healthcare
Leadership Symposium on Digital Media in Healthcare
 
Linked services for the Web of Data
Linked services for the Web of DataLinked services for the Web of Data
Linked services for the Web of Data
 
Windows Azure Uzerinden Alinabilen Hizmetler
Windows Azure Uzerinden Alinabilen HizmetlerWindows Azure Uzerinden Alinabilen Hizmetler
Windows Azure Uzerinden Alinabilen Hizmetler
 
Windows Azure Üzerinden Alınabilecek Hizmetler
Windows Azure Üzerinden Alınabilecek HizmetlerWindows Azure Üzerinden Alınabilecek Hizmetler
Windows Azure Üzerinden Alınabilecek Hizmetler
 
Knowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge BaseKnowledge Base+: a Cloud-Based Community Knowledge Base
Knowledge Base+: a Cloud-Based Community Knowledge Base
 
Complex Er[jl]ang Processing with StreamBase
Complex Er[jl]ang Processing with StreamBaseComplex Er[jl]ang Processing with StreamBase
Complex Er[jl]ang Processing with StreamBase
 
IT Governance Portals
IT Governance   PortalsIT Governance   Portals
IT Governance Portals
 
ACES QuakeSim 2011
ACES QuakeSim 2011ACES QuakeSim 2011
ACES QuakeSim 2011
 
21st Century Service Oriented Architecture
21st Century Service Oriented Architecture21st Century Service Oriented Architecture
21st Century Service Oriented Architecture
 
Choosing Your Windows Azure Platform Strategy
Choosing Your Windows Azure Platform StrategyChoosing Your Windows Azure Platform Strategy
Choosing Your Windows Azure Platform Strategy
 
Jonas On Windows Azure OW2con11, Nov 24-25, Paris
Jonas On Windows Azure OW2con11, Nov 24-25, ParisJonas On Windows Azure OW2con11, Nov 24-25, Paris
Jonas On Windows Azure OW2con11, Nov 24-25, Paris
 
Saadallah vtls
Saadallah vtlsSaadallah vtls
Saadallah vtls
 
6.Live Framework 和Mesh Services
6.Live Framework 和Mesh Services6.Live Framework 和Mesh Services
6.Live Framework 和Mesh Services
 
RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...
 
02 Ms Online Identity Session 1
02 Ms Online Identity   Session 102 Ms Online Identity   Session 1
02 Ms Online Identity Session 1
 
Viestinnän seminaari 8.11.2012 / Exchange
Viestinnän seminaari 8.11.2012 / ExchangeViestinnän seminaari 8.11.2012 / Exchange
Viestinnän seminaari 8.11.2012 / Exchange
 
Mach Technology
Mach Technology Mach Technology
Mach Technology
 
IBM Pulse 2013 session - DevOps for Mobile Apps
IBM Pulse 2013 session - DevOps for Mobile AppsIBM Pulse 2013 session - DevOps for Mobile Apps
IBM Pulse 2013 session - DevOps for Mobile Apps
 

Plus de Alex Hardisty

Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphere
Alex Hardisty
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
Alex Hardisty
 
TextofKeynote-EGIforum-15-Sep2010
TextofKeynote-EGIforum-15-Sep2010TextofKeynote-EGIforum-15-Sep2010
TextofKeynote-EGIforum-15-Sep2010
Alex Hardisty
 

Plus de Alex Hardisty (16)

openDS - A new standard for digital specimens
openDS - A new standard for digital specimensopenDS - A new standard for digital specimens
openDS - A new standard for digital specimens
 
Global Research Infrastructures for Biodiversity and Ecosystems Research
Global Research Infrastructures for Biodiversity and Ecosystems ResearchGlobal Research Infrastructures for Biodiversity and Ecosystems Research
Global Research Infrastructures for Biodiversity and Ecosystems Research
 
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) project
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) projectApproach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) project
Approach and outcome of the Biodiversity Virtual e-Laboratory (BioVeL) project
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphere
 
Constructing bottomup
Constructing bottomupConstructing bottomup
Constructing bottomup
 
Mapping Research Infrastructures with the ENVRI Reference Model
Mapping Research Infrastructures with the ENVRI Reference ModelMapping Research Infrastructures with the ENVRI Reference Model
Mapping Research Infrastructures with the ENVRI Reference Model
 
BioVeL at IBERGRID e-Infrastructures and biodiversity workshop, 19th Septembe...
BioVeL at IBERGRID e-Infrastructures and biodiversity workshop, 19th Septembe...BioVeL at IBERGRID e-Infrastructures and biodiversity workshop, 19th Septembe...
BioVeL at IBERGRID e-Infrastructures and biodiversity workshop, 19th Septembe...
 
Biodiversity Informatics Horizons 2013 - Introduction and Scope
Biodiversity Informatics Horizons 2013 - Introduction and ScopeBiodiversity Informatics Horizons 2013 - Introduction and Scope
Biodiversity Informatics Horizons 2013 - Introduction and Scope
 
Hardistyroberts190313opt 130319072407-phpapp02
Hardistyroberts190313opt 130319072407-phpapp02Hardistyroberts190313opt 130319072407-phpapp02
Hardistyroberts190313opt 130319072407-phpapp02
 
10th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v210th e concertation-brussels-06march2013-v2
10th e concertation-brussels-06march2013-v2
 
Biodiversity Virtual e-Laboratory (BioVeL)
Biodiversity Virtual e-Laboratory (BioVeL)Biodiversity Virtual e-Laboratory (BioVeL)
Biodiversity Virtual e-Laboratory (BioVeL)
 
E cconcertation lyon-22-sep2011-v3
E cconcertation lyon-22-sep2011-v3E cconcertation lyon-22-sep2011-v3
E cconcertation lyon-22-sep2011-v3
 
AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011AH-XLDBEurope-position-09 jun2011
AH-XLDBEurope-position-09 jun2011
 
XldbEuropeEdinburgh-09-jun2011
XldbEuropeEdinburgh-09-jun2011XldbEuropeEdinburgh-09-jun2011
XldbEuropeEdinburgh-09-jun2011
 
TextofKeynote-EGIforum-15-Sep2010
TextofKeynote-EGIforum-15-Sep2010TextofKeynote-EGIforum-15-Sep2010
TextofKeynote-EGIforum-15-Sep2010
 
EGIforum-Amsterdam-15-Sep2010
EGIforum-Amsterdam-15-Sep2010EGIforum-Amsterdam-15-Sep2010
EGIforum-Amsterdam-15-Sep2010
 

Eudat user forum-london-11march2013-biovel-v3

  • 1. Biodiversity Virtual e-Laboratory An e-Infrastructure and e-Science environment supporting research on biodiversity WEB SERVICES INFRASTRUCTURES FOR BIODIVERSITY SCIENCE Alex Hardisty Coordinator, Cardiff University EUDAT User Forum, 11-12th March 2013, London
  • 2.
  • 3. Products are “services” and “workflows” • Workflows allow to process vast amounts of data, repeatedly – Build your own workflow: select and apply successive “services” (data analysis and processing steps) – Import data from one’s own research and/or from existing libraries (i.e. GBIF, Catalogue of Life) • Access a library of workflows and re-use existing workflows. – Improves efficiency by reducing Part of a workflow to study the ecological niche of the horseshoe crab research time and overhead expenses
  • 4. Creates powerful data processing tools Ecological Niche Modelling Biogeochemical modelling for biodiversity research Metagenomics • Carbon Sequestration Phylogenetics Population Modelling • Ecosystem Functioning and Valuation Taxonomy Geospatial Visualization • Invasive Species Management An international virtual network of experts connecting 2 scientific communities: biodiversity and ICT • Aims to foster cooperation in the community by: – Discussing scientific use cases – Identifying and deploying important Web Services – Designing and offering workflows – Training scientists
  • 5. Supported by many friends Fits into a portfolio of initiatives • NoE: ALTER-Net, EDIT/PESI, LTER-Europe, EuroMarine, etc. • Projects: 4D4Life, agINFRA, Aquamaps, ArtDataBanken, BioFresh, Envri, EU BON, EUBrazilOpenBio, Fauna Iberica, i4Life, iMarine, Micro B3, OpenPlantBio, ViBRANT • Global: CAMERA, Catalogue of Life, COOPEUS, CReATIVE-B, EoL, GBIF, GSC Biodiversity WG, TreeBase, and many more Important contribution to infrastructure
  • 6. BioVeL Tool Spectrum Workflow design, compute Concept Knowledge Domain science Technical Science Domain PAL PAL Scientist Taverna Component Taverna Domain-Specific Workbench Builder Lite / Server Website (Taverna Player) High Workflow Visibility Low
  • 7. Biodiversity Catalogues & Catalogue Workflows Repositories Components Services BioCatalogue Curators Pro In the Interfaces Makers Field Design & Launch Users Third Party tools Taverna Lite Channels Workbench Local Public BioVeL Services Data Mgt Servers COTS Shim Local Taverna Server File Data Mgt Run time Stores Workspace Execution Local Authentication Data Management Sets Domain Server Interaction Server System Deployment Infrastructure Cloud hosting, compute, storage
  • 8. We’re at the halfway point • Several workflows maturing nicely – Public Shared: Data refinement, Population modelling, Ecol. niche modelling – Beta: Phylogenetic inferencing – In the pipe: Biogeochemical process modelling, metagenomics, … • Using Web services from GBIF, CoL, CRIA, Fraunhofer, INFN, …. – Developing new services: viz and data selection, phylo, metagenomics, Biome-BGC modelling, pop modelling • A curated public catalogue of Web services – www.biodiversitycatalogue.org • AWS cloud infrastructure, new user interfaces (tavlite1.biovel.eu) • Growing profile in community – Steady enquiries from potential users and public training workshops
  • 9. 4 questions to address 1. How to use distributed centres to efficiently run distributed processing chains? 2. Is there a problem of data exchange? (And how to solve this) 3. Deploying codes close to data 4. Access and security issues around managing protected services
  • 10. How to use distributed centres to efficiently run distributed processing chains? Users’ workflows and applications Service and Data Providers (INFN, BioVeL, GBIF, CoL, EBI, BGBM, etc.) Resource Providers (EUDAT, EGI.eu, PRACE, commercial cloud, etc.)
  • 11. Is there a problem of data exchange? (And how to solve this) • At simplest level, we need for the user: – A "starting place", where a workflow can find the data it needs – An "ending place", where a workflow can put its results – A "transient place" where temporary data / intermediate results can be put and retrieved • For services we need: – Temporary spaces associated with specific services, supporting data movements between services – Separation of users and separation of workflow runs • Summarise as : – A replicated distributed storage space, accessible to BioVeL services, (hence workflows) for both reading and writing; which presents to the user as a filespace, native to the user’s local environment. • = Dropbox for services, with fast replication between known service locations. Today, typically GB not TB
  • 12. Deploying codes close to data • BioVeL Appliance – A service packaged for DCI, deployed on-demand – Working with EGI Fedcloud on this – Could be deployed close to data but this only makes sense if this would be quicker than moving the data • So where is the break-even point? • Taverna Server deployments – In connection with Web Services hosting Taverna Server
  • 13. Access and security issues around managing protected services • We need a lightweight and standard solution for – User management & single sign-on to our Service Network – Permissions system for authorizing access to services • Same for Workspace Access Service (user workspace) User Contract SP Contract RP
  • 14. Access and security issues around managing protected services • We need a lightweight and standard solution for – User management & single sign-on to our Service Network – Permissions system for authorizing access to services • Same for Workspace Access Service (user workspace) • 3-legged OAuth, extended – resource / service is independent of BioVeL OAuth provider • Adopt from megx.net – marine ecological genomics
  • 15. Questions? BioVeL is funded by the European Commission 7th Framework Programme (FP7). It is part of its e-Infrastructures activity. BioVeL contributes to LifeWatch and GEO BON. BioVeL products are free to access. Under FP7, the e-Infrastructures activity is part of the Research Infrastructures programme, funded under the FP7 'Capacities' Specific Programme. It focuses on the further development and evolution of the high-capacity and high-performance communication network (GÉANT), distributed computing infrastructures (grids and clouds), supercomputer infrastructures, simulation software, scientific data infrastructures, e-Science services as well as on the adoption of e-Infrastructures by user communities.