SlideShare une entreprise Scribd logo
1  sur  38
BioCatalogue Joined project:   Aim:   Create a registry of  annotated  biological web services  & Funded by:
Timeline and Approach ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
In the Wild Cloud Data Services ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],http://www.mygrid.org.uk/wiki/Mygrid/BiologicalWebServices Variable sustainable stewardship
Digital Curation is… ,[object Object],[object Object],[object Object],http://en.wikipedia.org/wiki/Digital_curation
Curate Processes ,[object Object],[object Object],[object Object],[object Object],A registry A means to pool metadata about services  in the wild A means to discover and reuse those services A means to curate services A platform for service monitoring and analytics
Service and Workflow analytics and network analysis Recommendations and co-use. Social networks of third party externally hosted services Automated diagnostics, monitoring and metadata curation
Finding and Curating Services http://www.biocatalogue.org Drawing on 6 years experience in Taverna of semantic annotation of services using RDF and OWL ontologies. Drawing on experience at EBI in service provision. First pilot early November 2008, will cover major providers (EBI, NCBI, DDBJ) at “bronze” quality and show some at platinum.
Web Services in the Wild ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Writing Reusable stuff is DIFFICULT ,[object Object],[object Object],[object Object],[object Object]
Services Mutability and Preservation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Workflows and Services Curation by Experts Social Curation by the Crowd refine validate refine validate Self-Curation by Contributors seed seed refine validate seed refine validate seed Automated  Curation
Multiple Annotation Profiles User Profile Service Profile Profile Annotation Profile Annotation Profile Annotation Ranking Functions Group Profile
Service Profile Curation Model Quantitative  Content Tags Service Model Semantic  Content Model Ontologies Functional Provenance Operational Operational Metrics Conditions of Use Social Standing 6 facets Versioning QoS Usage
A.N.  Other Execution at Host Service Profile Finding WSDL WADL S-A.N.  Other SAWSDL SA-REST Analytics Ranking Browse/Shop Search Customised Services Workflows Monitoring Profiles Curation Quant’ve  Service Model Semantic  Content Model
Service Profile Facets Services Interface Neutral Functional Conditions of Use Operational Social Standing Operational  Metrics Provenance
Services Interface Neutral Functional Conditions of Use Operational Social Standing Operational  Metrics Provenance Multiply described  Third Party Aggregated Feeds Monitoring Multiple Sources Multiple Versions Dynamic Multiple Instances Discovery Interoperability Composition Reuse Trusted Authorities Policies Ontologies Controlled Vocabularies Tags Free  text Folksonomies Standards W*DL Atom Schemas
Services Interface Neutral Functional Conditions of Use Operational Social Standing Operational  Metrics Provenance Multiply described  Third Party Aggregated Feeds Monitoring Multiple Sources Multiple Versions Dynamic Multiple Instances Discovery Interoperability Composition Reuse Trusted Authorities Policies Ranking
Pay as you Go, Emergent Curation Just enough, Just in Time, not Just in Case. What is the Return for the Investment? Gain Pain Very BAD Good, but Unlikely Just right Folksonomy  Tagging Hard Core full on  Ontology Curation Rich enough metadata for effective reuse
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
myGrid History - Feta
[object Object],[object Object],[object Object]
 
BioCatalogue: The pilot ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Service Coverage + EMBRACE
 
 
 
Roadmap – Perpetual Beta ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Importers Importers Ontology Editor Ontologist BioCatalogue Catalogue Manager Service Providers Service Provider Workbench Domain  Services Bio  Web Services Extraction Importers Curator Workbench Expert Curator Chameleon   change  handler Discovery Service EB-eye Search Scientists Ontology Exporter Curation and Acquisition Tools Discovery Services Backend Catalogue Services Ontology Services “ Shopping” Web  Interface Find-O-Matic Auto Annotation Advanced Finding Web Service Interface BioNanny Monitor Reviewing  Feedback Blogging Tags Service Providers Tool Developers Web Browser Tool Developers Tags Community analysis Service analysis Community Use  Monitor Community Tools + Tags Scientists EB-eye Ranking Matching
Sister Project Close partnership Social Curation Shared Code
Finding, curating and reusing workflows Connecting Scientists in the Wild A supermarket for workflow users. A toolbox for workflow creators. Social networking over commodities. Different disciplines. 1200+ members from 114 countries. 50000+ workflows downloads. 1500-2000 unique visitors / month 460+ workflows. 98 groups. 35+ packs. Running for just over a year. Joint Manchester and Southampton. Project leader: Prof David De Roure
[object Object],[object Object],[object Object],[object Object],[object Object],, http://myexperiment.org
Open and off the shelf….. ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Google Gadget Web 2.0 protocols, Open Archive Initiative,  Linked Open Data,  RESTful APIs, Global, persistent URIs
More Information ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
BioCatalogue Team Thomas Laurent Hamish McWilliams Franck Tanoh Jiten Bhagat Carole Goble Rodrigo Lopez Eric Nzuobontane
my Grid+ Team
Curation Sweatshop ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
 

Contenu connexe

Tendances

It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...
It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...
It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...Violeta Ilik
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zenecaKerstin Forsberg
 
Building an Internet of Genomics
Building an Internet of GenomicsBuilding an Internet of Genomics
Building an Internet of GenomicsMarc Fiume
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Jamie Bisset
 
dkNET ESP Meeting - February 2016
dkNET ESP Meeting - February 2016dkNET ESP Meeting - February 2016
dkNET ESP Meeting - February 2016dkNET
 
Invited talk @ ESIP summer meeting, 2009
Invited talk @ ESIP summer meeting, 2009Invited talk @ ESIP summer meeting, 2009
Invited talk @ ESIP summer meeting, 2009Paolo Missier
 
dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017dkNET
 
Isf vivo2013
Isf vivo2013Isf vivo2013
Isf vivo2013mhaendel
 
The Case for Stable VIVO URIs
The Case for Stable VIVO URIsThe Case for Stable VIVO URIs
The Case for Stable VIVO URIsVioleta Ilik
 
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...ORCID, Inc
 
Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Clare Dean
 
Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...FAIRDOM
 

Tendances (14)

FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...
It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...
It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
Building an Internet of Genomics
Building an Internet of GenomicsBuilding an Internet of Genomics
Building an Internet of Genomics
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction) Publishing your research: Research Data Management (Introduction)
Publishing your research: Research Data Management (Introduction)
 
dkNET ESP Meeting - February 2016
dkNET ESP Meeting - February 2016dkNET ESP Meeting - February 2016
dkNET ESP Meeting - February 2016
 
Invited talk @ ESIP summer meeting, 2009
Invited talk @ ESIP summer meeting, 2009Invited talk @ ESIP summer meeting, 2009
Invited talk @ ESIP summer meeting, 2009
 
dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017dkNET Annual Meeting - June 2017
dkNET Annual Meeting - June 2017
 
Isf vivo2013
Isf vivo2013Isf vivo2013
Isf vivo2013
 
The Case for Stable VIVO URIs
The Case for Stable VIVO URIsThe Case for Stable VIVO URIs
The Case for Stable VIVO URIs
 
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
Improving Discoverability with Unique Identifiers: ORCID, ISNI, and Implement...
 
Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018
 
Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...
 

Similaire à Biocatalogue Talk Slides

BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioCatalogue
 
BioIT 2009 BioCatalogue slides by Carole Goble
BioIT 2009 BioCatalogue slides by Carole GobleBioIT 2009 BioCatalogue slides by Carole Goble
BioIT 2009 BioCatalogue slides by Carole GobleBioCatalogue
 
Getting Serious About A Community Bio Service Catalogue
Getting Serious About A Community Bio Service CatalogueGetting Serious About A Community Bio Service Catalogue
Getting Serious About A Community Bio Service CatalogueBioCatalogue
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleCarole Goble
 
ISMB 2009 Demo Introduction
ISMB 2009 Demo IntroductionISMB 2009 Demo Introduction
ISMB 2009 Demo IntroductionBioCatalogue
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?myGrid team
 
Six Principles of Software Design to Empower Scientists
Six Principles of Software Design to Empower ScientistsSix Principles of Software Design to Empower Scientists
Six Principles of Software Design to Empower ScientistsDavid De Roure
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...Carole Goble
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networksalitora
 
myExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentmyExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentDavid De Roure
 
Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...ResearchSpace
 
Nectar cloud workshop ndj 20110331.2
Nectar cloud workshop ndj 20110331.2Nectar cloud workshop ndj 20110331.2
Nectar cloud workshop ndj 20110331.2Nick Jones
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Rafael C. Jimenez
 

Similaire à Biocatalogue Talk Slides (20)

BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogue
 
DCC Keynote 2007
DCC Keynote 2007DCC Keynote 2007
DCC Keynote 2007
 
BioIT 2009 BioCatalogue slides by Carole Goble
BioIT 2009 BioCatalogue slides by Carole GobleBioIT 2009 BioCatalogue slides by Carole Goble
BioIT 2009 BioCatalogue slides by Carole Goble
 
Getting Serious About A Community Bio Service Catalogue
Getting Serious About A Community Bio Service CatalogueGetting Serious About A Community Bio Service Catalogue
Getting Serious About A Community Bio Service Catalogue
 
UCIAD overview
UCIAD overviewUCIAD overview
UCIAD overview
 
If we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote GobleIf we build it will they come? BOSC2012 Keynote Goble
If we build it will they come? BOSC2012 Keynote Goble
 
ISMB 2009 Demo Introduction
ISMB 2009 Demo IntroductionISMB 2009 Demo Introduction
ISMB 2009 Demo Introduction
 
If we build it will they come?
If we build it will they come?If we build it will they come?
If we build it will they come?
 
Six Principles of Software Design to Empower Scientists
Six Principles of Software Design to Empower ScientistsSix Principles of Software Design to Empower Scientists
Six Principles of Software Design to Empower Scientists
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Alitora Innovation Networks
Alitora Innovation NetworksAlitora Innovation Networks
Alitora Innovation Networks
 
myExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentmyExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research Environment
 
Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...
 
Nectar cloud workshop ndj 20110331.2
Nectar cloud workshop ndj 20110331.2Nectar cloud workshop ndj 20110331.2
Nectar cloud workshop ndj 20110331.2
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content TypesIlik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
Ilik - Beyond the Manuscript: Using IRs for Non Traditional Content Types
 
20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong
 
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
 

Plus de BioCatalogue

BioCatalogue Presentation @ Enabling Systems Biology 2011, by Jiten Bhagat
BioCatalogue Presentation @ Enabling Systems Biology 2011, by Jiten BhagatBioCatalogue Presentation @ Enabling Systems Biology 2011, by Jiten Bhagat
BioCatalogue Presentation @ Enabling Systems Biology 2011, by Jiten BhagatBioCatalogue
 
BioCatalogue at EMBL-EBI SME Workshop
BioCatalogue at EMBL-EBI SME WorkshopBioCatalogue at EMBL-EBI SME Workshop
BioCatalogue at EMBL-EBI SME WorkshopBioCatalogue
 
ISMB 2010 BioCatalogue presentation
ISMB 2010 BioCatalogue presentationISMB 2010 BioCatalogue presentation
ISMB 2010 BioCatalogue presentationBioCatalogue
 
The Functional Units
The Functional UnitsThe Functional Units
The Functional UnitsBioCatalogue
 
BioCatalogue Poster
BioCatalogue PosterBioCatalogue Poster
BioCatalogue PosterBioCatalogue
 
AHM 2009 BioCatalogue Poster
AHM 2009 BioCatalogue PosterAHM 2009 BioCatalogue Poster
AHM 2009 BioCatalogue PosterBioCatalogue
 
BioCatalogue DILS & Enfin 2009 by Jits
BioCatalogue DILS & Enfin 2009 by JitsBioCatalogue DILS & Enfin 2009 by Jits
BioCatalogue DILS & Enfin 2009 by JitsBioCatalogue
 

Plus de BioCatalogue (7)

BioCatalogue Presentation @ Enabling Systems Biology 2011, by Jiten Bhagat
BioCatalogue Presentation @ Enabling Systems Biology 2011, by Jiten BhagatBioCatalogue Presentation @ Enabling Systems Biology 2011, by Jiten Bhagat
BioCatalogue Presentation @ Enabling Systems Biology 2011, by Jiten Bhagat
 
BioCatalogue at EMBL-EBI SME Workshop
BioCatalogue at EMBL-EBI SME WorkshopBioCatalogue at EMBL-EBI SME Workshop
BioCatalogue at EMBL-EBI SME Workshop
 
ISMB 2010 BioCatalogue presentation
ISMB 2010 BioCatalogue presentationISMB 2010 BioCatalogue presentation
ISMB 2010 BioCatalogue presentation
 
The Functional Units
The Functional UnitsThe Functional Units
The Functional Units
 
BioCatalogue Poster
BioCatalogue PosterBioCatalogue Poster
BioCatalogue Poster
 
AHM 2009 BioCatalogue Poster
AHM 2009 BioCatalogue PosterAHM 2009 BioCatalogue Poster
AHM 2009 BioCatalogue Poster
 
BioCatalogue DILS & Enfin 2009 by Jits
BioCatalogue DILS & Enfin 2009 by JitsBioCatalogue DILS & Enfin 2009 by Jits
BioCatalogue DILS & Enfin 2009 by Jits
 

Biocatalogue Talk Slides

  • 1. BioCatalogue Joined project: Aim: Create a registry of annotated biological web services & Funded by:
  • 2.
  • 3.
  • 4.
  • 5.
  • 6. Service and Workflow analytics and network analysis Recommendations and co-use. Social networks of third party externally hosted services Automated diagnostics, monitoring and metadata curation
  • 7. Finding and Curating Services http://www.biocatalogue.org Drawing on 6 years experience in Taverna of semantic annotation of services using RDF and OWL ontologies. Drawing on experience at EBI in service provision. First pilot early November 2008, will cover major providers (EBI, NCBI, DDBJ) at “bronze” quality and show some at platinum.
  • 8.
  • 9.
  • 10.
  • 11. Workflows and Services Curation by Experts Social Curation by the Crowd refine validate refine validate Self-Curation by Contributors seed seed refine validate seed refine validate seed Automated Curation
  • 12. Multiple Annotation Profiles User Profile Service Profile Profile Annotation Profile Annotation Profile Annotation Ranking Functions Group Profile
  • 13. Service Profile Curation Model Quantitative Content Tags Service Model Semantic Content Model Ontologies Functional Provenance Operational Operational Metrics Conditions of Use Social Standing 6 facets Versioning QoS Usage
  • 14. A.N. Other Execution at Host Service Profile Finding WSDL WADL S-A.N. Other SAWSDL SA-REST Analytics Ranking Browse/Shop Search Customised Services Workflows Monitoring Profiles Curation Quant’ve Service Model Semantic Content Model
  • 15. Service Profile Facets Services Interface Neutral Functional Conditions of Use Operational Social Standing Operational Metrics Provenance
  • 16. Services Interface Neutral Functional Conditions of Use Operational Social Standing Operational Metrics Provenance Multiply described Third Party Aggregated Feeds Monitoring Multiple Sources Multiple Versions Dynamic Multiple Instances Discovery Interoperability Composition Reuse Trusted Authorities Policies Ontologies Controlled Vocabularies Tags Free text Folksonomies Standards W*DL Atom Schemas
  • 17. Services Interface Neutral Functional Conditions of Use Operational Social Standing Operational Metrics Provenance Multiply described Third Party Aggregated Feeds Monitoring Multiple Sources Multiple Versions Dynamic Multiple Instances Discovery Interoperability Composition Reuse Trusted Authorities Policies Ranking
  • 18. Pay as you Go, Emergent Curation Just enough, Just in Time, not Just in Case. What is the Return for the Investment? Gain Pain Very BAD Good, but Unlikely Just right Folksonomy Tagging Hard Core full on Ontology Curation Rich enough metadata for effective reuse
  • 19.
  • 21.
  • 22.  
  • 23.
  • 25.  
  • 26.  
  • 27.  
  • 28.
  • 29. Importers Importers Ontology Editor Ontologist BioCatalogue Catalogue Manager Service Providers Service Provider Workbench Domain Services Bio Web Services Extraction Importers Curator Workbench Expert Curator Chameleon change handler Discovery Service EB-eye Search Scientists Ontology Exporter Curation and Acquisition Tools Discovery Services Backend Catalogue Services Ontology Services “ Shopping” Web Interface Find-O-Matic Auto Annotation Advanced Finding Web Service Interface BioNanny Monitor Reviewing Feedback Blogging Tags Service Providers Tool Developers Web Browser Tool Developers Tags Community analysis Service analysis Community Use Monitor Community Tools + Tags Scientists EB-eye Ranking Matching
  • 30. Sister Project Close partnership Social Curation Shared Code
  • 31. Finding, curating and reusing workflows Connecting Scientists in the Wild A supermarket for workflow users. A toolbox for workflow creators. Social networking over commodities. Different disciplines. 1200+ members from 114 countries. 50000+ workflows downloads. 1500-2000 unique visitors / month 460+ workflows. 98 groups. 35+ packs. Running for just over a year. Joint Manchester and Southampton. Project leader: Prof David De Roure
  • 32.
  • 33.
  • 34.
  • 35. BioCatalogue Team Thomas Laurent Hamish McWilliams Franck Tanoh Jiten Bhagat Carole Goble Rodrigo Lopez Eric Nzuobontane
  • 37.
  • 38.  

Notes de l'éditeur

  1. The plan for this talk was to highlight what BioCatalogue is and to Give a demo but unfortunately can’t do it because not ready. But will use some screen shot to show you what really going on or what to Expect next from BioCatalogue. Background of the talk: Lots of database and data resources Feta but can’t annotate all the services BioCatalogue
  2. Services are methods too.
  3. Fix, File and Forget is curation in a way….. Assets are used, we hope By applications and scientists who had anticipated using them. By applications and scientists that had not, or in ways that were unanticipated.
  4. Of course it isn’t as clean as that. And highly interrelated.
  5. Workflows are combinations of services. External Not self-contained or isolated Service and Workflow analytics and network analysis Service Diagnostics and monitoring Automated curation
  6. Get service providers involved, get the community involved 3500+ service operations, but only 700ish annotated in Feta. myGrid Service Ontology Annotation and curation pipeline Curation and Discovery tools Other registries: DAS Registry, BioMOBY Central, SeekDa …
  7. Scientists are naughty Reuse is Hard We have to try them to find out what they do… IVOA referred to this too. … I used it last time so it will work again the same way…damn! change location, capabilities and signatures (BioMART changed its interface three times in 2006). new ones appear and existing ones disappear (SeqHound) they decay and become outdated or unreliable.
  8. Services in the Wild are frequently, er, disappointing and hard to use. (Rubbish ™) . Writing reusable workflows is hard. Local services Permissions. Licences What does it DO? Writing reusable services is hard. What does it DO? Predicting the unknown required by the unknown. Finding workflows, services and tools is hard Where do you go?? What does it DO?? Creating web services is still a bottleneck. For quick solutions it is still seen as too much extra trouble.
  9. Ruin Not fix, file, forget Services are not deposited and preserved in software libraries. Rapid metadata heart-beat, especially on operational metadata. Could use previous slide in DCC talk. Shadows Method archives Shadows – what it was that can be used again. They are referred to. No SLA to be stable or standard. Constantly need tending or else they go stale. (cf. IVOA service validation, DAS). Not software libraries BioNanny – using Grid tools Versioning of workflows – Andrea. Regular health checks Use myExperiment to notify scientists with potential problems Use myExperiment to be smart about which services should be monitored. Workflows are deposited but…. Not self-contained. Linking to external services in flux. Or depend on software Incorporating services unavailable to others. Workflow fragility and hence decay. Workflows become plans and provenance rather than working scientific objects unless tended and updated.
  10. In particular a platform for research into curation practices As in the panel today Expert – Is library like Suppliers and crowd are the web side Automated is
  11. Group profile is the interrelationships between the services. Co-reference, Co-use,
  12. Curation includes versioning Analytics includes monitoring
  13. OAIS? From the model point of view. From the standoff annotation point of view. Metadata richness.
  14. Skipped all but the core in talk. OAIS? From the model point of view. From the standoff annotation point of view. Metadata richness.
  15. From the model point of view. From the standoff model neutral annotation point of view Bronze, silver, gold and platinum compliance levels.
  16. Frankly, is it worth it to do the detailed stuff?
  17. Richness spectrum Spoke to it but probably should have skipped The quality and completeness of metadata – graceful decay Platinum to bronz Semantic Web services IVOA talk asked – “why and when Semantics”. Here is an answer. Leads to multiple pipelines and multiple Scientist - Finding Simple classifications on a few properties. Simple queries, reduce search space, final decision with user Biological terms. Heavy use of provenance, reputation, usage patterns, operational properties, example configurations and boring stuff like that. Think Amazon. The interface is the thing. Automation – Validation and Execution Rich metadata for automatic service configuration, invocation and fault management Rich descriptions for reasoning: mismatches, debugging, repair Rich descriptions for reasoning: automated composition Hard and time-consuming
  18. Joint project Manchester-EBI
  19. Technical Infrastructure But its still not all joined up!! Feta keeps coming and going. Grid service descriptions are produced by annotating services with terms from the myGrid ontology, stored in a central registry, GRIMOIRES. Services are found using the Feta discovery service [5]. We have piloted expert manual annotation tools augmented by automated tools using information extraction techniques.
  20. These ae not our scientists or our projects. We have none. Its just scientists in the wild. 50% usa and uk Google analytics says: 1931 uniq visitors for 3rd sept to 3rd oct 1698 uniq visitors for 3rd aug to 2nd sept myExperiment currently has 1203 users , 98 groups , 460 workflows , 130 files and 36 packs Extreme Web 2.0 18 months old Built on Ruby on Rails BSD License Source code hosted on RubyForge Publicly available 2 core developers 50% in Southampton, 50% in Manchester User driven design and development 959 active users 1429 unique IP visits in last month 82 groups 248 group memberships 296 workflow entries, 425 workflow versions 101 files 1382 taggings 46,427 downloads 77,393 viewings 408 creditations 12 packs (with 237 total entries)
  21. Towards repeatable, reproducible, comparable and reusable research
  22. Didn’t go into details
  23. I have no picture of Dave Newman