SlideShare une entreprise Scribd logo
1  sur  16
Stian Soiland-Reyes myGrid, School of Computer Science University of Manchester, UK http://taverna.org.uk/ Scufl2: Because a workflow is more than its definition BOSC 2011 Vienna, 2011-07-16 http://www.mygrid.org.uk/ http://www.wf4ever-project.org/
Taverna workflows A set of (local and remote) services to analyze or manage data Nested workflows are also services Data-links connects services i.e. output from service A is input to service B and C Describes the desired dataflow instead of process coordination Automatic iterations Can customize list handling and control links Taverna 2.3 released 2011-07-14
Taverna workflow features Nested workflows Reuse existing components Implicit iterations With customizable list handling Pipelining Process partial iteration results early Parallelisation Run as soon as data is available Retries, failover, looping For stability and conditional testing Plugin-extensible execution control Ideas: caching, error detection, dynamic service lookup
Extensible workflow engine New service types/protocols Execution control like looping/branching/start/stop Service discovery at runtime
Taverna’s existing wf formats Taverna 1: SCUFL 2004-2007 Lightweight XML format Extensions in arbitrary formats (“Put your XML element here”) Parsed and written by many 3rd parties Taverna 2: t2flow 2007-2011 Supports control and service extensions of T2 Annotation system ‘Heavy’ XML serialisation of Java beans Tricky to use by 3rd parties
SCUFL2 motivations Easy to use for third-parties Semantic web/linked open data compatible identifiers and ontology Extensible in a controlled manner Plugins should provide a minimal ‘schema’ But parsers should not need to know about plugin Bundle relevant resources Input/output data, provenance, scripts, binaries Annotations on anything Reproducible workflow results
Research objects Reusable – used as part of new study; Repurposeable – reuse the pieces in a new (and different) study. Substitute alternative data sets, methods; Repeatable – repeat the study, possibly years later; Reproducible – a special case of repeatability with a complete set of information/results to work towards; Replayable – go back and see what happened; Referenceable – cite in publications; Revealable – provenance and audit; Re-interpretable – crossing boundaries; Respectful – appropriate credit and attribution; Retrievable – discover and acquire.
Research Object vision ROs: Aggregationsto support sharing/publication Incorporating methods, data, people Research Objects will allow us to conduct research inways that are Efficient: cheaper to borrow than recreate Effective: larger scale through reuse Ethical: Benefiting wider communities, not just individuals Could I have a copy of your Research Object please?
WfEver project Architecture/implementation for workflow preservation, sharingand reuse Research Object models Workflow Decay, Integrity and Authenticity Workflow Evolution and Recommendation Provenance Driven by Use Cases: Biology & Astronomy …technological infrastructure for the preservation and efficient retrieval and reuse of scientific workflows in a range of disciplines.
What is SCUFL2? File format for (Taverna) workflows and data Specification (wiki) Ontology (OWL) Schema (XSD) API (Java +Ruby?) Conversion Tool (Java)
SCUFL2 bundles ZIP archive mimetype META-INF/manifest.xml http://myexperiment.org/workflow/1337 META-INF/container.xml workflowBundle.rdf annotations/myExperiment.rdf http://example.com/largeData workflow/HelloWorld.rdf inputs.rdf workflow/someNestedWf.rdf inputs/data.txt profile/tavernaServer.rdf outputs.rdf http://example.net/tavernaServer outputs/result.txt profile/tavernaCloud.rdf resources/script.pl run/provenance.rdf http://example.org/binary
Bundle manifest “Structured” zip-file  (can be unpacked to be exposed on the web) the-workflow-bundle.scufl2 mimetype application/vnd.taverna.scufl2.workflow-bundle Self-documenting media type (OpenOffice ODF, ePub OCF, Adobe UCF) – for tools like file(1) and mime magic Root file: Primary document of the bundle (ePub OCF/Adobe UCF).  Alternative representations of same workflow bundle allowed (Turtle, JSON, HTML, etc) META-INF/container.xml <container version="1.0“xmlns="urn:oasis:names:tc:opendocument:xmlns:container">     <rootfiles>         <rootfilefull-path="workflowBundle.rdf" media-type="application/rdf+xml" />         <!-- <rootfile full-path="workflowBundle.ttl"          media-type="text/turtle" />  Alternative repr. -->      </rootfiles> </container>
Workflow bundle Unique identifier which can be used as prefix for all relative references in bundle workflowBundle.rdf Suggests main workflow and main profile, but executor could (e.g. by parameter) run a different workflow or profile <rdf:RDF xmlns="http://ns.taverna.org.uk/2010/scufl2#" xmlns:rdf=".."xmlns:rdfs=".."xmlns:xsi=".."xsi:schemaLocation="http://ns.taverna.org.uk/2010/scufl2# http://ns.taverna.org.uk/2010/scufl2/scufl2.xsd .."xsi:type="WorkflowBundleDocument" xml:base="./">       <WorkflowBundle rdf:about="">           <name>HelloWorld</name>           <sameBaseAsrdf:resource= "http://ns.taverna.org.uk/2010/workflowBundle/28f7c..a0ef731/" />                   <mainWorkflow rdf:resource="workflow/HelloWorld/" />           <workflow>             <Workflow rdf:about="workflow/HelloWorld/">               <rdfs:seeAlso rdf:resource="workflow/HelloWorld.rdf" />             </Workflow> <!-- plus each nested <Workflow> -->           </workflow>           <mainProfile rdf:resource="profile/tavernaWorkbench/" />           <profile>             <Profile rdf:about="profile/tavernaWorkbench/">               <rdfs:seeAlso rdf:resource="profile/tavernaWorkbench.rdf" />             </Profile> <!– plus optional alternative <Profile>-->           </profile>           <rdfs:seeAlso rdf:resource="annotation/user_annotations.rdf" />           <rdfs:seeAlso rdf:resource="annotation/myExperiment-wf-765.rdf" />       </WorkflowBundle> </rdf:RDF> Additional (non-executable) annotations and metadata
Technologies and standards OpenOffice ODF manifest ePub OCF/Adobe Universal Container Format (UCF) container rootfile, mimetype RDF/XML and OWL structure and annotations XML Schema for ‘structured’ RDF/XML Open Annotation Collaboration (OAC)? ORI-ORE? Aggregation Open Provenance Model (OPM)?
Status: Want people to try it out Open for suggestions and feedback Beta to be relased in Autumn 2011 Native format for upcoming Taverna 3 in 2012 Load/Save plugin for Taverna 2.3 2011 Q4 Future plans Explore further research object & provenance Converting to/from wf systems like Galaxy? Alpha
http://www.mygrid.org.uk/

Contenu connexe

Plus de Stian Soiland-Reyes

2014-10-31 Taverna 3 architecture
2014-10-31 Taverna 3 architecture2014-10-31 Taverna 3 architecture
2014-10-31 Taverna 3 architectureStian Soiland-Reyes
 
2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator project2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator projectStian Soiland-Reyes
 
2014-06-13 Research objects in the wild
2014-06-13 Research objects in the wild2014-06-13 Research objects in the wild
2014-06-13 Research objects in the wildStian Soiland-Reyes
 
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)Stian Soiland-Reyes
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)Stian Soiland-Reyes
 
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)Stian Soiland-Reyes
 
2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?Stian Soiland-Reyes
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objectsStian Soiland-Reyes
 
2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow system2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow systemStian Soiland-Reyes
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXTaverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXStian Soiland-Reyes
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Stian Soiland-Reyes
 
Bringing caBIG services together using Taverna
Bringing caBIG services together using TavernaBringing caBIG services together using Taverna
Bringing caBIG services together using TavernaStian Soiland-Reyes
 

Plus de Stian Soiland-Reyes (14)

2014-10-31 Taverna 3 architecture
2014-10-31 Taverna 3 architecture2014-10-31 Taverna 3 architecture
2014-10-31 Taverna 3 architecture
 
2014-10-30 Taverna 3 status
2014-10-30 Taverna 3 status2014-10-30 Taverna 3 status
2014-10-30 Taverna 3 status
 
2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator project2014-10-30 Taverna as an Apache Incubator project
2014-10-30 Taverna as an Apache Incubator project
 
2014-06-13 Research objects in the wild
2014-06-13 Research objects in the wild2014-06-13 Research objects in the wild
2014-06-13 Research objects in the wild
 
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
2013-07-19 myExperiment research objects, beyond workflows and packs (PPTX)
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)
 
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
 
2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?2013-03-21 What can provenance do for me?
2013-03-21 What can provenance do for me?
 
2013-01-17 Research Object
2013-01-17 Research Object2013-01-17 Research Object
2013-01-17 Research Object
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
 
2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow system2011-06-08 Taverna workflow system
2011-06-08 Taverna workflow system
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXTaverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
 
Bringing caBIG services together using Taverna
Bringing caBIG services together using TavernaBringing caBIG services together using Taverna
Bringing caBIG services together using Taverna
 

Dernier

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 

Dernier (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

2011 07-16 SCUFL2 - because a workflow is more than its definition (BOSC 2011)

  • 1. Stian Soiland-Reyes myGrid, School of Computer Science University of Manchester, UK http://taverna.org.uk/ Scufl2: Because a workflow is more than its definition BOSC 2011 Vienna, 2011-07-16 http://www.mygrid.org.uk/ http://www.wf4ever-project.org/
  • 2. Taverna workflows A set of (local and remote) services to analyze or manage data Nested workflows are also services Data-links connects services i.e. output from service A is input to service B and C Describes the desired dataflow instead of process coordination Automatic iterations Can customize list handling and control links Taverna 2.3 released 2011-07-14
  • 3. Taverna workflow features Nested workflows Reuse existing components Implicit iterations With customizable list handling Pipelining Process partial iteration results early Parallelisation Run as soon as data is available Retries, failover, looping For stability and conditional testing Plugin-extensible execution control Ideas: caching, error detection, dynamic service lookup
  • 4. Extensible workflow engine New service types/protocols Execution control like looping/branching/start/stop Service discovery at runtime
  • 5. Taverna’s existing wf formats Taverna 1: SCUFL 2004-2007 Lightweight XML format Extensions in arbitrary formats (“Put your XML element here”) Parsed and written by many 3rd parties Taverna 2: t2flow 2007-2011 Supports control and service extensions of T2 Annotation system ‘Heavy’ XML serialisation of Java beans Tricky to use by 3rd parties
  • 6. SCUFL2 motivations Easy to use for third-parties Semantic web/linked open data compatible identifiers and ontology Extensible in a controlled manner Plugins should provide a minimal ‘schema’ But parsers should not need to know about plugin Bundle relevant resources Input/output data, provenance, scripts, binaries Annotations on anything Reproducible workflow results
  • 7. Research objects Reusable – used as part of new study; Repurposeable – reuse the pieces in a new (and different) study. Substitute alternative data sets, methods; Repeatable – repeat the study, possibly years later; Reproducible – a special case of repeatability with a complete set of information/results to work towards; Replayable – go back and see what happened; Referenceable – cite in publications; Revealable – provenance and audit; Re-interpretable – crossing boundaries; Respectful – appropriate credit and attribution; Retrievable – discover and acquire.
  • 8. Research Object vision ROs: Aggregationsto support sharing/publication Incorporating methods, data, people Research Objects will allow us to conduct research inways that are Efficient: cheaper to borrow than recreate Effective: larger scale through reuse Ethical: Benefiting wider communities, not just individuals Could I have a copy of your Research Object please?
  • 9. WfEver project Architecture/implementation for workflow preservation, sharingand reuse Research Object models Workflow Decay, Integrity and Authenticity Workflow Evolution and Recommendation Provenance Driven by Use Cases: Biology & Astronomy …technological infrastructure for the preservation and efficient retrieval and reuse of scientific workflows in a range of disciplines.
  • 10. What is SCUFL2? File format for (Taverna) workflows and data Specification (wiki) Ontology (OWL) Schema (XSD) API (Java +Ruby?) Conversion Tool (Java)
  • 11. SCUFL2 bundles ZIP archive mimetype META-INF/manifest.xml http://myexperiment.org/workflow/1337 META-INF/container.xml workflowBundle.rdf annotations/myExperiment.rdf http://example.com/largeData workflow/HelloWorld.rdf inputs.rdf workflow/someNestedWf.rdf inputs/data.txt profile/tavernaServer.rdf outputs.rdf http://example.net/tavernaServer outputs/result.txt profile/tavernaCloud.rdf resources/script.pl run/provenance.rdf http://example.org/binary
  • 12. Bundle manifest “Structured” zip-file (can be unpacked to be exposed on the web) the-workflow-bundle.scufl2 mimetype application/vnd.taverna.scufl2.workflow-bundle Self-documenting media type (OpenOffice ODF, ePub OCF, Adobe UCF) – for tools like file(1) and mime magic Root file: Primary document of the bundle (ePub OCF/Adobe UCF). Alternative representations of same workflow bundle allowed (Turtle, JSON, HTML, etc) META-INF/container.xml <container version="1.0“xmlns="urn:oasis:names:tc:opendocument:xmlns:container">     <rootfiles>         <rootfilefull-path="workflowBundle.rdf" media-type="application/rdf+xml" />         <!-- <rootfile full-path="workflowBundle.ttl"          media-type="text/turtle" /> Alternative repr. -->      </rootfiles> </container>
  • 13. Workflow bundle Unique identifier which can be used as prefix for all relative references in bundle workflowBundle.rdf Suggests main workflow and main profile, but executor could (e.g. by parameter) run a different workflow or profile <rdf:RDF xmlns="http://ns.taverna.org.uk/2010/scufl2#" xmlns:rdf=".."xmlns:rdfs=".."xmlns:xsi=".."xsi:schemaLocation="http://ns.taverna.org.uk/2010/scufl2# http://ns.taverna.org.uk/2010/scufl2/scufl2.xsd .."xsi:type="WorkflowBundleDocument" xml:base="./">       <WorkflowBundle rdf:about="">           <name>HelloWorld</name>           <sameBaseAsrdf:resource= "http://ns.taverna.org.uk/2010/workflowBundle/28f7c..a0ef731/" />                   <mainWorkflow rdf:resource="workflow/HelloWorld/" />           <workflow>             <Workflow rdf:about="workflow/HelloWorld/">               <rdfs:seeAlso rdf:resource="workflow/HelloWorld.rdf" />            </Workflow> <!-- plus each nested <Workflow> -->           </workflow>           <mainProfile rdf:resource="profile/tavernaWorkbench/" />           <profile>            <Profile rdf:about="profile/tavernaWorkbench/">              <rdfs:seeAlso rdf:resource="profile/tavernaWorkbench.rdf" />             </Profile> <!– plus optional alternative <Profile>-->           </profile>           <rdfs:seeAlso rdf:resource="annotation/user_annotations.rdf" />           <rdfs:seeAlso rdf:resource="annotation/myExperiment-wf-765.rdf" />       </WorkflowBundle> </rdf:RDF> Additional (non-executable) annotations and metadata
  • 14. Technologies and standards OpenOffice ODF manifest ePub OCF/Adobe Universal Container Format (UCF) container rootfile, mimetype RDF/XML and OWL structure and annotations XML Schema for ‘structured’ RDF/XML Open Annotation Collaboration (OAC)? ORI-ORE? Aggregation Open Provenance Model (OPM)?
  • 15. Status: Want people to try it out Open for suggestions and feedback Beta to be relased in Autumn 2011 Native format for upcoming Taverna 3 in 2012 Load/Save plugin for Taverna 2.3 2011 Q4 Future plans Explore further research object & provenance Converting to/from wf systems like Galaxy? Alpha
  • 18. Acknowledgements Funded by: EPSRC EP/G026238/1; European Commission’s 7th FWP FP7-ICT-2007-6 270192;  FP7-ICT-2007-6 270137
  • 19. More information Poster 21 http://www.taverna.org.uk/ http://www.wf4ever-project.org/ http://www.mygrid.org.uk/ http://github.com/mygrid/scufl2