SlideShare une entreprise Scribd logo
1  sur  21
Data sharing
Data management
The SysMO-SEEK
Story
Professor Carole Goble FREng FBCS CITP
University of Manchester, UK
carole.goble@manchester.ac.uk
13 teams
91 institutes, 300 scientists
Multi-site, multi-disciplinary
Each three year duration
Data generation
Data consumption
Data analysis
Data management:
Local – Shared – Long term
Pan European
Systems Biology
http://www.sysmo.net
Own data solutions. wikis, e-Groupware,
PHProjekt, BaseCamp, PLONE, Alfresco, bespoke
commercial … files and spreadsheets.
Extreme caution over sharing.
Modellers vs experimentalist tribalism
Many institutions, many projects, overlapping
memberships, changing membership. Projects
ending, starting, carrying on the same, carrying
on differently.
Legacy
Suspicion
Dynamics
Expert scientists, inexpert informaticians. Few
resources.
Skills
Patchy standards, incomparable data,
afterthought.
Data
Scientist Lab Collaborators Competitors
ProgrammePublished
Post-
Publication
Pre-
Publication
Data mine-ing
“my impression of researchers, and I can
criticize myself in this, is that we’re much
more interested in sharing data when we
mean sharing somebody else’s as opposed
[to] sharing ours.”
E-infrastructure - taking forward the strategy, RIN report, 2010
Competitive advantage.
Adoption.
Kudos & Credit.
Help.
Fame.
Reputation.
Being scooped.
Scrutiny.
Misinterpretation.
Cost.
Blame.
Reputation.
RewardsRisks
Nature 461, 145 (10 September 2009)
1. Sharing
“It’s not ready yet”
“I need to get (another) publication first”
“We don’t have the resources or skills to prepare
it for others, esp. now we finished that project”
“Its faster/easier to do it myself, and will keep the
credit/control too”
“Its not described enough to be usable”
“I don’t trust the quality. Its not reliable enough. Its
too noisy.
“Others won’t use it properly.”
“It’s not worth
my while”“They are my competitors!!”
Pseudo Sharing
2. Preparation for Use
Curation
Standards
Reusability
Reproducibility
Accountability & Quality
Data discipline Silo busting
CIMR Core Information for Metabolomics Reporting
MIABE Minimal Information About a Bioactive Entity
MIACA Minimal Information About a Cellular Assay
MIAME Minimum Information About a Microarray Experiment
MIAME/Env MIAME / Environmental transcriptomic experiment
MIAME/Nutr MIAME / Nutrigenomics
MIAME/Plant MIAME / Plant transcriptomics
MIAME/Tox MIAME / Toxicogenomics
MIAPA Minimum Information About a Phylogenetic Analysis
MIAPAR Minimum Information About a Protein Affinity Reagent
MIAPE Minimum Information About a Proteomics Experiment
MIARE Minimum Information About a RNAi Experiment
MIASE Minimum Information About a Simulation Experiment
MIENS Minimum Information about an ENvironmental Sequence
MIFlowCyt Minimum Information for a Flow Cytometry Experiment
MIGen Minimum Information about a Genotyping Experiment
MIGS Minimum Information about a Genome Sequence
MIMIx Minimum Information about a Molecular Interaction Experiment
MIMPP Minimal Information for Mouse Phenotyping Procedures
MINI Minimum Information about a Neuroscience Investigation
MINIMESS Minimal Metagenome Sequence Analysis Standard
MINSEQE Minimum Information about a high-throughput SeQuencing Experiment
MIPFE Minimal Information for Protein Functional Evaluation
MIQAS Minimal Information for QTLs and Association Studies
MIqPCR Minimum Information about a quantitative Polymerase Chain Reaction experiment
MIRIAM Minimal Information Required In the Annotation of biochemical Models
MISFISHIE Minimum Information Specification For In Situ Hybridization and Immunohistochemistry
Experiments
STRENDA Standards for Reporting Enzymology Data
TBC Tox Biology Checklist
BioPAX : Biological Pathways Exchange http://www.biopax.org/
FuGE Functional Genomics Experimenthttp://www.mibbi.org/index.php/MIBBI_portal
Minimum
Information for
Biological and
Biomedical
Investigations
Metadata Minefield
http://usefulchem.wikispaces.com/page/code/EXPLAN001
http://www.mygrid.org.uk/tools/taverna/
Publishing Process
models
software
methods
scripts
http://openwetware.org
standard operating
procedures
Community Curation
Responsiblity
Blue Collar Science
John Quackenbush
Difficult
and time
consuming
Poor Credit
or Reward
Shabby
Career
Paths &
Prospects
3. Credit Crisis
• Reward sharing, curation and
reuse rather than reinvention.
• Credit. Attribution. Citation.
• For software, methods and
standards too.
• Technical (DataCite.org).
• Cultural (Respected policy).
• Institutional.
• Funding bodies.
4. Infrastructure, Capability & Capacity
• Three year
PhD/project cycle
• Local data control
• Realistic paths to
adoption by busy
people.
• Spreadsheets, wikis,
catalogues and
yellow pages.
• Content and Tools
http://www.biosharing.org
Identity Management
Sharednames DataCite
LSID DOIs ORCID
5. Data Ecosystem
Resources
6. Sustained Resources
• Three year projects.
• Three year lifespan of data (and its software).
• Sunsets and Sustains
• Reinvention rewarded
• Institution.
• Funding councils.
• Funding panels.
• Publishers
• Libraries
• National data centres
• International data centres
Incentives.
Sensitivity to
Behaviours
Infrastructure
Community building
Trusted service
Coordination
Governance
Policy
Capability
Community
Integration
A Partnership
• Software engineers
• Computational scientists
• Experimental Scientists
• Domain informaticians
• Service providers
• Funding agencies
• But the community
credit crisis continues….
Summary
• Science is a complex social activity
undertaken by tribes of people and
dominated by trust issues.
• Infrastructure has to be there and fit for
purpose but its not the real the problem.
• Need a cultural shift (on all sides) that
truly honours data.

Contenu connexe

Tendances

Interoperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesInteroperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesRothamsted Research, UK
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Michel Dumontier
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMichel Dumontier
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge DiscoveryMichel Dumontier
 
KnetMiner - Knowledge Network Miner
KnetMiner - Knowledge Network MinerKnetMiner - Knowledge Network Miner
KnetMiner - Knowledge Network MinerKeywan Hassani-Pak
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance
 
Introducing the KnetMiner Knowledge Graph: things, not strings
Introducing the KnetMiner Knowledge Graph: things, not stringsIntroducing the KnetMiner Knowledge Graph: things, not strings
Introducing the KnetMiner Knowledge Graph: things, not stringsKeywan Hassani-Pak
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to BioinformaticsLeighton Pritchard
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biologyPranavathiyani G
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsChristoph Steinbeck
 
Omics Logic - Bioinformatics 2.0
Omics Logic - Bioinformatics 2.0Omics Logic - Bioinformatics 2.0
Omics Logic - Bioinformatics 2.0Elia Brodsky
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsRai University
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata managementPistoia Alliance
 
Can machines understand the scientific literature
Can machines understand the scientific literatureCan machines understand the scientific literature
Can machines understand the scientific literaturepetermurrayrust
 
Workflows for Biological Research at Notre Dame
Workflows for Biological Research at Notre Dame Workflows for Biological Research at Notre Dame
Workflows for Biological Research at Notre Dame Sandra Gesing
 
Bioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and AdvantagesBioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and Advantagesbioinformatt
 

Tendances (20)

Interoperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use CasesInteroperable Data for KnetMiner and DFW Use Cases
Interoperable Data for KnetMiner and DFW Use Cases
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
DCC Keynote 2007
DCC Keynote 2007DCC Keynote 2007
DCC Keynote 2007
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 
Mrr iti phar_mu
Mrr iti phar_muMrr iti phar_mu
Mrr iti phar_mu
 
KnetMiner - Knowledge Network Miner
KnetMiner - Knowledge Network MinerKnetMiner - Knowledge Network Miner
KnetMiner - Knowledge Network Miner
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
Introducing the KnetMiner Knowledge Graph: things, not strings
Introducing the KnetMiner Knowledge Graph: things, not stringsIntroducing the KnetMiner Knowledge Graph: things, not strings
Introducing the KnetMiner Knowledge Graph: things, not strings
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Towards a gold standard and regarding quality in public domain chemistry data...
Towards a gold standard and regarding quality in public domain chemistry data...Towards a gold standard and regarding quality in public domain chemistry data...
Towards a gold standard and regarding quality in public domain chemistry data...
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
Multi-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application DomainsMulti-Omics Bioinformatics across Application Domains
Multi-Omics Bioinformatics across Application Domains
 
Omics Logic - Bioinformatics 2.0
Omics Logic - Bioinformatics 2.0Omics Logic - Bioinformatics 2.0
Omics Logic - Bioinformatics 2.0
 
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformaticsB.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
Can machines understand the scientific literature
Can machines understand the scientific literatureCan machines understand the scientific literature
Can machines understand the scientific literature
 
Workflows for Biological Research at Notre Dame
Workflows for Biological Research at Notre Dame Workflows for Biological Research at Notre Dame
Workflows for Biological Research at Notre Dame
 
Bioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and AdvantagesBioinformatics, Its Usage and Advantages
Bioinformatics, Its Usage and Advantages
 

En vedette

SEO Crash Course - Go from White Belt to Ninja in Search Optimization
SEO Crash Course - Go from White Belt to Ninja in Search OptimizationSEO Crash Course - Go from White Belt to Ninja in Search Optimization
SEO Crash Course - Go from White Belt to Ninja in Search OptimizationPercussion Software
 
Web Content Management Internal Selling Template
 Web Content Management Internal Selling Template Web Content Management Internal Selling Template
Web Content Management Internal Selling TemplatePercussion Software
 
Values of libraries - Relationships between provision, usage and research out...
Values of libraries - Relationships between provision, usage and research out...Values of libraries - Relationships between provision, usage and research out...
Values of libraries - Relationships between provision, usage and research out...Research Information Network
 
Mark zoekt samenwerking
Mark zoekt samenwerkingMark zoekt samenwerking
Mark zoekt samenwerkingmnieuwenhuizen
 
Pre-Cal 30S December 8, 2008
Pre-Cal 30S December 8, 2008Pre-Cal 30S December 8, 2008
Pre-Cal 30S December 8, 2008Darren Kuropatwa
 
The future of scholarly publishing: where do we go from here?
The future of scholarly publishing: where do we go from here? The future of scholarly publishing: where do we go from here?
The future of scholarly publishing: where do we go from here? Research Information Network
 

En vedette (8)

SEO Crash Course - Go from White Belt to Ninja in Search Optimization
SEO Crash Course - Go from White Belt to Ninja in Search OptimizationSEO Crash Course - Go from White Belt to Ninja in Search Optimization
SEO Crash Course - Go from White Belt to Ninja in Search Optimization
 
Web Content Management Internal Selling Template
 Web Content Management Internal Selling Template Web Content Management Internal Selling Template
Web Content Management Internal Selling Template
 
Values of libraries - Relationships between provision, usage and research out...
Values of libraries - Relationships between provision, usage and research out...Values of libraries - Relationships between provision, usage and research out...
Values of libraries - Relationships between provision, usage and research out...
 
Mark zoekt samenwerking
Mark zoekt samenwerkingMark zoekt samenwerking
Mark zoekt samenwerking
 
Book Rapper We Blog
Book Rapper We BlogBook Rapper We Blog
Book Rapper We Blog
 
Pre-Cal 30S December 8, 2008
Pre-Cal 30S December 8, 2008Pre-Cal 30S December 8, 2008
Pre-Cal 30S December 8, 2008
 
The future of scholarly publishing: where do we go from here?
The future of scholarly publishing: where do we go from here? The future of scholarly publishing: where do we go from here?
The future of scholarly publishing: where do we go from here?
 
What researchers want, and how to pay for it...
What researchers want, and how to pay for it...What researchers want, and how to pay for it...
What researchers want, and how to pay for it...
 

Similaire à Data sharing - Data management - The SysMO-SEEK Story

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
 
WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...Chris Evelo
 
Tools for Metagenomics with 16S/ITS and Whole Genome Shotgun Sequences
Tools for Metagenomics with 16S/ITS and Whole Genome Shotgun SequencesTools for Metagenomics with 16S/ITS and Whole Genome Shotgun Sequences
Tools for Metagenomics with 16S/ITS and Whole Genome Shotgun SequencesSurya Saha
 
Building bioinformatics resources for the global community
Building bioinformatics resources for the global communityBuilding bioinformatics resources for the global community
Building bioinformatics resources for the global communityExternalEvents
 
DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...Núria Queralt Rosinach
 
A Step to the Clouded Solution of Scalable Clinical Genome Sequencing (BDT308...
A Step to the Clouded Solution of Scalable Clinical Genome Sequencing (BDT308...A Step to the Clouded Solution of Scalable Clinical Genome Sequencing (BDT308...
A Step to the Clouded Solution of Scalable Clinical Genome Sequencing (BDT308...Amazon Web Services
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisCatherine Canevet
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Philip Bourne
 
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...Fiona Nielsen
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksCarole Goble
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data ManagementCarole Goble
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyMaté Ongenaert
 
Bioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxBioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxxRowlet
 
Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016Fiona Nielsen
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917GenomeInABottle
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptxHussainTaqi1
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterNeuro, McGill University
 

Similaire à Data sharing - Data management - The SysMO-SEEK Story (20)

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
 
WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...WikiPathways: how open source and open data can make omics technology more us...
WikiPathways: how open source and open data can make omics technology more us...
 
Tools for Metagenomics with 16S/ITS and Whole Genome Shotgun Sequences
Tools for Metagenomics with 16S/ITS and Whole Genome Shotgun SequencesTools for Metagenomics with 16S/ITS and Whole Genome Shotgun Sequences
Tools for Metagenomics with 16S/ITS and Whole Genome Shotgun Sequences
 
Building bioinformatics resources for the global community
Building bioinformatics resources for the global communityBuilding bioinformatics resources for the global community
Building bioinformatics resources for the global community
 
DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...DisGeNET: A discovery platform for the dynamical exploration of human disease...
DisGeNET: A discovery platform for the dynamical exploration of human disease...
 
A Step to the Clouded Solution of Scalable Clinical Genome Sequencing (BDT308...
A Step to the Clouded Solution of Scalable Clinical Genome Sequencing (BDT308...A Step to the Clouded Solution of Scalable Clinical Genome Sequencing (BDT308...
A Step to the Clouded Solution of Scalable Clinical Genome Sequencing (BDT308...
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
 
FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Large scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biologyLarge scale machine learning challenges for systems biology
Large scale machine learning challenges for systems biology
 
Bioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxBioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptx
 
Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx
 
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating CenterInternational Cancer Genomics Consortium (ICGC) Data Coordinating Center
International Cancer Genomics Consortium (ICGC) Data Coordinating Center
 

Plus de Research Information Network

The Museum and the Researcher in the information age: problem – solution – b...
The Museum and the Researcher in the information age:  problem – solution – b...The Museum and the Researcher in the information age:  problem – solution – b...
The Museum and the Researcher in the information age: problem – solution – b...Research Information Network
 
The changing nature of scholarly communication - What does this mean for rese...
The changing nature of scholarly communication - What does this mean for rese...The changing nature of scholarly communication - What does this mean for rese...
The changing nature of scholarly communication - What does this mean for rese...Research Information Network
 
To share or not to share? Researchers' perspective on managing and sharing data
To share or not to share? Researchers' perspective on managing and sharing dataTo share or not to share? Researchers' perspective on managing and sharing data
To share or not to share? Researchers' perspective on managing and sharing dataResearch Information Network
 
UK researchers' behaviour and the drive for open knowledge
UK researchers' behaviour and the drive for open knowledgeUK researchers' behaviour and the drive for open knowledge
UK researchers' behaviour and the drive for open knowledgeResearch Information Network
 
A researcher perspective: what they want and how to pay for it
A researcher perspective: what they want and how to pay for itA researcher perspective: what they want and how to pay for it
A researcher perspective: what they want and how to pay for itResearch Information Network
 
Futures for scholarly journals: a researchers' perspective
Futures for scholarly journals: a researchers' perspectiveFutures for scholarly journals: a researchers' perspective
Futures for scholarly journals: a researchers' perspectiveResearch Information Network
 
RSS feeds what are they and how can they help my research?
RSS feeds what are they and how can they help my research?RSS feeds what are they and how can they help my research?
RSS feeds what are they and how can they help my research?Research Information Network
 
The changing scholarly communications landscape: what does this mean for peer...
The changing scholarly communications landscape: what does this mean for peer...The changing scholarly communications landscape: what does this mean for peer...
The changing scholarly communications landscape: what does this mean for peer...Research Information Network
 

Plus de Research Information Network (20)

The Museum and the Researcher in the information age: problem – solution – b...
The Museum and the Researcher in the information age:  problem – solution – b...The Museum and the Researcher in the information age:  problem – solution – b...
The Museum and the Researcher in the information age: problem – solution – b...
 
A publishers perpsective
A publishers perpsectiveA publishers perpsective
A publishers perpsective
 
The changing nature of scholarly communication - What does this mean for rese...
The changing nature of scholarly communication - What does this mean for rese...The changing nature of scholarly communication - What does this mean for rese...
The changing nature of scholarly communication - What does this mean for rese...
 
What researchers want
What researchers wantWhat researchers want
What researchers want
 
Bloomsbury Conference
Bloomsbury ConferenceBloomsbury Conference
Bloomsbury Conference
 
ELPUB 2010
ELPUB 2010ELPUB 2010
ELPUB 2010
 
Ucl e c ontent 2010 12 may 10
Ucl e c ontent 2010 12 may 10Ucl e c ontent 2010 12 may 10
Ucl e c ontent 2010 12 may 10
 
To share or not to share? Researchers' perspective on managing and sharing data
To share or not to share? Researchers' perspective on managing and sharing dataTo share or not to share? Researchers' perspective on managing and sharing data
To share or not to share? Researchers' perspective on managing and sharing data
 
To share or not to share?
To share or not to share?To share or not to share?
To share or not to share?
 
UK researchers' behaviour and the drive for open knowledge
UK researchers' behaviour and the drive for open knowledgeUK researchers' behaviour and the drive for open knowledge
UK researchers' behaviour and the drive for open knowledge
 
Open knowledge
Open knowledgeOpen knowledge
Open knowledge
 
A researcher perspective: what they want and how to pay for it
A researcher perspective: what they want and how to pay for itA researcher perspective: what they want and how to pay for it
A researcher perspective: what they want and how to pay for it
 
Transitions to open access
Transitions to open accessTransitions to open access
Transitions to open access
 
Futures for scholarly journals: a researchers' perspective
Futures for scholarly journals: a researchers' perspectiveFutures for scholarly journals: a researchers' perspective
Futures for scholarly journals: a researchers' perspective
 
Social Media And Career Development
Social Media And Career DevelopmentSocial Media And Career Development
Social Media And Career Development
 
RSS feeds what are they and how can they help my research?
RSS feeds what are they and how can they help my research?RSS feeds what are they and how can they help my research?
RSS feeds what are they and how can they help my research?
 
The changing scholarly communications landscape: what does this mean for peer...
The changing scholarly communications landscape: what does this mean for peer...The changing scholarly communications landscape: what does this mean for peer...
The changing scholarly communications landscape: what does this mean for peer...
 
STM Innovations Seminar 4 Dec 09
STM Innovations Seminar 4 Dec 09STM Innovations Seminar 4 Dec 09
STM Innovations Seminar 4 Dec 09
 
SCONUL Conference
SCONUL ConferenceSCONUL Conference
SCONUL Conference
 
SciTech Conference
SciTech ConferenceSciTech Conference
SciTech Conference
 

Dernier

Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 

Dernier (20)

Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 

Data sharing - Data management - The SysMO-SEEK Story

  • 1. Data sharing Data management The SysMO-SEEK Story Professor Carole Goble FREng FBCS CITP University of Manchester, UK carole.goble@manchester.ac.uk
  • 2. 13 teams 91 institutes, 300 scientists Multi-site, multi-disciplinary Each three year duration Data generation Data consumption Data analysis Data management: Local – Shared – Long term Pan European Systems Biology http://www.sysmo.net
  • 3.
  • 4. Own data solutions. wikis, e-Groupware, PHProjekt, BaseCamp, PLONE, Alfresco, bespoke commercial … files and spreadsheets. Extreme caution over sharing. Modellers vs experimentalist tribalism Many institutions, many projects, overlapping memberships, changing membership. Projects ending, starting, carrying on the same, carrying on differently. Legacy Suspicion Dynamics Expert scientists, inexpert informaticians. Few resources. Skills Patchy standards, incomparable data, afterthought. Data
  • 5. Scientist Lab Collaborators Competitors ProgrammePublished Post- Publication Pre- Publication
  • 6. Data mine-ing “my impression of researchers, and I can criticize myself in this, is that we’re much more interested in sharing data when we mean sharing somebody else’s as opposed [to] sharing ours.” E-infrastructure - taking forward the strategy, RIN report, 2010
  • 7. Competitive advantage. Adoption. Kudos & Credit. Help. Fame. Reputation. Being scooped. Scrutiny. Misinterpretation. Cost. Blame. Reputation. RewardsRisks Nature 461, 145 (10 September 2009) 1. Sharing
  • 8. “It’s not ready yet” “I need to get (another) publication first” “We don’t have the resources or skills to prepare it for others, esp. now we finished that project” “Its faster/easier to do it myself, and will keep the credit/control too” “Its not described enough to be usable” “I don’t trust the quality. Its not reliable enough. Its too noisy. “Others won’t use it properly.” “It’s not worth my while”“They are my competitors!!”
  • 10. 2. Preparation for Use Curation Standards Reusability Reproducibility Accountability & Quality Data discipline Silo busting
  • 11. CIMR Core Information for Metabolomics Reporting MIABE Minimal Information About a Bioactive Entity MIACA Minimal Information About a Cellular Assay MIAME Minimum Information About a Microarray Experiment MIAME/Env MIAME / Environmental transcriptomic experiment MIAME/Nutr MIAME / Nutrigenomics MIAME/Plant MIAME / Plant transcriptomics MIAME/Tox MIAME / Toxicogenomics MIAPA Minimum Information About a Phylogenetic Analysis MIAPAR Minimum Information About a Protein Affinity Reagent MIAPE Minimum Information About a Proteomics Experiment MIARE Minimum Information About a RNAi Experiment MIASE Minimum Information About a Simulation Experiment MIENS Minimum Information about an ENvironmental Sequence MIFlowCyt Minimum Information for a Flow Cytometry Experiment MIGen Minimum Information about a Genotyping Experiment MIGS Minimum Information about a Genome Sequence MIMIx Minimum Information about a Molecular Interaction Experiment MIMPP Minimal Information for Mouse Phenotyping Procedures MINI Minimum Information about a Neuroscience Investigation MINIMESS Minimal Metagenome Sequence Analysis Standard MINSEQE Minimum Information about a high-throughput SeQuencing Experiment MIPFE Minimal Information for Protein Functional Evaluation MIQAS Minimal Information for QTLs and Association Studies MIqPCR Minimum Information about a quantitative Polymerase Chain Reaction experiment MIRIAM Minimal Information Required In the Annotation of biochemical Models MISFISHIE Minimum Information Specification For In Situ Hybridization and Immunohistochemistry Experiments STRENDA Standards for Reporting Enzymology Data TBC Tox Biology Checklist BioPAX : Biological Pathways Exchange http://www.biopax.org/ FuGE Functional Genomics Experimenthttp://www.mibbi.org/index.php/MIBBI_portal Minimum Information for Biological and Biomedical Investigations Metadata Minefield
  • 14. Blue Collar Science John Quackenbush Difficult and time consuming Poor Credit or Reward Shabby Career Paths & Prospects
  • 15. 3. Credit Crisis • Reward sharing, curation and reuse rather than reinvention. • Credit. Attribution. Citation. • For software, methods and standards too. • Technical (DataCite.org). • Cultural (Respected policy). • Institutional. • Funding bodies.
  • 16. 4. Infrastructure, Capability & Capacity • Three year PhD/project cycle • Local data control • Realistic paths to adoption by busy people. • Spreadsheets, wikis, catalogues and yellow pages. • Content and Tools
  • 18. 6. Sustained Resources • Three year projects. • Three year lifespan of data (and its software). • Sunsets and Sustains • Reinvention rewarded • Institution. • Funding councils. • Funding panels. • Publishers • Libraries • National data centres • International data centres
  • 19. Incentives. Sensitivity to Behaviours Infrastructure Community building Trusted service Coordination Governance Policy Capability Community Integration
  • 20. A Partnership • Software engineers • Computational scientists • Experimental Scientists • Domain informaticians • Service providers • Funding agencies • But the community credit crisis continues….
  • 21. Summary • Science is a complex social activity undertaken by tribes of people and dominated by trust issues. • Infrastructure has to be there and fit for purpose but its not the real the problem. • Need a cultural shift (on all sides) that truly honours data.

Notes de l'éditeur

  1. Learn about JISC’s work in the area of shared services for STEM subjects, particularly the JANET network service and virtual research environments (i.e., web tools for helping research processes) Explore new opportunities for research being opened up via shared services, and also the economic savings this creates Consider the role their university might play in providing a shared service to other institutions
  2. Nor major data centres but long tail
  3. Data pipeline Data funnel Fuzzy line between collaborators and competitors Usb drives, wikis, databadsaes, Disributed in email etc.
  4. Sharing without fear
  5. MaDaM project Competitive advantage. Academic vanity. Adoption. Reputation. Acceleration. Novel insights. Help. Scrutiny. Being scooped. Misinterpretation. Reputation. Trust. Not comprehensible Competitive advantage. Academic vanity. Reputation. Adoption Scrutiny. Being scooped. Misinterpretation. New Reward Schemes But we have to aware of the drivers for collaboration. Competitive advantage. Be the first with the Nature paper. Academic vanity Credit, credibility, fame, acclaim, recognition, peer respect, reputation. Adoption Get my stuff adopted / recognised More funding Being found out Open to rigorous inspection. Being scooped Beaten by lab X Protecting my turf. Releasing results too early. Getting left behind. Being out of fashion. Looking stupid Being misinterpreted or misrepresented. Looking stupid. Losing control. Taking a risk
  6. Some excuses
  7. Genomics Standards Consortium http://gensc.org/gc_wiki/index.php/MIBBI_workshop All or nothing
  8. Scripts, workflows, simulations, experimental plans statistical models…. Repeatable, reproducible, comparable and reusable research. Propagate expertise Build reputation.
  9. Credit, Citation, Career Personal and institutional visibility Scholarly citation metrics
  10. contribute, curate, review, reuse. Data is not respected . John Quackenbush - John Quackenbush - Professor of Computational Biology and Bioinformatics - Department of Biostatistics - Harvard School of Public Health.
  11. 58% developed by students, 24% stated not maintained (Schultheiss et al. (2010) PLoS Comp Biol (in review)) Tools, commons Preparing data for sharing is free like puppies are free
  12. National Centre for BioOntologies The Open Biological and Biomedical Ontologies Standardise messages not structures Only as good as your data services Minimum models and Controlled vocabularies 63% 47%
  13. 58% developed by students, 24% stated not maintained (Schultheiss et al. (2010) PLoS Comp Biol (in review)) Tools, commons Preparing data for sharing is free like puppies are free Doi’s cost
  14. Hard core are the PALs Commons-based Cleanup ● Manual and automated curation workflows ● Curators emergent and assigned ● Curation tools Incentives Right time right place – also email! Third party curation is really hard Expert curation Classification Weeding Added value Structured metadata Prompting Classification Filtering Facetted browsing Time to get organised One example workflow can be found at: http://www.myexperiment.org/workflows/16 This the the old example workflow, but I have tagged as a benchmark. You can see the breakdown of tags given to this at: http://www.myexperiment.org/workflows/16/curation ... or by clicking on the breakdown section (see attached image). 14 curation tags Some are slightly ambiguous and others have little meaning These were:    * test workflow    * component - part of whole solution    * whole solution    * tutorial / example    * incomplete    * junk    * obsolete - deprecated    * runnable    * not runnable    * requires description    * requires credit / attribution    * requires example input data    * description; [Description Text]    * example data; [port : value] Each tag was preceeded by a "c:" so that it would be picked up by the myExperiment plugin and could be differentiated from other myExperiment tags. If some example data was known, I tried to add it to using the example tag "example data; [port : value]", where the port name is given, along with the data to be put into the port. The whole process was very time consuming, as I had to try and open each workflow in T2, run it using some example data (or figure out what it did and run it with lots of test data), and then add each comment (checking each workflow on myExperiment to see if it had complete properly.
  15. Add url here
  16. E-Lab and Taverna – all my software - elephants ---- elephant in the room, blind men and elephants, danger of being white elephants? SysMO And other e-Science projects Each of these apply to all our projects. Just one of them is not enough. Not even for Taverna. To sustain it as a service we must sustain the software and the content in its repositories