SlideShare a Scribd company logo
1 of 20
Download to read offline
Reproducibility, dissemination,
and management of modeling results

17 February 2014, Braunschweig

Dagmar Waltemath

http://sems.uni-rostock.de
Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management

http://sems.uni-rostock.de

2
Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management

http://sems.uni-rostock.de

3
“We’ve been hearing a common theme from
the academic community – researchers are
having difficulty managing and accessing their
data. It seems to be an ongoing problem for
research scientists, at any stage of their
careers.”
(Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data
Management)

http://sems.uni-rostock.de

4
Outline

reproducibility

dissemination

http://sems.uni-rostock.de

management

5
Outline

reproducibility

dissemination

management

“People can’t share knowledge if they don’t
speak a common language”
Tom Davenport, Lawrence Prusak (2000) Working Knowledge

http://sems.uni-rostock.de

6
Reproducible modeling results :: Standards

Model
Entities, network
of reactions, math

Fig: Goldbeter (1991),
http://www.ncbi.nlm.nih.
gov/pubmed/1833774

Annotations
Compartment: Cell GO:0005623
Publication: Goldbeter
PMID:1833774
M = inactive CDCD2 Kinase:
UniProt:CDK1a_XENIA
Fig.: BioModels Database
Behavior: Oscillation
TEDDY_0000006
Algorithm: Gillespie
KiSAO:000029

Protocols

Fig.: BioModels Database

http://sems.uni-rostock.de

7
Reproducible modeling results :: Towards publication

1

3

2

+

4

5

Following: Waltemath et al (2013) Reproducibility of model-based results in systems biology. Springer
http://sems.uni-rostock.de

8
Outline

reproducibility

dissemination

management

[Quantitative] models will be only as useful as their access and reuse
is easy for all scientists.
Nicolas Le Novère (2006) Model storage, exchange and integration. BMC Neuroscience
http://sems.uni-rostock.de

9
Dissemination :: Model curation and annotation

Fig.: Li et al (2010) BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic
models. BMC Systems Biology
http://sems.uni-rostock.de

10
Dissemination :: Public model repositories

1.
2.
3.
4.

Higher visibility of research
Long-term availability
Link to other resources
Quality-checks
Fig.: Piwowar and Vision (2013) Data reuse and the open
data citation advantage. PeerJ

http://sems.uni-rostock.de

11
Dissemination :: Quality checks with functional curation

Fig.: Example for functional curation on heart model, http://travis.cs.ox.ac.uk/FunctionalCuration/db.html

Fig.: Cooper et al (under review) Through models to knowledge with virtual experiments
http://sems.uni-rostock.de

Martin Scharm
12
Outline

reproducibility

dissemination

management

“And that’s why we need model Management.“
Following: http://www.indiana.edu/~hperp200/images/WhyWeNeedComputer_thumb.png

http://sems.uni-rostock.de

13
Management :: Integration of model-related data
“Which models are annotated with ‘Adenosine tri-phosphate’?”

Document

”Which models contain reactions with
ATP as reactant and ADP as product?“
C2

CP

Pubmed:
1831270
Kegg Pathway
sce04111

is

pM

Cell

asProduct
asReactant

EC-Code:
3.1.3.16

Uniprot:P04551

Uniprot:P04551

Interpro:
IPR006670

is

hasPart

isContainedIn

isVersion

isVersionOf

• Relations between entities
• Links to concepts in bio-ontologies

Reaction3

isVersionOf

• Graph store (Neo4J database)

isDescribedBy
Tyson1991
Cell Cycle 6
var

GO:0005623

Fig.: Henkel et al (2012) Considerations of graph-based
concepts to manage of computational biology models and
associated simulations INFORMATIK2012, Braunschweig

Ron Henkel
http://sems.uni-rostock.de

14
Management :: Integration of model-related data
Document

Document

SEDML
Pubmed:
1831270

isDescribedBy

Tyson_1991

Modelreference

C2

is_connected

is_connected

environment

Simulation

Task

Datagenerator

Output

CP
Variable

is_connected

Variable
C2

CP

time

time

time

CP

KISAO:
Ontology

C2

KISAO:097

is_mapped_to

KISAO:000

KISAO:201

isA
Document

isDescribedBy

KISAO:433

Tyson1991
Cell Cycle 6
var

Reaction3

C2

CP

pM

KISAO:352

KISAO:20

KISAO:019

Kegg Pathway
sce04111

is

KISAO:273

KISAO:447

SBO:
Ontology

Cell

asProduct
isContainedIn

is

hasPart

isA

ha

f
nO

sP

art

isVersionOf

SBO:0000

is

sio
er

EC-Code:
3.1.3.16

isV

isVersionOf

asReactant

SBO:064
Uniprot:P04551

Interpro:
IPR006670

GO:0005623

SBO:544

SBO:004

SBO:231

SBO:003

SBO:236

SBO:545

SBO:000064

Fig.: Henkel et al (in preparation)
http://sems.uni-rostock.de

15
Management :: Combination of methods
Keywords describing a
model of interest.

Rank

isVersion
Of

CP

Unipr
ot:P0
4551

is

pM

3.

Maex‘98

SEDM
L
Tyso
n_19
91

Inte
rpro
:
IPR
006
670

Pubm
isDescribedBy

Cel
l
envi
ron
men
t

ed:
Pubm
183127
0 ed:
183127
0

Model
refere
nce

CP

Simul
ation

Task

Outpu
t

Datag
enera
tor

Varia
ble

GO:0
0056
23

C2

CP

time

time

time

CP

C2

Varia
ble

ID:
BIOMD000000005
Authors:
Tyson JJ.
Date:
13 Sep 2005 12:31:08
Publication: pubmed:1831270
Species:
cdc2k, cyclin …
Reaction: cyclin_cdc2k_dissociation, …

Tyson‘91
Tyson‘91 ODE plot

simulate

Tyson‘91

Doc
ume
nt

Pub
med:
Kegg
1831
Path
270
way
sce04
111

is

hasPart

isVersion
Unipr
ot:P0
4551

Novak‘97

Docu
ment

isDescrib
edBy

C2

ECCode:
3.1.3.
16

Format

retrieve

select simulation
description

compare with paper

search

C2

isVersion
Of

19
91
Cel
l
Cy
cle
6
var

1.

2.

Do
cu
me
Tys
nt
on

Re
act
ion
3

Name

Tyson’91 ODE plot

Model: BIOMD000000005
add simulation
Algorithm:
ODE solver
description to
Type:
time course
simulation software Output:
plot

Fig.: Following Waltemath et al (2013) Reproducibility of model-based results in systems biology. Springer.
Henkel et al (2010) Ranked retrieval of Computational Biology models. BMC bioinformatics
http://sems.uni-rostock.de

Ron Henkel
16
Management :: Provenance
“Give me the best matching model published on the Cell Cycle
and considering cdk1.”

Lucene: species:cdk1, compartment:cell, …

Fig.: Waltemath et al (2013) Improving the reuse of computational models through version control.Bioinformatics
http://sems.uni-rostock.de

17
Management :: Model version control

Fig.: courtesy Martin Scharm, BudHat, http://sems.uni-rostock.de/budhat
http://sems.uni-rostock.de

Martin Scharm
18
Summary :: SEMS projects & Contributions

foster
dissemination

improve
management
Document

isDescribedBy
Tyson1991
Cell Cycle 6
var

Reaction3

C2

CP

Pubmed:
1831270
Kegg Pathway
sce04111

is

pM

Cell

asProduct

EC-Code:
3.1.3.16

http://sems.uni-rostock.de

Uniprot:P04551

Uniprot:P04551

Interpro:
IPR006670

is

hasPart

isContainedIn

isVersion

isVersionOf

asReactant

isVersionOf

ensure
reproducibility

GO:0005623

19
Thank you for your attention.
Collaborators
Nicolas Le Novère

Christian Rosenke

David Nickerson

Wolfgang Müller

Jonathan Cooper

Falk Schreiber

Jon Olav Vik

SED-ML Editorial Board

Tommy Yu

SBML Editorial Board

HARMONY 2015
Wittenberg
HERMESForschungsförderung
HERMES-Forschungsförderung
der
der Universität RostockUniversität Rostock
http://sems.uni-rostock.de

@SemsProject

20

More Related Content

Similar to Reproducibility, dissemination, and management of modeling results

Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Emma Schymanski
 
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
Dr. Haxel Consult
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
dgarijo
 

Similar to Reproducibility, dissemination, and management of modeling results (20)

Data Provenance and Scientific Workflow Management
Data Provenance and Scientific Workflow ManagementData Provenance and Scientific Workflow Management
Data Provenance and Scientific Workflow Management
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
Cao report 2007-2012
Cao report 2007-2012Cao report 2007-2012
Cao report 2007-2012
 
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
Environmental Cheminformatics for Unknown ID UC Davis Nov 2018
 
Journal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific ComputingJournal Club - Best Practices for Scientific Computing
Journal Club - Best Practices for Scientific Computing
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
 
Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium
 
Frankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectFrankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee Projeect
 
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016Using Healthcare Data for Research @ The Hyve - Campus Party 2016
Using Healthcare Data for Research @ The Hyve - Campus Party 2016
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
 
Transparency in the Data Supply Chain
Transparency in the Data Supply ChainTransparency in the Data Supply Chain
Transparency in the Data Supply Chain
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
II-SDV 2013 Text Mining at Work: Critical Assessment of the Completeness and ...
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 

More from University Medicine Greifswald

Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...
University Medicine Greifswald
 

More from University Medicine Greifswald (19)

A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...A guide to the COMBINE: Navigating through specifications, mailing lists and ...
A guide to the COMBINE: Navigating through specifications, mailing lists and ...
 
When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?When is a model FAIR – and why should we care?
When is a model FAIR – and why should we care?
 
COMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management rightCOMBINE standards & tools: Getting model management right
COMBINE standards & tools: Getting model management right
 
Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...Adding value to scientific results: COMBINE standards & guidelines for system...
Adding value to scientific results: COMBINE standards & guidelines for system...
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusability
 
2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn2019 07-04-model reuse-bonn
2019 07-04-model reuse-bonn
 
Mehr Medizininformatik am Meer
Mehr Medizininformatik am MeerMehr Medizininformatik am Meer
Mehr Medizininformatik am Meer
 
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...Implementierung Graph-basierter Ansätze für das Management systembiologischer...
Implementierung Graph-basierter Ansätze für das Management systembiologischer...
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology models
 
Identifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational modelsIdentifying pattern in reaction networks of computational models
Identifying pattern in reaction networks of computational models
 
Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...Extended support for standard graphical notations of biological networks in s...
Extended support for standard graphical notations of biological networks in s...
 
Coming Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMSComing Soon: de.NBI and SBGN-ED @ SEMS
Coming Soon: de.NBI and SBGN-ED @ SEMS
 
Masymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositoriesMasymos: Finding hidden treasures in model repositories
Masymos: Finding hidden treasures in model repositories
 
Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...Possibilities for integrating model-related data in computational biology (DI...
Possibilities for integrating model-related data in computational biology (DI...
 
SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)SEMS: Model search and ranked Retrieval (Ron Henkel)
SEMS: Model search and ranked Retrieval (Ron Henkel)
 
Simulation experiment descriptions and management
Simulation experiment descriptions and managementSimulation experiment descriptions and management
Simulation experiment descriptions and management
 
Sems project overview
Sems project overviewSems project overview
Sems project overview
 
Bio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-MLBio-Model Meta-Information and SED-ML
Bio-Model Meta-Information and SED-ML
 
Meta-Information for Bio-Models
Meta-Information for Bio-ModelsMeta-Information for Bio-Models
Meta-Information for Bio-Models
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Reproducibility, dissemination, and management of modeling results

  • 1. Reproducibility, dissemination, and management of modeling results 17 February 2014, Braunschweig Dagmar Waltemath http://sems.uni-rostock.de
  • 2. Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management http://sems.uni-rostock.de 2
  • 3. Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management http://sems.uni-rostock.de 3
  • 4. “We’ve been hearing a common theme from the academic community – researchers are having difficulty managing and accessing their data. It seems to be an ongoing problem for research scientists, at any stage of their careers.” (Nature Blogs: Of Schemes and Dreams (2014) Nine Worrying Stats on the Effect of Poor Scientific Data Management) http://sems.uni-rostock.de 4
  • 6. Outline reproducibility dissemination management “People can’t share knowledge if they don’t speak a common language” Tom Davenport, Lawrence Prusak (2000) Working Knowledge http://sems.uni-rostock.de 6
  • 7. Reproducible modeling results :: Standards Model Entities, network of reactions, math Fig: Goldbeter (1991), http://www.ncbi.nlm.nih. gov/pubmed/1833774 Annotations Compartment: Cell GO:0005623 Publication: Goldbeter PMID:1833774 M = inactive CDCD2 Kinase: UniProt:CDK1a_XENIA Fig.: BioModels Database Behavior: Oscillation TEDDY_0000006 Algorithm: Gillespie KiSAO:000029 Protocols Fig.: BioModels Database http://sems.uni-rostock.de 7
  • 8. Reproducible modeling results :: Towards publication 1 3 2 + 4 5 Following: Waltemath et al (2013) Reproducibility of model-based results in systems biology. Springer http://sems.uni-rostock.de 8
  • 9. Outline reproducibility dissemination management [Quantitative] models will be only as useful as their access and reuse is easy for all scientists. Nicolas Le Novère (2006) Model storage, exchange and integration. BMC Neuroscience http://sems.uni-rostock.de 9
  • 10. Dissemination :: Model curation and annotation Fig.: Li et al (2010) BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models. BMC Systems Biology http://sems.uni-rostock.de 10
  • 11. Dissemination :: Public model repositories 1. 2. 3. 4. Higher visibility of research Long-term availability Link to other resources Quality-checks Fig.: Piwowar and Vision (2013) Data reuse and the open data citation advantage. PeerJ http://sems.uni-rostock.de 11
  • 12. Dissemination :: Quality checks with functional curation Fig.: Example for functional curation on heart model, http://travis.cs.ox.ac.uk/FunctionalCuration/db.html Fig.: Cooper et al (under review) Through models to knowledge with virtual experiments http://sems.uni-rostock.de Martin Scharm 12
  • 13. Outline reproducibility dissemination management “And that’s why we need model Management.“ Following: http://www.indiana.edu/~hperp200/images/WhyWeNeedComputer_thumb.png http://sems.uni-rostock.de 13
  • 14. Management :: Integration of model-related data “Which models are annotated with ‘Adenosine tri-phosphate’?” Document ”Which models contain reactions with ATP as reactant and ADP as product?“ C2 CP Pubmed: 1831270 Kegg Pathway sce04111 is pM Cell asProduct asReactant EC-Code: 3.1.3.16 Uniprot:P04551 Uniprot:P04551 Interpro: IPR006670 is hasPart isContainedIn isVersion isVersionOf • Relations between entities • Links to concepts in bio-ontologies Reaction3 isVersionOf • Graph store (Neo4J database) isDescribedBy Tyson1991 Cell Cycle 6 var GO:0005623 Fig.: Henkel et al (2012) Considerations of graph-based concepts to manage of computational biology models and associated simulations INFORMATIK2012, Braunschweig Ron Henkel http://sems.uni-rostock.de 14
  • 15. Management :: Integration of model-related data Document Document SEDML Pubmed: 1831270 isDescribedBy Tyson_1991 Modelreference C2 is_connected is_connected environment Simulation Task Datagenerator Output CP Variable is_connected Variable C2 CP time time time CP KISAO: Ontology C2 KISAO:097 is_mapped_to KISAO:000 KISAO:201 isA Document isDescribedBy KISAO:433 Tyson1991 Cell Cycle 6 var Reaction3 C2 CP pM KISAO:352 KISAO:20 KISAO:019 Kegg Pathway sce04111 is KISAO:273 KISAO:447 SBO: Ontology Cell asProduct isContainedIn is hasPart isA ha f nO sP art isVersionOf SBO:0000 is sio er EC-Code: 3.1.3.16 isV isVersionOf asReactant SBO:064 Uniprot:P04551 Interpro: IPR006670 GO:0005623 SBO:544 SBO:004 SBO:231 SBO:003 SBO:236 SBO:545 SBO:000064 Fig.: Henkel et al (in preparation) http://sems.uni-rostock.de 15
  • 16. Management :: Combination of methods Keywords describing a model of interest. Rank isVersion Of CP Unipr ot:P0 4551 is pM 3. Maex‘98 SEDM L Tyso n_19 91 Inte rpro : IPR 006 670 Pubm isDescribedBy Cel l envi ron men t ed: Pubm 183127 0 ed: 183127 0 Model refere nce CP Simul ation Task Outpu t Datag enera tor Varia ble GO:0 0056 23 C2 CP time time time CP C2 Varia ble ID: BIOMD000000005 Authors: Tyson JJ. Date: 13 Sep 2005 12:31:08 Publication: pubmed:1831270 Species: cdc2k, cyclin … Reaction: cyclin_cdc2k_dissociation, … Tyson‘91 Tyson‘91 ODE plot simulate Tyson‘91 Doc ume nt Pub med: Kegg 1831 Path 270 way sce04 111 is hasPart isVersion Unipr ot:P0 4551 Novak‘97 Docu ment isDescrib edBy C2 ECCode: 3.1.3. 16 Format retrieve select simulation description compare with paper search C2 isVersion Of 19 91 Cel l Cy cle 6 var 1. 2. Do cu me Tys nt on Re act ion 3 Name Tyson’91 ODE plot Model: BIOMD000000005 add simulation Algorithm: ODE solver description to Type: time course simulation software Output: plot Fig.: Following Waltemath et al (2013) Reproducibility of model-based results in systems biology. Springer. Henkel et al (2010) Ranked retrieval of Computational Biology models. BMC bioinformatics http://sems.uni-rostock.de Ron Henkel 16
  • 17. Management :: Provenance “Give me the best matching model published on the Cell Cycle and considering cdk1.” Lucene: species:cdk1, compartment:cell, … Fig.: Waltemath et al (2013) Improving the reuse of computational models through version control.Bioinformatics http://sems.uni-rostock.de 17
  • 18. Management :: Model version control Fig.: courtesy Martin Scharm, BudHat, http://sems.uni-rostock.de/budhat http://sems.uni-rostock.de Martin Scharm 18
  • 19. Summary :: SEMS projects & Contributions foster dissemination improve management Document isDescribedBy Tyson1991 Cell Cycle 6 var Reaction3 C2 CP Pubmed: 1831270 Kegg Pathway sce04111 is pM Cell asProduct EC-Code: 3.1.3.16 http://sems.uni-rostock.de Uniprot:P04551 Uniprot:P04551 Interpro: IPR006670 is hasPart isContainedIn isVersion isVersionOf asReactant isVersionOf ensure reproducibility GO:0005623 19
  • 20. Thank you for your attention. Collaborators Nicolas Le Novère Christian Rosenke David Nickerson Wolfgang Müller Jonathan Cooper Falk Schreiber Jon Olav Vik SED-ML Editorial Board Tommy Yu SBML Editorial Board HARMONY 2015 Wittenberg HERMESForschungsförderung HERMES-Forschungsförderung der der Universität RostockUniversität Rostock http://sems.uni-rostock.de @SemsProject 20