SlideShare une entreprise Scribd logo
1  sur  62
Télécharger pour lire hors ligne
Computational Approaches
to Systems Biology
Michael Hucka, Ph.D.
Department of Computing + Mathematical Sciences
California Institute of Technology
Pasadena, CA, USA
The Kinghorn Cancer Centre, Australia, August 2013
Email: mhucka@caltech.edu Twitter: @mhucka
Outline
Background and introduction
The Systems Biology Markup Language (SBML)
Complementary efforts: MIRIAM and SED-ML
COMBINE: the Computational Modeling in Biology Network
Conclusion
Outline
Background and introduction
The Systems Biology Markup Language (SBML)
Complementary efforts: MIRIAM and SED-ML
COMBINE: the Computational Modeling in Biology Network
Conclusion
Research today: experimentation, computation, cogitation
“ The nature of systems biology”
Bruggeman & Westerhoff,
Trends Microbiol. 15 (2007).
Large-scale integrative models are growing
Many models have traditionally been published this way
Problems:
• Errors in printing
• Missing information
• Dependencies on
implementation
• Outright errors
• Can be a huge
effort to recreate
Is it enough to communicate the model in a paper?
Is it enough to make your (software X) code available?
It’s vital for good science:
• Someone with access to the same software can try to run it,
understand it, verify the computational results, build on them, etc.
• Opinion: you should always do this in any case
Is it enough to make your (software X) code available?
It’s vital for good science—
• Someone with access to the same software can try to run it,
understand it, build on it, etc.
• Opinion: you should always do this in any case
But it’s still not ideal for communication of scientific results:
• Doesn’t necessarily encode biological semantics of the model
• What if they don’t have access to the same software?
• What if they don’t want to use that software?
• What if they want to use a different conceptual framework?
• And how will people be able to relate the model to other work?
Different tools different interfaces & languages
Outline
Background and introduction
The Systems Biology Markup Language (SBML)
Complementary efforts: MIRIAM and SED-ML
COMBINE: the Computational Modeling in Biology Network
Conclusion
SBML:alinguafranca
forsoftware
Format for representing computational models of biological processes
• Data structures + usage principles + serialization to XML
• (Mostly) Declarative, not procedural—not a scripting language
Neutral with respect to modeling framework
• E.g., ODE, stochastic systems, etc.
Important: software reads/writes SBML, not humans
SBML = Systems Biology Markup Language
The raw SBML (as XML)
The process is central
• Literally called a“reaction”in SBML
• Participants are pools of entities (biochemical species)
Models can further include:
• Compartments
• Other constants & variables
• Discontinuous events
• Other, explicit math
Core SBML concepts are fairly simple
• Unit definitions
• Annotations
Well-stirred compartments
c
n
Some basics of SBML core model encoding
Species pools are located in compartments
c
n
protein A protein B
gene mRNAn mRNAc
Reactions can involve any species anywhere
c
n
protein A protein B
gene mRNAn mRNAc
Reactions can cross compartment boundaries
c
n
protein A protein B
gene mRNAn mRNAc
Reaction/process rates can be (almost) arbitrary formulas
c
n
protein A protein B
gene mRNAn mRNAc
f1(x)
f2(x)
f3(x)f4(x)
f5(x)
“Rules”: equations expressing relationships in addition to reaction sys.
c
n
protein A protein B
gene mRNAn mRNAc
f1(x)
f2(x)
f3(x)
g1(x)
g2(x)
.
.
.
f4(x)
f5(x)
“Events”: discontinuous actions triggered by system conditions
c
n
protein A protein B
gene mRNAn mRNAc
f1(x)
f2(x)
f3(x)
g1(x)
g2(x)
.
.
.
Event1: when (...condition...),
do (...assignments...)
Event2: when (...condition...),
do (...assignments...)
...
f4(x)
f5(x)
Annotations: machine-readable semantics and links to other resources
Event1: when (...condition...),
do (...assignments...)
Event2: when (...condition...),
do (...assignments...)
...
c
n
protein A protein B
gene mRNAn mRNAc
f1(x)
f2(x)
f3(x)
g1(x)
g2(x)
.
.
.
f4(x)
f5(x)
“This event
represents ...”
“This is identified
by GO id # ...”
“This is an enzymatic
reaction with EC # ...”
“This is a transport
into the nucleus ...” “This compartment
represents the nucleus ...”
BioModels Database
http://biomodels.net/biomodels
Contents of BioModels Database
Contents today:
• 142,000+ pathway models (converted from KEGG)
• 460+ hand-curated quantitative models
• 460+ non-curated quantitative models
8%
2%
3%
6%
6%
7%
8%
9%
24%
27%
signal transduction
metabolic process
multicelullar organismal process
rhythmic process
cell cycle
homeostatic process
response to stimulus
cell death
localization
others (e.g., developmental process)
Database data from 2013
Find software in the SBML Software Guide
Find SBML software
Find software in the SBML Software Guide
Question: Which of the following categories best describe your software?
(Check all that apply.)
Results of 2011 survey of SBML-compatible software
Out of 81 responses
Simulation software
Analysis s/w (in addition, or instead of, simulation)
Creation/model development software
Visualization/display/formatting software
Utility software (e.g., format conversion)
Data integration and management software
Repository or database
Framework or library (for use in developing s/w)
S/w for interactive env. (e.g., MATLAB, R, ...)
Annotation software
0 20 40 60 80
11
13
13
14
16
23
31
31
40
42
Some particularly full-featured, general simulation tools
COPASI: ODE & stochastic simulation, parameter scanning, plotting
Virtual Cell: web-based environment, spatial models
iBioSim: special features for genetic circuit models for synthetic biology
SBW (Systems Biology Workbench): component-based toolkit
SBMLsimulator: Java-based simulator, web-start or stand-alone
CellDesigner: graphical editing, SBGN support, SABIO-RK integration
Free software libraries – libSBML
Reads, writes, validates SBML
Can check & convert units
Written in portable C++
Runs on Linux, Mac, Windows
APIs for C, C++, C#, Java, Octave,
Perl, Python, R, Ruby, MATLAB
Well documented API
Open-source (LGPL)
http://sbml.org/Software/libSBML
Evolution of SBML continues
Today: SBML Level 3
• Level 3 Core provides framework for common models
• Level 3 packages add additional constructs to the Core
Level 3 package What it enables
Hierarchical model composition Models containing submodels ✔
Flux balance constraints Constraint-based models ✔
Qualitative models Petri net models, Boolean models ✔
Graph layout Diagrams of models ✔
Multicomponent/state species Entities w/ structure; also rule-based models draft
Spatial Nonhomogeneous spatial models draft
Graph rendering Diagrams of models draft
Groups Arbitrary grouping of components draft
Distributions Numerical values as statistical distributions in dev
Arrays & sets Arrays or sets of entities in dev
Dynamic structures Creation & destruction of components in dev
Annotations Richer annotation syntax
Status
NationalInstituteofGeneralMedicalSciences(USA)
European Molecular Biology Laboratory (EMBL)
JST ERATO Kitano Symbiotic Systems Project (Japan) (to 2003)
JST ERATO-SORST Program (Japan)
ELIXIR (UK)
Beckman Institute, Caltech (USA)
Keio University (Japan)
International Joint Research Program of NEDO (Japan)
Japanese Ministry of Agriculture
Japanese Ministry of Educ., Culture, Sports, Science and Tech.
BBSRC (UK)
National Science Foundation (USA)
DARPA IPTO Bio-SPICE Bio-Computation Program (USA)
Air Force Office of Scientific Research (USA)
STRI, University of Hertfordshire (UK)
Molecular Sciences Institute (USA)
SBML funding sources over the past 13+ years
Outline
Background and introduction
The Systems Biology Markup Language (SBML)
Complementary efforts: MIRIAM and SED-ML
COMBINE: the Computational Modeling in Biology Network
Conclusion
Modelerswanttousetheirownconventions
Modelerswanttousetheirownconventions
No standard
identifiers
Modelerswanttousetheirownconventions
Low info
content
No standard
identifiers
Raw models alone are insufficient
Need standard schemes for
machine-readable annotations
• Identify entities
• Mathematical semantics
• Links to other data resources
• Authorship & pub. info
Modelerswanttousetheirownconventions
Low info
content
No standard
identifiers
Addresses 2 general areas of annotation needs:
MIRIAM is not specific to SBML
MIRIAM(MinimumInformationRequestedIntheAnnotationofModels)
Requirements for
reference correspondence
Scheme for encoding
annotations
Annotations for
attributing model
creators & sources
Annotations for
referring to external
data resources
Addresses 2 general areas of annotation needs:
MIRIAM is not specific to SBML
MIRIAM(MinimumInformationRequestedIntheAnnotationofModels)
Requirements for
reference correspondence
Scheme for encoding
annotations
Annotations for
attributing model
creators & sources
Annotations for
referring to external
data resources
Annotations for
referring to external
data resources
Example of a problem that can be solved with annotations
http://www.ebi.ac.uk/chebi
Low info
content
Example of a problem that can be solved with annotations
http://www.ebi.ac.uk/chebi
Low info
content
Known by different names – 
do you want to write all of
them into your model?
salicylic acid
MIRIAM annotations for external references
Goal: link model constituents to corresponding entities in
bioinformatics resources (e.g., databases, controlled vocabularies)
• Supports:
- Precise identification of model constituents
- Discovery of models that concern the same thing
- Comparison of model constituents between different models
MIRIAM approach avoids putting data content directly in the model
• Instead, it points at external resources that contain the data
How do we create globally unique identifiers consistently?
Long story short—developed by the Le Novère group at the EBI
• Resource identifiers (URIs) combine 2 parts:
• There’s a registry for namespaces: MIRIAM Registry
- Allows people & software to use same namespace identifiers
• There’s a URI resolution service: MIRIAM Resources & identifiers.org
- Allows people & software to take a given identifier and figure
out what it points to
namespace entity identifier
{
{
Identifies a dataset Identifies a datum
within the dataset
Another problem: software can’t read figure legends
?
BIOMD0000000319 in BioModels Database
Decroly & Goldbeter, PNAS, 1982
SED-ML = Simulation Experiment Description ML
Application-independent format
•Captures procedures, algorithms, parameter values
Can be used for
•Simulation experiments encoding parametrizations & perturbations
•Simulations using more than one model and/or method
•Data manipulations to produce plot(s)
http://sedml.org
Simulation
Model
Task Data generators
Reports
Efforts like SED-ML improve reproducibility of publications
Waltemath et al.,
BMC Sys Bio 5, 2011.
Outline
Background and introduction
The Systems Biology Markup Language (SBML)
Complementary efforts: MIRIAM and SED-ML
COMBINE: the Computational Modeling in Biology Network
Conclusion
Need interoperable formats, but developing them is not easy
Need people with diverse set of knowledge & skills
• Scientific needs
• Technical implementation skills
• Practical experience
Need manage multiple phases of a standardization effort
• Creation
• Evolution
• Support
Need interoperable formats, but developing them is not easy
Need people with diverse set of knowledge & skills
• Scientific needs
• Technical implementation skills
• Practical experience
Need manage multiple phases of a standardization effort
• Creation
• Evolution
• Support
} This is just for the specification of the
standards, to say nothing of the necessary
software and other infrastructure!
Realizations about the state of affairs in late-2000’s
• Many standardization efforts overlapped, but lacked coordination
• Efforts were inventing their own processes from scratch
• Many individual meetings meant more travel for many people
• Limited and fragile funding didn’t support solid, coherent base
COMBINE = Computational Modeling in Biology Network
• Coordinate standards development
• Develop common procedures & tools (but not impose them!)
• Coordinate meetings
• Provide a recognized voice
Motivations for the creation of COMBINE
Standardization efforts represented in COMBINE today
BioPAX
Qualifiers
GPML
COMBINE Standards
Associated Standardization Efforts
Related Standardization Efforts
COMBINE formats cover many types of models
– from Nicolas Le Novère
Examples of community organization
Two main annual meetings, plus ad hoc workshops
• COMBINE meeting: status updates, presentations, outreach
- Next COMBINE: Paris, Sep 16–20, 2013
• HARMONY: Hackathon on Resources for Modeling in Biology
- Software development, interoperability hacking
COMBINE 2012, TorontoCOMBINE 2011, Heidelberg
COMBINE is open to all—and COMBINE needs you!
http://co.mbine.org
Current coordinators:
• Nicolas Le Novère, Mike Hucka, Falk Schreiber, Gary Bader
Outline
Background and introduction
The Systems Biology Markup Language (SBML)
Complementary efforts: MIRIAM and SED-ML
COMBINE: the Computational Modeling in Biology Network
Conclusion
Time it well
• Too early and too late are bad
Start with actual stakeholders
• Address real needs, not perceived ones
Start with small team of dedicated developers
• Can work faster, more focused; also avoids“designed-by-committee”
Engage people constantly, in many ways
• Electronic forums, email, electronic voting, surveys, hackathons
Make the results free and open-source
• Makes people comfortable knowing it will always be available
Be creative about seeking funding
Some things we (maybe?) got right with SBML
Not waiting for implementations before freezing specifications
• Sometimes finalized specification before implementations tested it
- Especially bad when we failed to do a good job
‣ E.g.,“forward thinking”features, or“elegant”designs
Not formalizing the development process sufficiently
• Especially early in the history, did not have a very open process
Not resolving intellectual property issues from the beginning
• Industrial users ask“who has the right to give any rights to this?”
Some things we certainly got wrong
Nicolas Le Novère, Henning Hermjakob, Camille Laibe, Chen Li, Lukas Endler,
Nico Rodriguez, Marco Donizelli,Viji Chelliah, Mélanie Courtot, Harish Dharuri
Attendees at SBML 10th Anniversary Symposium, Edinburgh, 2010
John C. Doyle, Hiroaki Kitano
Mike Hucka, Sarah Keating, Frank Bergmann, Lucian Smith, Andrew Finney,
Herbert Sauro, Hamid Bolouri, Ben Bornstein, Bruce Shapiro, Akira Funahashi,
Akiya Juraku, Ben Kovitz
OriginalPI’s:
SBMLTeam:
SBMLEditors:
BioModelsDB:
Mike Hucka, Nicolas Le Novère, Sarah Keating, Frank Bergmann, Lucian Smith,
Chris Myers, Stefan Hoops, Sven Sahle, James Schaff, DarrenWilkinson
And a huge thanks to many others in the COMBINE community
This work was made possible thanks to a great community
SBML http://sbml.org
BioModels Database http://biomodels.net/biomodels
MIRIAM http://biomodels.net/miriam
identifiers.org http://identifiers.org
SED-ML http://biomodels.net/sed-ml
SBO http://biomodels.net/sbo
SBGN http://sbgn.org
COMBINE http://co.mbine.org
URLs
I’d like your feedback!
You can use this anonymous form:
http://tinyurl.com/mhuckafeedback

Contenu connexe

Tendances (20)

Protein structure prediction (1)
Protein structure prediction (1)Protein structure prediction (1)
Protein structure prediction (1)
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Ab Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionAb Initio Protein Structure Prediction
Ab Initio Protein Structure Prediction
 
Bioinformatics and functional genomics
Bioinformatics and functional genomicsBioinformatics and functional genomics
Bioinformatics and functional genomics
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Scoring schemes in bioinformatics (blosum)
Scoring schemes in bioinformatics (blosum)Scoring schemes in bioinformatics (blosum)
Scoring schemes in bioinformatics (blosum)
 
The uni prot knowledgebase
The uni prot knowledgebaseThe uni prot knowledgebase
The uni prot knowledgebase
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Prosite
PrositeProsite
Prosite
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Protein Threading
Protein ThreadingProtein Threading
Protein Threading
 
Est database
Est databaseEst database
Est database
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Brief Introduction of Protein-Protein Interactions (PPIs)
Brief Introduction of Protein-Protein Interactions (PPIs)Brief Introduction of Protein-Protein Interactions (PPIs)
Brief Introduction of Protein-Protein Interactions (PPIs)
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
System's Biology
System's Biology System's Biology
System's Biology
 
Proteome analysis
Proteome analysisProteome analysis
Proteome analysis
 

Similaire à Computational Approaches to Systems Biology

Creating a new language to support open innovation
Creating a new language to support open innovationCreating a new language to support open innovation
Creating a new language to support open innovationMike Hucka
 
A new language for a new biology: How SBML and other tools are transforming m...
A new language for a new biology: How SBML and other tools are transforming m...A new language for a new biology: How SBML and other tools are transforming m...
A new language for a new biology: How SBML and other tools are transforming m...Mike Hucka
 
Recent developments in the world of SBML (the Systems Biology Markup Language)
Recent developments in the world of SBML (the Systems Biology Markup Language) Recent developments in the world of SBML (the Systems Biology Markup Language)
Recent developments in the world of SBML (the Systems Biology Markup Language) Mike Hucka
 
A summary of various COMBINE standardization activities
A summary of various COMBINE standardization activitiesA summary of various COMBINE standardization activities
A summary of various COMBINE standardization activitiesMike Hucka
 
SBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesSBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesMike Hucka
 
SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)Mike Hucka
 
Common ground between modelers and simulation software: the Systems Biology M...
Common ground between modelers and simulation software: the Systems Biology M...Common ground between modelers and simulation software: the Systems Biology M...
Common ground between modelers and simulation software: the Systems Biology M...Mike Hucka
 
Standards and software: practical aids for reproducibility of computational r...
Standards and software: practical aids for reproducibility of computational r...Standards and software: practical aids for reproducibility of computational r...
Standards and software: practical aids for reproducibility of computational r...Mike Hucka
 
Brief Review of Common Modeling Formalisms and Representation Approaches
Brief Review of Common Modeling Formalisms and Representation ApproachesBrief Review of Common Modeling Formalisms and Representation Approaches
Brief Review of Common Modeling Formalisms and Representation ApproachesMike Hucka
 
Recent software and services to support the SBML community
Recent software and services to support the SBML community Recent software and services to support the SBML community
Recent software and services to support the SBML community Mike Hucka
 
20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environmentJonathan Blakes
 
2013 nas-ehs-data-integration-dc
2013 nas-ehs-data-integration-dc2013 nas-ehs-data-integration-dc
2013 nas-ehs-data-integration-dcc.titus.brown
 
A status update on COMBINE standardization activities, with a focus on SBML
A status update on COMBINE standardization activities, with a focus on SBMLA status update on COMBINE standardization activities, with a focus on SBML
A status update on COMBINE standardization activities, with a focus on SBMLMike Hucka
 
SBML: What Is It About?
SBML: What Is It About?SBML: What Is It About?
SBML: What Is It About?Mike Hucka
 
SBML, SBML Packages, SED-ML, 
 COMBINE Archive, and more
SBML, SBML Packages, SED-ML, 
 COMBINE Archive, and moreSBML, SBML Packages, SED-ML, 
 COMBINE Archive, and more
SBML, SBML Packages, SED-ML, 
 COMBINE Archive, and moreMike Hucka
 
Recent Developments in SBML
Recent Developments in SBMLRecent Developments in SBML
Recent Developments in SBMLMike Hucka
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsFrancesca Lazzeri, PhD
 
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018Sri Ambati
 
Reference Representation in Large Metamodel-based Datasets
Reference Representation in Large Metamodel-based DatasetsReference Representation in Large Metamodel-based Datasets
Reference Representation in Large Metamodel-based DatasetsMarkus Scheidgen
 

Similaire à Computational Approaches to Systems Biology (20)

Creating a new language to support open innovation
Creating a new language to support open innovationCreating a new language to support open innovation
Creating a new language to support open innovation
 
A new language for a new biology: How SBML and other tools are transforming m...
A new language for a new biology: How SBML and other tools are transforming m...A new language for a new biology: How SBML and other tools are transforming m...
A new language for a new biology: How SBML and other tools are transforming m...
 
Recent developments in the world of SBML (the Systems Biology Markup Language)
Recent developments in the world of SBML (the Systems Biology Markup Language) Recent developments in the world of SBML (the Systems Biology Markup Language)
Recent developments in the world of SBML (the Systems Biology Markup Language)
 
A summary of various COMBINE standardization activities
A summary of various COMBINE standardization activitiesA summary of various COMBINE standardization activities
A summary of various COMBINE standardization activities
 
SBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resourcesSBML (the Systems Biology Markup Language), model databases, and other resources
SBML (the Systems Biology Markup Language), model databases, and other resources
 
SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)
 
Common ground between modelers and simulation software: the Systems Biology M...
Common ground between modelers and simulation software: the Systems Biology M...Common ground between modelers and simulation software: the Systems Biology M...
Common ground between modelers and simulation software: the Systems Biology M...
 
Standards and software: practical aids for reproducibility of computational r...
Standards and software: practical aids for reproducibility of computational r...Standards and software: practical aids for reproducibility of computational r...
Standards and software: practical aids for reproducibility of computational r...
 
Brief Review of Common Modeling Formalisms and Representation Approaches
Brief Review of Common Modeling Formalisms and Representation ApproachesBrief Review of Common Modeling Formalisms and Representation Approaches
Brief Review of Common Modeling Formalisms and Representation Approaches
 
Recent software and services to support the SBML community
Recent software and services to support the SBML community Recent software and services to support the SBML community
Recent software and services to support the SBML community
 
20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment
 
2013 nas-ehs-data-integration-dc
2013 nas-ehs-data-integration-dc2013 nas-ehs-data-integration-dc
2013 nas-ehs-data-integration-dc
 
A status update on COMBINE standardization activities, with a focus on SBML
A status update on COMBINE standardization activities, with a focus on SBMLA status update on COMBINE standardization activities, with a focus on SBML
A status update on COMBINE standardization activities, with a focus on SBML
 
SBML: What Is It About?
SBML: What Is It About?SBML: What Is It About?
SBML: What Is It About?
 
SBML, SBML Packages, SED-ML, 
 COMBINE Archive, and more
SBML, SBML Packages, SED-ML, 
 COMBINE Archive, and moreSBML, SBML Packages, SED-ML, 
 COMBINE Archive, and more
SBML, SBML Packages, SED-ML, 
 COMBINE Archive, and more
 
Recent Developments in SBML
Recent Developments in SBMLRecent Developments in SBML
Recent Developments in SBML
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systems
 
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
 
SECh78
SECh78SECh78
SECh78
 
Reference Representation in Large Metamodel-based Datasets
Reference Representation in Large Metamodel-based DatasetsReference Representation in Large Metamodel-based Datasets
Reference Representation in Large Metamodel-based Datasets
 

Plus de Mike Hucka

Caltech DIBS: Digital Borrowing System
Caltech DIBS: Digital Borrowing SystemCaltech DIBS: Digital Borrowing System
Caltech DIBS: Digital Borrowing SystemMike Hucka
 
Finding the right wheel
Finding the right wheelFinding the right wheel
Finding the right wheelMike Hucka
 
Introduction to Satellite Meeting on Overview and Use of Standards and Format...
Introduction to Satellite Meeting on Overview and Use of Standards and Format...Introduction to Satellite Meeting on Overview and Use of Standards and Format...
Introduction to Satellite Meeting on Overview and Use of Standards and Format...Mike Hucka
 
What is "COMBINE"?
What is "COMBINE"?What is "COMBINE"?
What is "COMBINE"?Mike Hucka
 
Some SBML-related resources at SBML.org
Some SBML-related resources at SBML.orgSome SBML-related resources at SBML.org
Some SBML-related resources at SBML.orgMike Hucka
 
Reproducibility of computational research: methods to avoid madness (Session ...
Reproducibility of computational research: methods to avoid madness (Session ...Reproducibility of computational research: methods to avoid madness (Session ...
Reproducibility of computational research: methods to avoid madness (Session ...Mike Hucka
 
Update on SBML for Tuesday Sep. 17 (COMBINE 2013)
Update on SBML for Tuesday Sep. 17 (COMBINE 2013)Update on SBML for Tuesday Sep. 17 (COMBINE 2013)
Update on SBML for Tuesday Sep. 17 (COMBINE 2013)Mike Hucka
 
Systems Biology Systems
Systems Biology SystemsSystems Biology Systems
Systems Biology SystemsMike Hucka
 
A Profile of Today's SBML-Compatible Software
A Profile of Today's SBML-Compatible SoftwareA Profile of Today's SBML-Compatible Software
A Profile of Today's SBML-Compatible SoftwareMike Hucka
 
Retrospective about SBML on the occasion of the 10th Anniversary of SBML
Retrospective about SBML on the occasion of the 10th Anniversary of SBMLRetrospective about SBML on the occasion of the 10th Anniversary of SBML
Retrospective about SBML on the occasion of the 10th Anniversary of SBMLMike Hucka
 
SBML and related resources 
and standardization efforts
SBML and related resources 
and standardization effortsSBML and related resources 
and standardization efforts
SBML and related resources 
and standardization effortsMike Hucka
 
SBML (the Systems Biology Markup Language), BioModels Database, and related r...
SBML (the Systems Biology Markup Language), BioModels Database, and related r...SBML (the Systems Biology Markup Language), BioModels Database, and related r...
SBML (the Systems Biology Markup Language), BioModels Database, and related r...Mike Hucka
 
Finding common ground between modelers and simulation software in systems bio...
Finding common ground between modelers and simulation software in systems bio...Finding common ground between modelers and simulation software in systems bio...
Finding common ground between modelers and simulation software in systems bio...Mike Hucka
 
SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)Mike Hucka
 
General updates about SBML and SBML Team activities
General updates about SBML and SBML Team activitiesGeneral updates about SBML and SBML Team activities
General updates about SBML and SBML Team activitiesMike Hucka
 
Software for SBML Today
Software for SBML TodaySoftware for SBML Today
Software for SBML TodayMike Hucka
 

Plus de Mike Hucka (16)

Caltech DIBS: Digital Borrowing System
Caltech DIBS: Digital Borrowing SystemCaltech DIBS: Digital Borrowing System
Caltech DIBS: Digital Borrowing System
 
Finding the right wheel
Finding the right wheelFinding the right wheel
Finding the right wheel
 
Introduction to Satellite Meeting on Overview and Use of Standards and Format...
Introduction to Satellite Meeting on Overview and Use of Standards and Format...Introduction to Satellite Meeting on Overview and Use of Standards and Format...
Introduction to Satellite Meeting on Overview and Use of Standards and Format...
 
What is "COMBINE"?
What is "COMBINE"?What is "COMBINE"?
What is "COMBINE"?
 
Some SBML-related resources at SBML.org
Some SBML-related resources at SBML.orgSome SBML-related resources at SBML.org
Some SBML-related resources at SBML.org
 
Reproducibility of computational research: methods to avoid madness (Session ...
Reproducibility of computational research: methods to avoid madness (Session ...Reproducibility of computational research: methods to avoid madness (Session ...
Reproducibility of computational research: methods to avoid madness (Session ...
 
Update on SBML for Tuesday Sep. 17 (COMBINE 2013)
Update on SBML for Tuesday Sep. 17 (COMBINE 2013)Update on SBML for Tuesday Sep. 17 (COMBINE 2013)
Update on SBML for Tuesday Sep. 17 (COMBINE 2013)
 
Systems Biology Systems
Systems Biology SystemsSystems Biology Systems
Systems Biology Systems
 
A Profile of Today's SBML-Compatible Software
A Profile of Today's SBML-Compatible SoftwareA Profile of Today's SBML-Compatible Software
A Profile of Today's SBML-Compatible Software
 
Retrospective about SBML on the occasion of the 10th Anniversary of SBML
Retrospective about SBML on the occasion of the 10th Anniversary of SBMLRetrospective about SBML on the occasion of the 10th Anniversary of SBML
Retrospective about SBML on the occasion of the 10th Anniversary of SBML
 
SBML and related resources 
and standardization efforts
SBML and related resources 
and standardization effortsSBML and related resources 
and standardization efforts
SBML and related resources 
and standardization efforts
 
SBML (the Systems Biology Markup Language), BioModels Database, and related r...
SBML (the Systems Biology Markup Language), BioModels Database, and related r...SBML (the Systems Biology Markup Language), BioModels Database, and related r...
SBML (the Systems Biology Markup Language), BioModels Database, and related r...
 
Finding common ground between modelers and simulation software in systems bio...
Finding common ground between modelers and simulation software in systems bio...Finding common ground between modelers and simulation software in systems bio...
Finding common ground between modelers and simulation software in systems bio...
 
SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)SBML (the Systems Biology Markup Language)
SBML (the Systems Biology Markup Language)
 
General updates about SBML and SBML Team activities
General updates about SBML and SBML Team activitiesGeneral updates about SBML and SBML Team activities
General updates about SBML and SBML Team activities
 
Software for SBML Today
Software for SBML TodaySoftware for SBML Today
Software for SBML Today
 

Dernier

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 

Dernier (20)

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Computational Approaches to Systems Biology

  • 1. Computational Approaches to Systems Biology Michael Hucka, Ph.D. Department of Computing + Mathematical Sciences California Institute of Technology Pasadena, CA, USA The Kinghorn Cancer Centre, Australia, August 2013 Email: mhucka@caltech.edu Twitter: @mhucka
  • 2. Outline Background and introduction The Systems Biology Markup Language (SBML) Complementary efforts: MIRIAM and SED-ML COMBINE: the Computational Modeling in Biology Network Conclusion
  • 3. Outline Background and introduction The Systems Biology Markup Language (SBML) Complementary efforts: MIRIAM and SED-ML COMBINE: the Computational Modeling in Biology Network Conclusion
  • 4. Research today: experimentation, computation, cogitation
  • 5. “ The nature of systems biology” Bruggeman & Westerhoff, Trends Microbiol. 15 (2007).
  • 7. Many models have traditionally been published this way Problems: • Errors in printing • Missing information • Dependencies on implementation • Outright errors • Can be a huge effort to recreate Is it enough to communicate the model in a paper?
  • 8. Is it enough to make your (software X) code available? It’s vital for good science: • Someone with access to the same software can try to run it, understand it, verify the computational results, build on them, etc. • Opinion: you should always do this in any case
  • 9. Is it enough to make your (software X) code available? It’s vital for good science— • Someone with access to the same software can try to run it, understand it, build on it, etc. • Opinion: you should always do this in any case But it’s still not ideal for communication of scientific results: • Doesn’t necessarily encode biological semantics of the model • What if they don’t have access to the same software? • What if they don’t want to use that software? • What if they want to use a different conceptual framework? • And how will people be able to relate the model to other work?
  • 10. Different tools different interfaces & languages
  • 11. Outline Background and introduction The Systems Biology Markup Language (SBML) Complementary efforts: MIRIAM and SED-ML COMBINE: the Computational Modeling in Biology Network Conclusion
  • 13. Format for representing computational models of biological processes • Data structures + usage principles + serialization to XML • (Mostly) Declarative, not procedural—not a scripting language Neutral with respect to modeling framework • E.g., ODE, stochastic systems, etc. Important: software reads/writes SBML, not humans SBML = Systems Biology Markup Language
  • 14. The raw SBML (as XML)
  • 15. The process is central • Literally called a“reaction”in SBML • Participants are pools of entities (biochemical species) Models can further include: • Compartments • Other constants & variables • Discontinuous events • Other, explicit math Core SBML concepts are fairly simple • Unit definitions • Annotations
  • 16. Well-stirred compartments c n Some basics of SBML core model encoding
  • 17. Species pools are located in compartments c n protein A protein B gene mRNAn mRNAc
  • 18. Reactions can involve any species anywhere c n protein A protein B gene mRNAn mRNAc
  • 19. Reactions can cross compartment boundaries c n protein A protein B gene mRNAn mRNAc
  • 20. Reaction/process rates can be (almost) arbitrary formulas c n protein A protein B gene mRNAn mRNAc f1(x) f2(x) f3(x)f4(x) f5(x)
  • 21. “Rules”: equations expressing relationships in addition to reaction sys. c n protein A protein B gene mRNAn mRNAc f1(x) f2(x) f3(x) g1(x) g2(x) . . . f4(x) f5(x)
  • 22. “Events”: discontinuous actions triggered by system conditions c n protein A protein B gene mRNAn mRNAc f1(x) f2(x) f3(x) g1(x) g2(x) . . . Event1: when (...condition...), do (...assignments...) Event2: when (...condition...), do (...assignments...) ... f4(x) f5(x)
  • 23. Annotations: machine-readable semantics and links to other resources Event1: when (...condition...), do (...assignments...) Event2: when (...condition...), do (...assignments...) ... c n protein A protein B gene mRNAn mRNAc f1(x) f2(x) f3(x) g1(x) g2(x) . . . f4(x) f5(x) “This event represents ...” “This is identified by GO id # ...” “This is an enzymatic reaction with EC # ...” “This is a transport into the nucleus ...” “This compartment represents the nucleus ...”
  • 25. Contents of BioModels Database Contents today: • 142,000+ pathway models (converted from KEGG) • 460+ hand-curated quantitative models • 460+ non-curated quantitative models 8% 2% 3% 6% 6% 7% 8% 9% 24% 27% signal transduction metabolic process multicelullar organismal process rhythmic process cell cycle homeostatic process response to stimulus cell death localization others (e.g., developmental process) Database data from 2013
  • 26. Find software in the SBML Software Guide
  • 27. Find SBML software Find software in the SBML Software Guide
  • 28. Question: Which of the following categories best describe your software? (Check all that apply.) Results of 2011 survey of SBML-compatible software Out of 81 responses Simulation software Analysis s/w (in addition, or instead of, simulation) Creation/model development software Visualization/display/formatting software Utility software (e.g., format conversion) Data integration and management software Repository or database Framework or library (for use in developing s/w) S/w for interactive env. (e.g., MATLAB, R, ...) Annotation software 0 20 40 60 80 11 13 13 14 16 23 31 31 40 42
  • 29. Some particularly full-featured, general simulation tools COPASI: ODE & stochastic simulation, parameter scanning, plotting Virtual Cell: web-based environment, spatial models iBioSim: special features for genetic circuit models for synthetic biology SBW (Systems Biology Workbench): component-based toolkit SBMLsimulator: Java-based simulator, web-start or stand-alone CellDesigner: graphical editing, SBGN support, SABIO-RK integration
  • 30. Free software libraries – libSBML Reads, writes, validates SBML Can check & convert units Written in portable C++ Runs on Linux, Mac, Windows APIs for C, C++, C#, Java, Octave, Perl, Python, R, Ruby, MATLAB Well documented API Open-source (LGPL) http://sbml.org/Software/libSBML
  • 31. Evolution of SBML continues Today: SBML Level 3 • Level 3 Core provides framework for common models • Level 3 packages add additional constructs to the Core
  • 32. Level 3 package What it enables Hierarchical model composition Models containing submodels ✔ Flux balance constraints Constraint-based models ✔ Qualitative models Petri net models, Boolean models ✔ Graph layout Diagrams of models ✔ Multicomponent/state species Entities w/ structure; also rule-based models draft Spatial Nonhomogeneous spatial models draft Graph rendering Diagrams of models draft Groups Arbitrary grouping of components draft Distributions Numerical values as statistical distributions in dev Arrays & sets Arrays or sets of entities in dev Dynamic structures Creation & destruction of components in dev Annotations Richer annotation syntax Status
  • 33. NationalInstituteofGeneralMedicalSciences(USA) European Molecular Biology Laboratory (EMBL) JST ERATO Kitano Symbiotic Systems Project (Japan) (to 2003) JST ERATO-SORST Program (Japan) ELIXIR (UK) Beckman Institute, Caltech (USA) Keio University (Japan) International Joint Research Program of NEDO (Japan) Japanese Ministry of Agriculture Japanese Ministry of Educ., Culture, Sports, Science and Tech. BBSRC (UK) National Science Foundation (USA) DARPA IPTO Bio-SPICE Bio-Computation Program (USA) Air Force Office of Scientific Research (USA) STRI, University of Hertfordshire (UK) Molecular Sciences Institute (USA) SBML funding sources over the past 13+ years
  • 34. Outline Background and introduction The Systems Biology Markup Language (SBML) Complementary efforts: MIRIAM and SED-ML COMBINE: the Computational Modeling in Biology Network Conclusion
  • 38. Raw models alone are insufficient Need standard schemes for machine-readable annotations • Identify entities • Mathematical semantics • Links to other data resources • Authorship & pub. info Modelerswanttousetheirownconventions Low info content No standard identifiers
  • 39. Addresses 2 general areas of annotation needs: MIRIAM is not specific to SBML MIRIAM(MinimumInformationRequestedIntheAnnotationofModels) Requirements for reference correspondence Scheme for encoding annotations Annotations for attributing model creators & sources Annotations for referring to external data resources
  • 40. Addresses 2 general areas of annotation needs: MIRIAM is not specific to SBML MIRIAM(MinimumInformationRequestedIntheAnnotationofModels) Requirements for reference correspondence Scheme for encoding annotations Annotations for attributing model creators & sources Annotations for referring to external data resources Annotations for referring to external data resources
  • 41. Example of a problem that can be solved with annotations http://www.ebi.ac.uk/chebi Low info content
  • 42. Example of a problem that can be solved with annotations http://www.ebi.ac.uk/chebi Low info content Known by different names –  do you want to write all of them into your model? salicylic acid
  • 43. MIRIAM annotations for external references Goal: link model constituents to corresponding entities in bioinformatics resources (e.g., databases, controlled vocabularies) • Supports: - Precise identification of model constituents - Discovery of models that concern the same thing - Comparison of model constituents between different models MIRIAM approach avoids putting data content directly in the model • Instead, it points at external resources that contain the data
  • 44. How do we create globally unique identifiers consistently? Long story short—developed by the Le Novère group at the EBI • Resource identifiers (URIs) combine 2 parts: • There’s a registry for namespaces: MIRIAM Registry - Allows people & software to use same namespace identifiers • There’s a URI resolution service: MIRIAM Resources & identifiers.org - Allows people & software to take a given identifier and figure out what it points to namespace entity identifier { { Identifies a dataset Identifies a datum within the dataset
  • 45. Another problem: software can’t read figure legends ? BIOMD0000000319 in BioModels Database Decroly & Goldbeter, PNAS, 1982
  • 46. SED-ML = Simulation Experiment Description ML Application-independent format •Captures procedures, algorithms, parameter values Can be used for •Simulation experiments encoding parametrizations & perturbations •Simulations using more than one model and/or method •Data manipulations to produce plot(s) http://sedml.org Simulation Model Task Data generators Reports
  • 47. Efforts like SED-ML improve reproducibility of publications Waltemath et al., BMC Sys Bio 5, 2011.
  • 48. Outline Background and introduction The Systems Biology Markup Language (SBML) Complementary efforts: MIRIAM and SED-ML COMBINE: the Computational Modeling in Biology Network Conclusion
  • 49. Need interoperable formats, but developing them is not easy Need people with diverse set of knowledge & skills • Scientific needs • Technical implementation skills • Practical experience Need manage multiple phases of a standardization effort • Creation • Evolution • Support
  • 50. Need interoperable formats, but developing them is not easy Need people with diverse set of knowledge & skills • Scientific needs • Technical implementation skills • Practical experience Need manage multiple phases of a standardization effort • Creation • Evolution • Support } This is just for the specification of the standards, to say nothing of the necessary software and other infrastructure!
  • 51. Realizations about the state of affairs in late-2000’s • Many standardization efforts overlapped, but lacked coordination • Efforts were inventing their own processes from scratch • Many individual meetings meant more travel for many people • Limited and fragile funding didn’t support solid, coherent base COMBINE = Computational Modeling in Biology Network • Coordinate standards development • Develop common procedures & tools (but not impose them!) • Coordinate meetings • Provide a recognized voice Motivations for the creation of COMBINE
  • 52. Standardization efforts represented in COMBINE today BioPAX Qualifiers GPML COMBINE Standards Associated Standardization Efforts Related Standardization Efforts
  • 53. COMBINE formats cover many types of models – from Nicolas Le Novère
  • 54. Examples of community organization Two main annual meetings, plus ad hoc workshops • COMBINE meeting: status updates, presentations, outreach - Next COMBINE: Paris, Sep 16–20, 2013 • HARMONY: Hackathon on Resources for Modeling in Biology - Software development, interoperability hacking COMBINE 2012, TorontoCOMBINE 2011, Heidelberg
  • 55. COMBINE is open to all—and COMBINE needs you! http://co.mbine.org Current coordinators: • Nicolas Le Novère, Mike Hucka, Falk Schreiber, Gary Bader
  • 56. Outline Background and introduction The Systems Biology Markup Language (SBML) Complementary efforts: MIRIAM and SED-ML COMBINE: the Computational Modeling in Biology Network Conclusion
  • 57. Time it well • Too early and too late are bad Start with actual stakeholders • Address real needs, not perceived ones Start with small team of dedicated developers • Can work faster, more focused; also avoids“designed-by-committee” Engage people constantly, in many ways • Electronic forums, email, electronic voting, surveys, hackathons Make the results free and open-source • Makes people comfortable knowing it will always be available Be creative about seeking funding Some things we (maybe?) got right with SBML
  • 58. Not waiting for implementations before freezing specifications • Sometimes finalized specification before implementations tested it - Especially bad when we failed to do a good job ‣ E.g.,“forward thinking”features, or“elegant”designs Not formalizing the development process sufficiently • Especially early in the history, did not have a very open process Not resolving intellectual property issues from the beginning • Industrial users ask“who has the right to give any rights to this?” Some things we certainly got wrong
  • 59. Nicolas Le Novère, Henning Hermjakob, Camille Laibe, Chen Li, Lukas Endler, Nico Rodriguez, Marco Donizelli,Viji Chelliah, Mélanie Courtot, Harish Dharuri Attendees at SBML 10th Anniversary Symposium, Edinburgh, 2010 John C. Doyle, Hiroaki Kitano Mike Hucka, Sarah Keating, Frank Bergmann, Lucian Smith, Andrew Finney, Herbert Sauro, Hamid Bolouri, Ben Bornstein, Bruce Shapiro, Akira Funahashi, Akiya Juraku, Ben Kovitz OriginalPI’s: SBMLTeam: SBMLEditors: BioModelsDB: Mike Hucka, Nicolas Le Novère, Sarah Keating, Frank Bergmann, Lucian Smith, Chris Myers, Stefan Hoops, Sven Sahle, James Schaff, DarrenWilkinson And a huge thanks to many others in the COMBINE community This work was made possible thanks to a great community
  • 60.
  • 61. SBML http://sbml.org BioModels Database http://biomodels.net/biomodels MIRIAM http://biomodels.net/miriam identifiers.org http://identifiers.org SED-ML http://biomodels.net/sed-ml SBO http://biomodels.net/sbo SBGN http://sbgn.org COMBINE http://co.mbine.org URLs
  • 62. I’d like your feedback! You can use this anonymous form: http://tinyurl.com/mhuckafeedback