SlideShare une entreprise Scribd logo
1  sur  42
Télécharger pour lire hors ligne
Approaches for the Integration of Visual and
Computational Analysis of Biomedical Data
HARVARD MEDICAL SCHOOL
DEPARTMENT OF BIOMEDICAL INFORMATICS
NILS GEHLENBORG
@nils_gehlenborg
http://gehlenborglab.org
FRITZ LEKSCHAS
HARVARD MEDICAL SCHOOL
BIG PILES OF DATA …
Data Repositories
general specialized
ArrayExpress
GEO
Metabolights
PRIDE
dbGAP
…
ENCODE
Roadmap
Epigenomics
…
… OFFER OPPORTUNITIES …
SINGLE OR FEW DATA SETS
Test hypotheses without generating new data.
Use published data as supporting evidence for findings based on
our your own data sets.
MANY DATA SETS
Conduct meta analyses, e.g. characterize expression patterns in
human tissues or to link diseases.
M. Lukk, et al., Nature Biotechnology, 28(4):322–324 (2010)
S. Suthram et al.,PLoS Computational Biology 6(2)(2010)
SINGLE OR FEW DATA SETS
Test hypotheses without generating new data.
Use published data as supporting evidence for findings based on
our your own data sets.
MANY DATA SETS
Conduct meta analyses, e.g. characterize expression patterns in
human tissues or to link diseases.
COMMON BEHAVIOR OF RESEARCH PARASITES!
N Gehlenborg et al. , manuscript in preparation
|
DATA REPOSITORY
VISUALIZATION TOOLS
ANALYSIS PIPELINES
N Gehlenborg et al. , manuscript in preparation
|
DATA REPOSITORY
VISUALIZATION TOOLS
ANALYSIS PIPELINES
ANALYSIS PIPELINES
N Gehlenborg et al. , manuscript in preparation
|
DATA REPOSITORY
VISUALIZATION TOOLS
ANALYSIS PIPELINES
ANALYSIS PIPELINES
N Gehlenborg et al. , manuscript in preparation
|
DATA REPOSITORY
VISUALIZATION TOOLS
ANALYSIS PIPELINES
GALAXY Toolshed
Workflow Editor
Tools
REST
API
ANALYSIS PIPELINES
N Gehlenborg et al. , manuscript in preparation
|
DATA REPOSITORY
VISUALIZATION TOOLS
ANALYSIS PIPELINES
GALAXY Toolshed
Workflow Editor
Tools
REST
API
Workflow Inputs
Workflow Outputs
N Gehlenborg et al. , manuscript in preparation
|
DATA REPOSITORY
VISUALIZATION TOOLS
ANALYSIS PIPELINES
http://www.refinery-platform.org
… BUT NOT SO FAST!
Z
Text-Bas
Data Sets
Metadata
Data Files
X Y Z
A1
X Y
Z
A2
A3
A4
X Y
Z- -
K K K K
L M L M
Free Text
Annotation
Mapping
K
L, M
X, Y
Z
X YZX Y
Keywords
Z
Text-Based Search
Data Sets
Metadata
Data Files
X Y
Ontologies
Z
A1
X Y
Z
A2
A3
A4
X Y
Z- -
K K K K
L M L M
Free Text
Annotation
Mapping
K
L, M
X, Y
Z
X YZX Y
Terminal
Root
subclassof
Keywords
Z
Text-Based Search
Data Sets
Metadata
Data Files
X Y
Ontologies
Z
A1
X Y
Z
A2
A3
A4
X Y
Z- -
K K K K
L M L M
Free Text
Annotation
Mapping
K
L, M
X, Y
Z
X YZX Y
Terminal
Root
subclassof
Keywords
Z
Text-Based Search
Data Sets
Metadata
Data Files
X Y
Ontologies
Z
A1
X Y
Z
A2
A3
A4
X Y
Z- -
K K K K
L M L M
Free Text
Annotation
Mapping
K
L, M
X, Y
Z
X YZX Y
Terminal
Root
subclassof
Keywords
Z
Text-Based Search
Data Sets
Metadata
Data Files
X Y
Ontologies
Z
A1
X Y
Z
A2
A3
A4
X Y
Z- -
K K K K
L M L M
Free Text
Annotation
Mapping
K
L, M
X, Y
Z
X YZX Y
Terminal
Root
subclassof
Keywords
X
Semantic Visual
Exploration
Y
Z
Text-Based Search
Data Sets
Metadata
Data Files
X Y
Ontologies
Z
A1
X Y
Z
A2
A3
A4
X Y
Z- -
K K K K
L M L M
Free Text
Annotation
Mapping
K
L, M
X, Y
Z
X YZX Y
SATORI
Terminal
Root
subclassof
Keywords
YX
Z
Z
X
SATORI: A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories
http://satori.refinery-platform.org
D
R
C
Data Analyst Group Leader Data Curator
D
R
C
Data Analyst Group Leader Data Curator
D
R
C
Data Analyst Group Leader Data Curator
D
R
C
Data Analyst Group Leader Data Curator
Need 1

find data sets that match certain experimental characteristics.
Need 2

find data sets that are similar (or dissimilar) to given data sets.
Need 3

get an overview of the distribution of the experimental characteristics
across a collection of data sets.
Need 4

get an overview of the annotation term hierarchy and term usage.
Peter Pirolli and Stu Card
SATORI: A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories
http://satori.refinery-platform.org
C
A B
C
List graph
B C
B
Tree
Tree map A
A B
C
Data sets
B
C
B
C
B
C
CB
CB
A B
C
Scenario 1:
Scenario 2:
Scenario 3:
AnnotationsTerm
1 2 3 4
SATORI: A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories
http://satori.refinery-platform.org
SATORI: A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories
http://satori.refinery-platform.org
The Art Institute of Chicago
HARVARD MEDICAL SCHOOL
JOHANNES KEPLER UNIVERSITY LINZ Stefan Luger, Holger Stitz, Marc Streit
Web
http://satori.refinery-platform.org · http://refinery-platform.org
Acknowledgements
Peter J Park & all members of the Computational Genomics Lab
Fritz Lekschas, Jennifer K Marx, Scott Ouellette, Anton Xue,
Psalm Haseley
HARVARD SCHOOL OF PUBLIC HEALTH Ilya Sytchev, Shannan Ho Sui
UNIVERSITY OF SHEFFIELD David R Jones, Winston Hide
Funding
NIH/NHGRI R00 HG007583, Harvard Stem Cell Institute
We are hiring postdocs & developers!
HARVARD MEDICAL SCHOOL
DEPARTMENT OF BIOMEDICAL INFORMATICS
See http://gehlenborglab.org or http://dbmi.med.harvard.edu for details.
Data visualization, analysis, and management for:
• genomic structural variants
• dynamics of the 3D genome
• cancer subtypes in patient cohorts
• exploration tools for data repositories
• provenance graphs
X
B
A
D
A
X XX Term Terminal term To be deleted
A
A
X To be duplicated
A A
C
ABA
C
B
C'
0 0 00 5 5 5 5
0 5
1 5
5 10 5 10
Term size Cumulative sizeX1 2
2 7
2 7
1 5
D
C
F D
C
F
F'
1. Global 2. Tree Map 3. Node-Link Diagram
5 10
1 5 1 105 5
0 10
G G
BB
B
C
C
C E EA'C

Contenu connexe

Tendances

FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_PresentationYatpang Cheung
 
Bioinformatics Final Report
Bioinformatics Final ReportBioinformatics Final Report
Bioinformatics Final ReportShruthi Choudary
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Michel Dumontier
 
Global phenotypic data sharing standards to maximize diagnostic discovery
Global phenotypic data sharing standards to maximize diagnostic discoveryGlobal phenotypic data sharing standards to maximize diagnostic discovery
Global phenotypic data sharing standards to maximize diagnostic discoverymhaendel
 
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease DiscoveryData Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease Discoverymhaendel
 
Bioinformatics Databases
Bioinformatics DatabasesBioinformatics Databases
Bioinformatics Databasescschlos2
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsmikaelhuss
 
GigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDBGigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDBGigaScience, BGI Hong Kong
 
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseRothamsted Research, UK
 
Ondex: Data integration and visualisation
Ondex: Data integration and visualisationOndex: Data integration and visualisation
Ondex: Data integration and visualisationBiogeeks
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge DiscoveryMichel Dumontier
 
An integrated dataset for in silico drug discovery
An integrated dataset for in silico drug discoveryAn integrated dataset for in silico drug discovery
An integrated dataset for in silico drug discoverySimon Cockell
 
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET
 
Claudia medina: Linking Health Records for Population Health Research in Brazil.
Claudia medina: Linking Health Records for Population Health Research in Brazil.Claudia medina: Linking Health Records for Population Health Research in Brazil.
Claudia medina: Linking Health Records for Population Health Research in Brazil.Flávio Codeço Coelho
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiChris Evelo
 
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryJeremy Yang
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princetonCyndy Parr
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance
 
is there life between standards? Data interoperability for AI.
is there life between standards? Data interoperability for AI.is there life between standards? Data interoperability for AI.
is there life between standards? Data interoperability for AI.Chris Evelo
 

Tendances (20)

FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_Presentation
 
Bioinformatics Final Report
Bioinformatics Final ReportBioinformatics Final Report
Bioinformatics Final Report
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
 
Global phenotypic data sharing standards to maximize diagnostic discovery
Global phenotypic data sharing standards to maximize diagnostic discoveryGlobal phenotypic data sharing standards to maximize diagnostic discovery
Global phenotypic data sharing standards to maximize diagnostic discovery
 
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease DiscoveryData Translator: an Open Science Data Platform for Mechanistic Disease Discovery
Data Translator: an Open Science Data Platform for Mechanistic Disease Discovery
 
Bioinformatics Databases
Bioinformatics DatabasesBioinformatics Databases
Bioinformatics Databases
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
GigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDBGigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDB
 
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
 
Ondex: Data integration and visualisation
Ondex: Data integration and visualisationOndex: Data integration and visualisation
Ondex: Data integration and visualisation
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 
An integrated dataset for in silico drug discovery
An integrated dataset for in silico drug discoveryAn integrated dataset for in silico drug discovery
An integrated dataset for in silico drug discovery
 
DCC Keynote 2007
DCC Keynote 2007DCC Keynote 2007
DCC Keynote 2007
 
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
dkNET Webinar: "The Microphysiology Systems Database (MPS-Db): A Platform For...
 
Claudia medina: Linking Health Records for Population Health Research in Brazil.
Claudia medina: Linking Health Records for Population Health Research in Brazil.Claudia medina: Linking Health Records for Population Health Research in Brazil.
Claudia medina: Linking Health Records for Population Health Research in Brazil.
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs api
 
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discovery
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
is there life between standards? Data interoperability for AI.
is there life between standards? Data interoperability for AI.is there life between standards? Data interoperability for AI.
is there life between standards? Data interoperability for AI.
 

En vedette

Computational Analysis in an extended model of E. Coli
Computational Analysis in an extended model of E. ColiComputational Analysis in an extended model of E. Coli
Computational Analysis in an extended model of E. ColiSteven Stadler
 
Computational Analysis NCP ICM - Copy
Computational Analysis NCP ICM - CopyComputational Analysis NCP ICM - Copy
Computational Analysis NCP ICM - CopyVernon D Dutch Jr
 
Computational Biology - Signaling networks and drug repositioning
Computational Biology - Signaling networks and drug repositioningComputational Biology - Signaling networks and drug repositioning
Computational Biology - Signaling networks and drug repositioningLars Juhl Jensen
 
A Computational Analysis of Agenda Setting Theory
A Computational Analysis of Agenda Setting TheoryA Computational Analysis of Agenda Setting Theory
A Computational Analysis of Agenda Setting TheoryAlice Oh
 
Computational Analysis Of A Thin Plate
Computational Analysis Of A Thin PlateComputational Analysis Of A Thin Plate
Computational Analysis Of A Thin PlateDavid Parker
 
Computational Drug Design
Computational Drug DesignComputational Drug Design
Computational Drug Designbaoilleach
 
COMPUTATIONAL ANALYSIS OF STEPPED AND STRAIGHT MICROCHANNEL HEAT SINK
COMPUTATIONAL ANALYSIS OF STEPPED AND STRAIGHT MICROCHANNEL HEAT SINK COMPUTATIONAL ANALYSIS OF STEPPED AND STRAIGHT MICROCHANNEL HEAT SINK
COMPUTATIONAL ANALYSIS OF STEPPED AND STRAIGHT MICROCHANNEL HEAT SINK IAEME Publication
 

En vedette (7)

Computational Analysis in an extended model of E. Coli
Computational Analysis in an extended model of E. ColiComputational Analysis in an extended model of E. Coli
Computational Analysis in an extended model of E. Coli
 
Computational Analysis NCP ICM - Copy
Computational Analysis NCP ICM - CopyComputational Analysis NCP ICM - Copy
Computational Analysis NCP ICM - Copy
 
Computational Biology - Signaling networks and drug repositioning
Computational Biology - Signaling networks and drug repositioningComputational Biology - Signaling networks and drug repositioning
Computational Biology - Signaling networks and drug repositioning
 
A Computational Analysis of Agenda Setting Theory
A Computational Analysis of Agenda Setting TheoryA Computational Analysis of Agenda Setting Theory
A Computational Analysis of Agenda Setting Theory
 
Computational Analysis Of A Thin Plate
Computational Analysis Of A Thin PlateComputational Analysis Of A Thin Plate
Computational Analysis Of A Thin Plate
 
Computational Drug Design
Computational Drug DesignComputational Drug Design
Computational Drug Design
 
COMPUTATIONAL ANALYSIS OF STEPPED AND STRAIGHT MICROCHANNEL HEAT SINK
COMPUTATIONAL ANALYSIS OF STEPPED AND STRAIGHT MICROCHANNEL HEAT SINK COMPUTATIONAL ANALYSIS OF STEPPED AND STRAIGHT MICROCHANNEL HEAT SINK
COMPUTATIONAL ANALYSIS OF STEPPED AND STRAIGHT MICROCHANNEL HEAT SINK
 

Similaire à Approaches for the Integration of Visual and Computational Analysis of Biomedical Data

Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...Istituto nazionale di statistica
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesUniversity of Malaya
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksCarole Goble
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chainPaul Groth
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and KnowledgeIan Foster
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbgetSurendraKumar338
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptxvijayapraba1
 
The Ondex Data Integration Framework
The Ondex Data Integration FrameworkThe Ondex Data Integration Framework
The Ondex Data Integration Frameworkbosc
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Nucl. Acids Res.-2014-Howe-nar-gku1244
Nucl. Acids Res.-2014-Howe-nar-gku1244Nucl. Acids Res.-2014-Howe-nar-gku1244
Nucl. Acids Res.-2014-Howe-nar-gku1244Yasel Cruz
 
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern MinimalizationABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern MinimalizationBlerina Spahiu
 
OVium Bioinformatic Solutions
OVium Bioinformatic SolutionsOVium Bioinformatic Solutions
OVium Bioinformatic SolutionsOVium Solutions
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...Araport
 

Similaire à Approaches for the Integration of Visual and Computational Analysis of Biomedical Data (20)

Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
Session III Census and registers - M. Scannapieco,The Italian Integrated Syst...
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chain
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and Knowledge
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
 
OpenTox Europe 2013
OpenTox Europe 2013OpenTox Europe 2013
OpenTox Europe 2013
 
The Ondex Data Integration Framework
The Ondex Data Integration FrameworkThe Ondex Data Integration Framework
The Ondex Data Integration Framework
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Satya Sahoo Thesis Defense
Satya Sahoo Thesis DefenseSatya Sahoo Thesis Defense
Satya Sahoo Thesis Defense
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Nucl. Acids Res.-2014-Howe-nar-gku1244
Nucl. Acids Res.-2014-Howe-nar-gku1244Nucl. Acids Res.-2014-Howe-nar-gku1244
Nucl. Acids Res.-2014-Howe-nar-gku1244
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
Semantic (Web) Technologies for Translational Research in Life Sciences
Semantic (Web) Technologies for Translational Research in Life SciencesSemantic (Web) Technologies for Translational Research in Life Sciences
Semantic (Web) Technologies for Translational Research in Life Sciences
 
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern MinimalizationABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
 
OVium Bioinformatic Solutions
OVium Bioinformatic SolutionsOVium Bioinformatic Solutions
OVium Bioinformatic Solutions
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
 

Plus de Nils Gehlenborg

Power to the People: Data Visualization in Biology and Medicine
Power to the People: Data Visualization in Biology and MedicinePower to the People: Data Visualization in Biology and Medicine
Power to the People: Data Visualization in Biology and MedicineNils Gehlenborg
 
Cancer Genomics Visualization across Scales: Nucleotides to Cohorts
Cancer Genomics Visualization across Scales: Nucleotides to CohortsCancer Genomics Visualization across Scales: Nucleotides to Cohorts
Cancer Genomics Visualization across Scales: Nucleotides to CohortsNils Gehlenborg
 
A Unified Approach to Exploration, Authoring, and Communication with Reproduc...
A Unified Approach to Exploration, Authoring, and Communication with Reproduc...A Unified Approach to Exploration, Authoring, and Communication with Reproduc...
A Unified Approach to Exploration, Authoring, and Communication with Reproduc...Nils Gehlenborg
 
EMBL John Kendrew Award Lecture 2018
EMBL John Kendrew Award Lecture 2018EMBL John Kendrew Award Lecture 2018
EMBL John Kendrew Award Lecture 2018Nils Gehlenborg
 
Mining Gems from the Data Visualization Literature
Mining Gems from the Data Visualization LiteratureMining Gems from the Data Visualization Literature
Mining Gems from the Data Visualization LiteratureNils Gehlenborg
 
Patients, Genomes, Time: Visualizing Disease Cohorts
Patients, Genomes, Time: Visualizing Disease CohortsPatients, Genomes, Time: Visualizing Disease Cohorts
Patients, Genomes, Time: Visualizing Disease CohortsNils Gehlenborg
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeNils Gehlenborg
 
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and TimeVisualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and TimeNils Gehlenborg
 
Visualization of 3D Genome Data
Visualization of 3D Genome DataVisualization of 3D Genome Data
Visualization of 3D Genome DataNils Gehlenborg
 
Bayer Data Science Meetup
Bayer Data Science MeetupBayer Data Science Meetup
Bayer Data Science MeetupNils Gehlenborg
 
HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...
HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...
HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...Nils Gehlenborg
 
Relaxation Techniques for the Upset Data Scientist
Relaxation Techniques for the Upset Data ScientistRelaxation Techniques for the Upset Data Scientist
Relaxation Techniques for the Upset Data ScientistNils Gehlenborg
 
Multi-Scale Visualization Tools for Exploration of Chromosome Interaction ...
Multi-Scale  Visualization Tools for  Exploration of  Chromosome Interaction ...Multi-Scale  Visualization Tools for  Exploration of  Chromosome Interaction ...
Multi-Scale Visualization Tools for Exploration of Chromosome Interaction ...Nils Gehlenborg
 
SMC-RNA BioVis Data Visualization DREAM Challenge Preview
SMC-RNA BioVis Data Visualization DREAM Challenge PreviewSMC-RNA BioVis Data Visualization DREAM Challenge Preview
SMC-RNA BioVis Data Visualization DREAM Challenge PreviewNils Gehlenborg
 
Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...
Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...
Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...Nils Gehlenborg
 
BioVis Meetup @ IEEE VIS 2015
BioVis Meetup @ IEEE VIS 2015BioVis Meetup @ IEEE VIS 2015
BioVis Meetup @ IEEE VIS 2015Nils Gehlenborg
 
Visualization Tools for the Refinery Platform - Supporting reproducible resea...
Visualization Tools for the Refinery Platform - Supporting reproducible resea...Visualization Tools for the Refinery Platform - Supporting reproducible resea...
Visualization Tools for the Refinery Platform - Supporting reproducible resea...Nils Gehlenborg
 
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All TogetherVisualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All TogetherNils Gehlenborg
 
Biological Visualization Community Meetup 2014
Biological Visualization Community Meetup 2014Biological Visualization Community Meetup 2014
Biological Visualization Community Meetup 2014Nils Gehlenborg
 

Plus de Nils Gehlenborg (20)

HiGlass & Friends
HiGlass & FriendsHiGlass & Friends
HiGlass & Friends
 
Power to the People: Data Visualization in Biology and Medicine
Power to the People: Data Visualization in Biology and MedicinePower to the People: Data Visualization in Biology and Medicine
Power to the People: Data Visualization in Biology and Medicine
 
Cancer Genomics Visualization across Scales: Nucleotides to Cohorts
Cancer Genomics Visualization across Scales: Nucleotides to CohortsCancer Genomics Visualization across Scales: Nucleotides to Cohorts
Cancer Genomics Visualization across Scales: Nucleotides to Cohorts
 
A Unified Approach to Exploration, Authoring, and Communication with Reproduc...
A Unified Approach to Exploration, Authoring, and Communication with Reproduc...A Unified Approach to Exploration, Authoring, and Communication with Reproduc...
A Unified Approach to Exploration, Authoring, and Communication with Reproduc...
 
EMBL John Kendrew Award Lecture 2018
EMBL John Kendrew Award Lecture 2018EMBL John Kendrew Award Lecture 2018
EMBL John Kendrew Award Lecture 2018
 
Mining Gems from the Data Visualization Literature
Mining Gems from the Data Visualization LiteratureMining Gems from the Data Visualization Literature
Mining Gems from the Data Visualization Literature
 
Patients, Genomes, Time: Visualizing Disease Cohorts
Patients, Genomes, Time: Visualizing Disease CohortsPatients, Genomes, Time: Visualizing Disease Cohorts
Patients, Genomes, Time: Visualizing Disease Cohorts
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the Eye
 
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and TimeVisualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
Visualizing Patient Cohorts: Integrating Data Types, Relationships, and Time
 
Visualization of 3D Genome Data
Visualization of 3D Genome DataVisualization of 3D Genome Data
Visualization of 3D Genome Data
 
Bayer Data Science Meetup
Bayer Data Science MeetupBayer Data Science Meetup
Bayer Data Science Meetup
 
HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...
HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...
HiGlass + HiPiler: Making Sense of Chromosome Interaction Data with Multi-Sca...
 
Relaxation Techniques for the Upset Data Scientist
Relaxation Techniques for the Upset Data ScientistRelaxation Techniques for the Upset Data Scientist
Relaxation Techniques for the Upset Data Scientist
 
Multi-Scale Visualization Tools for Exploration of Chromosome Interaction ...
Multi-Scale  Visualization Tools for  Exploration of  Chromosome Interaction ...Multi-Scale  Visualization Tools for  Exploration of  Chromosome Interaction ...
Multi-Scale Visualization Tools for Exploration of Chromosome Interaction ...
 
SMC-RNA BioVis Data Visualization DREAM Challenge Preview
SMC-RNA BioVis Data Visualization DREAM Challenge PreviewSMC-RNA BioVis Data Visualization DREAM Challenge Preview
SMC-RNA BioVis Data Visualization DREAM Challenge Preview
 
Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...
Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...
Tracing the Origins of Data and Ideas - Provenance Visualization for Biomedic...
 
BioVis Meetup @ IEEE VIS 2015
BioVis Meetup @ IEEE VIS 2015BioVis Meetup @ IEEE VIS 2015
BioVis Meetup @ IEEE VIS 2015
 
Visualization Tools for the Refinery Platform - Supporting reproducible resea...
Visualization Tools for the Refinery Platform - Supporting reproducible resea...Visualization Tools for the Refinery Platform - Supporting reproducible resea...
Visualization Tools for the Refinery Platform - Supporting reproducible resea...
 
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All TogetherVisualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All Together
 
Biological Visualization Community Meetup 2014
Biological Visualization Community Meetup 2014Biological Visualization Community Meetup 2014
Biological Visualization Community Meetup 2014
 

Dernier

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 

Dernier (20)

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 

Approaches for the Integration of Visual and Computational Analysis of Biomedical Data

  • 1. Approaches for the Integration of Visual and Computational Analysis of Biomedical Data HARVARD MEDICAL SCHOOL DEPARTMENT OF BIOMEDICAL INFORMATICS NILS GEHLENBORG @nils_gehlenborg http://gehlenborglab.org
  • 3. BIG PILES OF DATA …
  • 6. SINGLE OR FEW DATA SETS Test hypotheses without generating new data. Use published data as supporting evidence for findings based on our your own data sets. MANY DATA SETS Conduct meta analyses, e.g. characterize expression patterns in human tissues or to link diseases.
  • 7. M. Lukk, et al., Nature Biotechnology, 28(4):322–324 (2010)
  • 8. S. Suthram et al.,PLoS Computational Biology 6(2)(2010)
  • 9. SINGLE OR FEW DATA SETS Test hypotheses without generating new data. Use published data as supporting evidence for findings based on our your own data sets. MANY DATA SETS Conduct meta analyses, e.g. characterize expression patterns in human tissues or to link diseases. COMMON BEHAVIOR OF RESEARCH PARASITES!
  • 10. N Gehlenborg et al. , manuscript in preparation | DATA REPOSITORY VISUALIZATION TOOLS ANALYSIS PIPELINES
  • 11. N Gehlenborg et al. , manuscript in preparation | DATA REPOSITORY VISUALIZATION TOOLS ANALYSIS PIPELINES
  • 12. ANALYSIS PIPELINES N Gehlenborg et al. , manuscript in preparation | DATA REPOSITORY VISUALIZATION TOOLS ANALYSIS PIPELINES
  • 13. ANALYSIS PIPELINES N Gehlenborg et al. , manuscript in preparation | DATA REPOSITORY VISUALIZATION TOOLS ANALYSIS PIPELINES GALAXY Toolshed Workflow Editor Tools REST API
  • 14. ANALYSIS PIPELINES N Gehlenborg et al. , manuscript in preparation | DATA REPOSITORY VISUALIZATION TOOLS ANALYSIS PIPELINES GALAXY Toolshed Workflow Editor Tools REST API Workflow Inputs Workflow Outputs
  • 15. N Gehlenborg et al. , manuscript in preparation | DATA REPOSITORY VISUALIZATION TOOLS ANALYSIS PIPELINES http://www.refinery-platform.org
  • 16. … BUT NOT SO FAST!
  • 17. Z Text-Bas Data Sets Metadata Data Files X Y Z A1 X Y Z A2 A3 A4 X Y Z- - K K K K L M L M Free Text Annotation Mapping K L, M X, Y Z X YZX Y Keywords
  • 18. Z Text-Based Search Data Sets Metadata Data Files X Y Ontologies Z A1 X Y Z A2 A3 A4 X Y Z- - K K K K L M L M Free Text Annotation Mapping K L, M X, Y Z X YZX Y Terminal Root subclassof Keywords
  • 19. Z Text-Based Search Data Sets Metadata Data Files X Y Ontologies Z A1 X Y Z A2 A3 A4 X Y Z- - K K K K L M L M Free Text Annotation Mapping K L, M X, Y Z X YZX Y Terminal Root subclassof Keywords
  • 20. Z Text-Based Search Data Sets Metadata Data Files X Y Ontologies Z A1 X Y Z A2 A3 A4 X Y Z- - K K K K L M L M Free Text Annotation Mapping K L, M X, Y Z X YZX Y Terminal Root subclassof Keywords
  • 21.
  • 22.
  • 23.
  • 24.
  • 25. Z Text-Based Search Data Sets Metadata Data Files X Y Ontologies Z A1 X Y Z A2 A3 A4 X Y Z- - K K K K L M L M Free Text Annotation Mapping K L, M X, Y Z X YZX Y Terminal Root subclassof Keywords
  • 26. X Semantic Visual Exploration Y Z Text-Based Search Data Sets Metadata Data Files X Y Ontologies Z A1 X Y Z A2 A3 A4 X Y Z- - K K K K L M L M Free Text Annotation Mapping K L, M X, Y Z X YZX Y SATORI Terminal Root subclassof Keywords YX Z Z X
  • 27. SATORI: A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories http://satori.refinery-platform.org
  • 28. D R C Data Analyst Group Leader Data Curator
  • 29. D R C Data Analyst Group Leader Data Curator
  • 30. D R C Data Analyst Group Leader Data Curator
  • 31. D R C Data Analyst Group Leader Data Curator
  • 32. Need 1
 find data sets that match certain experimental characteristics. Need 2
 find data sets that are similar (or dissimilar) to given data sets. Need 3
 get an overview of the distribution of the experimental characteristics across a collection of data sets. Need 4
 get an overview of the annotation term hierarchy and term usage.
  • 33. Peter Pirolli and Stu Card
  • 34. SATORI: A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories http://satori.refinery-platform.org
  • 35. C A B C List graph B C B Tree Tree map A A B C Data sets B C B C B C CB CB A B C Scenario 1: Scenario 2: Scenario 3: AnnotationsTerm 1 2 3 4
  • 36. SATORI: A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories http://satori.refinery-platform.org
  • 37. SATORI: A System for Ontology-Guided Visual Exploration of Biomedical Data Repositories http://satori.refinery-platform.org
  • 38.
  • 39. The Art Institute of Chicago
  • 40. HARVARD MEDICAL SCHOOL JOHANNES KEPLER UNIVERSITY LINZ Stefan Luger, Holger Stitz, Marc Streit Web http://satori.refinery-platform.org · http://refinery-platform.org Acknowledgements Peter J Park & all members of the Computational Genomics Lab Fritz Lekschas, Jennifer K Marx, Scott Ouellette, Anton Xue, Psalm Haseley HARVARD SCHOOL OF PUBLIC HEALTH Ilya Sytchev, Shannan Ho Sui UNIVERSITY OF SHEFFIELD David R Jones, Winston Hide Funding NIH/NHGRI R00 HG007583, Harvard Stem Cell Institute
  • 41. We are hiring postdocs & developers! HARVARD MEDICAL SCHOOL DEPARTMENT OF BIOMEDICAL INFORMATICS See http://gehlenborglab.org or http://dbmi.med.harvard.edu for details. Data visualization, analysis, and management for: • genomic structural variants • dynamics of the 3D genome • cancer subtypes in patient cohorts • exploration tools for data repositories • provenance graphs
  • 42. X B A D A X XX Term Terminal term To be deleted A A X To be duplicated A A C ABA C B C' 0 0 00 5 5 5 5 0 5 1 5 5 10 5 10 Term size Cumulative sizeX1 2 2 7 2 7 1 5 D C F D C F F' 1. Global 2. Tree Map 3. Node-Link Diagram 5 10 1 5 1 105 5 0 10 G G BB B C C C E EA'C