Biolayout Marblar Feb13

M
Big Data goes 3D: BioLayout Express3D

         Prof Tom Freeman
        University of Edinburgh
Network Graphs of (Biological) Relationships
Many types of data, biological or otherwise, can best be
viewed and interrogated as networks, best visualised as so-
called network graphs.

In biology these may include:

• Social interactions between individuals                     Spread of TB via contact tracing



• Transmission of disease

• Relationship (evolutionary, homology) between genes and
  proteins

• Interactions between proteins (data, co-citation, pathway          Protein homology

  models)

• ‘omics data




                                                 Pathways           Protein interaction
Example: Microarray Gene Expression Data




• Can sequence and measure tissue-specific activity of 23,000          Microarrays

  genes in human body

• Microarrays comprised of 1000s/millions of DNA probes –
  routinely used to measure activity across the genome

• Produce highly complex data – analysis/visualisation is         Display of statistical hits
  challenging

• BioLayout Express3D developed originally to analyse this kind
  of data through use of 3D network graphs




                                                                      Display of clusters
Example (cont.): Steps Involved in Analyzing Gene Expression Data

• Microarray data (many measurements over many samples)
  imported
• Co-expression defined using correlation measure (read: is gene A
  upregulated in the same samples as gene B?)
• Genes (nodes) are connected to each other in a network based on
  their level of co-expression (edges) (read: pretty graphs!)



                                       1.25 billion
                       50,000




                                       calculations
                                                      r>


                                   50,000
                                Correlation
                                  matrix
Example (cont.): The program’s work-flow in detail

              Data quality control,
          normalisation and annotation


               Gene-to-gene Pearson
           correlation calculated for every
               probe set on the array


  Filter correlations file based on user defined
threshold (0 - 1.0), i.e. exclude weak correlations


    Edges drawn between nodes (genes) based
     on correlations > than selected threshold


              2D or 3D visualisation



        Clustering and visual exploration
                                                      CPU or GPU parallelization used for all
                                                      computationally intensive algorithms
Example Graphs Derived from
     Expression Data
Advantages of Graph-based Analyses of Complex Data using
                    BioLayout Express3D


• Rapid calculation of networks from primary data

• Support for the visualization of large (10s of thousands of nodes, millions
  of edges) network graphs
• Rendering of the networks in 3D space with real-time interactive
  navigation
• Full range of tools for network visualization, inspection, querying and
  analysis
• Rapid calculations as CPU and GPU are used for parallel calculations

• Can in principle use to visualise data from all kinds of fields as well as
  linking to primary data manipulation programs such as Excel
Modelling and Visualization of Stochastic Flow through Large
           Network Systems – e.g. biological pathways


• Standardized graphical notation system depicts the complex network of
  relationships within e.g. biological pathways
• Previously no way of using these models as a basis for the computational
  modeling of pathway function
• Biolayout can dynamically model the stochastic flow of ‘activity’ through large
  networks/pathways
• Can represent this flow visually

Basically: Can model and animate how components of a complex network
   influence each other over time & compare to real data to test the model.
Modelling and Visualization of Stochastic Flow through Large
                     Network Systems

                                           1. Pathway models drawn in yEd
                                           graph editor, parameterized and
                                           saved as .graphml files


                                                             2. Models imported
                                                             into BioLayout and
                                                             used to calculate time-
                                                             dependent stochastic
                                                             flow through network




    3. The results of flow simulations
    can be visualised as graphs (mouse-
    over function) or viewed as real-
    time animations where the size and
    colour of nodes is used to represent
    their activity
What we’re looking for


The code is open source for non-commercial use – we’d love for you to use it in your
research, be it in biology or anywhere else
•Where do you see this tool making an impact in a research setting?

Maybe you’re a programmer who’d like to get involved in adapting the software for:
•Adapting it to new applications
•Integrating it with other tools
•Exploring the visualisation capabilities of the tool in new setting

We’re also looking to develop the technology commercially
•Can you think of any great market opportunities for BioLayout?
•Who should we be partnering with to develop the tool for this application? Who might
want to license it?


Either way, we’d love to hear from you!
BioLayout Express3D Team


      The Roslin Institute
          Tim Angus
         Derek Wright
         Tom Freeman


           EMBL-EBI
         Anton Enright
       Stijn van Dongen


Thanks to the challenge sponsors
1 sur 11

Recommandé

Ppt manqing par
Ppt manqingPpt manqing
Ppt manqingXiang Zhang
162 vues21 diapositives
High dimesional data (FAST clustering ALG) PPT par
High dimesional data (FAST clustering ALG) PPTHigh dimesional data (FAST clustering ALG) PPT
High dimesional data (FAST clustering ALG) PPTdeepan v
1K vues21 diapositives
A fast clustering based feature subset selection algorithm for high-dimension... par
A fast clustering based feature subset selection algorithm for high-dimension...A fast clustering based feature subset selection algorithm for high-dimension...
A fast clustering based feature subset selection algorithm for high-dimension...IEEEFINALYEARPROJECTS
2K vues8 diapositives
A fast clustering based feature subset selection algorithm for high-dimension... par
A fast clustering based feature subset selection algorithm for high-dimension...A fast clustering based feature subset selection algorithm for high-dimension...
A fast clustering based feature subset selection algorithm for high-dimension...IEEEFINALYEARPROJECTS
8.2K vues10 diapositives
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w... par
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...
Implementation of Fuzzy Logic for the High-Resolution Remote Sensing Images w...IOSR Journals
412 vues5 diapositives
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex... par
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...
Biclustering using Parallel Fuzzy Approach for Analysis of Microarray Gene Ex...CSCJournals
235 vues13 diapositives

Contenu connexe

Tendances

F017533540 par
F017533540F017533540
F017533540IOSR Journals
197 vues6 diapositives
Prediction Model Using Web Usage Mining Techniques par
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesEditor IJCATR
436 vues4 diapositives
Term Paper Presentation par
Term Paper PresentationTerm Paper Presentation
Term Paper PresentationShubham Singh
381 vues43 diapositives
IRJET-Multimodal Image Classification through Band and K-Means Clustering par
IRJET-Multimodal Image Classification through Band and K-Means ClusteringIRJET-Multimodal Image Classification through Band and K-Means Clustering
IRJET-Multimodal Image Classification through Band and K-Means ClusteringIRJET Journal
42 vues6 diapositives
Review Paper on Shared and Distributed Memory Parallel Algorithms to Solve Bi... par
Review Paper on Shared and Distributed Memory Parallel Algorithms to Solve Bi...Review Paper on Shared and Distributed Memory Parallel Algorithms to Solve Bi...
Review Paper on Shared and Distributed Memory Parallel Algorithms to Solve Bi...JIEMS Akkalkuwa
42 vues13 diapositives
"Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm" par
"Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm""Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm"
"Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm"Government of India and Tata Trusts
101 vues23 diapositives

Tendances(19)

Prediction Model Using Web Usage Mining Techniques par Editor IJCATR
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining Techniques
Editor IJCATR436 vues
IRJET-Multimodal Image Classification through Band and K-Means Clustering par IRJET Journal
IRJET-Multimodal Image Classification through Band and K-Means ClusteringIRJET-Multimodal Image Classification through Band and K-Means Clustering
IRJET-Multimodal Image Classification through Band and K-Means Clustering
IRJET Journal42 vues
Review Paper on Shared and Distributed Memory Parallel Algorithms to Solve Bi... par JIEMS Akkalkuwa
Review Paper on Shared and Distributed Memory Parallel Algorithms to Solve Bi...Review Paper on Shared and Distributed Memory Parallel Algorithms to Solve Bi...
Review Paper on Shared and Distributed Memory Parallel Algorithms to Solve Bi...
JIEMS Akkalkuwa42 vues
The effect of gamma value on support vector machine performance with differen... par IJECEIAES
The effect of gamma value on support vector machine performance with differen...The effect of gamma value on support vector machine performance with differen...
The effect of gamma value on support vector machine performance with differen...
IJECEIAES18 vues
16-mmap-ml-sigmod par Dezhi Fang
16-mmap-ml-sigmod16-mmap-ml-sigmod
16-mmap-ml-sigmod
Dezhi Fang120 vues
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I... par ijcsit
Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...Q UANTUM  C LUSTERING -B ASED  F EATURE SUBSET  S ELECTION FOR MAMMOGRAPHIC I...
Q UANTUM C LUSTERING -B ASED F EATURE SUBSET S ELECTION FOR MAMMOGRAPHIC I...
ijcsit183 vues
A genetic algorithm approach for predicting ribonucleic acid sequencing data ... par TELKOMNIKA JOURNAL
A genetic algorithm approach for predicting ribonucleic acid sequencing data ...A genetic algorithm approach for predicting ribonucleic acid sequencing data ...
A genetic algorithm approach for predicting ribonucleic acid sequencing data ...
ROBUST TEXT DETECTION AND EXTRACTION IN NATURAL SCENE IMAGES USING CONDITIONA... par ijiert bestjournal
ROBUST TEXT DETECTION AND EXTRACTION IN NATURAL SCENE IMAGES USING CONDITIONA...ROBUST TEXT DETECTION AND EXTRACTION IN NATURAL SCENE IMAGES USING CONDITIONA...
ROBUST TEXT DETECTION AND EXTRACTION IN NATURAL SCENE IMAGES USING CONDITIONA...
16-model-compare-hilda par Dezhi Fang
16-model-compare-hilda16-model-compare-hilda
16-model-compare-hilda
Dezhi Fang157 vues
Hybridization of Meta-heuristics for Optimizing Routing protocol in VANETs par IJERA Editor
Hybridization of Meta-heuristics for Optimizing Routing protocol in VANETsHybridization of Meta-heuristics for Optimizing Routing protocol in VANETs
Hybridization of Meta-heuristics for Optimizing Routing protocol in VANETs
IJERA Editor44 vues
An Integrated Inductive-Deductive Framework for Data Mapping in Wireless Sens... par M H
An Integrated Inductive-Deductive Framework for Data Mapping in Wireless Sens...An Integrated Inductive-Deductive Framework for Data Mapping in Wireless Sens...
An Integrated Inductive-Deductive Framework for Data Mapping in Wireless Sens...
M H584 vues
Pattern recognition system based on support vector machines par Alexander Decker
Pattern recognition system based on support vector machinesPattern recognition system based on support vector machines
Pattern recognition system based on support vector machines
Alexander Decker345 vues
Bayesian-Network-Based Algorithm Selection with High Level Representation Fee... par ITIIIndustries
Bayesian-Network-Based Algorithm Selection with High Level Representation Fee...Bayesian-Network-Based Algorithm Selection with High Level Representation Fee...
Bayesian-Network-Based Algorithm Selection with High Level Representation Fee...
ITIIIndustries35 vues

Similaire à Biolayout Marblar Feb13

Cloud data management par
Cloud data managementCloud data management
Cloud data managementambitlick
579 vues53 diapositives
Cytoscape Talk 2010 par
Cytoscape Talk 2010Cytoscape Talk 2010
Cytoscape Talk 2010Stewart MacArthur
1.6K vues27 diapositives
Poster (1) par
Poster (1)Poster (1)
Poster (1)Daniel Osei
65 vues1 diapositive
Graph Signal Processing for Machine Learning A Review and New Perspectives - ... par
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...lauratoni4
198 vues69 diapositives
CLIM Program: Remote Sensing Workshop, An Introduction to Systems and Softwar... par
CLIM Program: Remote Sensing Workshop, An Introduction to Systems and Softwar...CLIM Program: Remote Sensing Workshop, An Introduction to Systems and Softwar...
CLIM Program: Remote Sensing Workshop, An Introduction to Systems and Softwar...The Statistical and Applied Mathematical Sciences Institute
210 vues23 diapositives
Pathway and network analysis par
Pathway and network analysisPathway and network analysis
Pathway and network analysisManar Al-Eslam Mattar
733 vues46 diapositives

Similaire à Biolayout Marblar Feb13(20)

Cloud data management par ambitlick
Cloud data managementCloud data management
Cloud data management
ambitlick579 vues
Graph Signal Processing for Machine Learning A Review and New Perspectives - ... par lauratoni4
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
lauratoni4198 vues
Handwritten Text Recognition Using Machine Learning par IRJET Journal
Handwritten Text Recognition Using Machine LearningHandwritten Text Recognition Using Machine Learning
Handwritten Text Recognition Using Machine Learning
IRJET Journal16 vues
Unit i introduction to grid computing par sudha kar
Unit i   introduction to grid computingUnit i   introduction to grid computing
Unit i introduction to grid computing
sudha kar13.6K vues
From Simulation to Online Gaming: the need for adaptive solutions par Gabriele D'Angelo
From Simulation to Online Gaming: the need for adaptive solutions From Simulation to Online Gaming: the need for adaptive solutions
From Simulation to Online Gaming: the need for adaptive solutions
IRJET - Network Traffic Monitoring and Botnet Detection using K-ANN Algorithm par IRJET Journal
IRJET - Network Traffic Monitoring and Botnet Detection using K-ANN AlgorithmIRJET - Network Traffic Monitoring and Botnet Detection using K-ANN Algorithm
IRJET - Network Traffic Monitoring and Botnet Detection using K-ANN Algorithm
IRJET Journal10 vues
Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4... par Keiichiro Ono
Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4...Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4...
Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4...
Keiichiro Ono7.6K vues
NetBioSIG2013-KEYNOTE Benno Schwikowski par Alexander Pico
NetBioSIG2013-KEYNOTE Benno SchwikowskiNetBioSIG2013-KEYNOTE Benno Schwikowski
NetBioSIG2013-KEYNOTE Benno Schwikowski
Alexander Pico2K vues
Scalable Similarity-Based Neighborhood Methods with MapReduce par sscdotopen
Scalable Similarity-Based Neighborhood Methods with MapReduceScalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReduce
sscdotopen2.8K vues
Online stream mining approach for clustering network traffic par eSAT Journals
Online stream mining approach for clustering network trafficOnline stream mining approach for clustering network traffic
Online stream mining approach for clustering network traffic
eSAT Journals124 vues
CS8603_Notes_003-1_edubuzz360.pdf par KishaKiddo
CS8603_Notes_003-1_edubuzz360.pdfCS8603_Notes_003-1_edubuzz360.pdf
CS8603_Notes_003-1_edubuzz360.pdf
KishaKiddo127 vues
20090219 The case for another systems biology modelling environment par Jonathan Blakes
20090219 The case for another systems biology modelling environment20090219 The case for another systems biology modelling environment
20090219 The case for another systems biology modelling environment
Jonathan Blakes494 vues

Plus de marblar

Droplet Orchestrator par
Droplet OrchestratorDroplet Orchestrator
Droplet Orchestratormarblar
2.7K vues11 diapositives
Svaya tech overview for marblar nanoreactors v2 par
Svaya tech overview for marblar   nanoreactors v2Svaya tech overview for marblar   nanoreactors v2
Svaya tech overview for marblar nanoreactors v2marblar
2.3K vues10 diapositives
Polar bear positioning par
Polar bear positioningPolar bear positioning
Polar bear positioningmarblar
746 vues6 diapositives
CLaDS Marblar Feb 13 par
CLaDS Marblar Feb 13CLaDS Marblar Feb 13
CLaDS Marblar Feb 13marblar
2.4K vues12 diapositives
Titanium dioxide - Svaya Feb 2013 par
Titanium dioxide - Svaya Feb 2013Titanium dioxide - Svaya Feb 2013
Titanium dioxide - Svaya Feb 2013marblar
1.7K vues9 diapositives
Offshore access vessel slides gm par
Offshore access vessel slides   gmOffshore access vessel slides   gm
Offshore access vessel slides gmmarblar
924 vues8 diapositives

Plus de marblar(20)

Droplet Orchestrator par marblar
Droplet OrchestratorDroplet Orchestrator
Droplet Orchestrator
marblar2.7K vues
Svaya tech overview for marblar nanoreactors v2 par marblar
Svaya tech overview for marblar   nanoreactors v2Svaya tech overview for marblar   nanoreactors v2
Svaya tech overview for marblar nanoreactors v2
marblar2.3K vues
Polar bear positioning par marblar
Polar bear positioningPolar bear positioning
Polar bear positioning
marblar746 vues
CLaDS Marblar Feb 13 par marblar
CLaDS Marblar Feb 13CLaDS Marblar Feb 13
CLaDS Marblar Feb 13
marblar2.4K vues
Titanium dioxide - Svaya Feb 2013 par marblar
Titanium dioxide - Svaya Feb 2013Titanium dioxide - Svaya Feb 2013
Titanium dioxide - Svaya Feb 2013
marblar1.7K vues
Offshore access vessel slides gm par marblar
Offshore access vessel slides   gmOffshore access vessel slides   gm
Offshore access vessel slides gm
marblar924 vues
Optical tweezers Jan 13 par marblar
Optical tweezers Jan 13Optical tweezers Jan 13
Optical tweezers Jan 13
marblar1.4K vues
Polar bear challenge Jan 13 par marblar
Polar bear challenge Jan 13Polar bear challenge Jan 13
Polar bear challenge Jan 13
marblar482 vues
Surfuzion - Marblar Nov 12 par marblar
Surfuzion - Marblar Nov 12Surfuzion - Marblar Nov 12
Surfuzion - Marblar Nov 12
marblar874 vues
Light my carbons - Nov 12 par marblar
Light my carbons - Nov 12Light my carbons - Nov 12
Light my carbons - Nov 12
marblar306 vues
Nanoparticle assay par marblar
Nanoparticle assayNanoparticle assay
Nanoparticle assay
marblar1.7K vues
Superman-vision par marblar
Superman-visionSuperman-vision
Superman-vision
marblar541 vues
Natures drill bit - Oct 12 par marblar
Natures drill bit - Oct 12Natures drill bit - Oct 12
Natures drill bit - Oct 12
marblar1.8K vues
microFTS Oct 2012 par marblar
microFTS Oct 2012microFTS Oct 2012
microFTS Oct 2012
marblar1.2K vues
Oxygen sensor Oct 2012 par marblar
Oxygen sensor Oct 2012Oxygen sensor Oct 2012
Oxygen sensor Oct 2012
marblar835 vues
Viral UPS - Oct 12 par marblar
Viral UPS - Oct 12Viral UPS - Oct 12
Viral UPS - Oct 12
marblar334 vues
Energy Harvester Oct 12 par marblar
Energy Harvester Oct 12Energy Harvester Oct 12
Energy Harvester Oct 12
marblar937 vues
CyMap Sept 12 par marblar
CyMap Sept 12CyMap Sept 12
CyMap Sept 12
marblar499 vues
SlipChip - Oct 2012 par marblar
SlipChip - Oct 2012SlipChip - Oct 2012
SlipChip - Oct 2012
marblar3.9K vues

Biolayout Marblar Feb13

  • 1. Big Data goes 3D: BioLayout Express3D Prof Tom Freeman University of Edinburgh
  • 2. Network Graphs of (Biological) Relationships Many types of data, biological or otherwise, can best be viewed and interrogated as networks, best visualised as so- called network graphs. In biology these may include: • Social interactions between individuals Spread of TB via contact tracing • Transmission of disease • Relationship (evolutionary, homology) between genes and proteins • Interactions between proteins (data, co-citation, pathway Protein homology models) • ‘omics data Pathways Protein interaction
  • 3. Example: Microarray Gene Expression Data • Can sequence and measure tissue-specific activity of 23,000 Microarrays genes in human body • Microarrays comprised of 1000s/millions of DNA probes – routinely used to measure activity across the genome • Produce highly complex data – analysis/visualisation is Display of statistical hits challenging • BioLayout Express3D developed originally to analyse this kind of data through use of 3D network graphs Display of clusters
  • 4. Example (cont.): Steps Involved in Analyzing Gene Expression Data • Microarray data (many measurements over many samples) imported • Co-expression defined using correlation measure (read: is gene A upregulated in the same samples as gene B?) • Genes (nodes) are connected to each other in a network based on their level of co-expression (edges) (read: pretty graphs!) 1.25 billion 50,000 calculations r> 50,000 Correlation matrix
  • 5. Example (cont.): The program’s work-flow in detail Data quality control, normalisation and annotation Gene-to-gene Pearson correlation calculated for every probe set on the array Filter correlations file based on user defined threshold (0 - 1.0), i.e. exclude weak correlations Edges drawn between nodes (genes) based on correlations > than selected threshold 2D or 3D visualisation Clustering and visual exploration CPU or GPU parallelization used for all computationally intensive algorithms
  • 6. Example Graphs Derived from Expression Data
  • 7. Advantages of Graph-based Analyses of Complex Data using BioLayout Express3D • Rapid calculation of networks from primary data • Support for the visualization of large (10s of thousands of nodes, millions of edges) network graphs • Rendering of the networks in 3D space with real-time interactive navigation • Full range of tools for network visualization, inspection, querying and analysis • Rapid calculations as CPU and GPU are used for parallel calculations • Can in principle use to visualise data from all kinds of fields as well as linking to primary data manipulation programs such as Excel
  • 8. Modelling and Visualization of Stochastic Flow through Large Network Systems – e.g. biological pathways • Standardized graphical notation system depicts the complex network of relationships within e.g. biological pathways • Previously no way of using these models as a basis for the computational modeling of pathway function • Biolayout can dynamically model the stochastic flow of ‘activity’ through large networks/pathways • Can represent this flow visually Basically: Can model and animate how components of a complex network influence each other over time & compare to real data to test the model.
  • 9. Modelling and Visualization of Stochastic Flow through Large Network Systems 1. Pathway models drawn in yEd graph editor, parameterized and saved as .graphml files 2. Models imported into BioLayout and used to calculate time- dependent stochastic flow through network 3. The results of flow simulations can be visualised as graphs (mouse- over function) or viewed as real- time animations where the size and colour of nodes is used to represent their activity
  • 10. What we’re looking for The code is open source for non-commercial use – we’d love for you to use it in your research, be it in biology or anywhere else •Where do you see this tool making an impact in a research setting? Maybe you’re a programmer who’d like to get involved in adapting the software for: •Adapting it to new applications •Integrating it with other tools •Exploring the visualisation capabilities of the tool in new setting We’re also looking to develop the technology commercially •Can you think of any great market opportunities for BioLayout? •Who should we be partnering with to develop the tool for this application? Who might want to license it? Either way, we’d love to hear from you!
  • 11. BioLayout Express3D Team The Roslin Institute Tim Angus Derek Wright Tom Freeman EMBL-EBI Anton Enright Stijn van Dongen Thanks to the challenge sponsors