SlideShare a Scribd company logo
1 of 31
Automation in Cytomics: A Modern
          RDBMS Based Platform for Image
         Analysis and Management in High-
         Throughput Screening Experiments

    HIS 2012 : The 1st. International Conference on Health Information
    Science

Enrique Larios, Kuan Yan, Fons J. Verbeek (LIACS, Leiden University, The Netherlands)
Ying Zhang, Fabian Groffen (CWI, Amsterdam, The Netherlands)
Zi Di, Sylvia LeDévédec (Department of Toxicology, Leiden University, The Netherlands)




                                                                             Centrum Wiskunde
                                                                               & Informatica

                                                    Leiden University. The university to discover.
Introduction
§  The current approach in                              Time Lapse
                                     Static
                                                          Sequence
    cytomics is to conduct          Images
                                                           Images
    High Throughput
    Screening (HTS)
    experiments so that cells         2D                    2D + T
    can be tested under
    different experimental
    conditions.
§  In HTS experiments, as            3D                    3D + T
    more experiments are lined
    up, the amount of data and                 Cytomics
    computation needed to
    analyze these increases
    rapidly.
                                                                             2

                                 Leiden University. The university to discover.
Workflow in HTS experiments


            Experiment               Plate design
             planning
Scientist
                         Scientist




                                                                      3

                          Leiden University. The university to discover.
Workflow in HTS experiments
                          HTS process          Storage




   Setting up
       the                              tiff files
   microscope

                BD Pathway
                                        ics & ids
                                           files
                                        tiff files

                Nikon 1
                                        nd2 files
                                        ics & ids
Scientist                                  files
                                        tiff files
                                                                         File Server
                Nikon 2
                                        nd2 files


                                        tiff files

                                        nd2 files
                Nikon 3

                                                                                       4

                                        Leiden University. The university to discover.
Workflow in HTS experiments


                                                                              Image
                                                                             Analysis
                                        Cell masks and motion trajectories
            Bioinformaticians         High-throughput image analysis
                                      provides an automated quantification
                                      of dynamic cell behavior in both
                                      cellular level and structural level.




                                                                              Data
                                                                             Analysis
Scientist
                                     By employing pattern recognition
                                     theorem, the system provides objective
                      Data Map       statistical conclusions to support
                                     biological hypothesizes.


                                                                                   5

                                 Leiden University. The university to discover.
Problems identified

                                         Software            Component
  Duration of           Data
                                        tools used            s used in
      the           (images and
                                          are not                the
  experiments       metadata) is
                                        suitable for         experiment
    can take          not linked
                                         the work              are not
    months.           properly.
                                        performed.           integrated.



 There is no platform that can facilitate Scientists to learn from the
      experience. Lack of a Knowledge Discovery System.




                                                                                6

                                    Leiden University. The university to discover.
Objectives


                                           Develop an
                                           integrated platform
                                           to automate data
                    Design a               management and
                    database to            image analysis of
                    store almost all       cytomic HTS
                    data produced          experiments.
    Establish an    and used in
    automated       the HTS
    workflow        experiments.
    system of the
    HTS
    experiments.                                                       7

                           Leiden University. The university to discover.
Workflow of the HTS System




                                                           8

               Leiden University. The university to discover.
Which data should be stored in
the database?

                    Experiment details




  Users
                                                         Plates & Wells



                   HTS Database




 Results of Data                                        Results of Image
    Analysis                                                Analysis


                      Raw images                                           9

                               Leiden University. The university to discover.
Description of the System
Architecture
GUI layer                      HTS Analysis GUI




            Plate Design   Image Analysis       Pattern recognition tools
                 API            API             API
  Web
Services
 layer

               Glassfish                                      - IIS


  Data
storage /                                              Scientific Super
Processin
 g layer
                                                            Computer


                                                                            10

                               Leiden University. The university to discover.
}    + easy to add/modify a record           }    + only need to read in relevant data
}    - might read in unnecessary             }    - tuple writes require multiple
      data.                                         accesses.
         }    Suitable for read-mostly, read-intensive, large data
               repositories.
         }    MonetDB is a open-source database system for high-
               performance applications in data mining, OLAP, GIS, XML
               Query, text and multimedia retrieval. MonetDB often achieves
               a 10-fold raw speed improvement for SQL and XQuery over
               competitor RDBMSs.                          by Peter Boncz (CWI)
                                                                                           11

                                                Leiden University. The university to discover.
ROW STORAGE    COLUMN STORAGE




     STRIPE                  STRIPE




                                by Peter Boncz (CWI)
                                                         12

              Leiden University. The university to discover.
HTS System Database Schema




                                                        13

             Leiden University. The university to discover.
How data is organized
in the schema?

                    Users




         Experiment details




                                                                         14

                              Leiden University. The university to discover.
How data is organized
in the schema?



      Plates & Wells




                                                                  15

                       Leiden University. The university to discover.
How data is organized in the
schema?




                              Raw images




                                                          16

               Leiden University. The university to discover.
How data is organized in the
schema?


                      Results of Image
                          Analysis




                       Results of Data
                         Analysis
                                                          17

               Leiden University. The university to discover.
How the platform works?
                                                                                        Authentication
                           Decision        New idea       Web User
                           making                         interface
                                                            (GUI)
   Users

User Roles


               System      §      Audit, maintenance of
             Administrator         users, roles, conditions.

                             §    Create Projects,
             Administrator         Experiments, Plates,
                                   Upload the images from
                                   the microscope, and                      Plate layout design (GUI)
                                   perform data and image
                             §    analysis. images from
                                   Upload the                    §    Every user need to log in in the
             Expert User                                               platform and is administrator of their
                                   the microscope, and
                                   perform data and image              own Projects-Experiments.
                             §    analysis.data and image
                                   Perform                       §    A user can also grant to other users a
             Analyst User          analysis and link the               specific role (Administrator, Expert
                                   results to the experiment.          User or Analyst user) and create a
                                                                       collaborative environment.

                                                                                                           18

                                                         Leiden University. The university to discover.
How the platform works?
Web User
interface
  (GUI)




     Administration option:
     •  Create / Edit / Delete users
     •  Assign Roles to a user




                                                                                  19

                                       Leiden University. The university to discover.
How the platform works?
Web User
interface
  (GUI)




            Project option:
            •  Create, Edit, Delete Projects
            •  Visualize Project’s metadata




                                                                                   20

                                        Leiden University. The university to discover.
How the platform works?
Web User
interface
  (GUI)




            Experiments option:
            •  Create, Edit, Delete Experiments
            •  Visualize Experiment’s metadata




                                                                          21

                               Leiden University. The university to discover.
How the platform works?
Web User
interface
  (GUI)




            Conditions option:
            •  Create, Edit, Delete, Import
               Coating parameters, Cell line
               tissues, Compounds, siRNA,
               and Antibodies/reagents.




                                                                 22

                      Leiden University. The university to discover.
How the platform works?
Web User
interface
  (GUI)




              Plates option:
              •  Create, Edit, Delete Plates
              •  Visualize Plate’s metadata.




                                                        23

             Leiden University. The university to discover.
How the platform works?
Web User
interface
  (GUI)




                     Reports option:
                     •  Perform custom queries
                        through different datasets.
                     •  Visualize predefined reports
                        about Projects/
                        Experiments/ Plates/ Well
                        metadata.




                                                        24

             Leiden University. The university to discover.
How the platform works?
Web User
interface
  (GUI)




                     Analysis option:
                     •  Invoke the Data and
                        Image Analysis APIs .
                     •  Visualize the results of the
                        data and image analysis.




                                                        25

             Leiden University. The university to discover.
How the platform works?
Steps in the new Workflow
          System

  §  Create a Project
  §  Create an Experiment
  §  Design the layout of a culture
      plate (4x6 wells, 6x8 wells , 8x12
      wells, etc.).
  §  Assign the experimental
      conditions applied to the wells
      (drag and drop).
  §  Allow access to your project to
      other users assigning them a
      specific Wet lab experiment
               role.
                using the plate design

                                                                         Time-Lapse
                                                                            Image
                                                                         Sequence /
                                         HTS                                Static
                                                                           Images




                                                                                          26

                                               Leiden University. The university to discover.
How the platform works?
                                                                Upload HTS
                                                                 Images
                                        HTS

                           Time-Lapse          §  The files generated by the
                              Image
                           Sequence /              microscope have a standard
                              Static
                             Images
                                                   named convention.
                                               §  Through the GUI, the images are
                                                   uploaded to the platform.
                  Raw Images                   §  The platform links the imported
                                                   images to the experiment and the
q    2D (XY): [1] Frame     [1] Image
      [1..n] Channels
                                                   plate designed previously.
                                               §  The platform also reads from the
q    2D+T (XY+T): [1] Video [1..n]                header of the files information
      Frames   [1] Image    [1..n] Channels
                                                   associated to the microscope
q    3D (XYZ): [1] Frame   [1..n] Sections        settings.
      [1] Image [1..n] Channels                §  According to the microscope used,
q    3D+T (XYZ+T): [1] Video [1..n]               the image’s metadata has a
      Frame [1..n] Sections [1] Image              particular structure that is also
      [1..n] Channel                               stored in the database.
                                                                                         27

                                              Leiden University. The university to discover.
How the platform works?
                                                                     Image
                                                                    Analysis
 Images
uploaded                                            §  Through the GUI it is
                                                        possible to invoke the
                                                        API for the image
                                                        analysis process.
                                                    §  As a result of the image
                                                        analysis, auxiliary
                                                        images are generated:
                                                        binary masks or
                                                        trajectories.
                                                    §  These auxiliary images
                                                        are linked to the plates
                                                        – wells and raw images
                                                        in the GUI.
                                                        Auxiliary
                                                         images


           Binary mask   Trajectories

                                                                                   28

                                        Leiden University. The university to discover.
How the platform works?
                                                                         Data
                                                                        Analysis
Images
Analysis
                                                         §  Measurements
                                                             extracted from the
                                                             image analysis are
                                                             further analyzed using
                                                             Patter recognition tools.
                                                         §  Through the GUI it is
                                                             possible to invoke the
             Binary mask          Trajectories
                                                             API for the data
                                                             analysis process.
                                                         §  As a result, it is
                                                             generated CSV files
                                                             which are stored in the
                                                             database in order to
                                                             have later graphical
                                                             representations.
                                                              Example
                                                              Results
    Cell migration analysis   Structure dynamic
                                   analysis
                                                                                        29

                                             Leiden University. The university to discover.
Conclusions
}    Using this platform for image analysis and management in HTS it is
      possible to avoid typical man-made errors in the experiments.
}    Using this system the time invested in post experiment analysis has
      been reduced considerably. Now takes less than a week to accomplish the
      data analysis that previously easily took more than a month with commercial
      software, or a year by manual observation.
}    The platform allows end-users to perform high-profile cytomics with a
      minimum level of a prior experience on image analysis and machine
      learning.
}    The system uses web services, therefore, the framework is very flexible
      as it allows the connection to other web services.
}    The platform can eventually evolve into a sophisticated interdisciplinary
      platform for cytomics.
}    Having the HTS information comprehensively organized in a
      sophisticated and scalable database is a fertile ground for knowledge
      discovery.


                                                                                      30

                                           Leiden University. The university to discover.
Questions ?


Sponsors:


                                                            31

                 Leiden University. The university to discover.

More Related Content

Similar to Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and Management in High-Throughput Screening Experiments

Developments in datamanagement
Developments in datamanagementDevelopments in datamanagement
Developments in datamanagementSURFnet
 
Ross Tredinnick - Rebecca J. Holz Research Data Management Talk 4/16/2013
Ross Tredinnick - Rebecca J. Holz Research Data Management Talk 4/16/2013Ross Tredinnick - Rebecca J. Holz Research Data Management Talk 4/16/2013
Ross Tredinnick - Rebecca J. Holz Research Data Management Talk 4/16/2013rossTnick
 
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictionsDeep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictionsValery Tkachenko
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceRobert H. McDonald
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesManjulaPatel
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science James Hendler
 
Towards automated phenotypic cell profiling with high-content imaging
Towards automated phenotypic cell profiling with high-content imagingTowards automated phenotypic cell profiling with high-content imaging
Towards automated phenotypic cell profiling with high-content imagingOla Spjuth
 
Devnology Back to School: Empirical Evidence on Modeling in Software Development
Devnology Back to School: Empirical Evidence on Modeling in Software DevelopmentDevnology Back to School: Empirical Evidence on Modeling in Software Development
Devnology Back to School: Empirical Evidence on Modeling in Software DevelopmentDevnology
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for ScienceIan Foster
 
Performing Large Scale Repeatable Software Engineering Studies
Performing Large Scale Repeatable Software Engineering StudiesPerforming Large Scale Repeatable Software Engineering Studies
Performing Large Scale Repeatable Software Engineering StudiesGeorgios Gousios
 
Live memory analysis tools and techniques in linux environment tech foring
Live memory analysis tools and techniques in linux environment tech foringLive memory analysis tools and techniques in linux environment tech foring
Live memory analysis tools and techniques in linux environment tech foringSheikh Foyjul Islam
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureOdinot Stanislas
 
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors IndiaBEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors IndiaTutors India
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentationelasticdave
 
Hattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in MaterialsHattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in MaterialsJason Hattrick-Simpers
 
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceBrain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceKrzysztof Gorgolewski
 
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesDiscovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesIan Foster
 

Similar to Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and Management in High-Throughput Screening Experiments (20)

Developments in datamanagement
Developments in datamanagementDevelopments in datamanagement
Developments in datamanagement
 
Ross Tredinnick - Rebecca J. Holz Research Data Management Talk 4/16/2013
Ross Tredinnick - Rebecca J. Holz Research Data Management Talk 4/16/2013Ross Tredinnick - Rebecca J. Holz Research Data Management Talk 4/16/2013
Ross Tredinnick - Rebecca J. Holz Research Data Management Talk 4/16/2013
 
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictionsDeep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
Deep Learning on nVidia GPUs for QSAR, QSPR and QNAR predictions
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability Science
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
Brain Imaging Data Structure
Brain Imaging Data StructureBrain Imaging Data Structure
Brain Imaging Data Structure
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 
Linked Sensor Data 101 (FIS2011)
Linked Sensor Data 101 (FIS2011)Linked Sensor Data 101 (FIS2011)
Linked Sensor Data 101 (FIS2011)
 
Towards automated phenotypic cell profiling with high-content imaging
Towards automated phenotypic cell profiling with high-content imagingTowards automated phenotypic cell profiling with high-content imaging
Towards automated phenotypic cell profiling with high-content imaging
 
Devnology Back to School: Empirical Evidence on Modeling in Software Development
Devnology Back to School: Empirical Evidence on Modeling in Software DevelopmentDevnology Back to School: Empirical Evidence on Modeling in Software Development
Devnology Back to School: Empirical Evidence on Modeling in Software Development
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
Performing Large Scale Repeatable Software Engineering Studies
Performing Large Scale Repeatable Software Engineering StudiesPerforming Large Scale Repeatable Software Engineering Studies
Performing Large Scale Repeatable Software Engineering Studies
 
31 34
31 3431 34
31 34
 
Live memory analysis tools and techniques in linux environment tech foring
Live memory analysis tools and techniques in linux environment tech foringLive memory analysis tools and techniques in linux environment tech foring
Live memory analysis tools and techniques in linux environment tech foring
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
 
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors IndiaBEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentation
 
Hattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in MaterialsHattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in Materials
 
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceBrain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible Neuroscince
 
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesDiscovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
 

More from PlanetData Network of Excellence

A Contextualized Knowledge Repository for Open Data about Trentino
A Contextualized Knowledge Repository for Open Data about TrentinoA Contextualized Knowledge Repository for Open Data about Trentino
A Contextualized Knowledge Repository for Open Data about TrentinoPlanetData Network of Excellence
 
On Leveraging Crowdsourcing Techniques for Schema Matching Networks
On Leveraging Crowdsourcing Techniques for Schema Matching NetworksOn Leveraging Crowdsourcing Techniques for Schema Matching Networks
On Leveraging Crowdsourcing Techniques for Schema Matching NetworksPlanetData Network of Excellence
 
Towards Enabling Probabilistic Databases for Participatory Sensing
Towards Enabling Probabilistic Databases for Participatory SensingTowards Enabling Probabilistic Databases for Participatory Sensing
Towards Enabling Probabilistic Databases for Participatory SensingPlanetData Network of Excellence
 
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstream
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstreamDemo: tablet-based visualisation of transport data in Madrid using SPARQLstream
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstreamPlanetData Network of Excellence
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingPlanetData Network of Excellence
 
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...PlanetData Network of Excellence
 
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatch
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatchLinking Smart Cities Datasets with Human Computation: the case of UrbanMatch
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatchPlanetData Network of Excellence
 
SciQL, Bridging the Gap between Science and Relational DBMS
SciQL, Bridging the Gap between Science and Relational DBMSSciQL, Bridging the Gap between Science and Relational DBMS
SciQL, Bridging the Gap between Science and Relational DBMSPlanetData Network of Excellence
 
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduce
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduceScalable Nonmonotonic Reasoning over RDF Data Using MapReduce
Scalable Nonmonotonic Reasoning over RDF Data Using MapReducePlanetData Network of Excellence
 
Evolution of Workflow Provenance Information in the Presence of Custom Infere...
Evolution of Workflow Provenance Information in the Presence of Custom Infere...Evolution of Workflow Provenance Information in the Presence of Custom Infere...
Evolution of Workflow Provenance Information in the Presence of Custom Infere...PlanetData Network of Excellence
 
Towards Parallel Nonmonotonic Reasoning with Billions of Facts
Towards Parallel Nonmonotonic Reasoning with Billions of FactsTowards Parallel Nonmonotonic Reasoning with Billions of Facts
Towards Parallel Nonmonotonic Reasoning with Billions of FactsPlanetData Network of Excellence
 

More from PlanetData Network of Excellence (20)

Dl2014 slides
Dl2014 slidesDl2014 slides
Dl2014 slides
 
A Contextualized Knowledge Repository for Open Data about Trentino
A Contextualized Knowledge Repository for Open Data about TrentinoA Contextualized Knowledge Repository for Open Data about Trentino
A Contextualized Knowledge Repository for Open Data about Trentino
 
On Leveraging Crowdsourcing Techniques for Schema Matching Networks
On Leveraging Crowdsourcing Techniques for Schema Matching NetworksOn Leveraging Crowdsourcing Techniques for Schema Matching Networks
On Leveraging Crowdsourcing Techniques for Schema Matching Networks
 
Towards Enabling Probabilistic Databases for Participatory Sensing
Towards Enabling Probabilistic Databases for Participatory SensingTowards Enabling Probabilistic Databases for Participatory Sensing
Towards Enabling Probabilistic Databases for Participatory Sensing
 
Privacy-Preserving Schema Reuse
Privacy-Preserving Schema ReusePrivacy-Preserving Schema Reuse
Privacy-Preserving Schema Reuse
 
Pay-as-you-go Reconciliation in Schema Matching Networks
Pay-as-you-go Reconciliation in Schema Matching NetworksPay-as-you-go Reconciliation in Schema Matching Networks
Pay-as-you-go Reconciliation in Schema Matching Networks
 
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstream
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstreamDemo: tablet-based visualisation of transport data in Madrid using SPARQLstream
Demo: tablet-based visualisation of transport data in Madrid using SPARQLstream
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream Processing
 
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
Urbanopoly: Collection and Quality Assessment of Geo-spatial Linked Data via ...
 
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatch
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatchLinking Smart Cities Datasets with Human Computation: the case of UrbanMatch
Linking Smart Cities Datasets with Human Computation: the case of UrbanMatch
 
SciQL, Bridging the Gap between Science and Relational DBMS
SciQL, Bridging the Gap between Science and Relational DBMSSciQL, Bridging the Gap between Science and Relational DBMS
SciQL, Bridging the Gap between Science and Relational DBMS
 
CLODA: A Crowdsourced Linked Open Data Architecture
CLODA: A Crowdsourced Linked Open Data ArchitectureCLODA: A Crowdsourced Linked Open Data Architecture
CLODA: A Crowdsourced Linked Open Data Architecture
 
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduce
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduceScalable Nonmonotonic Reasoning over RDF Data Using MapReduce
Scalable Nonmonotonic Reasoning over RDF Data Using MapReduce
 
Data and Knowledge Evolution
Data and Knowledge Evolution  Data and Knowledge Evolution
Data and Knowledge Evolution
 
Evolution of Workflow Provenance Information in the Presence of Custom Infere...
Evolution of Workflow Provenance Information in the Presence of Custom Infere...Evolution of Workflow Provenance Information in the Presence of Custom Infere...
Evolution of Workflow Provenance Information in the Presence of Custom Infere...
 
Access Control for RDF graphs using Abstract Models
Access Control for RDF graphs using Abstract ModelsAccess Control for RDF graphs using Abstract Models
Access Control for RDF graphs using Abstract Models
 
Arrays in Databases, the next frontier?
Arrays in Databases, the next frontier?Arrays in Databases, the next frontier?
Arrays in Databases, the next frontier?
 
Abstract Access Control Model for Dynamic RDF Datasets
Abstract Access Control Model for Dynamic RDF DatasetsAbstract Access Control Model for Dynamic RDF Datasets
Abstract Access Control Model for Dynamic RDF Datasets
 
Towards Parallel Nonmonotonic Reasoning with Billions of Facts
Towards Parallel Nonmonotonic Reasoning with Billions of FactsTowards Parallel Nonmonotonic Reasoning with Billions of Facts
Towards Parallel Nonmonotonic Reasoning with Billions of Facts
 
Heuristic based Query Optimisation for SPARQL
Heuristic based Query Optimisation for SPARQLHeuristic based Query Optimisation for SPARQL
Heuristic based Query Optimisation for SPARQL
 

Recently uploaded

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and Management in High-Throughput Screening Experiments

  • 1. Automation in Cytomics: A Modern RDBMS Based Platform for Image Analysis and Management in High- Throughput Screening Experiments HIS 2012 : The 1st. International Conference on Health Information Science Enrique Larios, Kuan Yan, Fons J. Verbeek (LIACS, Leiden University, The Netherlands) Ying Zhang, Fabian Groffen (CWI, Amsterdam, The Netherlands) Zi Di, Sylvia LeDévédec (Department of Toxicology, Leiden University, The Netherlands) Centrum Wiskunde & Informatica Leiden University. The university to discover.
  • 2. Introduction §  The current approach in Time Lapse Static Sequence cytomics is to conduct Images Images High Throughput Screening (HTS) experiments so that cells 2D 2D + T can be tested under different experimental conditions. §  In HTS experiments, as 3D 3D + T more experiments are lined up, the amount of data and Cytomics computation needed to analyze these increases rapidly. 2 Leiden University. The university to discover.
  • 3. Workflow in HTS experiments Experiment Plate design planning Scientist Scientist 3 Leiden University. The university to discover.
  • 4. Workflow in HTS experiments HTS process Storage Setting up the tiff files microscope BD Pathway ics & ids files tiff files Nikon 1 nd2 files ics & ids Scientist files tiff files File Server Nikon 2 nd2 files tiff files nd2 files Nikon 3 4 Leiden University. The university to discover.
  • 5. Workflow in HTS experiments Image Analysis Cell masks and motion trajectories Bioinformaticians High-throughput image analysis provides an automated quantification of dynamic cell behavior in both cellular level and structural level. Data Analysis Scientist By employing pattern recognition theorem, the system provides objective Data Map statistical conclusions to support biological hypothesizes. 5 Leiden University. The university to discover.
  • 6. Problems identified Software Component Duration of Data tools used s used in the (images and are not the experiments metadata) is suitable for experiment can take not linked the work are not months. properly. performed. integrated. There is no platform that can facilitate Scientists to learn from the experience. Lack of a Knowledge Discovery System. 6 Leiden University. The university to discover.
  • 7. Objectives Develop an integrated platform to automate data Design a management and database to image analysis of store almost all cytomic HTS data produced experiments. Establish an and used in automated the HTS workflow experiments. system of the HTS experiments. 7 Leiden University. The university to discover.
  • 8. Workflow of the HTS System 8 Leiden University. The university to discover.
  • 9. Which data should be stored in the database? Experiment details Users Plates & Wells HTS Database Results of Data Results of Image Analysis Analysis Raw images 9 Leiden University. The university to discover.
  • 10. Description of the System Architecture GUI layer HTS Analysis GUI Plate Design Image Analysis Pattern recognition tools API API API Web Services layer Glassfish - IIS Data storage / Scientific Super Processin g layer Computer 10 Leiden University. The university to discover.
  • 11. }  + easy to add/modify a record }  + only need to read in relevant data }  - might read in unnecessary }  - tuple writes require multiple data. accesses. }  Suitable for read-mostly, read-intensive, large data repositories. }  MonetDB is a open-source database system for high- performance applications in data mining, OLAP, GIS, XML Query, text and multimedia retrieval. MonetDB often achieves a 10-fold raw speed improvement for SQL and XQuery over competitor RDBMSs. by Peter Boncz (CWI) 11 Leiden University. The university to discover.
  • 12. ROW STORAGE COLUMN STORAGE STRIPE STRIPE by Peter Boncz (CWI) 12 Leiden University. The university to discover.
  • 13. HTS System Database Schema 13 Leiden University. The university to discover.
  • 14. How data is organized in the schema? Users Experiment details 14 Leiden University. The university to discover.
  • 15. How data is organized in the schema? Plates & Wells 15 Leiden University. The university to discover.
  • 16. How data is organized in the schema? Raw images 16 Leiden University. The university to discover.
  • 17. How data is organized in the schema? Results of Image Analysis Results of Data Analysis 17 Leiden University. The university to discover.
  • 18. How the platform works? Authentication Decision New idea Web User making interface (GUI) Users User Roles System §  Audit, maintenance of Administrator users, roles, conditions. §  Create Projects, Administrator Experiments, Plates, Upload the images from the microscope, and Plate layout design (GUI) perform data and image §  analysis. images from Upload the §  Every user need to log in in the Expert User platform and is administrator of their the microscope, and perform data and image own Projects-Experiments. §  analysis.data and image Perform §  A user can also grant to other users a Analyst User analysis and link the specific role (Administrator, Expert results to the experiment. User or Analyst user) and create a collaborative environment. 18 Leiden University. The university to discover.
  • 19. How the platform works? Web User interface (GUI) Administration option: •  Create / Edit / Delete users •  Assign Roles to a user 19 Leiden University. The university to discover.
  • 20. How the platform works? Web User interface (GUI) Project option: •  Create, Edit, Delete Projects •  Visualize Project’s metadata 20 Leiden University. The university to discover.
  • 21. How the platform works? Web User interface (GUI) Experiments option: •  Create, Edit, Delete Experiments •  Visualize Experiment’s metadata 21 Leiden University. The university to discover.
  • 22. How the platform works? Web User interface (GUI) Conditions option: •  Create, Edit, Delete, Import Coating parameters, Cell line tissues, Compounds, siRNA, and Antibodies/reagents. 22 Leiden University. The university to discover.
  • 23. How the platform works? Web User interface (GUI) Plates option: •  Create, Edit, Delete Plates •  Visualize Plate’s metadata. 23 Leiden University. The university to discover.
  • 24. How the platform works? Web User interface (GUI) Reports option: •  Perform custom queries through different datasets. •  Visualize predefined reports about Projects/ Experiments/ Plates/ Well metadata. 24 Leiden University. The university to discover.
  • 25. How the platform works? Web User interface (GUI) Analysis option: •  Invoke the Data and Image Analysis APIs . •  Visualize the results of the data and image analysis. 25 Leiden University. The university to discover.
  • 26. How the platform works? Steps in the new Workflow System §  Create a Project §  Create an Experiment §  Design the layout of a culture plate (4x6 wells, 6x8 wells , 8x12 wells, etc.). §  Assign the experimental conditions applied to the wells (drag and drop). §  Allow access to your project to other users assigning them a specific Wet lab experiment role. using the plate design Time-Lapse Image Sequence / HTS Static Images 26 Leiden University. The university to discover.
  • 27. How the platform works? Upload HTS Images HTS Time-Lapse §  The files generated by the Image Sequence / microscope have a standard Static Images named convention. §  Through the GUI, the images are uploaded to the platform. Raw Images §  The platform links the imported images to the experiment and the q  2D (XY): [1] Frame [1] Image [1..n] Channels plate designed previously. §  The platform also reads from the q  2D+T (XY+T): [1] Video [1..n] header of the files information Frames [1] Image [1..n] Channels associated to the microscope q  3D (XYZ): [1] Frame [1..n] Sections settings. [1] Image [1..n] Channels §  According to the microscope used, q  3D+T (XYZ+T): [1] Video [1..n] the image’s metadata has a Frame [1..n] Sections [1] Image particular structure that is also [1..n] Channel stored in the database. 27 Leiden University. The university to discover.
  • 28. How the platform works? Image Analysis Images uploaded §  Through the GUI it is possible to invoke the API for the image analysis process. §  As a result of the image analysis, auxiliary images are generated: binary masks or trajectories. §  These auxiliary images are linked to the plates – wells and raw images in the GUI. Auxiliary images Binary mask Trajectories 28 Leiden University. The university to discover.
  • 29. How the platform works? Data Analysis Images Analysis §  Measurements extracted from the image analysis are further analyzed using Patter recognition tools. §  Through the GUI it is possible to invoke the Binary mask Trajectories API for the data analysis process. §  As a result, it is generated CSV files which are stored in the database in order to have later graphical representations. Example Results Cell migration analysis Structure dynamic analysis 29 Leiden University. The university to discover.
  • 30. Conclusions }  Using this platform for image analysis and management in HTS it is possible to avoid typical man-made errors in the experiments. }  Using this system the time invested in post experiment analysis has been reduced considerably. Now takes less than a week to accomplish the data analysis that previously easily took more than a month with commercial software, or a year by manual observation. }  The platform allows end-users to perform high-profile cytomics with a minimum level of a prior experience on image analysis and machine learning. }  The system uses web services, therefore, the framework is very flexible as it allows the connection to other web services. }  The platform can eventually evolve into a sophisticated interdisciplinary platform for cytomics. }  Having the HTS information comprehensively organized in a sophisticated and scalable database is a fertile ground for knowledge discovery. 30 Leiden University. The university to discover.
  • 31. Questions ? Sponsors: 31 Leiden University. The university to discover.