SlideShare une entreprise Scribd logo
1  sur  17
Télécharger pour lire hors ligne
High Throughput, High Content 
Screening ‐ Automa6ng the Pipeline 
            Rajarshi Guha, Ph.D. 
    NIH Center for Transla:onal Therapeu:cs 

          San Francisco, January 2010 
Merging Screening Technologies 

High throughput screening       High content screening 
 •    Lead iden:fica:on          •    Phenotypic profiling 
 •    Single (few) read outs    •    Mul:ple parameters 
 •    High‐throughput           •    Moderate throughput 
 •    Moderate data volumes     •    Very large data volumes 


 •  We’d like to combine the technologies, to obtain rich 
    high‐resolu:on data at high speed 
 •  Is this feasible? What are the trade‐offs? 
Merging Screening Technologies 
•  A simple solu:on is to run a HTS & HCS as 
   separate, primary & secondary screens 
•  Alterna:vely – Wells to Cells 
  –  Integrate HTS & HCS in a single screen using a 
     combined plaYorm for robo:cs & real :me 
     automated HTS analy:cs 
  –  Selec%ve imaging of interes%ng wells 
Wells to Cells Workflow 
                                                                           Acquisition Client
                                                                                                                                                                   •  Sequen:al qHTS using laser 
      HTS
      Laser Scanning Cytometry
                                                                                                                                          Selective HCS
                                                                                                                                                     Microscopy
                                                                                                                                                                      scanning cytometry followed 
                      Raw data
                     Population Definition                                                                                   Population Definition
                                                                                                                                                     Images           by high‐res microscopy 
                                                                                                                                                                   •  Unit of work is a plate series  
                       Object segmentation                                                                             Object segmentation
                       Parameters selection                                               Selected                 Parameters selection
                                                                                          wells
                       Thresholds definition                                                                          Thresholds definition


                      Population distribution                                                               Objects characterization
                                                                                                Morphological properties, localization

                     Response Curve Calculation

                       Normalization
                       Correction
                       Fitting                                                    Decision
                                                                                                        Response Curve Calculation

                                                                                                                                  Normalization
                                                                                                                                      Correction
                                                                                                                                           Fitting
                                                                                                                                                                   •  The same aliquot is analyzed 
                                                                                                                                                                      by both techniques 
                       Curve classification                                       Analytics                                  Curve classification

                      Curve class, AC50, Efficacy                                                        Curve class, AC50, Efficacy


                      Active                             Inactive                                                             0
                0                                   0
                                                                                                             Activity (%)
Activity (%)




                                   Activity (%)




                                                                                                                            - 25
               -25                                 -25                           SAR




                                                                                                                                                                   •  A message based system 
                                                                                                                            - 50
               -50                                 -50
                               b                                                          HCS                               - 75
               -75                                 -75                   HTS                                            - 100
         -100                 a                   -100                                                                      - 9 - 8 - 7 - 6 - 5 -4
            -9 -8 -7 -6 -5 -4                        -9 -8 -7 -6 -5 -4
               Log[Compound], M                   Log[Compound], M                                                          Log[Compound], M
                                                                               qHTS Database




                                                                                                                                                                   •  The key is deciding which 
                                                                                 Confirmation




                                                                                                                                                                      wells go through the 
                                                                                                                                                                      workflow 
                                                                         Integrated Chemical
                                                                           Genomics Client
Informa:cs Pla<orm 




                                          InCell Layout  
                                              File 


•  Advanced correc:on and 
   normaliza:on methods 
•  Sophis:cated curve fi]ng 
   algorithm 
•  Good performance, allows 
   paralleliza:on of the en:re 
   workflow 
Why Messaging? 
•  A messaging architecture allows for significant 
   flexibility 
  –  Persistent, can be kept for process tracking, 
     repor:ng 
  –  Asynchronous, allows individual components of 
     the workflow to proceed at their own pace 
  –  Modular, new components can be introduced at 
     any :me without redesigning the whole workflow 
•  We employ Oracle AQ, but any message 
   queue can be employed 
qHTS & Curve Classes 




                                             Inac%ve 
•  Heuris:c assessment of the significance 
   of a concentra:on response curve 
•  Prior valida:on screens 
   allow us to decide which 




                                             Inconclusive 
   types of curves should 
   be selected 




                                             Ac%ve 
Well Selec:on Criteria 
•  Generally, pre‐determined (from valida:on 
   assays) 
•  Selec:on criteria implemented as Java code 
  –  Easy to adapt for different assays 
  –  Currently only makes use of the :tra:on curve 
     parameters  
  –  Could easily involve  
     •  Chemical structure 
     •  Enrichments 
     •  Predic:ve models 
Well to Cells Assays  
•  Cell cycle, cell transloca:on, DNA 
   repreplica:on 
•  All assays run against LOPAC1280  
•  Consistency between cytometry & microscopy 
   is measured by the R2 between log AC50’s 
  –  Cell cycle, 0.94 – 0.96 
  –  Cell transloca:on, 0.66 – 0.94 
  –  DNA rereplica:on, s:ll in progress  
Cell Transloca:on Example Hits 
Data Access & Browsing 
•  In development 
•  An integrated tool to  
   manage and disseminate  
   data relevant to chemical  
   genomics  
•  A consistent/simple interface to register/
   import, browse, search, and annotate data 
•  An effec:ve tool for confirma:on of HTS and/
   or HCS data 
Handling Mul:ple Pla<orms 
•  Current examples employ InCell hardware 
•  We also use Molecular Devices hardware 
•  As a result we have two orthogonal image 
   stores / databases 
•  Need to integrate them 
  –  Support seamless data browsing  across mul:ple 
     screens irrespec:ve of imaging plaYorm used 
  –  Support analy:cs external to vendor code 
Image Stores & REST 
•  We use the file‐system based image store 
   op:on for MetaXpress 
•  Allows us to repurpose it to store InCell 
   images 
•  Custom Python code to load InCell images into 
   the store and meta‐data into an Oracle DB 
A Unified Interface 
•  A client sees a single, simple interface to 
   screening image data 
       h;p://host/rest/protocol/plate/well/image 

•  Transparently extract  
   image data via the  
   MetaXpress database  
   or via custom code 
•  Currently the interface address image serving 
•  Unified metadata interface in the works 
Trade‐offs & Opportuni:es 
•  Automa:on reduces the ability to handle 
   unforeseen errors 
  –  Dispense errors and other plate problems 
  –  Well selec:on based on curve classes may need to 
     be modified on the fly 
•  Well selec:on does not consider SAR 
  –  Wells are selected independently of each other 
  –  If we could model SAR on the fly (or from 
     valida:on screens), we’d select mul:ple wells, to 
     obtain posi:ve and nega6ve results 
Conclusions 
•  Automated mul:‐stage screening is a leap 
   forward 
  –  Saves money and :me 
  –  Requires good analy:cs to be robust to on‐the‐fly 
     errors 
•  Integra:on at all layers (data / image store, 
   data types) is key to making sense out of the 
   data 
•  Would be nice to have clean vendor API’s! 
Acknowledgments 
•    Doug Auld 
•    Jim Inglese 
•    Ronald Johnson 
•    Sam Michael 
•    Trung Nguyen 
•    Steve Titus 
•    Jennifer Wichterman 

Contenu connexe

Tendances

DRUG DESIGN AND DISCOVERY
DRUG DESIGN AND DISCOVERY DRUG DESIGN AND DISCOVERY
DRUG DESIGN AND DISCOVERY Rahul B S
 
Immunoassay methods and their application in pharmaceutical analysis
Immunoassay methods and their application in pharmaceutical analysisImmunoassay methods and their application in pharmaceutical analysis
Immunoassay methods and their application in pharmaceutical analysisSibasishDey1
 
Instrumentation and application of LC-MS/MS in bioanalysis
Instrumentation and application of LC-MS/MS in bioanalysisInstrumentation and application of LC-MS/MS in bioanalysis
Instrumentation and application of LC-MS/MS in bioanalysisDr. Amit Patel
 
High throughput screening
High throughput screeningHigh throughput screening
High throughput screeningJeremy Ogbadu
 
ISOELECTRIC FOCUSING PPT - SLIDE SHARE
ISOELECTRIC FOCUSING PPT - SLIDE SHAREISOELECTRIC FOCUSING PPT - SLIDE SHARE
ISOELECTRIC FOCUSING PPT - SLIDE SHAREAditi Chaturvedi
 
High throughput screening
High throughput screening High throughput screening
High throughput screening RewariBhavya
 
Microsomal stability assay
Microsomal stability assayMicrosomal stability assay
Microsomal stability assaypavanireddy86
 
Computer aided drug design(CADD)
Computer aided drug design(CADD)Computer aided drug design(CADD)
Computer aided drug design(CADD)Sameh Abdel-ghany
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methodsratanvishwas
 
High throughput screening
High throughput screeningHigh throughput screening
High throughput screeningManish Kumar
 
Capillary Electrophoresis
Capillary ElectrophoresisCapillary Electrophoresis
Capillary ElectrophoresisSantoshi10
 
Cell culture technique
Cell culture techniqueCell culture technique
Cell culture techniqueirtizaashaq
 

Tendances (20)

DRUG DESIGN AND DISCOVERY
DRUG DESIGN AND DISCOVERY DRUG DESIGN AND DISCOVERY
DRUG DESIGN AND DISCOVERY
 
Immunoassay methods and their application in pharmaceutical analysis
Immunoassay methods and their application in pharmaceutical analysisImmunoassay methods and their application in pharmaceutical analysis
Immunoassay methods and their application in pharmaceutical analysis
 
Instrumentation and application of LC-MS/MS in bioanalysis
Instrumentation and application of LC-MS/MS in bioanalysisInstrumentation and application of LC-MS/MS in bioanalysis
Instrumentation and application of LC-MS/MS in bioanalysis
 
High throughput screening
High throughput screeningHigh throughput screening
High throughput screening
 
Toxicological screening
Toxicological screeningToxicological screening
Toxicological screening
 
ISOELECTRIC FOCUSING PPT - SLIDE SHARE
ISOELECTRIC FOCUSING PPT - SLIDE SHAREISOELECTRIC FOCUSING PPT - SLIDE SHARE
ISOELECTRIC FOCUSING PPT - SLIDE SHARE
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
docking
docking docking
docking
 
Biological databases
Biological databasesBiological databases
Biological databases
 
High throughput screening
High throughput screening High throughput screening
High throughput screening
 
HTS
HTSHTS
HTS
 
Microsomal stability assay
Microsomal stability assayMicrosomal stability assay
Microsomal stability assay
 
Computer aided drug design(CADD)
Computer aided drug design(CADD)Computer aided drug design(CADD)
Computer aided drug design(CADD)
 
Lead identification
Lead identification Lead identification
Lead identification
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methods
 
Radio Immunoassay Notes
Radio Immunoassay NotesRadio Immunoassay Notes
Radio Immunoassay Notes
 
High throughput screening
High throughput screeningHigh throughput screening
High throughput screening
 
MOLECULAR DOCKING
MOLECULAR DOCKINGMOLECULAR DOCKING
MOLECULAR DOCKING
 
Capillary Electrophoresis
Capillary ElectrophoresisCapillary Electrophoresis
Capillary Electrophoresis
 
Cell culture technique
Cell culture techniqueCell culture technique
Cell culture technique
 

En vedette

HIGH THROUGHPUT SCREENING Technology
HIGH THROUGHPUT SCREENING  TechnologyHIGH THROUGHPUT SCREENING  Technology
HIGH THROUGHPUT SCREENING TechnologyUniversity Of Swabi
 
High throughput screening
High throughput screeningHigh throughput screening
High throughput screeningajivengan
 
Combinatorial chemistry
Combinatorial chemistry Combinatorial chemistry
Combinatorial chemistry Naresh Juttu
 
Use of Automated High Content Analysis Applied To Assessment Of Primary DNA D...
Use of Automated High Content Analysis Applied To Assessment Of Primary DNA D...Use of Automated High Content Analysis Applied To Assessment Of Primary DNA D...
Use of Automated High Content Analysis Applied To Assessment Of Primary DNA D...HCS Pharma
 
High-Content Analysis & Phenotypic Screening Conference 2016
High-Content Analysis & Phenotypic Screening Conference 2016High-Content Analysis & Phenotypic Screening Conference 2016
High-Content Analysis & Phenotypic Screening Conference 2016Jaime Hodges
 
A fast graphic api for non-linear machine learning
A fast graphic api for non-linear machine learningA fast graphic api for non-linear machine learning
A fast graphic api for non-linear machine learningPeng Cheng
 
Improving Test Team Throughput via Architecture by Dustin Williams
Improving Test Team Throughput via Architecture by Dustin WilliamsImproving Test Team Throughput via Architecture by Dustin Williams
Improving Test Team Throughput via Architecture by Dustin WilliamsQA or the Highway
 
Reyes and Shader Pipeline
Reyes and Shader PipelineReyes and Shader Pipeline
Reyes and Shader PipelineShuen-Huei Guan
 
Pipeline hazard
Pipeline hazardPipeline hazard
Pipeline hazardAJAL A J
 
Instruction pipeline: Computer Architecture
Instruction pipeline: Computer ArchitectureInstruction pipeline: Computer Architecture
Instruction pipeline: Computer ArchitectureInteX Research Lab
 
Optimizing the Graphics Pipeline with Compute, GDC 2016
Optimizing the Graphics Pipeline with Compute, GDC 2016Optimizing the Graphics Pipeline with Compute, GDC 2016
Optimizing the Graphics Pipeline with Compute, GDC 2016Graham Wihlidal
 
Computer architecture
Computer architecture Computer architecture
Computer architecture Ashish Kumar
 
Combinatorial chemistry
Combinatorial chemistryCombinatorial chemistry
Combinatorial chemistryBhaskar_Borkar
 

En vedette (20)

HIGH THROUGHPUT SCREENING Technology
HIGH THROUGHPUT SCREENING  TechnologyHIGH THROUGHPUT SCREENING  Technology
HIGH THROUGHPUT SCREENING Technology
 
High throughput screening
High throughput screeningHigh throughput screening
High throughput screening
 
Combinatorial chemistry
Combinatorial chemistry Combinatorial chemistry
Combinatorial chemistry
 
Use of Automated High Content Analysis Applied To Assessment Of Primary DNA D...
Use of Automated High Content Analysis Applied To Assessment Of Primary DNA D...Use of Automated High Content Analysis Applied To Assessment Of Primary DNA D...
Use of Automated High Content Analysis Applied To Assessment Of Primary DNA D...
 
High-Content Analysis & Phenotypic Screening Conference 2016
High-Content Analysis & Phenotypic Screening Conference 2016High-Content Analysis & Phenotypic Screening Conference 2016
High-Content Analysis & Phenotypic Screening Conference 2016
 
A fast graphic api for non-linear machine learning
A fast graphic api for non-linear machine learningA fast graphic api for non-linear machine learning
A fast graphic api for non-linear machine learning
 
Improving Test Team Throughput via Architecture by Dustin Williams
Improving Test Team Throughput via Architecture by Dustin WilliamsImproving Test Team Throughput via Architecture by Dustin Williams
Improving Test Team Throughput via Architecture by Dustin Williams
 
Reyes and Shader Pipeline
Reyes and Shader PipelineReyes and Shader Pipeline
Reyes and Shader Pipeline
 
Aca2 06 new
Aca2 06 newAca2 06 new
Aca2 06 new
 
pipelining
pipeliningpipelining
pipelining
 
Pipeline hazard
Pipeline hazardPipeline hazard
Pipeline hazard
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
 
Instruction pipeline: Computer Architecture
Instruction pipeline: Computer ArchitectureInstruction pipeline: Computer Architecture
Instruction pipeline: Computer Architecture
 
Capsules -Pharmaceutics
Capsules -PharmaceuticsCapsules -Pharmaceutics
Capsules -Pharmaceutics
 
Drug delivery
Drug deliveryDrug delivery
Drug delivery
 
Optimizing the Graphics Pipeline with Compute, GDC 2016
Optimizing the Graphics Pipeline with Compute, GDC 2016Optimizing the Graphics Pipeline with Compute, GDC 2016
Optimizing the Graphics Pipeline with Compute, GDC 2016
 
Computer architecture
Computer architecture Computer architecture
Computer architecture
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
 
Combinatorial chemistry
Combinatorial chemistryCombinatorial chemistry
Combinatorial chemistry
 
Capsules
CapsulesCapsules
Capsules
 

Plus de Rajarshi Guha

Pharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark GenomePharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark GenomeRajarshi Guha
 
Pharos: Putting targets in context
Pharos: Putting targets in contextPharos: Putting targets in context
Pharos: Putting targets in contextRajarshi Guha
 
Pharos – A Torch to Use in Your Journey In the Dark Genome
Pharos – A Torch to Use in Your Journey In the Dark GenomePharos – A Torch to Use in Your Journey In the Dark Genome
Pharos – A Torch to Use in Your Journey In the Dark GenomeRajarshi Guha
 
Pharos - Face of the KMC
Pharos - Face of the KMCPharos - Face of the KMC
Pharos - Face of the KMCRajarshi Guha
 
Enhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Enhancing Prioritization & Discovery of Novel Combinations using an HTS PlatformEnhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Enhancing Prioritization & Discovery of Novel Combinations using an HTS PlatformRajarshi Guha
 
What can your library do for you?
What can your library do for you?What can your library do for you?
What can your library do for you?Rajarshi Guha
 
So I have an SD File … What do I do next?
So I have an SD File … What do I do next?So I have an SD File … What do I do next?
So I have an SD File … What do I do next?Rajarshi Guha
 
Characterization of Chemical Libraries Using Scaffolds and Network Models
Characterization of Chemical Libraries Using Scaffolds and Network ModelsCharacterization of Chemical Libraries Using Scaffolds and Network Models
Characterization of Chemical Libraries Using Scaffolds and Network ModelsRajarshi Guha
 
From Data to Action : Bridging Chemistry and Biology with Informatics at NCATS
From Data to Action: Bridging Chemistry and Biology with Informatics at NCATSFrom Data to Action: Bridging Chemistry and Biology with Informatics at NCATS
From Data to Action : Bridging Chemistry and Biology with Informatics at NCATSRajarshi Guha
 
Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & RRajarshi Guha
 
Fingerprinting Chemical Structures
Fingerprinting Chemical StructuresFingerprinting Chemical Structures
Fingerprinting Chemical StructuresRajarshi Guha
 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...Rajarshi Guha
 
When the whole is better than the parts
When the whole is better than the partsWhen the whole is better than the parts
When the whole is better than the partsRajarshi Guha
 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Rajarshi Guha
 
Pushing Chemical Biology Through the Pipes
Pushing Chemical Biology Through the PipesPushing Chemical Biology Through the Pipes
Pushing Chemical Biology Through the PipesRajarshi Guha
 
Characterization and visualization of compound combination responses in a hig...
Characterization and visualization of compound combination responses in a hig...Characterization and visualization of compound combination responses in a hig...
Characterization and visualization of compound combination responses in a hig...Rajarshi Guha
 
The BioAssay Research Database
The BioAssay Research DatabaseThe BioAssay Research Database
The BioAssay Research DatabaseRajarshi Guha
 
Cloudy with a Touch of Cheminformatics
Cloudy with a Touch of CheminformaticsCloudy with a Touch of Cheminformatics
Cloudy with a Touch of CheminformaticsRajarshi Guha
 
Chemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleChemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleRajarshi Guha
 
Chemogenomics in the cloud: Is the sky the limit?
Chemogenomics in the cloud: Is the sky the limit?Chemogenomics in the cloud: Is the sky the limit?
Chemogenomics in the cloud: Is the sky the limit?Rajarshi Guha
 

Plus de Rajarshi Guha (20)

Pharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark GenomePharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark Genome
 
Pharos: Putting targets in context
Pharos: Putting targets in contextPharos: Putting targets in context
Pharos: Putting targets in context
 
Pharos – A Torch to Use in Your Journey In the Dark Genome
Pharos – A Torch to Use in Your Journey In the Dark GenomePharos – A Torch to Use in Your Journey In the Dark Genome
Pharos – A Torch to Use in Your Journey In the Dark Genome
 
Pharos - Face of the KMC
Pharos - Face of the KMCPharos - Face of the KMC
Pharos - Face of the KMC
 
Enhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Enhancing Prioritization & Discovery of Novel Combinations using an HTS PlatformEnhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Enhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
 
What can your library do for you?
What can your library do for you?What can your library do for you?
What can your library do for you?
 
So I have an SD File … What do I do next?
So I have an SD File … What do I do next?So I have an SD File … What do I do next?
So I have an SD File … What do I do next?
 
Characterization of Chemical Libraries Using Scaffolds and Network Models
Characterization of Chemical Libraries Using Scaffolds and Network ModelsCharacterization of Chemical Libraries Using Scaffolds and Network Models
Characterization of Chemical Libraries Using Scaffolds and Network Models
 
From Data to Action : Bridging Chemistry and Biology with Informatics at NCATS
From Data to Action: Bridging Chemistry and Biology with Informatics at NCATSFrom Data to Action: Bridging Chemistry and Biology with Informatics at NCATS
From Data to Action : Bridging Chemistry and Biology with Informatics at NCATS
 
Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & R
 
Fingerprinting Chemical Structures
Fingerprinting Chemical StructuresFingerprinting Chemical Structures
Fingerprinting Chemical Structures
 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
 
When the whole is better than the parts
When the whole is better than the partsWhen the whole is better than the parts
When the whole is better than the parts
 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
 
Pushing Chemical Biology Through the Pipes
Pushing Chemical Biology Through the PipesPushing Chemical Biology Through the Pipes
Pushing Chemical Biology Through the Pipes
 
Characterization and visualization of compound combination responses in a hig...
Characterization and visualization of compound combination responses in a hig...Characterization and visualization of compound combination responses in a hig...
Characterization and visualization of compound combination responses in a hig...
 
The BioAssay Research Database
The BioAssay Research DatabaseThe BioAssay Research Database
The BioAssay Research Database
 
Cloudy with a Touch of Cheminformatics
Cloudy with a Touch of CheminformaticsCloudy with a Touch of Cheminformatics
Cloudy with a Touch of Cheminformatics
 
Chemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleChemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & Reproducible
 
Chemogenomics in the cloud: Is the sky the limit?
Chemogenomics in the cloud: Is the sky the limit?Chemogenomics in the cloud: Is the sky the limit?
Chemogenomics in the cloud: Is the sky the limit?
 

High Throughput, High Content Screening - Automating the Pipeline

  • 1. High Throughput, High Content  Screening ‐ Automa6ng the Pipeline  Rajarshi Guha, Ph.D.  NIH Center for Transla:onal Therapeu:cs  San Francisco, January 2010 
  • 2. Merging Screening Technologies  High throughput screening  High content screening  •  Lead iden:fica:on  •  Phenotypic profiling  •  Single (few) read outs  •  Mul:ple parameters  •  High‐throughput  •  Moderate throughput  •  Moderate data volumes  •  Very large data volumes  •  We’d like to combine the technologies, to obtain rich  high‐resolu:on data at high speed  •  Is this feasible? What are the trade‐offs? 
  • 3. Merging Screening Technologies  •  A simple solu:on is to run a HTS & HCS as  separate, primary & secondary screens  •  Alterna:vely – Wells to Cells  –  Integrate HTS & HCS in a single screen using a  combined plaYorm for robo:cs & real :me  automated HTS analy:cs  –  Selec%ve imaging of interes%ng wells 
  • 4. Wells to Cells Workflow  Acquisition Client •  Sequen:al qHTS using laser  HTS Laser Scanning Cytometry Selective HCS Microscopy scanning cytometry followed  Raw data Population Definition Population Definition Images by high‐res microscopy  •  Unit of work is a plate series   Object segmentation Object segmentation Parameters selection Selected Parameters selection wells Thresholds definition Thresholds definition Population distribution Objects characterization Morphological properties, localization Response Curve Calculation Normalization Correction Fitting Decision Response Curve Calculation Normalization Correction Fitting •  The same aliquot is analyzed  by both techniques  Curve classification Analytics Curve classification Curve class, AC50, Efficacy Curve class, AC50, Efficacy Active Inactive 0 0 0 Activity (%) Activity (%) Activity (%) - 25 -25 -25 SAR •  A message based system  - 50 -50 -50 b HCS - 75 -75 -75 HTS - 100 -100 a -100 - 9 - 8 - 7 - 6 - 5 -4 -9 -8 -7 -6 -5 -4 -9 -8 -7 -6 -5 -4 Log[Compound], M Log[Compound], M Log[Compound], M qHTS Database •  The key is deciding which  Confirmation wells go through the  workflow  Integrated Chemical Genomics Client
  • 5. Informa:cs Pla<orm  InCell Layout   File  •  Advanced correc:on and  normaliza:on methods  •  Sophis:cated curve fi]ng  algorithm  •  Good performance, allows  paralleliza:on of the en:re  workflow 
  • 6. Why Messaging?  •  A messaging architecture allows for significant  flexibility  –  Persistent, can be kept for process tracking,  repor:ng  –  Asynchronous, allows individual components of  the workflow to proceed at their own pace  –  Modular, new components can be introduced at  any :me without redesigning the whole workflow  •  We employ Oracle AQ, but any message  queue can be employed 
  • 7. qHTS & Curve Classes  Inac%ve  •  Heuris:c assessment of the significance  of a concentra:on response curve  •  Prior valida:on screens  allow us to decide which  Inconclusive  types of curves should  be selected  Ac%ve 
  • 8. Well Selec:on Criteria  •  Generally, pre‐determined (from valida:on  assays)  •  Selec:on criteria implemented as Java code  –  Easy to adapt for different assays  –  Currently only makes use of the :tra:on curve  parameters   –  Could easily involve   •  Chemical structure  •  Enrichments  •  Predic:ve models 
  • 9. Well to Cells Assays   •  Cell cycle, cell transloca:on, DNA  repreplica:on  •  All assays run against LOPAC1280   •  Consistency between cytometry & microscopy  is measured by the R2 between log AC50’s  –  Cell cycle, 0.94 – 0.96  –  Cell transloca:on, 0.66 – 0.94  –  DNA rereplica:on, s:ll in progress  
  • 11. Data Access & Browsing  •  In development  •  An integrated tool to   manage and disseminate   data relevant to chemical   genomics   •  A consistent/simple interface to register/ import, browse, search, and annotate data  •  An effec:ve tool for confirma:on of HTS and/ or HCS data 
  • 12. Handling Mul:ple Pla<orms  •  Current examples employ InCell hardware  •  We also use Molecular Devices hardware  •  As a result we have two orthogonal image  stores / databases  •  Need to integrate them  –  Support seamless data browsing  across mul:ple  screens irrespec:ve of imaging plaYorm used  –  Support analy:cs external to vendor code 
  • 13. Image Stores & REST  •  We use the file‐system based image store  op:on for MetaXpress  •  Allows us to repurpose it to store InCell  images  •  Custom Python code to load InCell images into  the store and meta‐data into an Oracle DB 
  • 14. A Unified Interface  •  A client sees a single, simple interface to  screening image data  h;p://host/rest/protocol/plate/well/image  •  Transparently extract   image data via the   MetaXpress database   or via custom code  •  Currently the interface address image serving  •  Unified metadata interface in the works 
  • 15. Trade‐offs & Opportuni:es  •  Automa:on reduces the ability to handle  unforeseen errors  –  Dispense errors and other plate problems  –  Well selec:on based on curve classes may need to  be modified on the fly  •  Well selec:on does not consider SAR  –  Wells are selected independently of each other  –  If we could model SAR on the fly (or from  valida:on screens), we’d select mul:ple wells, to  obtain posi:ve and nega6ve results 
  • 16. Conclusions  •  Automated mul:‐stage screening is a leap  forward  –  Saves money and :me  –  Requires good analy:cs to be robust to on‐the‐fly  errors  •  Integra:on at all layers (data / image store,  data types) is key to making sense out of the  data  •  Would be nice to have clean vendor API’s! 
  • 17. Acknowledgments  •  Doug Auld  •  Jim Inglese  •  Ronald Johnson  •  Sam Michael  •  Trung Nguyen  •  Steve Titus  •  Jennifer Wichterman