SlideShare une entreprise Scribd logo
1  sur  29
Dual-Event Machine Learning Models to Accelerate Drug
                      Discovery



Sean Ekins1,2*, Robert C. Reynolds3,4*, Hiyun Kim5, Mi-Sun Koo5, Marilyn
Ekonomidis5, Meliza Talaue5, Steve D. Paget5, Lisa K. Woolhiser6, Anne J.
Lenaerts6, Barry A. Bunin1, Nancy Connell5 and Joel S. Freundlich5,7*
1CollaborativeDrug Discovery, 1633 Bayshore Highway, Suite 342, Burlingame, CA 94010, USA.
2Collaborationsin Chemistry, 5616 Hilltop Needmore Road, Fuquay-Varina, NC 27526, USA.
3Southern Research Institute, 2000 Ninth Avenue South, Birmingham, AL 35205, USA.
4Current address: University of Alabama at Birmingham, College of Arts and Sciences , Department of Chemistry, 1530 3 rd

Avenue South, Birmingham, Alabama 35294-1240, USA.
5Department of Medicine, Center for Emerging and Reemerging Pathogens, UMDNJ – New Jersey Medical School, 185 South

Orange Avenue Newark, NJ 07103, USA.
6Department of Microbiology, Immunology and Pathology, Colorado State University, 200 West Lake Street, CO 80523, USA.
7Department of Pharmacology & Physiology, UMDNJ – New Jersey Medical School, 185 South Orange Avenue Newark, NJ

07103, USA.
                                                             .
TB facts


   Tuberculosis kills 1.6-1.7m/yr (~1 every 8 seconds)
   1/3rd of worlds population infected!!!!

   Multi drug resistance in 4.3% of cases
   Extensively drug resistant increasing incidence
   one new drug (bedaquiline) in 40 yrs        streptomycin (1943)
                                                  para-aminosalicyclic acid (1949)
                                                  isoniazid (1952) (Bayer, Roche, Squibb)
                                                  pyrazinamide (1954)
                                                  cycloserine (1955)
                                                  ethambutol (1962)
                                                  rifampicin (1967)
   Drug-drug interactions and Co-morbidity with HIV

   Collaboration between groups is rare
   These groups may work on existing or new targets
   Use of computational methods with TB is rare
~ 20 public datasets for TB
Including Novartis data on TB hits
>300,000 cpds

Patents, Papers Annotated by CDD

Open to browse by anyone

 http://www.collaborativedrug.
 com/register
Phenotypic screening HTS Hit rates




                                  SRI papers




           Usually less than 1%
Bayesian Model Construction: Mtb Whole-Cell HTS




• Learning from 3,779 compounds from an NIAID library
       - active: MIC < 5 M
       - inactive: MIC ≥ 5 M
Bayesian machine learning

Bayesian classification is a simple probabilistic classification model. It is based on
Bayes’ theorem




h is the hypothesis or model
d is the observed data
p(h) is the prior belief (probability of hypothesis h before observing any data)
p(d) is the data evidence (marginal probability of the data)
p(d|h) is the likelihood (probability of data d if hypothesis h is true)
p(h|d) is the posterior probability (probability of hypothesis h being true given the
observed data d)

A weight is calculated for each feature using a Laplacian-adjusted probability
estimate to account for the different sampling frequencies of different features.

The weights are summed to provide a probability estimate

Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
Novel Bayesian Models for Mtb Whole-Cell Efficacy


  SRI MLSMR 220K single point model
        active: ≥90% inhibition @ 10 M; inactive <90% inhibition @ 10 M

  SRI MLSMR 2.5K dose reponse model
        active: IC50 ≤ 5 M; inactive: IC50 > 5 M

   Model Building and Validation
 • Laplacian-corrected Bayesian classifier models (Accelrys Discovery Studio)

 • Molecular function class fingerprints of maximum diameter 6 (FCFP_6)

 • Simple molecular descriptors chosen including AlogP, molecular weight,
 # rotatable bonds, # rings, # hydrogen bond acceptors, # hydrogen bond
 donors, and polar surface area

 • Validated w/ leave-one-out cross-validation & leave-50%-out cross-validation
Ekins, S. et al., Mol. Biosyst. 2010, 6, 840-51; Ekins, S. et al., Mol. Biosyst. 2010, 6, 2316-2324.
Bayesian Classification TB Models

     We can use the public data for machine learning model building
     Using Discovery Studio Bayesian model
     Leave out 50% x 100




     Dateset                        Internal
   (number of        External         ROC
   molecules)       ROC Score        Score         Concordance         Specificity    Sensitivity
     MLSMR
 All single point
      screen
 (N = 220463)        0.86    0      0.86    0       78.56   1.86      78.59   1.94    77.13     2.26
    MLSMR
dose response set
   (N = 2273)       0.73    0.01   0.75    0.01     66.85   4.06      67.21   7.05    65.47     7.96

                                                  Ekins et al., Mol BioSyst, 6: 840-851, 2010
Bayesian Classification Models for TB

      Laplacian-corrected Bayesian classifier models were generated using FCFP-6 and
      simple descriptors. 2 models 220,000 and >2000 compounds
                         active compounds with MIC < 5uM


Good




Bad




                                              Ekins et al., Mol BioSyst, 6: 840-851, 2010
Bayesian Classification Dose response

Good




Bad




                      Ekins et al., Mol BioSyst, 6: 840-851, 2010
Additional test sets

 1702 hits in >100K cpds          34 hits in 248 cpds              21 hits in 2108 cpds

  100K library                    Novartis Data                   FDA drugs




Suggests models can predict data from the same and independent labs
Enrichments 4-10 fold
Initial enrichment – enables screening few compounds to find actives
Ekins et al., Mol BioSyst, 6: 840-851, 2010   Ekins and Freundlich, Pharm Res, 28, 1859-1869, 2011.
Testing to date has been retrospective

      Can we use our models to select compounds and influence
      design?

      Prospective prediction

      Do it enough times to show robustness




 Testing prospectively
Bayesian Machine Learning Models – testing

Ranked Asinex 25K library with MLSMR dose response model –

Bayesian score range -28.4 – 15.3

99 compounds screened (Bayesian score 9.4 – 15.3).

12 cpds were identified with IC90 < 30 ug/mL

~12% hit rate



Most active SYN 22269076

Pyrazolo[1,5-a]pyrimidine
                             Bayesian
                                         14.9         10.6   9.8
IC50 1.1ug/ml (3.2uM)           Score
Some follow up compounds for the Asinex hit
Principal component analysis (PCA) of all SRI data sets to
illustrate overlap of chemistry space using the datasets

from this study (red TAACF-CB2, green = MLSMR, black =
kinase dataset), 3PCs explain 72% of the variance.
Dual-Event models
High-throughput    Mtb screening                        Bayesian Machine Learning Mtb Model
  phenotypic      molecule database
 Mtb screening

                            S                        Descriptors + Bioactivity (+Cytotoxicity)

                       N
                       H
                                N




                                      Molecule Database
                                  (e.g. GSK malaria actives)
                           virtually scored using Bayesian Models



                                Top scoring molecules assayed for                  New bioactivity data
                                      Mtb growth inhibition                        may enhance models




                                                                           S

                                        Identify in vitro hits         N
                                                                       H
                                                                               N



                                    Increased hit/lead discovery efficiency
Dual-Event models


Become more stringent in what we call an ACTIVE


IC90 < 10 ug/ml (CB2) or <10uM (MLSMR) and a selectivity index (SI)
greater than ten.

SI was calculated as SI = CC50/IC90 where CC50 is the concentration that
resulted in 50% inhibition of Vero cells (CC50).
Bayesian Classification TB Models

    Single pt ROC XV AUC            = 0.88
    Dose resp                       = 0.78
    Dose resp + cyto                = 0.86

     Dateset          External        Internal
   (number of          ROC              ROC
   molecules)          Score           Score        Concordance    Specificity    Sensitivity
      MLSMR
  All single point
       screen
  (N = 220463)        0.86    0       0.86    0     78.56   1.86   78.59   1.94   77.13   2.26
    MLSMR
dose response set
   (N = 2273)        0.73    0.01    0.75    0.01   66.85   4.06   67.21   7.05   65.47   7.96
NEW Dose resp and
 cytotoxicity (N =
       2273)         0.82    0.02    0.84    0.02   82.61   4.68   83.91   5.48   65.99   7.47


                                                      Ekins et al., PLOSONE, in press 2013
A new dataset to use as a test set for models
Bayesian Machine Learning Models – blind testing




           Dual event model shows increased enrichment
                             Ekins et al.,Chem Biol 20, 370–378, 2013
Prospective prediction of antimalarial compounds vs Mtb




           1. Virtually screen 13,533-member GSK antimalarial hit library
           2. Model = SRI TAACF-CB dose response + cytotoxicity model
           3. Top 46 commercially available compounds visually inspected
           4. 7 compounds chosen for Mtb testing based on
                    - drug-likeness
                    - chemotype diversity

      Dateset
                         External   Internal ROC
(number of molecules)   ROC Score       Score      Concordance    Specificity   Sensitivity

TAACF-CB2 IC90 and
                          0.64       0.59 ± 0.01    0.63 ± 0.02   55.74 ±1.31   61.61 ± 8.96
 cytotoxicity (1783)
Prospective prediction of antimalarial compounds vs Mtb




                    7 tested, 5 active (70% hit rate)

                                  Ekins et al.,Chem Biol 20, 370–378, 2013
Bayesian Model Follow-up: Do we have a lead?
                                               • BAS00521003/ TCMDC-125802 reported to be a
                                               P. falciparum lactate dehydrogenase inhibitor
                                               • Only one report (that we were unaware of when
                                               picking the compound) of antitubercular activity
                                               from 1969
                                                 - solid agar MIC = 1 g/mL (“wild strain”)
        MIC of 0.0625 ug/mL                      - “no activity” in mouse model up to 400 mg/kg
                                                 - however, activity was solely judged by
                                                   extension of survival!




                                                           SRI MLSMR 220K library contains:
                                                           107 hits with this substructure
                                                            - 3 nitrofuryl hydrazones
                                                            - 10 furyl hydrazones
                                                            - 19 nitrophenyl hydrazones
Bruhin, H. et al., J. Pharm. Pharmac. 1969, 21, 423-433.
Maddry et al., Tuberculosis 2009, 89, 354.
                                                           32 inactives with this substructure
Efficacy Profiling of TCMDC-125802

                          • 64X MIC affords 6 logs of kill
                          • Resistance and/or drug
                            instability beyond 14 d

                            Vero cells : CC50 = 4.0
                             g/mL

                            Selectivity Index SI =
                            CC50/MICMtb = 16 – 64




                  Ekins et al.,Chem Biol 20, 370–378, 2013
In vivo Evaluation of TCMDC-125802
Goal: Evaluate the in vivo safety and efficacy of JSF-2019 in mouse
models of TB infection
     Step #2: 7-day Maximum Tolerated Dose study in mice
       - formulated in 0.5% methyl cellulose
       - single dose p.o. @ 30, 100, and 300 mg/kg in B6D2F1 mice
       - no overt toxicity
     Step #3: evaluation in GKO mouse model of TB infection
       - Five 12 week-old female C57BL/6 mice infected with Mtb Erdman via
        low-dose aerosol exposure

      - Days 16 – 23 : dosed w/ 300 mg/kg JSF-2019 p.o. OR 25 mg/kg INH
      OR untreated

      - Sacrificed day 24 and lung and spleen homogenates were cultured

     - no difference in lungs and spleens vs. control


                                                        Lisa Woolhiser and Anne Lenaerts (CSU)
Why screen cpds?




Ballel et al., Fueling Open-Source drug discovery: 177 small-   http://goo.gl/UujRX
molecule leads against tuberculosis ChemMedChem 2013.

GSK screened 2M compounds – 3 yrs ago
Bayesian predictions for 14,000 cpds exposed 11 / 15 (73%)
correct when paper was published
Further prospective validation example
Conclusions

>38,000 molecules screened through Bayesian models

106 molecules were tested in vitro

17 actives were identified (22.5 % hit rate)


Identified several novel potent lead series with good cytotoxicity & selectivity
Some series have been missed in SRI screening data

Took a non toxic molecule quickly in vivo – Have made analogs in attempt to
overcome in vivo efficacy failure

All Bayesian models shared with Abbott and Merck in TB Accelerator project

All Bayesian models are freely available to researchers




                                               Ekins et al.,Chem Biol 20, 370–378, 2013
Acknowledgments

                               Joel Freundlich Lab



   The project described was supported by Award Number R43 LM011152-01
    “Biocomputation across distributed private datasets to enhance drug
    discovery” from the National Library of Medicine (PI: S. Ekins)

   Accelrys

   The CDD TB has been developed thanks to funding from the Bill and
    Melinda Gates Foundation (Grant#49852 “Collaborative drug discovery for
    TB through a novel database of SAR data optimized to promote data
    archiving and sharing”)

   Allen Casey (IDRI)
You can find me @...                                                CDD Booth 205
PAPER ID: 13433
PAPER TITLE: “Dispensing processes profoundly impact biological assays and computational and statistical
analyses”
April 8th 8.35am Room 349

PAPER ID: 14750
PAPER TITLE: “Enhancing High Throughput Screening For Mycobacterium tuberculosis Drug Discovery
Using Bayesian Models”
April 9th 1.30pm Room 353
PAPER ID: 21524

PAPER TITLE: “Navigating between patents, papers, abstracts and databases using public sources and
tools”
April 9th 3.50pm Room 350
PAPER ID: 13358

PAPER TITLE: “TB Mobile: Appifying Data on Anti-tuberculosis Molecule Targets”
April 10th 8.30am Room 357

PAPER ID: 13382
PAPER TITLE: “Challenges and recommendations for obtaining chemical structures of industry-provided
repurposing candidates”
April 10th 10.20am Room 350

PAPER ID: 13438
PAPER TITLE: “Dual-event machine learning models to accelerate drug discovery”
April 10th 3.05 pm Room 350

Contenu connexe

Tendances

IOSRPHR(www.iosrphr.org) IOSR Journal of Pharmacy
IOSRPHR(www.iosrphr.org) IOSR Journal of PharmacyIOSRPHR(www.iosrphr.org) IOSR Journal of Pharmacy
IOSRPHR(www.iosrphr.org) IOSR Journal of Pharmacyiosrphr_editor
 
A miniaturized sandwich immunoassay platform
A miniaturized sandwich immunoassay platformA miniaturized sandwich immunoassay platform
A miniaturized sandwich immunoassay platformQing Chen
 
Potential of Benzyl Adenine, Naphthalene Acetic Acid and Sucrose Concentratio...
Potential of Benzyl Adenine, Naphthalene Acetic Acid and Sucrose Concentratio...Potential of Benzyl Adenine, Naphthalene Acetic Acid and Sucrose Concentratio...
Potential of Benzyl Adenine, Naphthalene Acetic Acid and Sucrose Concentratio...drboon
 
acs talk open source drug discovery
acs talk open source drug discoveryacs talk open source drug discovery
acs talk open source drug discoverySean Ekins
 
Stephen Friend MIT 2011-10-20
Stephen Friend MIT 2011-10-20Stephen Friend MIT 2011-10-20
Stephen Friend MIT 2011-10-20Sage Base
 
Fragments for drug discovery and chemical biology
Fragments for drug discovery and chemical biologyFragments for drug discovery and chemical biology
Fragments for drug discovery and chemical biologywarwick_amr
 
Xie et al 2016 risk analysis
Xie et al 2016 risk analysisXie et al 2016 risk analysis
Xie et al 2016 risk analysisMaria Isabel
 
CYP121 Drug Discovery (M. tuberculosis)
CYP121 Drug Discovery (M. tuberculosis)CYP121 Drug Discovery (M. tuberculosis)
CYP121 Drug Discovery (M. tuberculosis)Anthony Coyne
 
As munich 2013 4.0
As munich 2013 4.0As munich 2013 4.0
As munich 2013 4.0mmoreno2013
 
Cytoscan_Copy_Number_Confirmation_with_SYBR_Green_qPCR_white_paper
Cytoscan_Copy_Number_Confirmation_with_SYBR_Green_qPCR_white_paperCytoscan_Copy_Number_Confirmation_with_SYBR_Green_qPCR_white_paper
Cytoscan_Copy_Number_Confirmation_with_SYBR_Green_qPCR_white_paperAndrea Ujvari
 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Rajarshi Guha
 
Bioinformatica t9-t10-biocheminformatics
Bioinformatica t9-t10-biocheminformaticsBioinformatica t9-t10-biocheminformatics
Bioinformatica t9-t10-biocheminformaticsProf. Wim Van Criekinge
 
Susceptibility profiles of Enterobacteriaceae to temocillin, piperacillin-taz...
Susceptibility profiles of Enterobacteriaceae to temocillin, piperacillin-taz...Susceptibility profiles of Enterobacteriaceae to temocillin, piperacillin-taz...
Susceptibility profiles of Enterobacteriaceae to temocillin, piperacillin-taz...Erin Vaughan
 

Tendances (17)

IOSRPHR(www.iosrphr.org) IOSR Journal of Pharmacy
IOSRPHR(www.iosrphr.org) IOSR Journal of PharmacyIOSRPHR(www.iosrphr.org) IOSR Journal of Pharmacy
IOSRPHR(www.iosrphr.org) IOSR Journal of Pharmacy
 
A miniaturized sandwich immunoassay platform
A miniaturized sandwich immunoassay platformA miniaturized sandwich immunoassay platform
A miniaturized sandwich immunoassay platform
 
Potential of Benzyl Adenine, Naphthalene Acetic Acid and Sucrose Concentratio...
Potential of Benzyl Adenine, Naphthalene Acetic Acid and Sucrose Concentratio...Potential of Benzyl Adenine, Naphthalene Acetic Acid and Sucrose Concentratio...
Potential of Benzyl Adenine, Naphthalene Acetic Acid and Sucrose Concentratio...
 
interchangeability
interchangeabilityinterchangeability
interchangeability
 
acs talk open source drug discovery
acs talk open source drug discoveryacs talk open source drug discovery
acs talk open source drug discovery
 
Stephen Friend MIT 2011-10-20
Stephen Friend MIT 2011-10-20Stephen Friend MIT 2011-10-20
Stephen Friend MIT 2011-10-20
 
Fragments for drug discovery and chemical biology
Fragments for drug discovery and chemical biologyFragments for drug discovery and chemical biology
Fragments for drug discovery and chemical biology
 
Xie et al 2016 risk analysis
Xie et al 2016 risk analysisXie et al 2016 risk analysis
Xie et al 2016 risk analysis
 
Dispensing Processes Profoundly Impact Biological Assays and Computational an...
Dispensing Processes Profoundly Impact Biological Assays and Computational an...Dispensing Processes Profoundly Impact Biological Assays and Computational an...
Dispensing Processes Profoundly Impact Biological Assays and Computational an...
 
CYP121 Drug Discovery (M. tuberculosis)
CYP121 Drug Discovery (M. tuberculosis)CYP121 Drug Discovery (M. tuberculosis)
CYP121 Drug Discovery (M. tuberculosis)
 
As munich 2013 4.0
As munich 2013 4.0As munich 2013 4.0
As munich 2013 4.0
 
Cytoscan_Copy_Number_Confirmation_with_SYBR_Green_qPCR_white_paper
Cytoscan_Copy_Number_Confirmation_with_SYBR_Green_qPCR_white_paperCytoscan_Copy_Number_Confirmation_with_SYBR_Green_qPCR_white_paper
Cytoscan_Copy_Number_Confirmation_with_SYBR_Green_qPCR_white_paper
 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
 
my paper 2015
my paper 2015my paper 2015
my paper 2015
 
Bioinformatica t9-t10-biocheminformatics
Bioinformatica t9-t10-biocheminformaticsBioinformatica t9-t10-biocheminformatics
Bioinformatica t9-t10-biocheminformatics
 
Susceptibility profiles of Enterobacteriaceae to temocillin, piperacillin-taz...
Susceptibility profiles of Enterobacteriaceae to temocillin, piperacillin-taz...Susceptibility profiles of Enterobacteriaceae to temocillin, piperacillin-taz...
Susceptibility profiles of Enterobacteriaceae to temocillin, piperacillin-taz...
 
Kwon et al jbs 2011
Kwon et al jbs 2011Kwon et al jbs 2011
Kwon et al jbs 2011
 

Similaire à dual-event machine learning models to accelerate drug discovery

Enhancing high throughput screeing for mycobacterium tuberculosis drug discov...
Enhancing high throughput screeing for mycobacterium tuberculosis drug discov...Enhancing high throughput screeing for mycobacterium tuberculosis drug discov...
Enhancing high throughput screeing for mycobacterium tuberculosis drug discov...Sean Ekins
 
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...Sean Ekins
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoverySean Ekins
 
Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Sean Ekins
 
Acs combining cheminformatics methods and pathway analysis to identify molecu...
Acs combining cheminformatics methods and pathway analysis to identify molecu...Acs combining cheminformatics methods and pathway analysis to identify molecu...
Acs combining cheminformatics methods and pathway analysis to identify molecu...Sean Ekins
 
Avoid overfitting in precision medicine: How to use cross-validation to relia...
Avoid overfitting in precision medicine: How to use cross-validation to relia...Avoid overfitting in precision medicine: How to use cross-validation to relia...
Avoid overfitting in precision medicine: How to use cross-validation to relia...Nicole Krämer
 
Quantitative mass spec imaging
Quantitative mass spec imagingQuantitative mass spec imaging
Quantitative mass spec imagingSteve Cepa
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16Sean Ekins
 
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental DataANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Datao_devinyak
 
AsedaSciences SLAS2017 poster presentation
AsedaSciences SLAS2017 poster presentationAsedaSciences SLAS2017 poster presentation
AsedaSciences SLAS2017 poster presentationAndrew Bieberich
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Sean Ekins
 
BioIT Drug induced liver injury talk 2011
BioIT Drug induced liver injury talk 2011BioIT Drug induced liver injury talk 2011
BioIT Drug induced liver injury talk 2011Sean Ekins
 
CIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big Data
CIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big DataCIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big Data
CIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big DataDaniel Russo
 
Analytical chemistry 2013 qian liu
Analytical chemistry 2013 qian liuAnalytical chemistry 2013 qian liu
Analytical chemistry 2013 qian liuQian Liu, phD
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNAUlises Urzua
 
Personalized medicine via molecular interrogation, data mining and systems bi...
Personalized medicine via molecular interrogation, data mining and systems bi...Personalized medicine via molecular interrogation, data mining and systems bi...
Personalized medicine via molecular interrogation, data mining and systems bi...Gerald Lushington
 

Similaire à dual-event machine learning models to accelerate drug discovery (20)

Enhancing high throughput screeing for mycobacterium tuberculosis drug discov...
Enhancing high throughput screeing for mycobacterium tuberculosis drug discov...Enhancing high throughput screeing for mycobacterium tuberculosis drug discov...
Enhancing high throughput screeing for mycobacterium tuberculosis drug discov...
 
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug Discovery
 
Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery
 
Acs combining cheminformatics methods and pathway analysis to identify molecu...
Acs combining cheminformatics methods and pathway analysis to identify molecu...Acs combining cheminformatics methods and pathway analysis to identify molecu...
Acs combining cheminformatics methods and pathway analysis to identify molecu...
 
Avoid overfitting in precision medicine: How to use cross-validation to relia...
Avoid overfitting in precision medicine: How to use cross-validation to relia...Avoid overfitting in precision medicine: How to use cross-validation to relia...
Avoid overfitting in precision medicine: How to use cross-validation to relia...
 
Quantitative mass spec imaging
Quantitative mass spec imagingQuantitative mass spec imaging
Quantitative mass spec imaging
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16
 
Gdt 2-126
Gdt 2-126Gdt 2-126
Gdt 2-126
 
Gdt 2-126 (1)
Gdt 2-126 (1)Gdt 2-126 (1)
Gdt 2-126 (1)
 
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental DataANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
ANTICANCER THIAZOLIDINONES DESIGN: Mining of 60-Cell Lines Experimental Data
 
AsedaSciences SLAS2017 poster presentation
AsedaSciences SLAS2017 poster presentationAsedaSciences SLAS2017 poster presentation
AsedaSciences SLAS2017 poster presentation
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
 
BioIT Drug induced liver injury talk 2011
BioIT Drug induced liver injury talk 2011BioIT Drug induced liver injury talk 2011
BioIT Drug induced liver injury talk 2011
 
poster
posterposter
poster
 
CIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big Data
CIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big DataCIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big Data
CIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big Data
 
Analytical chemistry 2013 qian liu
Analytical chemistry 2013 qian liuAnalytical chemistry 2013 qian liu
Analytical chemistry 2013 qian liu
 
Genomica - Microarreglos de DNA
Genomica - Microarreglos de DNAGenomica - Microarreglos de DNA
Genomica - Microarreglos de DNA
 
Sam Case Study
Sam Case StudySam Case Study
Sam Case Study
 
Personalized medicine via molecular interrogation, data mining and systems bi...
Personalized medicine via molecular interrogation, data mining and systems bi...Personalized medicine via molecular interrogation, data mining and systems bi...
Personalized medicine via molecular interrogation, data mining and systems bi...
 

Plus de Sean Ekins

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptxSean Ekins
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Sean Ekins
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...Sean Ekins
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Sean Ekins
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseSean Ekins
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Sean Ekins
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueSean Ekins
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesSean Ekins
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchSean Ekins
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation Sean Ekins
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2Sean Ekins
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3 Sean Ekins
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2 Sean Ekins
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1 Sean Ekins
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...Sean Ekins
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b igSean Ekins
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - Sean Ekins
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016Sean Ekins
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientistsSean Ekins
 
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsCDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsSean Ekins
 

Plus de Sean Ekins (20)

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptx
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas Disease
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issue
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b ig
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models -
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientists
 
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsCDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
 

Dernier

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 

Dernier (20)

Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 

dual-event machine learning models to accelerate drug discovery

  • 1. Dual-Event Machine Learning Models to Accelerate Drug Discovery Sean Ekins1,2*, Robert C. Reynolds3,4*, Hiyun Kim5, Mi-Sun Koo5, Marilyn Ekonomidis5, Meliza Talaue5, Steve D. Paget5, Lisa K. Woolhiser6, Anne J. Lenaerts6, Barry A. Bunin1, Nancy Connell5 and Joel S. Freundlich5,7* 1CollaborativeDrug Discovery, 1633 Bayshore Highway, Suite 342, Burlingame, CA 94010, USA. 2Collaborationsin Chemistry, 5616 Hilltop Needmore Road, Fuquay-Varina, NC 27526, USA. 3Southern Research Institute, 2000 Ninth Avenue South, Birmingham, AL 35205, USA. 4Current address: University of Alabama at Birmingham, College of Arts and Sciences , Department of Chemistry, 1530 3 rd Avenue South, Birmingham, Alabama 35294-1240, USA. 5Department of Medicine, Center for Emerging and Reemerging Pathogens, UMDNJ – New Jersey Medical School, 185 South Orange Avenue Newark, NJ 07103, USA. 6Department of Microbiology, Immunology and Pathology, Colorado State University, 200 West Lake Street, CO 80523, USA. 7Department of Pharmacology & Physiology, UMDNJ – New Jersey Medical School, 185 South Orange Avenue Newark, NJ 07103, USA. .
  • 2. TB facts  Tuberculosis kills 1.6-1.7m/yr (~1 every 8 seconds)  1/3rd of worlds population infected!!!!  Multi drug resistance in 4.3% of cases  Extensively drug resistant increasing incidence  one new drug (bedaquiline) in 40 yrs streptomycin (1943) para-aminosalicyclic acid (1949) isoniazid (1952) (Bayer, Roche, Squibb) pyrazinamide (1954) cycloserine (1955) ethambutol (1962) rifampicin (1967)  Drug-drug interactions and Co-morbidity with HIV  Collaboration between groups is rare  These groups may work on existing or new targets  Use of computational methods with TB is rare
  • 3. ~ 20 public datasets for TB Including Novartis data on TB hits >300,000 cpds Patents, Papers Annotated by CDD Open to browse by anyone http://www.collaborativedrug. com/register
  • 4. Phenotypic screening HTS Hit rates SRI papers Usually less than 1%
  • 5. Bayesian Model Construction: Mtb Whole-Cell HTS • Learning from 3,779 compounds from an NIAID library - active: MIC < 5 M - inactive: MIC ≥ 5 M
  • 6. Bayesian machine learning Bayesian classification is a simple probabilistic classification model. It is based on Bayes’ theorem h is the hypothesis or model d is the observed data p(h) is the prior belief (probability of hypothesis h before observing any data) p(d) is the data evidence (marginal probability of the data) p(d|h) is the likelihood (probability of data d if hypothesis h is true) p(h|d) is the posterior probability (probability of hypothesis h being true given the observed data d) A weight is calculated for each feature using a Laplacian-adjusted probability estimate to account for the different sampling frequencies of different features. The weights are summed to provide a probability estimate Ekins, Williams and Xu, Drug Metab Dispos 38: 2302-2308, 2010
  • 7. Novel Bayesian Models for Mtb Whole-Cell Efficacy SRI MLSMR 220K single point model active: ≥90% inhibition @ 10 M; inactive <90% inhibition @ 10 M SRI MLSMR 2.5K dose reponse model active: IC50 ≤ 5 M; inactive: IC50 > 5 M Model Building and Validation • Laplacian-corrected Bayesian classifier models (Accelrys Discovery Studio) • Molecular function class fingerprints of maximum diameter 6 (FCFP_6) • Simple molecular descriptors chosen including AlogP, molecular weight, # rotatable bonds, # rings, # hydrogen bond acceptors, # hydrogen bond donors, and polar surface area • Validated w/ leave-one-out cross-validation & leave-50%-out cross-validation Ekins, S. et al., Mol. Biosyst. 2010, 6, 840-51; Ekins, S. et al., Mol. Biosyst. 2010, 6, 2316-2324.
  • 8. Bayesian Classification TB Models We can use the public data for machine learning model building Using Discovery Studio Bayesian model Leave out 50% x 100 Dateset Internal (number of External ROC molecules) ROC Score Score Concordance Specificity Sensitivity MLSMR All single point screen (N = 220463) 0.86 0 0.86 0 78.56 1.86 78.59 1.94 77.13 2.26 MLSMR dose response set (N = 2273) 0.73 0.01 0.75 0.01 66.85 4.06 67.21 7.05 65.47 7.96 Ekins et al., Mol BioSyst, 6: 840-851, 2010
  • 9. Bayesian Classification Models for TB Laplacian-corrected Bayesian classifier models were generated using FCFP-6 and simple descriptors. 2 models 220,000 and >2000 compounds active compounds with MIC < 5uM Good Bad Ekins et al., Mol BioSyst, 6: 840-851, 2010
  • 10. Bayesian Classification Dose response Good Bad Ekins et al., Mol BioSyst, 6: 840-851, 2010
  • 11. Additional test sets 1702 hits in >100K cpds 34 hits in 248 cpds 21 hits in 2108 cpds 100K library Novartis Data FDA drugs Suggests models can predict data from the same and independent labs Enrichments 4-10 fold Initial enrichment – enables screening few compounds to find actives Ekins et al., Mol BioSyst, 6: 840-851, 2010 Ekins and Freundlich, Pharm Res, 28, 1859-1869, 2011.
  • 12. Testing to date has been retrospective Can we use our models to select compounds and influence design? Prospective prediction Do it enough times to show robustness Testing prospectively
  • 13. Bayesian Machine Learning Models – testing Ranked Asinex 25K library with MLSMR dose response model – Bayesian score range -28.4 – 15.3 99 compounds screened (Bayesian score 9.4 – 15.3). 12 cpds were identified with IC90 < 30 ug/mL ~12% hit rate Most active SYN 22269076 Pyrazolo[1,5-a]pyrimidine Bayesian 14.9 10.6 9.8 IC50 1.1ug/ml (3.2uM) Score
  • 14. Some follow up compounds for the Asinex hit
  • 15. Principal component analysis (PCA) of all SRI data sets to illustrate overlap of chemistry space using the datasets from this study (red TAACF-CB2, green = MLSMR, black = kinase dataset), 3PCs explain 72% of the variance.
  • 16. Dual-Event models High-throughput Mtb screening Bayesian Machine Learning Mtb Model phenotypic molecule database Mtb screening S Descriptors + Bioactivity (+Cytotoxicity) N H N Molecule Database (e.g. GSK malaria actives) virtually scored using Bayesian Models Top scoring molecules assayed for New bioactivity data Mtb growth inhibition may enhance models S Identify in vitro hits N H N Increased hit/lead discovery efficiency
  • 17. Dual-Event models Become more stringent in what we call an ACTIVE IC90 < 10 ug/ml (CB2) or <10uM (MLSMR) and a selectivity index (SI) greater than ten. SI was calculated as SI = CC50/IC90 where CC50 is the concentration that resulted in 50% inhibition of Vero cells (CC50).
  • 18. Bayesian Classification TB Models Single pt ROC XV AUC = 0.88 Dose resp = 0.78 Dose resp + cyto = 0.86 Dateset External Internal (number of ROC ROC molecules) Score Score Concordance Specificity Sensitivity MLSMR All single point screen (N = 220463) 0.86 0 0.86 0 78.56 1.86 78.59 1.94 77.13 2.26 MLSMR dose response set (N = 2273) 0.73 0.01 0.75 0.01 66.85 4.06 67.21 7.05 65.47 7.96 NEW Dose resp and cytotoxicity (N = 2273) 0.82 0.02 0.84 0.02 82.61 4.68 83.91 5.48 65.99 7.47 Ekins et al., PLOSONE, in press 2013
  • 19. A new dataset to use as a test set for models
  • 20. Bayesian Machine Learning Models – blind testing Dual event model shows increased enrichment Ekins et al.,Chem Biol 20, 370–378, 2013
  • 21. Prospective prediction of antimalarial compounds vs Mtb 1. Virtually screen 13,533-member GSK antimalarial hit library 2. Model = SRI TAACF-CB dose response + cytotoxicity model 3. Top 46 commercially available compounds visually inspected 4. 7 compounds chosen for Mtb testing based on - drug-likeness - chemotype diversity Dateset External Internal ROC (number of molecules) ROC Score Score Concordance Specificity Sensitivity TAACF-CB2 IC90 and 0.64 0.59 ± 0.01 0.63 ± 0.02 55.74 ±1.31 61.61 ± 8.96 cytotoxicity (1783)
  • 22. Prospective prediction of antimalarial compounds vs Mtb 7 tested, 5 active (70% hit rate) Ekins et al.,Chem Biol 20, 370–378, 2013
  • 23. Bayesian Model Follow-up: Do we have a lead? • BAS00521003/ TCMDC-125802 reported to be a P. falciparum lactate dehydrogenase inhibitor • Only one report (that we were unaware of when picking the compound) of antitubercular activity from 1969 - solid agar MIC = 1 g/mL (“wild strain”) MIC of 0.0625 ug/mL - “no activity” in mouse model up to 400 mg/kg - however, activity was solely judged by extension of survival! SRI MLSMR 220K library contains: 107 hits with this substructure - 3 nitrofuryl hydrazones - 10 furyl hydrazones - 19 nitrophenyl hydrazones Bruhin, H. et al., J. Pharm. Pharmac. 1969, 21, 423-433. Maddry et al., Tuberculosis 2009, 89, 354. 32 inactives with this substructure
  • 24. Efficacy Profiling of TCMDC-125802 • 64X MIC affords 6 logs of kill • Resistance and/or drug instability beyond 14 d Vero cells : CC50 = 4.0 g/mL Selectivity Index SI = CC50/MICMtb = 16 – 64 Ekins et al.,Chem Biol 20, 370–378, 2013
  • 25. In vivo Evaluation of TCMDC-125802 Goal: Evaluate the in vivo safety and efficacy of JSF-2019 in mouse models of TB infection Step #2: 7-day Maximum Tolerated Dose study in mice - formulated in 0.5% methyl cellulose - single dose p.o. @ 30, 100, and 300 mg/kg in B6D2F1 mice - no overt toxicity Step #3: evaluation in GKO mouse model of TB infection - Five 12 week-old female C57BL/6 mice infected with Mtb Erdman via low-dose aerosol exposure - Days 16 – 23 : dosed w/ 300 mg/kg JSF-2019 p.o. OR 25 mg/kg INH OR untreated - Sacrificed day 24 and lung and spleen homogenates were cultured - no difference in lungs and spleens vs. control Lisa Woolhiser and Anne Lenaerts (CSU)
  • 26. Why screen cpds? Ballel et al., Fueling Open-Source drug discovery: 177 small- http://goo.gl/UujRX molecule leads against tuberculosis ChemMedChem 2013. GSK screened 2M compounds – 3 yrs ago Bayesian predictions for 14,000 cpds exposed 11 / 15 (73%) correct when paper was published Further prospective validation example
  • 27. Conclusions >38,000 molecules screened through Bayesian models 106 molecules were tested in vitro 17 actives were identified (22.5 % hit rate) Identified several novel potent lead series with good cytotoxicity & selectivity Some series have been missed in SRI screening data Took a non toxic molecule quickly in vivo – Have made analogs in attempt to overcome in vivo efficacy failure All Bayesian models shared with Abbott and Merck in TB Accelerator project All Bayesian models are freely available to researchers Ekins et al.,Chem Biol 20, 370–378, 2013
  • 28. Acknowledgments Joel Freundlich Lab  The project described was supported by Award Number R43 LM011152-01 “Biocomputation across distributed private datasets to enhance drug discovery” from the National Library of Medicine (PI: S. Ekins)  Accelrys  The CDD TB has been developed thanks to funding from the Bill and Melinda Gates Foundation (Grant#49852 “Collaborative drug discovery for TB through a novel database of SAR data optimized to promote data archiving and sharing”)  Allen Casey (IDRI)
  • 29. You can find me @... CDD Booth 205 PAPER ID: 13433 PAPER TITLE: “Dispensing processes profoundly impact biological assays and computational and statistical analyses” April 8th 8.35am Room 349 PAPER ID: 14750 PAPER TITLE: “Enhancing High Throughput Screening For Mycobacterium tuberculosis Drug Discovery Using Bayesian Models” April 9th 1.30pm Room 353 PAPER ID: 21524 PAPER TITLE: “Navigating between patents, papers, abstracts and databases using public sources and tools” April 9th 3.50pm Room 350 PAPER ID: 13358 PAPER TITLE: “TB Mobile: Appifying Data on Anti-tuberculosis Molecule Targets” April 10th 8.30am Room 357 PAPER ID: 13382 PAPER TITLE: “Challenges and recommendations for obtaining chemical structures of industry-provided repurposing candidates” April 10th 10.20am Room 350 PAPER ID: 13438 PAPER TITLE: “Dual-event machine learning models to accelerate drug discovery” April 10th 3.05 pm Room 350