SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Data Quality Issues That Can Impact Drug
                Discovery


     Sean Ekins1, Joe Olechno2 Antony J. Williams3


            1 Collaborationsin Chemistry, Fuquay Varina, NC.
                      2 Labcyte Inc, Sunnyvale, CA.
             3 Royal Society of Chemistry, Wake Forest, NC.




Disclaimer: SE and AJE have no affiliation with Labcyte and have not
                   been engaged as consultants
“If I have seen further than others, it is by standing
            upon the shoulders of giants.”

                   Isaac Newton
Example e.g. GPCR drug discovery


 Clone gene for b-receptor
 Cloning of more genes for other receptors
 Understanding of similarities
 Identify family GPCRs
                                                      Pictures: Wikipedia
 Now 1000s of receptors belong to this family
 Understand how to target receptors - identify structure
 Results in drugs tailored to fit these receptors
 “GPCR and drug discovery” – 661 hits in PubMed since 1997
Where do scientists get chemistry/ biology data?

      Databases
      Patents
      Papers
      Your own lab
      Collaborators


      Some or all of the above?
      What is common to all? – quality issues
From data hoarding to open data




(IMDB)        Me      Linked Open data cloud 2011 (Wikipedia)
Pharma company data hoarding - to open data
Simple Rules for licensing “open” data

As we see a future of increased database integration the
licensing of the data may be a hurdle that hampers progress
and usability.




  Williams, Wilbanks and Ekins. PLoS Comput Biol 8(9): e1002706, 2012
Could open accessibility = Disruption




1: NIH and other international scientific funding bodies should mandate
…open accessibility for all data generated by publicly funded research


Ekins, Waller, Bradley, Clark and Williams. DDT In press, 2012
Data can be found – but what about quality?
Why drug structure quality is important?

 More groups doing in silico repositioning
 Target-based or ligand-based
 Network and systems biology

 They are all integrating or using sets of FDA drugs..if the
  structures are incorrect predictions will be too..

 What is need is a definitive set of FDA approved drugs with
  correct structures and tools for in silico screening

 Also linkage between in vitro data & clinical data
Structure Quality Issues
Database released and within days 100’s of errors found in structures
                                          NPC Browser http://tripod.nih.gov/npc/

               Science Translational Medicine 2011




                                                 Williams and Ekins, DDT, 16: 747-750 (2011)
Which is Neomycin?
     Wh




Describes errors in
other types of
databases too!!


Williams, Ekins and Tkachenko Drug Disc Today 17: 685-701 (2012)
Data Errors in the NPC Browser: Analysis of Steroids




   Substructure   # of    # of           No            Incomplete       Complete but

                  Hits   Correct   stereochemistry   Stereochemistry      incorrect

                          Hits                                         stereochemistry



  Gonane          34       5             8                 21                0


  Gon-4-ene       55       12            3                 33                7


  Gon-1,4-diene   60       17            10                23                10




Williams, Ekins and Tkachenko Drug Disc Today 17: 685-701 (2012)
DDT editorial Dec 2011




http://goo.gl/dIqhU
Its not just structure quality we need to worry about




            Jan - Joe Olechno saw editorial
            commented on blog
How do you move a liquid?



  Low throughput




  High throughput




Images courtesy of Bing
Plastic leaching




McDonald et al., Science 2008, 322, 917.   Belaiche et al., Clin Chem 2009, 55, 1883-1884
Moving liquids with sound




Images courtesy of Labcyte Inc.
 http://goo.gl/K0Fjz
Tips vs. Acoustic analysis




Data appears randomly scattered.    ~ 40 12 point IC50 values. Compounds
24% had IC50 values > 3 fold        more active when using acoustic
weaker using tip-based              dispensing. Correlation in data is poor
dispensing. 8% produced no          with many compounds showing >10 fold
value using tip-based dispensing.   shift in potency depending on dispensing
                                    method.
No analysis of molecule
properties.                         No analysis of molecule properties.

Spicer et al., In Drug Discovery     Wingfield et al., American Drug Discovery 2008,
Technology: Boston, 2005.            3, 24-30.
Tips vs. Acoustic analysis – large scale




Inhibition of tyrosine kinases at 10 µM for ~10,000 compounds.
False +ve from acoustic transfer (as measured by subsequent IC50 analyses) =
19% of hits.

False +ve from tip-based transfers = 55% of all hits.

60 more compounds were identified as active with acoustic transfer.

No analysis of molecule properties.

             Wingfield, J. Drug Discovery 2012: Manchester, UK, 2012
Problems with biological data:
            how you dispense matters


Few structures and corresponding data are public

Using data from 2 AstraZeneca patents –

Tyrosine kinase EphB4 pharmacophores (Accelrys Discovery
Studio) were developed using data for 14 compounds

IC50 determined using different dispensing methods

Analyzed correlation with simple descriptors (SAS JMP)

Calculated LogP correlation with log IC50 data for acoustic
dispensing (r2 = 0.34, p < 0.05, N = 14)

 Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009
 Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
Examples of IC50 values produced via acoustic transfer with
direct dilution vs those generated with tip-based transfer and
                         serial dilutions




acoustically-derived IC50 values were 1.5 to 276.5-fold lower than
for tip-based dispensing
Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009
Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
Tyrosine kinase EphB4 Pharmacophores


                                                                    Cyan = hydrophobic

                                                                    Green = hydrogen bond
                                                                    acceptor

                                                                    Purple = hydrogen bond donor

                                                                    Each model shows most
                                                                    potent molecule mapping


           Acoustic                         Disposable tip

                                   Hydrophobic       Hydrogen      Hydrogen     Observed vs.

                                  features (HPF)   bond acceptor   bond donor   predicted IC50

                                                      (HBA)          (HBD)            r



Acoustic mediated process
                                        2               1              1            0.92

Disposable tip mediated process
                                        0               2              1            0.80



  Ekins, Olechno and Williams, Submitted 2012
Test set evaluation of pharmacophores

• An additional 12 compounds from AstraZeneca
       Barlaam, B. C.; Ducray, R., WO 2008/132505 A1, 2008

• 10 of these compounds had data for tip-based dispensing
  and 2 for acoustic dispensing

• Calculated LogP and logD showed low but statistically
  significant correlations with tip-based dispensing (r2=
  0.39 p < 0.05 and 0.24 p < 0.05, N = 36)

• Used as a test set for pharmacophores

• The two compounds analyzed with acoustic liquid
  handling were predicted in the top 3 using the acoustic
  pharmacophore

• The tip-based pharmacophore failed to rank the retrieved
  compounds correctly
Automated receptor-ligand pharmacophore generation
                        method

    Pharmacophores for the tyrosine kinase EphB4 generated from crystal
structures in the protein data bank PDB using Discovery Studio version 3.5.5
                                                                Cyan =
                                                                hydrophobic

                                                                Green = hydrogen
                                                                bond acceptor

                                                                Purple = hydrogen
                                                                bond donor

                                                                Grey = excluded
                                                                volumes

                                                                Each model shows
                                                                most potent
                                                                molecule mapping



                                                              Bioorg   Med Chem Lett
                                                              2010,    20, 6242-6245.
                                                              Bioorg   Med Chem Lett
                                                              2008,    18, 5717-5721.
                                                              Bioorg   Med Chem Lett
                                                              2008,    18, 2776-2780.
                                                              Bioorg   Med Chem Lett
                                                              2011,    21, 2207-2211.
Summary

• In the absence of structural data, pharmacophores and other
  computational and statistical models are used to guide medicinal
  chemistry in early drug discovery.

• Our findings suggest non tip-based methods could improve HTS results
  and avoid the development of misleading computational models and
  statistical relationships.


• Automated pharmacophores are closer to pharmacophore generated
  with acoustic data – all have hydrophobic features – missing from tip
  based model

• Importance of hydrophobicity seen with logP correlation and
  crystal structure interactions

• Public databases should annotate this meta-data alongside biological
  data points, to create larger datasets for comparing different
  computational methods.
The stuff of nightmares?

 How much of the data in databases is generated by tip-based methods
 How much is erroneous
 Do we have to start again?
 How does it affect all subsequent science – data mining etc
 Does it impact Pharmas productivity?
Strengths and Weaknesses

Small dataset size – focused on one compounds series

No previous publication describing how data quality can be
impacted by dispensing and how this in turn affects
computational models and downstream decision making.

No comparison of pharmacophores generated from acoustic
dispensing and tip-based dispensing.

No previous comparison of pharmacophores generated from in
vitro data with pharmacophores automatically generated from X-
ray crystal conformations of inhibitors.


Severely limited by number of structures in public domain with
data in both systems
Data Quality Issues That Can Impact Drug Discovery

Contenu connexe

Tendances

Nanocrystalline TiO2 Photo-Sensitized with Natural Dyes
Nanocrystalline TiO2 Photo-Sensitized with Natural DyesNanocrystalline TiO2 Photo-Sensitized with Natural Dyes
Nanocrystalline TiO2 Photo-Sensitized with Natural Dyes
Sudharaka Palamakumbura
 
Open2012 functional-taxonomy-connect-eng-design
Open2012 functional-taxonomy-connect-eng-designOpen2012 functional-taxonomy-connect-eng-design
Open2012 functional-taxonomy-connect-eng-design
the nciia
 
How Point of Care Diagnostics Will Change Health Care-Texas Enterprise Speake...
How Point of Care Diagnostics Will Change Health Care-Texas Enterprise Speake...How Point of Care Diagnostics Will Change Health Care-Texas Enterprise Speake...
How Point of Care Diagnostics Will Change Health Care-Texas Enterprise Speake...
UT Austin McCombs School of Business
 
Understanding and classifying metabolite space and metabolite likeness
Understanding and classifying metabolite space and metabolite likenessUnderstanding and classifying metabolite space and metabolite likeness
Understanding and classifying metabolite space and metabolite likeness
VodafoneZiggo
 
chem poster pres summer 2016 final
chem poster pres summer 2016 finalchem poster pres summer 2016 final
chem poster pres summer 2016 final
Elisa Hofmeister
 

Tendances (19)

Discovering drugs (I. Belda)
Discovering drugs (I. Belda)Discovering drugs (I. Belda)
Discovering drugs (I. Belda)
 
Nanocrystalline TiO2 Photo-Sensitized with Natural Dyes
Nanocrystalline TiO2 Photo-Sensitized with Natural DyesNanocrystalline TiO2 Photo-Sensitized with Natural Dyes
Nanocrystalline TiO2 Photo-Sensitized with Natural Dyes
 
Open2012 functional-taxonomy-connect-eng-design
Open2012 functional-taxonomy-connect-eng-designOpen2012 functional-taxonomy-connect-eng-design
Open2012 functional-taxonomy-connect-eng-design
 
How Point of Care Diagnostics Will Change Health Care-Texas Enterprise Speake...
How Point of Care Diagnostics Will Change Health Care-Texas Enterprise Speake...How Point of Care Diagnostics Will Change Health Care-Texas Enterprise Speake...
How Point of Care Diagnostics Will Change Health Care-Texas Enterprise Speake...
 
Vulnerability to Addictions: Dopamine Studies in Humans - Marco Leyton
Vulnerability to Addictions: Dopamine Studies in Humans - Marco Leyton Vulnerability to Addictions: Dopamine Studies in Humans - Marco Leyton
Vulnerability to Addictions: Dopamine Studies in Humans - Marco Leyton
 
Radiological procedure CONTRAST MEDIA
Radiological procedure CONTRAST MEDIARadiological procedure CONTRAST MEDIA
Radiological procedure CONTRAST MEDIA
 
CNDO 11-12-11 BioWorld Insight
CNDO 11-12-11 BioWorld InsightCNDO 11-12-11 BioWorld Insight
CNDO 11-12-11 BioWorld Insight
 
Chemistry
ChemistryChemistry
Chemistry
 
Clinicalgenet.slideshare
Clinicalgenet.slideshareClinicalgenet.slideshare
Clinicalgenet.slideshare
 
Sds-Page
Sds-Page Sds-Page
Sds-Page
 
IRJET - Electrophoretic Deposition of Calcium phosphate Ceramics over Dup...
IRJET -  	  Electrophoretic Deposition of Calcium phosphate Ceramics over Dup...IRJET -  	  Electrophoretic Deposition of Calcium phosphate Ceramics over Dup...
IRJET - Electrophoretic Deposition of Calcium phosphate Ceramics over Dup...
 
Agarose Gel
Agarose GelAgarose Gel
Agarose Gel
 
Radiology Contrast media and its Types
Radiology Contrast media and its TypesRadiology Contrast media and its Types
Radiology Contrast media and its Types
 
Understanding and classifying metabolite space and metabolite likeness
Understanding and classifying metabolite space and metabolite likenessUnderstanding and classifying metabolite space and metabolite likeness
Understanding and classifying metabolite space and metabolite likeness
 
Agarose gel electrophoresis by KK Sahu sir
Agarose gel electrophoresis by KK Sahu sirAgarose gel electrophoresis by KK Sahu sir
Agarose gel electrophoresis by KK Sahu sir
 
chem poster pres summer 2016 final
chem poster pres summer 2016 finalchem poster pres summer 2016 final
chem poster pres summer 2016 final
 
Lectut btn-202-ppt-l13. gel electrophoresis
Lectut btn-202-ppt-l13. gel electrophoresisLectut btn-202-ppt-l13. gel electrophoresis
Lectut btn-202-ppt-l13. gel electrophoresis
 
SBS 2011: Sensitive Cell-based and Biochemical Assays Using Epic(R) Label-fre...
SBS 2011: Sensitive Cell-based and Biochemical Assays Using Epic(R) Label-fre...SBS 2011: Sensitive Cell-based and Biochemical Assays Using Epic(R) Label-fre...
SBS 2011: Sensitive Cell-based and Biochemical Assays Using Epic(R) Label-fre...
 
SFVNBLFNBLADKFNV
SFVNBLFNBLADKFNVSFVNBLFNBLADKFNV
SFVNBLFNBLADKFNV
 

Similaire à Data Quality Issues That Can Impact Drug Discovery

Dispensing Processes Impact Computational and Statistical Analyses
Dispensing Processes Impact Computational and Statistical AnalysesDispensing Processes Impact Computational and Statistical Analyses
Dispensing Processes Impact Computational and Statistical Analyses
Sean Ekins
 
Dispensing Processes Impact Apparent Biological Activity as Determined by Com...
Dispensing Processes Impact Apparent Biological Activity as Determined by Com...Dispensing Processes Impact Apparent Biological Activity as Determined by Com...
Dispensing Processes Impact Apparent Biological Activity as Determined by Com...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
5-18-15 Mona Shores Talk
5-18-15 Mona Shores Talk5-18-15 Mona Shores Talk
5-18-15 Mona Shores Talk
David Boyer
 
E 269-2011 hager abstract symposium biotech 5-2013
E 269-2011 hager abstract symposium biotech 5-2013E 269-2011 hager abstract symposium biotech 5-2013
E 269-2011 hager abstract symposium biotech 5-2013
National Cancer Institute
 

Similaire à Data Quality Issues That Can Impact Drug Discovery (20)

Dispensing Processes Impact Computational and Statistical Analyses
Dispensing Processes Impact Computational and Statistical AnalysesDispensing Processes Impact Computational and Statistical Analyses
Dispensing Processes Impact Computational and Statistical Analyses
 
Dispensing Processes Profoundly Impact Biological Assays and Computational an...
Dispensing Processes Profoundly Impact Biological Assays and Computational an...Dispensing Processes Profoundly Impact Biological Assays and Computational an...
Dispensing Processes Profoundly Impact Biological Assays and Computational an...
 
Dispensing Processes Impact Apparent Biological Activity as Determined by Com...
Dispensing Processes Impact Apparent Biological Activity as Determined by Com...Dispensing Processes Impact Apparent Biological Activity as Determined by Com...
Dispensing Processes Impact Apparent Biological Activity as Determined by Com...
 
Molecular Assembly of Peptide based Materials towards Biomedical Application.
Molecular Assembly of Peptide based Materials towards Biomedical Application.Molecular Assembly of Peptide based Materials towards Biomedical Application.
Molecular Assembly of Peptide based Materials towards Biomedical Application.
 
Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposing
 
Exp 4 Lab Report
Exp 4 Lab ReportExp 4 Lab Report
Exp 4 Lab Report
 
5-18-15 Mona Shores Talk
5-18-15 Mona Shores Talk5-18-15 Mona Shores Talk
5-18-15 Mona Shores Talk
 
Structural Systems Pharmacology
Structural Systems PharmacologyStructural Systems Pharmacology
Structural Systems Pharmacology
 
Gel Electrophoresis.pptx
Gel Electrophoresis.pptxGel Electrophoresis.pptx
Gel Electrophoresis.pptx
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016
 
DNA Barcoding and its application in species identification
DNA Barcoding and its application in species identificationDNA Barcoding and its application in species identification
DNA Barcoding and its application in species identification
 
Montreal 8th world congress
Montreal 8th world congressMontreal 8th world congress
Montreal 8th world congress
 
December Highlights
December HighlightsDecember Highlights
December Highlights
 
Improving Crystallite Size and Orientation in Organic Semiconductor Thin Film...
Improving Crystallite Size and Orientation in Organic Semiconductor Thin Film...Improving Crystallite Size and Orientation in Organic Semiconductor Thin Film...
Improving Crystallite Size and Orientation in Organic Semiconductor Thin Film...
 
Introduction to Chemoinfornatics
Introduction to ChemoinfornaticsIntroduction to Chemoinfornatics
Introduction to Chemoinfornatics
 
Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery
 
E 269-2011 hager abstract symposium biotech 5-2013
E 269-2011 hager abstract symposium biotech 5-2013E 269-2011 hager abstract symposium biotech 5-2013
E 269-2011 hager abstract symposium biotech 5-2013
 
A Computational Drug Designing from Active Product of Herbal Plant Ochna Squa...
A Computational Drug Designing from Active Product of Herbal Plant Ochna Squa...A Computational Drug Designing from Active Product of Herbal Plant Ochna Squa...
A Computational Drug Designing from Active Product of Herbal Plant Ochna Squa...
 
BioSeek EPA ToxCast Program
BioSeek EPA ToxCast ProgramBioSeek EPA ToxCast Program
BioSeek EPA ToxCast Program
 
Stable Isotope labeled organic molecules publication
Stable Isotope labeled organic molecules publicationStable Isotope labeled organic molecules publication
Stable Isotope labeled organic molecules publication
 

Plus de Sean Ekins

Plus de Sean Ekins (20)

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptx
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas Disease
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issue
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1
 
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b ig
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models -
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientists
 
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsCDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
 

Dernier

Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Dipal Arora
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
perfect solution
 

Dernier (20)

All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
Call Girls Guntur Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Guntur  Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Guntur  Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Guntur Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
 
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...Top Rated Bangalore Call Girls Mg Road ⟟   9332606886 ⟟ Call Me For Genuine S...
Top Rated Bangalore Call Girls Mg Road ⟟ 9332606886 ⟟ Call Me For Genuine S...
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
 
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
 
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
 
Call Girls Agra Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Agra Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Agra Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Agra Just Call 8250077686 Top Class Call Girl Service Available
 
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
Mumbai ] (Call Girls) in Mumbai 10k @ I'm VIP Independent Escorts Girls 98333...
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
 
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
Call Girls Service Jaipur {9521753030} ❤️VVIP RIDDHI Call Girl in Jaipur Raja...
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
 

Data Quality Issues That Can Impact Drug Discovery

  • 1. Data Quality Issues That Can Impact Drug Discovery Sean Ekins1, Joe Olechno2 Antony J. Williams3 1 Collaborationsin Chemistry, Fuquay Varina, NC. 2 Labcyte Inc, Sunnyvale, CA. 3 Royal Society of Chemistry, Wake Forest, NC. Disclaimer: SE and AJE have no affiliation with Labcyte and have not been engaged as consultants
  • 2. “If I have seen further than others, it is by standing upon the shoulders of giants.” Isaac Newton
  • 3. Example e.g. GPCR drug discovery  Clone gene for b-receptor  Cloning of more genes for other receptors  Understanding of similarities  Identify family GPCRs Pictures: Wikipedia  Now 1000s of receptors belong to this family  Understand how to target receptors - identify structure  Results in drugs tailored to fit these receptors  “GPCR and drug discovery” – 661 hits in PubMed since 1997
  • 4. Where do scientists get chemistry/ biology data?  Databases  Patents  Papers  Your own lab  Collaborators  Some or all of the above?  What is common to all? – quality issues
  • 5. From data hoarding to open data (IMDB) Me Linked Open data cloud 2011 (Wikipedia)
  • 6. Pharma company data hoarding - to open data
  • 7. Simple Rules for licensing “open” data As we see a future of increased database integration the licensing of the data may be a hurdle that hampers progress and usability. Williams, Wilbanks and Ekins. PLoS Comput Biol 8(9): e1002706, 2012
  • 8. Could open accessibility = Disruption 1: NIH and other international scientific funding bodies should mandate …open accessibility for all data generated by publicly funded research Ekins, Waller, Bradley, Clark and Williams. DDT In press, 2012
  • 9. Data can be found – but what about quality?
  • 10. Why drug structure quality is important?  More groups doing in silico repositioning  Target-based or ligand-based  Network and systems biology  They are all integrating or using sets of FDA drugs..if the structures are incorrect predictions will be too..  What is need is a definitive set of FDA approved drugs with correct structures and tools for in silico screening  Also linkage between in vitro data & clinical data
  • 11. Structure Quality Issues Database released and within days 100’s of errors found in structures NPC Browser http://tripod.nih.gov/npc/ Science Translational Medicine 2011 Williams and Ekins, DDT, 16: 747-750 (2011)
  • 12. Which is Neomycin? Wh Describes errors in other types of databases too!! Williams, Ekins and Tkachenko Drug Disc Today 17: 685-701 (2012)
  • 13. Data Errors in the NPC Browser: Analysis of Steroids Substructure # of # of No Incomplete Complete but Hits Correct stereochemistry Stereochemistry incorrect Hits stereochemistry Gonane 34 5 8 21 0 Gon-4-ene 55 12 3 33 7 Gon-1,4-diene 60 17 10 23 10 Williams, Ekins and Tkachenko Drug Disc Today 17: 685-701 (2012)
  • 14. DDT editorial Dec 2011 http://goo.gl/dIqhU
  • 15. Its not just structure quality we need to worry about Jan - Joe Olechno saw editorial commented on blog
  • 16. How do you move a liquid? Low throughput High throughput Images courtesy of Bing
  • 17. Plastic leaching McDonald et al., Science 2008, 322, 917. Belaiche et al., Clin Chem 2009, 55, 1883-1884
  • 18. Moving liquids with sound Images courtesy of Labcyte Inc. http://goo.gl/K0Fjz
  • 19. Tips vs. Acoustic analysis Data appears randomly scattered. ~ 40 12 point IC50 values. Compounds 24% had IC50 values > 3 fold more active when using acoustic weaker using tip-based dispensing. Correlation in data is poor dispensing. 8% produced no with many compounds showing >10 fold value using tip-based dispensing. shift in potency depending on dispensing method. No analysis of molecule properties. No analysis of molecule properties. Spicer et al., In Drug Discovery Wingfield et al., American Drug Discovery 2008, Technology: Boston, 2005. 3, 24-30.
  • 20. Tips vs. Acoustic analysis – large scale Inhibition of tyrosine kinases at 10 µM for ~10,000 compounds. False +ve from acoustic transfer (as measured by subsequent IC50 analyses) = 19% of hits. False +ve from tip-based transfers = 55% of all hits. 60 more compounds were identified as active with acoustic transfer. No analysis of molecule properties. Wingfield, J. Drug Discovery 2012: Manchester, UK, 2012
  • 21. Problems with biological data: how you dispense matters Few structures and corresponding data are public Using data from 2 AstraZeneca patents – Tyrosine kinase EphB4 pharmacophores (Accelrys Discovery Studio) were developed using data for 14 compounds IC50 determined using different dispensing methods Analyzed correlation with simple descriptors (SAS JMP) Calculated LogP correlation with log IC50 data for acoustic dispensing (r2 = 0.34, p < 0.05, N = 14) Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009 Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
  • 22. Examples of IC50 values produced via acoustic transfer with direct dilution vs those generated with tip-based transfer and serial dilutions acoustically-derived IC50 values were 1.5 to 276.5-fold lower than for tip-based dispensing Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009 Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
  • 23. Tyrosine kinase EphB4 Pharmacophores Cyan = hydrophobic Green = hydrogen bond acceptor Purple = hydrogen bond donor Each model shows most potent molecule mapping Acoustic Disposable tip Hydrophobic Hydrogen Hydrogen Observed vs. features (HPF) bond acceptor bond donor predicted IC50 (HBA) (HBD) r Acoustic mediated process 2 1 1 0.92 Disposable tip mediated process 0 2 1 0.80 Ekins, Olechno and Williams, Submitted 2012
  • 24. Test set evaluation of pharmacophores • An additional 12 compounds from AstraZeneca Barlaam, B. C.; Ducray, R., WO 2008/132505 A1, 2008 • 10 of these compounds had data for tip-based dispensing and 2 for acoustic dispensing • Calculated LogP and logD showed low but statistically significant correlations with tip-based dispensing (r2= 0.39 p < 0.05 and 0.24 p < 0.05, N = 36) • Used as a test set for pharmacophores • The two compounds analyzed with acoustic liquid handling were predicted in the top 3 using the acoustic pharmacophore • The tip-based pharmacophore failed to rank the retrieved compounds correctly
  • 25. Automated receptor-ligand pharmacophore generation method Pharmacophores for the tyrosine kinase EphB4 generated from crystal structures in the protein data bank PDB using Discovery Studio version 3.5.5 Cyan = hydrophobic Green = hydrogen bond acceptor Purple = hydrogen bond donor Grey = excluded volumes Each model shows most potent molecule mapping Bioorg Med Chem Lett 2010, 20, 6242-6245. Bioorg Med Chem Lett 2008, 18, 5717-5721. Bioorg Med Chem Lett 2008, 18, 2776-2780. Bioorg Med Chem Lett 2011, 21, 2207-2211.
  • 26. Summary • In the absence of structural data, pharmacophores and other computational and statistical models are used to guide medicinal chemistry in early drug discovery. • Our findings suggest non tip-based methods could improve HTS results and avoid the development of misleading computational models and statistical relationships. • Automated pharmacophores are closer to pharmacophore generated with acoustic data – all have hydrophobic features – missing from tip based model • Importance of hydrophobicity seen with logP correlation and crystal structure interactions • Public databases should annotate this meta-data alongside biological data points, to create larger datasets for comparing different computational methods.
  • 27. The stuff of nightmares?  How much of the data in databases is generated by tip-based methods  How much is erroneous  Do we have to start again?  How does it affect all subsequent science – data mining etc  Does it impact Pharmas productivity?
  • 28. Strengths and Weaknesses Small dataset size – focused on one compounds series No previous publication describing how data quality can be impacted by dispensing and how this in turn affects computational models and downstream decision making. No comparison of pharmacophores generated from acoustic dispensing and tip-based dispensing. No previous comparison of pharmacophores generated from in vitro data with pharmacophores automatically generated from X- ray crystal conformations of inhibitors. Severely limited by number of structures in public domain with data in both systems