SlideShare une entreprise Scribd logo
1  sur  41
http://tinyurl.com/d6wodsl




  Mining public domain data as a basis
                  for drug repurposing

                    Antony J Williams, Sean Ekins and Valery Tkachenko

                                               ACS Philadelphia August 2012
Drug Repurposing
 Drug repurposing commonly
  means data reexamination also!

 Lots of data mining occurs

 Then more screening which
  creates more data..

 LOTS of public databases used
  to examine repurposing…
A LOT of data coming online
Interlinked on the semantic web
Where do you get your data?
     Databases?
     Patents?
     Papers?
     Your own lab?
     Collaborators?
     All of the above?

   What is likely common to all sources? Data
    Quality issues. There is no perfect database.
Public Domain Databases
 Our databases are a mess…
 Non-curated databases are proliferating errors

 We source and deposit data between databases

 Original sources of errors hard to determine

 Curation is time-consuming and challenging
Availability of libraries of FDA drugs




 Johns Hopkins Clinical Compound library- made compounds available at cost
The FDA Drug Database
The DailyMed Database
Government Databases Should
Come With a Health Warning




               Williams and Ekins, DDT, 16: 747-750 (2011)
What is Neomycin?
Not this…
Data Errors in the NPC Browser: Analysis of Steroids




      Substructure   # of    # of          No           Incomplete       Complete but

                     Hits   Correct   stereochemistry Stereochemistry      incorrect

                             Hits                                       stereochemistry


     Gonane          34       5             8               21                0

     Gon-4-ene       55       12            3               33                7

     Gon-1,4-diene   60       17            10              23                10


Williams, Ekins and Tkachenko
Drug Disc Today 17: 685-701 (2012)
Drug Disambiguation Project
NCATS Discovering “New Therapeutic
Uses for Existing Molecules”




58 Molecule names
and identifiers. Where
are the “structures”?
NCATS dataset
•   Several groups tried to collate molecules
•   Chris Lipinski provided approximately 30 unique molecules

•   Simple molecule descriptors shows no difference between
    compounds classified as discontinued (N= 15) or those in
    clinical trials (n = 14).




•   Where is the definitive set of publicly accessible molecules
    for computational repurposing and analysis?
Drug structure quality is important..
 Many groups ARE doing in silico repositioning

 Integrating or using sets of FDA drugs..and if
  structures are incorrect predictions will be

 Where is the definitive set of FDA approved
  drugs with correct structures?

 Ideally we need linkage between in vitro data
  and clinical data
We have a problem…
   Lots of data available but quality is suspect
   Errors proliferate database to database
   Data continues to flow in unabated
   When errors are identified hard to get fixed!
   Data licensing is confusing – “Open Data”
   We are “takers” not “givers” mostly…
   Standards are lacking:
      Data licensing
      Data processing – structure standardization
So what needs to happen to improve?
• Let’s agree collaboration and crowdsourcing
  can help
• Provide SIMPLE ways to provide feedback
• Contribute when possible – databases should
  provide feedback mechanisms
• Adopt standards for structure handling and
  representation
• Adopt standards for data interchange
• Allow machine handling of data – use the
  power of the semantic web
Williams, Ekins and Tkachenko, Drug Disc Today 17: 685-701 (2012)
Collaboration on Curation
 Collaborate on curation…share through standards
  and open interfaces
All DBs should take comments!
Standardize




 Use the SRS as guidance for standardization
“Appify” curation and collaboration

• The data network is complex
• “Appify” collaboration and
  curation networks
• Increasing crowdsourcing role
  for data analysis




                 Ekins & Williams, Pharm Res, 27: 393-395, 2010.
Mobile Apps for Drug Discovery
Open Drug Discovery Teams

 Free iOS app used to expose repurposing data
 All of this data has been tweeted
  http://tinyurl.com/6l9qy4f




Ekins, Clark and Williams, Mol Informatics, in Press 2012
Open Drug Discovery Teams
Simple Rules for licensing “open” data
   Gather stakeholders. Decide if goals are primarily scientific,
    commercial or mixed.

   Explore benefits of open licensing and drawbacks of
    enclosure. Hold closely to open definitions and standards.
    Do not write your own IP licenses!

   Provide simple explanations for terms of use. Use
    metadata to indicate licensing terms explicitly - the
    Creative Commons Rights Expression Language is a
    good tool.

   Do not lock up metadata. If you can’t make the data public
    domain, make the metadata public domain.
Williams, Wilbanks and Ekins.
PLoS Comput. Biol. in Press Sept.2012
Open PHACTS Project
 Develop a set of robust standards…
 Implement the standards in a semantic integration hub
 Deliver services to support drug discovery programs
  in pharma and public domain
 22 partners, 8 pharmaceutical companies, 3 biotechs
 36 months project

  Guiding principle is open access, open usage, open source
                - Key to standards adoption -
To facilitate THIS process!
                                                         IP?
                                 What’s the
                                 structure?
                                                     Are they in
                                                      our file?
                                   What’s
                                  similar?
                                                     What’s the
                               Pharmacology           target?
                                   data?

                                              Known
                                             Pathways?
                              Competitors?
                                                     Working On
                               Connections             Now?
                               to disease?
                                               Expressed in
                                              right cell type?
It’s not JUST structures of course…
Taxol: Paclitaxel Bioassay Data
 Most Bioassay data associated with structure
  with one ambiguous stereocenter
Measuring data: dispensing dependencies
  Data from 2 AstraZeneca patents - Ephrin pharmacophores
  developed using data for 14 compounds with IC50. Different
  dispensing methods give different results. Impact
  hypotheses and could impact drug discovery.




                     Acoustic                                        Disposable tip
                                       Hydrophobic        Hydrogen      Hydrogen     Observed vs.

                                       features (HPF)   bond acceptor   bond donor   predicted IC50

                                                           (HBA)          (HBD)            r

     Acoustic mediated process
                                             2               1              1            0.92
     Disposable tip mediated process
                                             0               2              1            0.80



Ekins, Olechno and Williams, Submitted 2012
Measuring data: dispensing dependencies
  Acoustically-derived IC50 values were 1.5 to 276.5-fold
  lower than for tip-based dispensing
• Pharmacophores and other computational models are used
  to guide medicinal chemistry.

• Non tip-based methods may improve HTS results and avoid
  misleading computational and statistical models.

• No analysis of influence of dispensing processes on data.

• Public databases should annotate metadata to create larger
  datasets for comparing different computational methods.
  How much data is reproducible, accurate, valid? The
  challenge of high-throughput science.
Conclusions
Acknowledgments
   Sean Ekins
   Christopher Lipinski
   Joe Olechno
   John Wilbanks
   Drug Disambiguation project team
   RSC Cheminformatics Team
Thank you

Email: williamsa@rsc.org
Twitter: @chemconnector
Blog: www.chemconnector.com
SLIDES: www.slideshare.net/AntonyWilliams


Email: ekinssean@yahoo.com
Twitter: collabchem
Blog: http://www.collabchem.com/

Contenu connexe

Tendances

BigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigData_Europe
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAGopen_phacts
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)Carole Goble
 
Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Valery Tkachenko
 
Small Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica MunichSmall Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica MunichEmma Schymanski
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...Frederik van den Broek
 

Tendances (20)

An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
 
BigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigDataEurope - Big Data & Health
BigDataEurope - Big Data & Health
 
The influence of data curation on QSAR Modeling – examining issues of qualit...
 The influence of data curation on QSAR Modeling – examining issues of qualit... The influence of data curation on QSAR Modeling – examining issues of qualit...
The influence of data curation on QSAR Modeling – examining issues of qualit...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspnRSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG
 
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
 
How the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data finalHow the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data final
 
FAIR Data and Model Management for Systems Biology (and SOPs too!)
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
 
Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...
 
Small Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica MunichSmall Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica Munich
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
 
Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
 
Navigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpiderNavigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpider
 
Cheminformatics and the Structure Elucidation of Natural Products
Cheminformatics and the Structure Elucidation of Natural ProductsCheminformatics and the Structure Elucidation of Natural Products
Cheminformatics and the Structure Elucidation of Natural Products
 
Towards a gold standard and regarding quality in public domain chemistry data...
Towards a gold standard and regarding quality in public domain chemistry data...Towards a gold standard and regarding quality in public domain chemistry data...
Towards a gold standard and regarding quality in public domain chemistry data...
 
A Presentation at Nature Publishing Group Crowdsourcing, Collaborations and T...
A Presentation at Nature Publishing Group Crowdsourcing, Collaborations and T...A Presentation at Nature Publishing Group Crowdsourcing, Collaborations and T...
A Presentation at Nature Publishing Group Crowdsourcing, Collaborations and T...
 

Similaire à Antony J Williams

Scio12 sem web_final
Scio12 sem web_finalScio12 sem web_final
Scio12 sem web_finalKristi Holmes
 
Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1BigData_Europe
 
Friend NAS 2013-01-10
Friend NAS 2013-01-10Friend NAS 2013-01-10
Friend NAS 2013-01-10Sage Base
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiChris Evelo
 
Open sourcedays2013 claus stie kallesoe
Open sourcedays2013   claus stie kallesoeOpen sourcedays2013   claus stie kallesoe
Open sourcedays2013 claus stie kallesoeClaus Stie Kallesøe
 
The Translational Medicine
The Translational MedicineThe Translational Medicine
The Translational MedicineJoanne Luciano
 
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...open_phacts
 
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...Richard Boyce, PhD
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Michel Dumontier
 
Indications discovery and drug repurposing
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposingSean Ekins
 
Using transparency to increase awareness of chemical hazards.pptx
Using transparency to increase awareness of chemical hazards.pptxUsing transparency to increase awareness of chemical hazards.pptx
Using transparency to increase awareness of chemical hazards.pptxDIv CHAS
 
Knowledge Discovery using an Integrated Semantic Web
Knowledge Discovery using an Integrated Semantic WebKnowledge Discovery using an Integrated Semantic Web
Knowledge Discovery using an Integrated Semantic WebMichel Dumontier
 

Similaire à Antony J Williams (20)

Chem spider as a chemical term resolver
Chem spider as a chemical term resolverChem spider as a chemical term resolver
Chem spider as a chemical term resolver
 
ChemSpider as a chemical term resolver
ChemSpider as a chemical term resolverChemSpider as a chemical term resolver
ChemSpider as a chemical term resolver
 
Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...
 
Scio12 sem web_final
Scio12 sem web_finalScio12 sem web_final
Scio12 sem web_final
 
Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1
 
SLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
SLAS Screen Design and Assay Technology SIG: SLAS2013 PresentationSLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
SLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
 
Friend NAS 2013-01-10
Friend NAS 2013-01-10Friend NAS 2013-01-10
Friend NAS 2013-01-10
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Opening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs api
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Open sourcedays2013 claus stie kallesoe
Open sourcedays2013   claus stie kallesoeOpen sourcedays2013   claus stie kallesoe
Open sourcedays2013 claus stie kallesoe
 
The Translational Medicine
The Translational MedicineThe Translational Medicine
The Translational Medicine
 
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
 
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
Indications discovery and drug repurposing
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposing
 
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
 
Chemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScienceChemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScience
 
Using transparency to increase awareness of chemical hazards.pptx
Using transparency to increase awareness of chemical hazards.pptxUsing transparency to increase awareness of chemical hazards.pptx
Using transparency to increase awareness of chemical hazards.pptx
 
Knowledge Discovery using an Integrated Semantic Web
Knowledge Discovery using an Integrated Semantic WebKnowledge Discovery using an Integrated Semantic Web
Knowledge Discovery using an Integrated Semantic Web
 

Dernier

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 

Dernier (20)

SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Antony J Williams

  • 1. http://tinyurl.com/d6wodsl Mining public domain data as a basis for drug repurposing Antony J Williams, Sean Ekins and Valery Tkachenko ACS Philadelphia August 2012
  • 2. Drug Repurposing  Drug repurposing commonly means data reexamination also!  Lots of data mining occurs  Then more screening which creates more data..  LOTS of public databases used to examine repurposing…
  • 3. A LOT of data coming online
  • 4. Interlinked on the semantic web
  • 5. Where do you get your data?  Databases?  Patents?  Papers?  Your own lab?  Collaborators?  All of the above?  What is likely common to all sources? Data Quality issues. There is no perfect database.
  • 6. Public Domain Databases  Our databases are a mess…  Non-curated databases are proliferating errors  We source and deposit data between databases  Original sources of errors hard to determine  Curation is time-consuming and challenging
  • 7.
  • 8. Availability of libraries of FDA drugs Johns Hopkins Clinical Compound library- made compounds available at cost
  • 9. The FDA Drug Database
  • 11. Government Databases Should Come With a Health Warning Williams and Ekins, DDT, 16: 747-750 (2011)
  • 14. Data Errors in the NPC Browser: Analysis of Steroids Substructure # of # of No Incomplete Complete but Hits Correct stereochemistry Stereochemistry incorrect Hits stereochemistry Gonane 34 5 8 21 0 Gon-4-ene 55 12 3 33 7 Gon-1,4-diene 60 17 10 23 10 Williams, Ekins and Tkachenko Drug Disc Today 17: 685-701 (2012)
  • 15.
  • 17. NCATS Discovering “New Therapeutic Uses for Existing Molecules” 58 Molecule names and identifiers. Where are the “structures”?
  • 18. NCATS dataset • Several groups tried to collate molecules • Chris Lipinski provided approximately 30 unique molecules • Simple molecule descriptors shows no difference between compounds classified as discontinued (N= 15) or those in clinical trials (n = 14). • Where is the definitive set of publicly accessible molecules for computational repurposing and analysis?
  • 19. Drug structure quality is important..  Many groups ARE doing in silico repositioning  Integrating or using sets of FDA drugs..and if structures are incorrect predictions will be  Where is the definitive set of FDA approved drugs with correct structures?  Ideally we need linkage between in vitro data and clinical data
  • 20. We have a problem…  Lots of data available but quality is suspect  Errors proliferate database to database  Data continues to flow in unabated  When errors are identified hard to get fixed!  Data licensing is confusing – “Open Data”  We are “takers” not “givers” mostly…  Standards are lacking:  Data licensing  Data processing – structure standardization
  • 21. So what needs to happen to improve? • Let’s agree collaboration and crowdsourcing can help • Provide SIMPLE ways to provide feedback • Contribute when possible – databases should provide feedback mechanisms • Adopt standards for structure handling and representation • Adopt standards for data interchange • Allow machine handling of data – use the power of the semantic web
  • 22. Williams, Ekins and Tkachenko, Drug Disc Today 17: 685-701 (2012)
  • 23. Collaboration on Curation  Collaborate on curation…share through standards and open interfaces
  • 24. All DBs should take comments!
  • 25. Standardize  Use the SRS as guidance for standardization
  • 26. “Appify” curation and collaboration • The data network is complex • “Appify” collaboration and curation networks • Increasing crowdsourcing role for data analysis Ekins & Williams, Pharm Res, 27: 393-395, 2010.
  • 27. Mobile Apps for Drug Discovery
  • 28. Open Drug Discovery Teams  Free iOS app used to expose repurposing data  All of this data has been tweeted http://tinyurl.com/6l9qy4f Ekins, Clark and Williams, Mol Informatics, in Press 2012
  • 30. Simple Rules for licensing “open” data  Gather stakeholders. Decide if goals are primarily scientific, commercial or mixed.  Explore benefits of open licensing and drawbacks of enclosure. Hold closely to open definitions and standards. Do not write your own IP licenses!  Provide simple explanations for terms of use. Use metadata to indicate licensing terms explicitly - the Creative Commons Rights Expression Language is a good tool.  Do not lock up metadata. If you can’t make the data public domain, make the metadata public domain. Williams, Wilbanks and Ekins. PLoS Comput. Biol. in Press Sept.2012
  • 31. Open PHACTS Project  Develop a set of robust standards…  Implement the standards in a semantic integration hub  Deliver services to support drug discovery programs in pharma and public domain  22 partners, 8 pharmaceutical companies, 3 biotechs  36 months project Guiding principle is open access, open usage, open source - Key to standards adoption -
  • 32.
  • 33.
  • 34. To facilitate THIS process! IP? What’s the structure? Are they in our file? What’s similar? What’s the Pharmacology target? data? Known Pathways? Competitors? Working On Connections Now? to disease? Expressed in right cell type?
  • 35. It’s not JUST structures of course…
  • 36. Taxol: Paclitaxel Bioassay Data  Most Bioassay data associated with structure with one ambiguous stereocenter
  • 37. Measuring data: dispensing dependencies Data from 2 AstraZeneca patents - Ephrin pharmacophores developed using data for 14 compounds with IC50. Different dispensing methods give different results. Impact hypotheses and could impact drug discovery. Acoustic Disposable tip Hydrophobic Hydrogen Hydrogen Observed vs. features (HPF) bond acceptor bond donor predicted IC50 (HBA) (HBD) r Acoustic mediated process 2 1 1 0.92 Disposable tip mediated process 0 2 1 0.80 Ekins, Olechno and Williams, Submitted 2012
  • 38. Measuring data: dispensing dependencies Acoustically-derived IC50 values were 1.5 to 276.5-fold lower than for tip-based dispensing • Pharmacophores and other computational models are used to guide medicinal chemistry. • Non tip-based methods may improve HTS results and avoid misleading computational and statistical models. • No analysis of influence of dispensing processes on data. • Public databases should annotate metadata to create larger datasets for comparing different computational methods. How much data is reproducible, accurate, valid? The challenge of high-throughput science.
  • 40. Acknowledgments  Sean Ekins  Christopher Lipinski  Joe Olechno  John Wilbanks  Drug Disambiguation project team  RSC Cheminformatics Team
  • 41. Thank you Email: williamsa@rsc.org Twitter: @chemconnector Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams Email: ekinssean@yahoo.com Twitter: collabchem Blog: http://www.collabchem.com/

Notes de l'éditeur

  1. Text edited