Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Comparison of Compounds-to-targets between Databases
1. Comparison of Compound-to-Target
Relationships in Chemogenomic and
Drug Databases
Aprill 2012 update: FYI these two blog posts are on the same theme
http://cdsouthan.blogspot.se/2012/01/our-human-beta-lactamase-is-not_09.html
http://cdsouthan.blogspot.se/2011/08/compound-to-target-mappings-part-i.html
Chris Southan
ChrisDS Consulting, Göteborg, Sweden,
Presented to the NCBI PubChem team on 11 April. the BioIT World
Chemogenomics and Toxicogenomics Workshop on 12 April Boston, USA, and
as a shorter version, the ChEMBL users meeting at the EBI, 27 may 2011
[1]
2. Aknowledgments and Context
• I profoundly appreciate the efforts of those who develop, manage
and maintain public resources specified here and many others I
enjoy acessing
• I have some history in evaluating the utility, exploitation and
content quality of both bioinformatics and cheminformatics
databases. I thus enjoy the dual roles (roughly in equal parts) of
both fan and critic
• All databases have imperfections. This presentation investigates a
selection of these but critical analysis should not be missinterpreted
as disparaging either the quality of primary sources or the work of
curators and database teams
[2]
3. Outline
• Mapping concepts sources and challenges
• Extremes of the distribution
• Atorvastatin, drug-to-targets
• Hmg-CoA reductase target-to-drugs
• Equivocal mapping examples
• Exploring data intersects
• Complex targets
• Conclusions and outlook
[3]
4. Activity-to-compound-to-protein Mapping:
Capturing Relationships Between four Concepts
MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGG
APLGLRLPRETDEEPEEPGRRGSFVEMVDNLRGKSGQ
GYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHR
YYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSI
PHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEI
ARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSE
VLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIV
RVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKKVFE
AAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNI
FPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDD
CYKFAISQSSTGTVMGAVIMEGFYVVFDRARKRIGFAV
SACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESTL
MTIAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHD
DFADDISLLK
Document Assay Result Compound Protein
Expert extraction and curation
Unstructured data Structured data
Papers & Patents Databases
[4]
13. Target Matrix for Atorvastatin
Swiss-Prot ChEMBL TTD DrugBa PubChem
(BindingD nk
B)
HMDH_HUMAN X X X (PDB) X
HMDH_RAT X X
DPP4_HUMAN X
DPP4_PIG X
AHR_HUMAN X
[13]
19. Swiss-Prot Target Intersects
• 1,627 results for database:(type:drugbank)
• 297 results for database:(type:bindingdb)
• 45 results for database:(type:bindingdb) AND
database:(type:drugbank) AND organism:"Homo sapiens
[19]
29. APP: Indirect Target, three mechanisms
“for small molecules that suppress the Amyloid Precursor Protein (APP)
translation by binding to the 5'Untranslated Region of the APP mRNA
[29]
33. Mycophenolic acid and Prodrug: Complex mappings
• Primary Target human IMPDH2
• IMPDH1 ?
• IMPDH2 hamster
• IMPDH2 Tritrichomonas
• myfortic is an enteric-coated formulation of MPA in a delayed-
release tablet.
[33]
34. Conclusions
• Compared to what we had even a few years ago, let alone in
LBPC (life-before-PubChem) these compound-to-protein
sources are fantastic
• However, most things that could go wrong have
• We don’t often see QC statistics
• Data coverage is patchy, ad hoc and can be circular
• If you operate on these data at large scale you have no choice
but to ”trust and filter”
• If detailed realationships are important you need to ”verify and
judge” back to the primary source
• You can only really do this if you have at least some in vitro
background rather than just in silico
[34]
35. Wouldn’t it be nice if we had ....
• Interpreted mapping distribution statisitcs for each database
• Details about extraction triages, curation rules and parsing logic
• Harmonisation of mapping rules and cross-comparison of content
• Clear declarations and statistics of circularity between databases
• Curator judgments overuling document primacy
• Consolidated and extended Swiss-Prot cross-references
• Assay and target ontologies (Pistoia ? Open Phacts ?)
• “Standardization of Enzyme Data” (STRENDA, http://www.beilstein-
institut.de/en/projekte/strenda/)
• “Minimum Information About a Bioactive Entity” (MIABE,
http://www.psidev.info/index.php?q=node/394)
[35]