introduce a semantic framework for studying systems chemical biology / systems pharmacology, in which three major projects (Chem2Bio2RDF, Chem2Bio2OWL, SLAP (semantic link association prediction) are covered.
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
Towards semantic systems chemical biology
1. From Data Integration to Data mining in Semantic Web systems chemical biology as a case study Bin Chen School of Informatics and Computing Indiana University at Bloomington Lecture for S636 Nov 17, 2011
2.
3.
4.
5.
6. SPARQL RDF Ontology Algorithm and tools Applications Experimental Data Text mining Data Chem2Bio2RDF Chem2Bio2OWL Path finding; Association search; Association ranking and prediction Polypharmacology; drug side effect Architecture of Semantic Systems Chemical Biology
7.
8.
9. Use RDF to Integrate Data http://chem2bio2rdf.org/drugbank/DB01076 name company lipitor Pfizer http://chem2bio2rdf.org/drugbank/DB01076 Molecular_Weight formula 558.6398 C33H35FN2O5 Database 1 Database 2 Same URI, merged!
10. Use RDF to Link Data http://chem2bio2rdf.org/drugbank/DB01076 sameAs http://www4.wiwiss.fu-berlin.de/drugbank/resource/drugs/DB01076 http://chem2bio2rdf.org/pubchem/resource/pubchem_compound/60823 cid
11. uniprot Bio2RDF Others LODD Chem2Bio2RDF Virtuoso Triple store SPARQL ENDPOINTS Dereferenable URI Browsing PlotViz: Visualization Cytoscape Plugin Linked Path Generation and Ranking Third party tools
12. Workflow for RDF conversion XML CSV DB TXT Relational DB D2R Mapping D2R server Dumping Virtuoso Triple Store Scripts Ontology Publishing External Sources Download Local copy … Chen B,et al. Chem2Bio2RDF: a semantic framework for linking and data mining chemogenomic and systems chemical biology data. BMC Bioinformatics. 2010
14. Node represents each database colored by its RDF vender; Directed edge shows the linkage from one dataset to another dataset, colored by the linkage type. E.g,., the type compound includes CID, CAS, ChEBI, DBID and so on. The size of nodes and the width of edges are dependent on the # of triples and # of linkages respectively. Chem2Bio2RDF Datasets http://chem2bio2rdf.org Chem2Bio2RDF data Other data venders compound protein/gene chemogenomics literature others
22. Node represents each database colored by its RDF vender; Directed edge shows the linkage from one dataset to another dataset, colored by the linkage type. E.g,., the type compound includes CID, CAS, ChEBI, DBID and so on. The size of nodes and the width of edges are dependent on the # of triples and # of linkages respectively. Chem2Bio2RDF Datasets http://chem2bio2rdf.org Chem2Bio2RDF data Other data venders compound protein/gene chemogenomics literature others
34. Two objects are similar if they are related to similar objects Coauthorship Same Target
35. Two objects are related if they share same objects or their related objects are related Compound 1 Protein 2 Protein 1 Compound 1 Protein 2 Protein 1 Compound 2 Computer Science Person2 Person 1 Computer Science Person2 paper1 paper2 advisor major publish cite conference
36. Cmpd1 Protein 1 Protein 2 Cmpd 2 Cmpd 1 Cmpd 2 Protein 1 Neighbor Chemogenomics Chemogenomics Chemogenomics Chemogenomics Protein 2 Cmpd1 Protein 1 Chemogenomics hasGO hasGO Protein 2 Cmpd1 Protein 1 Chemogenomics PPI GO:0001 Sample patterns Cmpd1 Protein 1 Cmpd 2 Chemogenomics hypertension Side effect Side effect Cmpd1 Protein 1 Cmpd 2 Chemogenomics Substructure substructure substructure
37. Target 2 Compound1 Compound 2 Compound 3 Target 3 GO:00001 hasGO hasGO chemogenomics chemogenomics chemogenomics chemogenomics chemogenomics neighbor Side Effect 1 hasSideEffect hasSideEffect Gene Family 1 hasGeneFamily hasGeneFamily Target 1 chemogenomics Target 4 chemogenomics proteinProteinInteraction Association depends on its neighborhood
38.
39. Statistical Model Convert the question to a path surfing problem Gene i Gene j PPI PPI PPI hasGO hasGO hasPathway chemogenomics P(i j) =1/3
Learn a basic and used several software, What, how,
What is scb, why we need seman, the whole architecture,
Antibacterial drug, 4 parts, data, we wanna ask can this drug have side effect?
Can we use semantic web, answered it by google or siri?
Remove logic, to application
Demo, links
Show indice search
To link them, 25 database; why need download (nobody else is doing) why relation database (already have, and doubt semantic web, data quality), ontology or mapping file is key
Show mapping file, demo one database, SQL no difference. show exhibit: http://cheminfov.informatics.indiana.edu:8080/exhibit/drugbank.html Show triples select * where { <http://chem2bio2rdf.org/drugbank/resource/drugbank_drug/DB00041> ?p ?o }
Here is the figure of all chem2bio2rdf datasets. Node represents each database colored by its RDF vender; red nodes are RDF data sources provided in chem2bio2rdf. Directed edge shows the linkage from one dataset to another dataset, colored by the linkage type. E.g,., the type compound includes CID, CAS, ChEBI, DBID and so on. The size of nodes and the width of edges are dependent on the # of triples and # of linkages respectively. Up to now, we already have over 110million triples
Show indice search
Now we can use our portal to answer a variety of questions in systems chemical biology. From the basic one like give me all info about this compound, to advanced one like linke kegg and pubchem to identify potential multiple pathway inhibitors for MAPK