SlideShare une entreprise Scribd logo
1  sur  47
The Great Promise of Online Data for
    Chemistry and the Life Sciences

                           Antony J Williams
                      Silverchair Colloquium 2012
READ FAST – IT’S HAPPENING NOW

  20 minutes, >40 slides

Disruption Can be Cheap,
 Fast and Unexpectedly
        Successful
Online Chemistry Databases in 2007
A search gave LOTS of “info”..
What is Yohimbine?
For chemists…try filtering!
Why not Index the web of chemistry?
 Build a search engine for chemistry

 Index all public domain chemicals and link

 Build a structure searchable web

 Crowdsource new chemistry from the community

 Crowdsource curation and annotation
Create a structure-centric hub
Answering Real Questions
 Questions a chemist might ask…
   What is the melting point of n-heptanol?
   What is the chemical structure of Xanax?
   Chemically, what is phenolphthalein?
   What are the stereocenters of cholesterol?
   Where can I find publications about xylene?
   What are the different trade names for Ketoconazole?
   What is the NMR spectrum of Aspirin?
   What are the safety handling issues for Thymol Blue?
The World of Online Chemistry
   Safety data
   Toxicity data
   Blogs and Wikis
   Property databases
   Experimental results
   Scientific publications
   Compound aggregators
   Open Notebook Science
   Metabolic pathway databases
   Encyclopedic articles (Wikipedia)
Linked Data for Life Sciences growing…
Solve Real World Problems
 Provide programmable interface against content
 Provide a chemistry database tuned to integrators
RSC and ChemSpider – May 2009
Why RSC acquired ChemSpider
 Commitment to serve the community

 Bring cheminformatics expertise in-house

 Add additional data to publications

 Potential freemium model – web services, data

 Because data is critical to science
Making sense of data is overwhelming
Publications are Hosts to Data
Data has value, is Free, is Open
 Data cannot be copyrighted. A particular
  expression of data, such as a chart or table in a
  publication, can be.

 Data licensing is being dealt with and openness
  encouraged

 Research data mandates are starting…

 Who will manage the integration and curation
  and keep the access FREE!
Tell me about Yohimbine…
Of course it is out there…
SOME Chemistry Databases in 2012
Tell me more…but…
   Where can I find the electronic structure?
   Papers/Patents about Yohimbine?
   What are the side effects of Yohimbine?
   Where can I order Yohimbine?
   What are the physicochemical properties?
   What are the associated metabolic pathways?
   Different synonyms of Yohimbine?
   Are there side effects with Yohimbine?

 ChemSpider links all of this information and more
Yohimbine on ChemSpider
RSC Databases are Integrated
RSC Journals are Integrated
Patents are Linked
Google Books are Integrated
And so are…
   Chemical vendors
   Safety and Toxicity information
   Experimental and Predicted properties
   Analytical data
   Images and Movies

 And all for free…
And all “mobile”
Not only compounds but syntheses
And analytical data…
The world can take and contribute
 Scientists can deposit their data

 They can annotate and curate

 They can download data

 They can embed data in the social network

 They can integrate and connect
Integrate to electronic lab notebooks
Integrate to electronic lab notebooks
Integrate to instruments and software
 Primary analytical instrumentation vendors integrate

   Agilent, Bruker, Thermo, Waters


 Cheminformatics vendors link to ChemSpider

   Accelrys, ACD/Labs, ChemAxon, iChemLabs
Publications are a summary of work
 Scientific publications are a summary of work
   Is all work reported?
   How much science is lost to pruning?
   What of value sits in notebooks and is lost?

 How much data is lost?
   How many compounds never reported?
   How many syntheses fail or succeed?
   How many characterization measurements?
What if we could capture it all?
Start with data in publications
But in the time of Big Data…it’s linked!
ONE example – data for life sciences
                                                    IP?
                            What’s the
                            structure?
                                                Are they in
                                                 our file?
                              What’s
                             similar?
                                                What’s the
                          Pharmacology           target?
                              data?

                                          Known
                                        Pathways?
                         Competitors?
                                                Working On
                          Connections             Now?
                          to disease?
                                          Expressed in
                                         right cell type?
 Crowdsourcing across drug discovery
 Open PHACTS : partnership between European
  Community and European Pharma Companies
 22 partners, 8 pharmaceutical companies, 3
  biotechs working together for 3 years

 Freely accessible for knowledge discovery and
  verification.
    Data on chemistry and biology
    Pharmacological profiles
    Proprietary and public data sources.
All that glisters is not gold…
Crowdsourced Assertions
 The future of publishing will include generation
  and consumption of “nanopublications”




 http://www.nanopub.org/
Nanopublications??
So what’s the business model?
 Decisions are based on data

 Publications encapsulate, reference and link data

 More data is free and open. More services and
  APIS allow access – free or for fee. Ask Google

 The large-scale licensed content business model
  is at risk without interfaces to integrate and mine
Acknowledgments
 The RSC ChemSpider team

 Our users, our depositors, our curators

 GGA Software Services, OpenEye, ACD/Labs
  and a lot of Open Source code!

 And Al Gore for supporting the internet
http://
  en.wikipedia.org/wiki/Al_Gore_and_information_techn
Thank you

Email: williamsa@rsc.org
Twitter: ChemConnector
Personal Blog: www.chemconnector.com
SLIDES: www.slideshare.net/AntonyWilliams

Contenu connexe

Tendances

ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Why Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpiderWhy Chemistry and the Web Will Benefit from a ChemSpider
ChemSpider hosting linking and curating chemistry data for the community
ChemSpider  hosting linking and curating chemistry data for the communityChemSpider  hosting linking and curating chemistry data for the community
ChemSpider hosting linking and curating chemistry data for the community
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Tendances (20)

ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
 
Navigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpiderNavigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpider
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
 
RSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider – Building An Internet Based Community For ChemistsRSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider – Building An Internet Based Community For Chemists
 
Building A Community Resource For The Life Sciences
Building A Community Resource For The Life SciencesBuilding A Community Resource For The Life Sciences
Building A Community Resource For The Life Sciences
 
Why Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpiderWhy Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpider
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider
 
ChemSpider hosting linking and curating chemistry data for the community
ChemSpider  hosting linking and curating chemistry data for the communityChemSpider  hosting linking and curating chemistry data for the community
ChemSpider hosting linking and curating chemistry data for the community
 
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
 
Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposing
 
Connecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpiderConnecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpider
 
How the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data finalHow the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data final
 
Structure representations in public chemistry databases: The challenges of va...
Structure representations in public chemistry databases: The challenges of va...Structure representations in public chemistry databases: The challenges of va...
Structure representations in public chemistry databases: The challenges of va...
 
ChemSpider as a chemical term resolver
ChemSpider as a chemical term resolverChemSpider as a chemical term resolver
ChemSpider as a chemical term resolver
 
Chem spider as a chemical term resolver
Chem spider as a chemical term resolverChem spider as a chemical term resolver
Chem spider as a chemical term resolver
 
Taming The Wild West Of Internet Based Chemistry You Can Help
Taming The Wild West Of Internet Based Chemistry You Can HelpTaming The Wild West Of Internet Based Chemistry You Can Help
Taming The Wild West Of Internet Based Chemistry You Can Help
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
 
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
 
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
 

En vedette

France presentation eleanor
France presentation eleanorFrance presentation eleanor
France presentation eleanor
Philip Copeland
 
φωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλαφωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλα
3dimchan
 
困髮族五大原因
困髮族五大原因困髮族五大原因
困髮族五大原因
formosa858
 
Slideshare
SlideshareSlideshare
Slideshare
bolona
 
Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)
Яндекс.Деньги
 

En vedette (20)

HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
 
France presentation eleanor
France presentation eleanorFrance presentation eleanor
France presentation eleanor
 
φωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλαφωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλα
 
индия
индияиндия
индия
 
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
 
DSA - delivering on the promise of bespoke support
DSA  - delivering on the promise of bespoke support DSA  - delivering on the promise of bespoke support
DSA - delivering on the promise of bespoke support
 
что такое вселенная
что такое вселеннаячто такое вселенная
что такое вселенная
 
困髮族五大原因
困髮族五大原因困髮族五大原因
困髮族五大原因
 
Volunteer in Italy 2012
Volunteer in Italy 2012Volunteer in Italy 2012
Volunteer in Italy 2012
 
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina FrancaTutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
 
Slideshare
SlideshareSlideshare
Slideshare
 
Kalender actie
Kalender actieKalender actie
Kalender actie
 
5434 avtodsdsdsds
5434 avtodsdsdsds5434 avtodsdsdsds
5434 avtodsdsdsds
 
Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)
 
Caràcters poligénics. 2
Caràcters poligénics. 2Caràcters poligénics. 2
Caràcters poligénics. 2
 
Places in kolkata
Places in kolkataPlaces in kolkata
Places in kolkata
 
Fotoscurso
FotoscursoFotoscurso
Fotoscurso
 
Presentation1
Presentation1Presentation1
Presentation1
 
Kalkulus 2 minggu 11
Kalkulus 2   minggu 11Kalkulus 2   minggu 11
Kalkulus 2 minggu 11
 
Derivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mbaDerivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mba
 

Similaire à The Great Promise of Online Data for Chemistry and the Life Sciences

Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
Sean Ekins
 
The future of scientific information & communication
The future of scientific information & communicationThe future of scientific information & communication
Chemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the communityChemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the community
Royal Society of Chemistry
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 

Similaire à The Great Promise of Online Data for Chemistry and the Life Sciences (20)

Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...
 
Chemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the handChemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the hand
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Chemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScienceChemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScience
 
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
 
Slides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinalSlides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinal
 
Open Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific ResearchOpen Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific Research
 
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
 
The future of scientific information & communication
The future of scientific information & communicationThe future of scientific information & communication
The future of scientific information & communication
 
Collaborative Computational Technologies for Biomedical Research: An Enabler ...
Collaborative Computational Technologies for Biomedical Research: An Enabler ...Collaborative Computational Technologies for Biomedical Research: An Enabler ...
Collaborative Computational Technologies for Biomedical Research: An Enabler ...
 
Crowdsourcing Chemistry for the Community – 5 Years of Experiences
Crowdsourcing Chemistry for the Community – 5 Years of ExperiencesCrowdsourcing Chemistry for the Community – 5 Years of Experiences
Crowdsourcing Chemistry for the Community – 5 Years of Experiences
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Collaboration - theory & Practice
Collaboration - theory & PracticeCollaboration - theory & Practice
Collaboration - theory & Practice
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
 
Chemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the communityChemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the community
 
Qualifying Online Information Resources for Chemists
Qualifying Online Information Resources for ChemistsQualifying Online Information Resources for Chemists
Qualifying Online Information Resources for Chemists
 
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
 
Engaging participation from the chemistry community
Engaging participation from the chemistry communityEngaging participation from the chemistry community
Engaging participation from the chemistry community
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Dernier (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

The Great Promise of Online Data for Chemistry and the Life Sciences

  • 1. The Great Promise of Online Data for Chemistry and the Life Sciences Antony J Williams Silverchair Colloquium 2012
  • 2. READ FAST – IT’S HAPPENING NOW 20 minutes, >40 slides Disruption Can be Cheap, Fast and Unexpectedly Successful
  • 4. A search gave LOTS of “info”.. What is Yohimbine?
  • 6. Why not Index the web of chemistry?  Build a search engine for chemistry  Index all public domain chemicals and link  Build a structure searchable web  Crowdsource new chemistry from the community  Crowdsource curation and annotation
  • 8.
  • 9. Answering Real Questions  Questions a chemist might ask…  What is the melting point of n-heptanol?  What is the chemical structure of Xanax?  Chemically, what is phenolphthalein?  What are the stereocenters of cholesterol?  Where can I find publications about xylene?  What are the different trade names for Ketoconazole?  What is the NMR spectrum of Aspirin?  What are the safety handling issues for Thymol Blue?
  • 10. The World of Online Chemistry  Safety data  Toxicity data  Blogs and Wikis  Property databases  Experimental results  Scientific publications  Compound aggregators  Open Notebook Science  Metabolic pathway databases  Encyclopedic articles (Wikipedia)
  • 11. Linked Data for Life Sciences growing…
  • 12. Solve Real World Problems  Provide programmable interface against content  Provide a chemistry database tuned to integrators
  • 13. RSC and ChemSpider – May 2009
  • 14. Why RSC acquired ChemSpider  Commitment to serve the community  Bring cheminformatics expertise in-house  Add additional data to publications  Potential freemium model – web services, data  Because data is critical to science
  • 15. Making sense of data is overwhelming
  • 17. Data has value, is Free, is Open  Data cannot be copyrighted. A particular expression of data, such as a chart or table in a publication, can be.  Data licensing is being dealt with and openness encouraged  Research data mandates are starting…  Who will manage the integration and curation and keep the access FREE!
  • 18. Tell me about Yohimbine…
  • 19. Of course it is out there…
  • 21. Tell me more…but…  Where can I find the electronic structure?  Papers/Patents about Yohimbine?  What are the side effects of Yohimbine?  Where can I order Yohimbine?  What are the physicochemical properties?  What are the associated metabolic pathways?  Different synonyms of Yohimbine?  Are there side effects with Yohimbine?  ChemSpider links all of this information and more
  • 23. RSC Databases are Integrated
  • 24. RSC Journals are Integrated
  • 26. Google Books are Integrated
  • 27. And so are…  Chemical vendors  Safety and Toxicity information  Experimental and Predicted properties  Analytical data  Images and Movies  And all for free…
  • 29. Not only compounds but syntheses
  • 31. The world can take and contribute  Scientists can deposit their data  They can annotate and curate  They can download data  They can embed data in the social network  They can integrate and connect
  • 32. Integrate to electronic lab notebooks
  • 33. Integrate to electronic lab notebooks
  • 34. Integrate to instruments and software  Primary analytical instrumentation vendors integrate  Agilent, Bruker, Thermo, Waters  Cheminformatics vendors link to ChemSpider  Accelrys, ACD/Labs, ChemAxon, iChemLabs
  • 35. Publications are a summary of work  Scientific publications are a summary of work  Is all work reported?  How much science is lost to pruning?  What of value sits in notebooks and is lost?  How much data is lost?  How many compounds never reported?  How many syntheses fail or succeed?  How many characterization measurements?
  • 36. What if we could capture it all?
  • 37. Start with data in publications
  • 38. But in the time of Big Data…it’s linked!
  • 39. ONE example – data for life sciences IP? What’s the structure? Are they in our file? What’s similar? What’s the Pharmacology target? data? Known Pathways? Competitors? Working On Connections Now? to disease? Expressed in right cell type?
  • 40.  Crowdsourcing across drug discovery  Open PHACTS : partnership between European Community and European Pharma Companies  22 partners, 8 pharmaceutical companies, 3 biotechs working together for 3 years  Freely accessible for knowledge discovery and verification.  Data on chemistry and biology  Pharmacological profiles  Proprietary and public data sources.
  • 41.
  • 42. All that glisters is not gold…
  • 43. Crowdsourced Assertions  The future of publishing will include generation and consumption of “nanopublications”  http://www.nanopub.org/
  • 45. So what’s the business model?  Decisions are based on data  Publications encapsulate, reference and link data  More data is free and open. More services and APIS allow access – free or for fee. Ask Google  The large-scale licensed content business model is at risk without interfaces to integrate and mine
  • 46. Acknowledgments  The RSC ChemSpider team  Our users, our depositors, our curators  GGA Software Services, OpenEye, ACD/Labs and a lot of Open Source code!  And Al Gore for supporting the internet http:// en.wikipedia.org/wiki/Al_Gore_and_information_techn
  • 47. Thank you Email: williamsa@rsc.org Twitter: ChemConnector Personal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams