SlideShare une entreprise Scribd logo
1  sur  47
The Great Promise of Online Data for
    Chemistry and the Life Sciences

                           Antony J Williams
                      Silverchair Colloquium 2012
READ FAST – IT’S HAPPENING NOW

  20 minutes, >40 slides

Disruption Can be Cheap,
 Fast and Unexpectedly
        Successful
Online Chemistry Databases in 2007
A search gave LOTS of “info”..
What is Yohimbine?
For chemists…try filtering!
Why not Index the web of chemistry?
 Build a search engine for chemistry

 Index all public domain chemicals and link

 Build a structure searchable web

 Crowdsource new chemistry from the community

 Crowdsource curation and annotation
Create a structure-centric hub
Answering Real Questions
 Questions a chemist might ask…
   What is the melting point of n-heptanol?
   What is the chemical structure of Xanax?
   Chemically, what is phenolphthalein?
   What are the stereocenters of cholesterol?
   Where can I find publications about xylene?
   What are the different trade names for Ketoconazole?
   What is the NMR spectrum of Aspirin?
   What are the safety handling issues for Thymol Blue?
The World of Online Chemistry
   Safety data
   Toxicity data
   Blogs and Wikis
   Property databases
   Experimental results
   Scientific publications
   Compound aggregators
   Open Notebook Science
   Metabolic pathway databases
   Encyclopedic articles (Wikipedia)
Linked Data for Life Sciences growing…
Solve Real World Problems
 Provide programmable interface against content
 Provide a chemistry database tuned to integrators
RSC and ChemSpider – May 2009
Why RSC acquired ChemSpider
 Commitment to serve the community

 Bring cheminformatics expertise in-house

 Add additional data to publications

 Potential freemium model – web services, data

 Because data is critical to science
Making sense of data is overwhelming
Publications are Hosts to Data
Data has value, is Free, is Open
 Data cannot be copyrighted. A particular
  expression of data, such as a chart or table in a
  publication, can be.

 Data licensing is being dealt with and openness
  encouraged

 Research data mandates are starting…

 Who will manage the integration and curation
  and keep the access FREE!
Tell me about Yohimbine…
Of course it is out there…
SOME Chemistry Databases in 2012
Tell me more…but…
   Where can I find the electronic structure?
   Papers/Patents about Yohimbine?
   What are the side effects of Yohimbine?
   Where can I order Yohimbine?
   What are the physicochemical properties?
   What are the associated metabolic pathways?
   Different synonyms of Yohimbine?
   Are there side effects with Yohimbine?

 ChemSpider links all of this information and more
Yohimbine on ChemSpider
RSC Databases are Integrated
RSC Journals are Integrated
Patents are Linked
Google Books are Integrated
And so are…
   Chemical vendors
   Safety and Toxicity information
   Experimental and Predicted properties
   Analytical data
   Images and Movies

 And all for free…
And all “mobile”
Not only compounds but syntheses
And analytical data…
The world can take and contribute
 Scientists can deposit their data

 They can annotate and curate

 They can download data

 They can embed data in the social network

 They can integrate and connect
Integrate to electronic lab notebooks
Integrate to electronic lab notebooks
Integrate to instruments and software
 Primary analytical instrumentation vendors integrate

   Agilent, Bruker, Thermo, Waters


 Cheminformatics vendors link to ChemSpider

   Accelrys, ACD/Labs, ChemAxon, iChemLabs
Publications are a summary of work
 Scientific publications are a summary of work
   Is all work reported?
   How much science is lost to pruning?
   What of value sits in notebooks and is lost?

 How much data is lost?
   How many compounds never reported?
   How many syntheses fail or succeed?
   How many characterization measurements?
What if we could capture it all?
Start with data in publications
But in the time of Big Data…it’s linked!
ONE example – data for life sciences
                                                    IP?
                            What’s the
                            structure?
                                                Are they in
                                                 our file?
                              What’s
                             similar?
                                                What’s the
                          Pharmacology           target?
                              data?

                                          Known
                                        Pathways?
                         Competitors?
                                                Working On
                          Connections             Now?
                          to disease?
                                          Expressed in
                                         right cell type?
 Crowdsourcing across drug discovery
 Open PHACTS : partnership between European
  Community and European Pharma Companies
 22 partners, 8 pharmaceutical companies, 3
  biotechs working together for 3 years

 Freely accessible for knowledge discovery and
  verification.
    Data on chemistry and biology
    Pharmacological profiles
    Proprietary and public data sources.
All that glisters is not gold…
Crowdsourced Assertions
 The future of publishing will include generation
  and consumption of “nanopublications”




 http://www.nanopub.org/
Nanopublications??
So what’s the business model?
 Decisions are based on data

 Publications encapsulate, reference and link data

 More data is free and open. More services and
  APIS allow access – free or for fee. Ask Google

 The large-scale licensed content business model
  is at risk without interfaces to integrate and mine
Acknowledgments
 The RSC ChemSpider team

 Our users, our depositors, our curators

 GGA Software Services, OpenEye, ACD/Labs
  and a lot of Open Source code!

 And Al Gore for supporting the internet
http://
  en.wikipedia.org/wiki/Al_Gore_and_information_techn
Thank you

Email: williamsa@rsc.org
Twitter: ChemConnector
Personal Blog: www.chemconnector.com
SLIDES: www.slideshare.net/AntonyWilliams

Contenu connexe

Tendances

Tendances (20)

ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
 
Navigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpiderNavigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpider
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
 
RSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider – Building An Internet Based Community For ChemistsRSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider – Building An Internet Based Community For Chemists
 
Building A Community Resource For The Life Sciences
Building A Community Resource For The Life SciencesBuilding A Community Resource For The Life Sciences
Building A Community Resource For The Life Sciences
 
Why Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpiderWhy Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpider
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider
 
ChemSpider hosting linking and curating chemistry data for the community
ChemSpider  hosting linking and curating chemistry data for the communityChemSpider  hosting linking and curating chemistry data for the community
ChemSpider hosting linking and curating chemistry data for the community
 
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
 
Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposing
 
Connecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpiderConnecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpider
 
How the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data finalHow the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data final
 
Structure representations in public chemistry databases: The challenges of va...
Structure representations in public chemistry databases: The challenges of va...Structure representations in public chemistry databases: The challenges of va...
Structure representations in public chemistry databases: The challenges of va...
 
ChemSpider as a chemical term resolver
ChemSpider as a chemical term resolverChemSpider as a chemical term resolver
ChemSpider as a chemical term resolver
 
Chem spider as a chemical term resolver
Chem spider as a chemical term resolverChem spider as a chemical term resolver
Chem spider as a chemical term resolver
 
Taming The Wild West Of Internet Based Chemistry You Can Help
Taming The Wild West Of Internet Based Chemistry You Can HelpTaming The Wild West Of Internet Based Chemistry You Can Help
Taming The Wild West Of Internet Based Chemistry You Can Help
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
 
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
 
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
 

En vedette

HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?Amit Jhunjhunwala
 
France presentation eleanor
France presentation eleanorFrance presentation eleanor
France presentation eleanorPhilip Copeland
 
φωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλαφωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλα3dimchan
 
индия
индияиндия
индияbanditka
 
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...Alan Quayle
 
DSA - delivering on the promise of bespoke support
DSA  - delivering on the promise of bespoke support DSA  - delivering on the promise of bespoke support
DSA - delivering on the promise of bespoke support iansyst
 
困髮族五大原因
困髮族五大原因困髮族五大原因
困髮族五大原因formosa858
 
Volunteer in Italy 2012
Volunteer in Italy 2012Volunteer in Italy 2012
Volunteer in Italy 2012AYAvolunteer
 
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina FrancaTutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina FrancaMassimiliano Martucci
 
Slideshare
SlideshareSlideshare
Slidesharebolona
 
5434 avtodsdsdsds
5434 avtodsdsdsds5434 avtodsdsdsds
5434 avtodsdsdsdsNightLightW
 
Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)Яндекс.Деньги
 
Kalkulus 2 minggu 11
Kalkulus 2   minggu 11Kalkulus 2   minggu 11
Kalkulus 2 minggu 11Iwan Pranoto
 
Derivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mbaDerivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mbaBabasab Patil
 

En vedette (20)

HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
 
France presentation eleanor
France presentation eleanorFrance presentation eleanor
France presentation eleanor
 
φωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλαφωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλα
 
индия
индияиндия
индия
 
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
 
DSA - delivering on the promise of bespoke support
DSA  - delivering on the promise of bespoke support DSA  - delivering on the promise of bespoke support
DSA - delivering on the promise of bespoke support
 
что такое вселенная
что такое вселеннаячто такое вселенная
что такое вселенная
 
困髮族五大原因
困髮族五大原因困髮族五大原因
困髮族五大原因
 
Volunteer in Italy 2012
Volunteer in Italy 2012Volunteer in Italy 2012
Volunteer in Italy 2012
 
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina FrancaTutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
 
Slideshare
SlideshareSlideshare
Slideshare
 
Kalender actie
Kalender actieKalender actie
Kalender actie
 
5434 avtodsdsdsds
5434 avtodsdsdsds5434 avtodsdsdsds
5434 avtodsdsdsds
 
Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)
 
Caràcters poligénics. 2
Caràcters poligénics. 2Caràcters poligénics. 2
Caràcters poligénics. 2
 
Places in kolkata
Places in kolkataPlaces in kolkata
Places in kolkata
 
Fotoscurso
FotoscursoFotoscurso
Fotoscurso
 
Presentation1
Presentation1Presentation1
Presentation1
 
Kalkulus 2 minggu 11
Kalkulus 2   minggu 11Kalkulus 2   minggu 11
Kalkulus 2 minggu 11
 
Derivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mbaDerivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mba
 

Similaire à The Great Promise of Online Data for Chemistry and the Life Sciences

Slides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinalSlides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinalSean Ekins
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Sean Ekins
 
Collaboration - theory & Practice
Collaboration - theory & PracticeCollaboration - theory & Practice
Collaboration - theory & PracticeSean Ekins
 
Chemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the communityChemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the communityRoyal Society of Chemistry
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 

Similaire à The Great Promise of Online Data for Chemistry and the Life Sciences (20)

Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...
 
Chemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the handChemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the hand
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Chemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScienceChemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScience
 
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
 
Slides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinalSlides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinal
 
Open Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific ResearchOpen Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific Research
 
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
 
The future of scientific information & communication
The future of scientific information & communicationThe future of scientific information & communication
The future of scientific information & communication
 
Collaborative Computational Technologies for Biomedical Research: An Enabler ...
Collaborative Computational Technologies for Biomedical Research: An Enabler ...Collaborative Computational Technologies for Biomedical Research: An Enabler ...
Collaborative Computational Technologies for Biomedical Research: An Enabler ...
 
Crowdsourcing Chemistry for the Community – 5 Years of Experiences
Crowdsourcing Chemistry for the Community – 5 Years of ExperiencesCrowdsourcing Chemistry for the Community – 5 Years of Experiences
Crowdsourcing Chemistry for the Community – 5 Years of Experiences
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Collaboration - theory & Practice
Collaboration - theory & PracticeCollaboration - theory & Practice
Collaboration - theory & Practice
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
 
Chemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the communityChemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the community
 
Qualifying Online Information Resources for Chemists
Qualifying Online Information Resources for ChemistsQualifying Online Information Resources for Chemists
Qualifying Online Information Resources for Chemists
 
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
 
Engaging participation from the chemistry community
Engaging participation from the chemistry communityEngaging participation from the chemistry community
Engaging participation from the chemistry community
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 

Dernier

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Dernier (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

The Great Promise of Online Data for Chemistry and the Life Sciences

  • 1. The Great Promise of Online Data for Chemistry and the Life Sciences Antony J Williams Silverchair Colloquium 2012
  • 2. READ FAST – IT’S HAPPENING NOW 20 minutes, >40 slides Disruption Can be Cheap, Fast and Unexpectedly Successful
  • 4. A search gave LOTS of “info”.. What is Yohimbine?
  • 6. Why not Index the web of chemistry?  Build a search engine for chemistry  Index all public domain chemicals and link  Build a structure searchable web  Crowdsource new chemistry from the community  Crowdsource curation and annotation
  • 8.
  • 9. Answering Real Questions  Questions a chemist might ask…  What is the melting point of n-heptanol?  What is the chemical structure of Xanax?  Chemically, what is phenolphthalein?  What are the stereocenters of cholesterol?  Where can I find publications about xylene?  What are the different trade names for Ketoconazole?  What is the NMR spectrum of Aspirin?  What are the safety handling issues for Thymol Blue?
  • 10. The World of Online Chemistry  Safety data  Toxicity data  Blogs and Wikis  Property databases  Experimental results  Scientific publications  Compound aggregators  Open Notebook Science  Metabolic pathway databases  Encyclopedic articles (Wikipedia)
  • 11. Linked Data for Life Sciences growing…
  • 12. Solve Real World Problems  Provide programmable interface against content  Provide a chemistry database tuned to integrators
  • 13. RSC and ChemSpider – May 2009
  • 14. Why RSC acquired ChemSpider  Commitment to serve the community  Bring cheminformatics expertise in-house  Add additional data to publications  Potential freemium model – web services, data  Because data is critical to science
  • 15. Making sense of data is overwhelming
  • 17. Data has value, is Free, is Open  Data cannot be copyrighted. A particular expression of data, such as a chart or table in a publication, can be.  Data licensing is being dealt with and openness encouraged  Research data mandates are starting…  Who will manage the integration and curation and keep the access FREE!
  • 18. Tell me about Yohimbine…
  • 19. Of course it is out there…
  • 21. Tell me more…but…  Where can I find the electronic structure?  Papers/Patents about Yohimbine?  What are the side effects of Yohimbine?  Where can I order Yohimbine?  What are the physicochemical properties?  What are the associated metabolic pathways?  Different synonyms of Yohimbine?  Are there side effects with Yohimbine?  ChemSpider links all of this information and more
  • 23. RSC Databases are Integrated
  • 24. RSC Journals are Integrated
  • 26. Google Books are Integrated
  • 27. And so are…  Chemical vendors  Safety and Toxicity information  Experimental and Predicted properties  Analytical data  Images and Movies  And all for free…
  • 29. Not only compounds but syntheses
  • 31. The world can take and contribute  Scientists can deposit their data  They can annotate and curate  They can download data  They can embed data in the social network  They can integrate and connect
  • 32. Integrate to electronic lab notebooks
  • 33. Integrate to electronic lab notebooks
  • 34. Integrate to instruments and software  Primary analytical instrumentation vendors integrate  Agilent, Bruker, Thermo, Waters  Cheminformatics vendors link to ChemSpider  Accelrys, ACD/Labs, ChemAxon, iChemLabs
  • 35. Publications are a summary of work  Scientific publications are a summary of work  Is all work reported?  How much science is lost to pruning?  What of value sits in notebooks and is lost?  How much data is lost?  How many compounds never reported?  How many syntheses fail or succeed?  How many characterization measurements?
  • 36. What if we could capture it all?
  • 37. Start with data in publications
  • 38. But in the time of Big Data…it’s linked!
  • 39. ONE example – data for life sciences IP? What’s the structure? Are they in our file? What’s similar? What’s the Pharmacology target? data? Known Pathways? Competitors? Working On Connections Now? to disease? Expressed in right cell type?
  • 40.  Crowdsourcing across drug discovery  Open PHACTS : partnership between European Community and European Pharma Companies  22 partners, 8 pharmaceutical companies, 3 biotechs working together for 3 years  Freely accessible for knowledge discovery and verification.  Data on chemistry and biology  Pharmacological profiles  Proprietary and public data sources.
  • 41.
  • 42. All that glisters is not gold…
  • 43. Crowdsourced Assertions  The future of publishing will include generation and consumption of “nanopublications”  http://www.nanopub.org/
  • 45. So what’s the business model?  Decisions are based on data  Publications encapsulate, reference and link data  More data is free and open. More services and APIS allow access – free or for fee. Ask Google  The large-scale licensed content business model is at risk without interfaces to integrate and mine
  • 46. Acknowledgments  The RSC ChemSpider team  Our users, our depositors, our curators  GGA Software Services, OpenEye, ACD/Labs and a lot of Open Source code!  And Al Gore for supporting the internet http:// en.wikipedia.org/wiki/Al_Gore_and_information_techn
  • 47. Thank you Email: williamsa@rsc.org Twitter: ChemConnector Personal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams