SlideShare a Scribd company logo
1 of 38
Structure Identification Using High Resolution
Mass Spectrometry Data and the
EPA Chemistry Dashboard
Antony J. Williams†, Andrew McEachran, Jon Sobus,
Chris Grulke, Jennifer Smith, Michelle Krzyzanowski,
Jordan Foster and Jeff Edwards
National Center for Computational Toxicology
U.S. Environmental Protection Agency, RTP, NC
August 21-25, 2016
ACS Fall Meeting, Philadelphia, PA
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
http://www.orcid.org/0000-0002-2668-4821
@ChemConnector on Twitter
Comparing Analysis Approaches
• Targeted Analysis:
- We know exactly what we’re looking for
- 10s – 100s of chemicals
• Suspect Screening Analysis (SSA):
- We have chemicals of interest
- 100s – 1,000s of chemicals
• Non-Targeted Analysis (NTA):
- We have no preconceived lists
- 1,000s – 10,000s of chemicals
- In dust, soil, food, air, water, products,
plants, animals, and…us!!
General Goals of SSA/NTA
- 1 Dust Sample
- Negative Ionization Mode
- 300 Extracted “Molecular Features”
1) Prioritize “Molecular Features”
2) Correctly assign formulas
3) Correctly assign structures
4) Determine chemical sources
5) Predict chemical concentrations
C17H19NO3 12 µg/g
(1)
(2) (3) (4) (5)
EXPOSURE
Non-targeted analysis challenges
• 3000-5000 molecular features in a given
sample
• Current technologies can identify up to 5%
• How can we improve identification???
– Simple workflows
– Reliable formula prediction (Instrument)
– Accurate ranking of likelihood (Databases)
The General Approach
Analytical Instruments Comp. Tools & Workflows
Databases
Previous Work with Suspect-Screening
We’re on the Right Path…
• … but certainly room for improvement
• ~thousands of molecular features (not unique)
• 33 confirmed chemicals
• State-of-the-art SSA yields <5% confirmed IDs
• So what else is in these (and other) samples??
2012: Definitive Study of Known-
Unknowns using ChemSpider
7
2012: Definitive Study of Known-
Unknowns using ChemSpider
8
ChemSpider
• http://www.chemspider.com
• Grows daily with new depositors and
annotations. Example Data Sources
9
Our New Dashboard
https://comptox.epa.gov
10
Bisphenol A
11
Physicochemical Properties
12
Bioassay Screening Data
13
Functional Use and Composition
14
External Links
15
National Environmental Methods Index
16
Advanced Search
17
Formula Searching
Formulae matching Bisphenol A
18
Formula Search Results
19
Download to Excel
20
Download as SDF file
21
SDF file opened in ChemFolder
22
Rank-ordering all of those hits??
• With so many hits how do you rank order
based on formulae? Or mass??
23
Comparing Performance
24
721k structures
Bisphenol A as an example
ChemSpider: 1564 Structures
25
Bisphenol A as an example
Dashboard: 215 Structures
26
A more pointed example…C15H15N3O2
6926 results
27
A more pointed example…C15H15N3O2
94 results
28
Particulate
Matter
Antibiotics used in
animal production
TYL= Tylosin
MON= Monensin
TC= Tetracycline
OTC= Oxytetracycline
CTC= Chlortetracycline
McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes:
aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555
Antibiotics in beef commercial feed
Mass-based Search Formula-based Search
Agricultural
Source
# Compounds Dashboard ChemSpider Dashboard ChemSpider
Wastewater land
application1
34 1.3 1.8 1.1 1.1
Cattle Feedyard2 5 1.0 1.0 1.0 1.0
1McEachran AD, Shea D, Bodnar W, Nichols EG. 2016. Pharmaceutical Occurrence in groundwater and surface waters in
forests land-applied with municipal wastewater. Environ Toxicol Chem 35: 898-905. DOI: 10.1002/etc.3216
2McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic
resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343;
DOI:10.1289/EHP.1408555
Mass-based Search Formula Based Search
Dashboard ChemSpider Dashboard ChemSpider
Tylosin 1/1 1/28 1/1 1/25
Monensin 1/1 1/39 1/1 1/24
Tetracycline 1/ 38 1/4008 1/11 1/355
Oxytetracycline 1/16 1/3271 1/3 1/110
Chlortetracycline 1/23 1/2545 1/3 1/77
Rank Position/Total # Results
Mean Rank Position
Rank-ordering Comparisons
Chemical Identification
Dashboard vs ChemSpider
Sorted by number of
references (ChemSpider)
or data sources
(Dashboard)
Monoisotopic Mass (+/- 0.005 amu) Search
Position of compound sorted
Source of List # of
Compounds
Search Tool Mean
Position
Median
Position #1 #2 #3 #4 #5+
McEachran et al
Wastewater
34 ChemSpider 1.8 1 28 5 0 0 1
Dashboard 1.3 1 31 2 0 0 1
Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1
Dashboard 1.7 1 10 2 0 0 1
Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1
Dashboard 1.6 1 12 3 3 1 0
Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4
Dashboard 1.08 1 22 2 0 0 0
Dashboard vs ChemSpider
Ranking Summary
Mass-based Searching Formula Based Searching
Dashboard ChemSpider Dashboard ChemSpider
Cumulative Average
Position 1.3 2.2 1.2 1.4
% in #1 Position 85% 70% 88% 80%
162 total individual chemicals in search
Functional Use to Sort Candidates
33
Anti-cancer Drug
Microbiological
Indicator Dye
Textile/Product Dye
Future Work
• Rank-ordering based on other criteria
• Already testing QSARs to build retention
time models for ranking
• External links to methods: e.g. CDC NIOSH
• Formula identification using isotope profiles
34
ToxCast
Chemicals
What impurities/
interaction products
found?
Engaging the MS Community
Conclusions
• Our NTA research is focused on understanding
our exposure to chemicals
• New dashboard with focus on high-quality
data – no large database will be perfect!
• Specific searches/functionality are being
developed with Non-targeted Analysis in mind
• Dashboard outperforms ChemSpider, a
community standard database, in ranking
chemicals of environmental concern
• Early work on new rank-ordering approaches
show that we can improve things even further.
36
Acknowledgements
EPA NCCT
Chris Grulke
Jeff Edwards
Ann Richard
Jordan Foster
Jennifer Smith
Andrew McEachran*
Michelle Krzyzanowski
EPA NERL
Kathie Dionisio
Katherine Phillips
Jon Sobus
Mark Strynar
Elin Ulrich
Seth Newton
* = ORISE Participant

More Related Content

What's hot

Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
EPA CompTox chemicals dashboard: An online resource for environmental chemists
EPA CompTox chemicals dashboard: An online resource for environmental chemists EPA CompTox chemicals dashboard: An online resource for environmental chemists
EPA CompTox chemicals dashboard: An online resource for environmental chemists
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
What chemicals constitute the Exposome? Accessing data via the US EPA’s  Comp...What chemicals constitute the Exposome? Accessing data via the US EPA’s  Comp...
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

What's hot (20)

The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...The needs for chemistry standards, database tools and data curation at the ch...
The needs for chemistry standards, database tools and data curation at the ch...
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...Chemical identification of unknowns in high resolution mass spectrometry usin...
Chemical identification of unknowns in high resolution mass spectrometry usin...
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
 
Open PHACTS Chemistry Platform Update and Learnings
Open PHACTS Chemistry Platform Update and Learnings Open PHACTS Chemistry Platform Update and Learnings
Open PHACTS Chemistry Platform Update and Learnings
 
Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...Accessing information for chemicals in hydraulic fracturing fluids using the ...
Accessing information for chemicals in hydraulic fracturing fluids using the ...
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
 
EPA CompTox chemicals dashboard: An online resource for environmental chemists
EPA CompTox chemicals dashboard: An online resource for environmental chemists EPA CompTox chemicals dashboard: An online resource for environmental chemists
EPA CompTox chemicals dashboard: An online resource for environmental chemists
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
 
Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...Adding complex expert knowledge into chemical database and transforming surfa...
Adding complex expert knowledge into chemical database and transforming surfa...
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
What chemicals constitute the Exposome? Accessing data via the US EPA’s  Comp...What chemicals constitute the Exposome? Accessing data via the US EPA’s  Comp...
What chemicals constitute the Exposome? Accessing data via the US EPA’s Comp...
 
New Approach Methods - What is That?
New Approach Methods - What is That?New Approach Methods - What is That?
New Approach Methods - What is That?
 
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
 
Structure identification using high resolution mass spectrometry data and the...
Structure identification using high resolution mass spectrometry data and the...Structure identification using high resolution mass spectrometry data and the...
Structure identification using high resolution mass spectrometry data and the...
 

Viewers also liked

Conférence débat sur les réseaux sociaux
Conférence débat sur les réseaux sociauxConférence débat sur les réseaux sociaux
Conférence débat sur les réseaux sociaux
Régis Gautheron
 
Investigating Impact Metrics for Performance for the US-EPA National Center f...
Investigating Impact Metrics for Performance for the US-EPA National Center f...Investigating Impact Metrics for Performance for the US-EPA National Center f...
Investigating Impact Metrics for Performance for the US-EPA National Center f...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Shaping Expectations: Defining and Refining the Role of Technical Services in...
Shaping Expectations: Defining and Refining the Role of Technical Services in...Shaping Expectations: Defining and Refining the Role of Technical Services in...
Shaping Expectations: Defining and Refining the Role of Technical Services in...
NASIG
 
Building an Online Profile Using Social Networking and Amplification Tools fo...
Building an Online Profile Using Social Networking and Amplification Tools fo...Building an Online Profile Using Social Networking and Amplification Tools fo...
Building an Online Profile Using Social Networking and Amplification Tools fo...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Viewers also liked (17)

Conférence débat sur les réseaux sociaux
Conférence débat sur les réseaux sociauxConférence débat sur les réseaux sociaux
Conférence débat sur les réseaux sociaux
 
Using Ecological Momentary Assessment to Examine Post-food Consumption Affect...
Using Ecological Momentary Assessment to Examine Post-food Consumption Affect...Using Ecological Momentary Assessment to Examine Post-food Consumption Affect...
Using Ecological Momentary Assessment to Examine Post-food Consumption Affect...
 
NSF Data Management Requirements 101
NSF Data Management Requirements 101NSF Data Management Requirements 101
NSF Data Management Requirements 101
 
From Data Availability to Information Accessibility: The WellWiki Project
From Data Availability to Information Accessibility: The WellWiki ProjectFrom Data Availability to Information Accessibility: The WellWiki Project
From Data Availability to Information Accessibility: The WellWiki Project
 
How One Monkey on a Typewriter Made a Difference to Online Chemistry
How One Monkey on a Typewriter Made a Difference to Online ChemistryHow One Monkey on a Typewriter Made a Difference to Online Chemistry
How One Monkey on a Typewriter Made a Difference to Online Chemistry
 
Simple Springshare Mashups: Cross-Platform Strategies for Repurposing Digital...
Simple Springshare Mashups: Cross-Platform Strategies for Repurposing Digital...Simple Springshare Mashups: Cross-Platform Strategies for Repurposing Digital...
Simple Springshare Mashups: Cross-Platform Strategies for Repurposing Digital...
 
SMS Berlin 2016 Cultural Perspectives on Strategic Management
SMS Berlin 2016 Cultural Perspectives on Strategic ManagementSMS Berlin 2016 Cultural Perspectives on Strategic Management
SMS Berlin 2016 Cultural Perspectives on Strategic Management
 
Investigating Impact Metrics for Performance for the US-EPA National Center f...
Investigating Impact Metrics for Performance for the US-EPA National Center f...Investigating Impact Metrics for Performance for the US-EPA National Center f...
Investigating Impact Metrics for Performance for the US-EPA National Center f...
 
A Bird in the Hand: Leveraging ILL Requests to Improve Electronic Resource A...
A Bird in the Hand: Leveraging ILL Requests to Improve Electronic Resource A...A Bird in the Hand: Leveraging ILL Requests to Improve Electronic Resource A...
A Bird in the Hand: Leveraging ILL Requests to Improve Electronic Resource A...
 
Social Media Tools for Scientists and Building an Online Profile
Social Media Tools for Scientists and Building an Online ProfileSocial Media Tools for Scientists and Building an Online Profile
Social Media Tools for Scientists and Building an Online Profile
 
Shaping Expectations: Defining and Refining the Role of Technical Services in...
Shaping Expectations: Defining and Refining the Role of Technical Services in...Shaping Expectations: Defining and Refining the Role of Technical Services in...
Shaping Expectations: Defining and Refining the Role of Technical Services in...
 
Building an Online Profile Using Social Networking and Amplification Tools fo...
Building an Online Profile Using Social Networking and Amplification Tools fo...Building an Online Profile Using Social Networking and Amplification Tools fo...
Building an Online Profile Using Social Networking and Amplification Tools fo...
 
Web Preservation, or Managing your Organisation’s Online Presence After the O...
Web Preservation, or Managing your Organisation’s Online Presence After the O...Web Preservation, or Managing your Organisation’s Online Presence After the O...
Web Preservation, or Managing your Organisation’s Online Presence After the O...
 
Going Concerns: A Perspective from the Nexus of Business, Culture and Instit...
Going Concerns:  A Perspective from the Nexus of Business, Culture and Instit...Going Concerns:  A Perspective from the Nexus of Business, Culture and Instit...
Going Concerns: A Perspective from the Nexus of Business, Culture and Instit...
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
2016 bergen-sars
2016 bergen-sars2016 bergen-sars
2016 bergen-sars
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
 

Similar to Structure Identification Using High Resolution Mass Spectrometry Data and the EPA’s Chemistry Dashboard

Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Kamel Mansouri
 
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicalsEPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
Kamel Mansouri
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Similar to Structure Identification Using High Resolution Mass Spectrometry Data and the EPA’s Chemistry Dashboard (20)

Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
 
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicalsEPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
EPA’s CompTox Chemicals Dashboard, a tool with information on ~900,000 chemicals
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor ActivityCoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
CoMPARA: Collaborative Modeling Project for Androgen Receptor Activity
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...Integrating Mass Spectrometry  Non-Targeted Analysis and Computational Chemis...
Integrating Mass Spectrometry Non-Targeted Analysis and Computational Chemis...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting Exposomics
 
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
CERAPP - Collaborative Estrogen Receptor Activity Prediction Project. Computa...
 
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
 
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...Web-based access to data for >600 disinfection by-products via the EPA CompTo...
Web-based access to data for >600 disinfection by-products via the EPA CompTo...
 
Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...Web-based access to experimental and predicted data for environmental fate, t...
Web-based access to experimental and predicted data for environmental fate, t...
 
US-EPA Chemicals Dashboard and Applications to Digital Design of Molecules
US-EPA Chemicals Dashboard and Applications to Digital Design  of MoleculesUS-EPA Chemicals Dashboard and Applications to Digital Design  of Molecules
US-EPA Chemicals Dashboard and Applications to Digital Design of Molecules
 

Recently uploaded

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 

Structure Identification Using High Resolution Mass Spectrometry Data and the EPA’s Chemistry Dashboard

  • 1. Structure Identification Using High Resolution Mass Spectrometry Data and the EPA Chemistry Dashboard Antony J. Williams†, Andrew McEachran, Jon Sobus, Chris Grulke, Jennifer Smith, Michelle Krzyzanowski, Jordan Foster and Jeff Edwards National Center for Computational Toxicology U.S. Environmental Protection Agency, RTP, NC August 21-25, 2016 ACS Fall Meeting, Philadelphia, PA The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA http://www.orcid.org/0000-0002-2668-4821 @ChemConnector on Twitter
  • 2. Comparing Analysis Approaches • Targeted Analysis: - We know exactly what we’re looking for - 10s – 100s of chemicals • Suspect Screening Analysis (SSA): - We have chemicals of interest - 100s – 1,000s of chemicals • Non-Targeted Analysis (NTA): - We have no preconceived lists - 1,000s – 10,000s of chemicals - In dust, soil, food, air, water, products, plants, animals, and…us!!
  • 3. General Goals of SSA/NTA - 1 Dust Sample - Negative Ionization Mode - 300 Extracted “Molecular Features” 1) Prioritize “Molecular Features” 2) Correctly assign formulas 3) Correctly assign structures 4) Determine chemical sources 5) Predict chemical concentrations C17H19NO3 12 µg/g (1) (2) (3) (4) (5) EXPOSURE
  • 4. Non-targeted analysis challenges • 3000-5000 molecular features in a given sample • Current technologies can identify up to 5% • How can we improve identification??? – Simple workflows – Reliable formula prediction (Instrument) – Accurate ranking of likelihood (Databases)
  • 5. The General Approach Analytical Instruments Comp. Tools & Workflows Databases
  • 6. Previous Work with Suspect-Screening
  • 7. We’re on the Right Path… • … but certainly room for improvement • ~thousands of molecular features (not unique) • 33 confirmed chemicals • State-of-the-art SSA yields <5% confirmed IDs • So what else is in these (and other) samples??
  • 8. 2012: Definitive Study of Known- Unknowns using ChemSpider 7
  • 9. 2012: Definitive Study of Known- Unknowns using ChemSpider 8
  • 10. ChemSpider • http://www.chemspider.com • Grows daily with new depositors and annotations. Example Data Sources 9
  • 15. Functional Use and Composition 14
  • 22. Download as SDF file 21
  • 23. SDF file opened in ChemFolder 22
  • 24. Rank-ordering all of those hits?? • With so many hits how do you rank order based on formulae? Or mass?? 23
  • 26. Bisphenol A as an example ChemSpider: 1564 Structures 25
  • 27. Bisphenol A as an example Dashboard: 215 Structures 26
  • 28. A more pointed example…C15H15N3O2 6926 results 27
  • 29. A more pointed example…C15H15N3O2 94 results 28
  • 30. Particulate Matter Antibiotics used in animal production TYL= Tylosin MON= Monensin TC= Tetracycline OTC= Oxytetracycline CTC= Chlortetracycline McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555 Antibiotics in beef commercial feed
  • 31. Mass-based Search Formula-based Search Agricultural Source # Compounds Dashboard ChemSpider Dashboard ChemSpider Wastewater land application1 34 1.3 1.8 1.1 1.1 Cattle Feedyard2 5 1.0 1.0 1.0 1.0 1McEachran AD, Shea D, Bodnar W, Nichols EG. 2016. Pharmaceutical Occurrence in groundwater and surface waters in forests land-applied with municipal wastewater. Environ Toxicol Chem 35: 898-905. DOI: 10.1002/etc.3216 2McEachran AD, Blackwell BR, Hanson JD, Wooten KJ, Mayer GD, Cox SB, Smith PN. 2015. Antibiotics, bacteria, and antibiotic resistance genes: aerial transport from cattle feed yards via particulate matter. Environ Health Perspect 123:337-343; DOI:10.1289/EHP.1408555 Mass-based Search Formula Based Search Dashboard ChemSpider Dashboard ChemSpider Tylosin 1/1 1/28 1/1 1/25 Monensin 1/1 1/39 1/1 1/24 Tetracycline 1/ 38 1/4008 1/11 1/355 Oxytetracycline 1/16 1/3271 1/3 1/110 Chlortetracycline 1/23 1/2545 1/3 1/77 Rank Position/Total # Results Mean Rank Position Rank-ordering Comparisons
  • 32. Chemical Identification Dashboard vs ChemSpider Sorted by number of references (ChemSpider) or data sources (Dashboard) Monoisotopic Mass (+/- 0.005 amu) Search Position of compound sorted Source of List # of Compounds Search Tool Mean Position Median Position #1 #2 #3 #4 #5+ McEachran et al Wastewater 34 ChemSpider 1.8 1 28 5 0 0 1 Dashboard 1.3 1 31 2 0 0 1 Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1 Dashboard 1.7 1 10 2 0 0 1 Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1 Dashboard 1.6 1 12 3 3 1 0 Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4 Dashboard 1.08 1 22 2 0 0 0
  • 33. Dashboard vs ChemSpider Ranking Summary Mass-based Searching Formula Based Searching Dashboard ChemSpider Dashboard ChemSpider Cumulative Average Position 1.3 2.2 1.2 1.4 % in #1 Position 85% 70% 88% 80% 162 total individual chemicals in search
  • 34. Functional Use to Sort Candidates 33 Anti-cancer Drug Microbiological Indicator Dye Textile/Product Dye
  • 35. Future Work • Rank-ordering based on other criteria • Already testing QSARs to build retention time models for ranking • External links to methods: e.g. CDC NIOSH • Formula identification using isotope profiles 34
  • 37. Conclusions • Our NTA research is focused on understanding our exposure to chemicals • New dashboard with focus on high-quality data – no large database will be perfect! • Specific searches/functionality are being developed with Non-targeted Analysis in mind • Dashboard outperforms ChemSpider, a community standard database, in ranking chemicals of environmental concern • Early work on new rank-ordering approaches show that we can improve things even further. 36
  • 38. Acknowledgements EPA NCCT Chris Grulke Jeff Edwards Ann Richard Jordan Foster Jennifer Smith Andrew McEachran* Michelle Krzyzanowski EPA NERL Kathie Dionisio Katherine Phillips Jon Sobus Mark Strynar Elin Ulrich Seth Newton * = ORISE Participant

Editor's Notes

  1. For example- Rager was actually 33 confirmed; Bade was 25