SlideShare une entreprise Scribd logo
1  sur  1
Looking closer at the failed samples provides some insight to the nature of the failure.  ID #3 failed specifically because of an unassigned 2D peak appearing at 17.8 ppm and 2.2 ppm.  It was determined that this peak was due to the aromatic methyl group in the structure.  The reason for a lack of consistency between the predicted and experimental chemical shifts in this case were due to a slow rotation around the N-CO bond.  As a result, this rotamer produces an experimental spectrum that looks like a mixture.  The software was unable to accurately predict this mixture of forms based on the experimental conditions, and as a result, the predicted spectrum did not match the experimental and the sample was flagged for manual analysis.  Figure 2.   ID #3 was a false negative as the software was unable to assign the methyl group (highlighted in blue) due to slow rotation around the N-CO bond.  The presence of this rotamer resulted in an inconsistency between the experimental and predicted chemical shifts.  The software also flagged the sample ID #7 to be considered for closer inspection.  In this particular case, the issue with this spectrum was determined to be based on the presence of a mixture.  Based on some of the spectral features in both the  1 H and HSQC–DEPT spectra, it is believed that some of the product had converted to an alcohol resulting in a mixture of both the brominated and hydroxylated products.  As a result, the software correctly identified this spectrum as not being consistent with the proposed structure and it was flagged for manual analysis. Figure 3.   ID #7 was flagged as an ambiguous result.  Spectral features in the  1 H and HSQC–DEPT dataset suggest a mixture between the two compounds shown above.  The final sample in the blind test set (ID #10) was also flagged by the software.  In this particular case, the software was unable to identify two protons in the experimental spectrum because the software correctly set a dark region over a large water peak.  Unfortunately, the creation of this dark region resulted in the exclusion of an important multiplet in the experimental spectrum that was in close proximity to the water peak.  Because of this dark region, the software was unable to confirm a match between the spectrum and structure. Table 1.   The results of the 19 Aldrich datasets.  For this dataset, 13 of the 19 datasets (69%) were automatically evaluated by the software.  The software was unable to confirm the proposed structure in Figure 1 (ID #8).  Upon closer inspection it was observed that the software failed because the experimental peak located at 4.74 ppm corresponding to atom #11 in the proposed structure had an integration value that was too low to assign correctly.  As a result, the software flagged this result as ambiguous.  It was later observed that the low integration value could be due to enol formation.  A longer relaxation delay may have more adequately prepared the experimental spectrum for automatic evaluation by the software.  Figure 1.   ID #8 was a false negative because the integration value for the multiplet at 4.74 ppm was too low.  A second compound, ID #11, was rejected by the software as well.  Upon closer inspection of the experimental data, it was determined that the purity of this compound was not sufficient and that the sample contained several different components.  In these two particular examples, the software did a good job of flagging the two problem spectra that required a closer look.  Results of the Blind Test After the previous results and settings had been agreed upon, a blind test set of 10 compounds was run through the system in the exact same fashion.  No changes to the processing or verification parameters were made.  The results of this test are shown in table 2.  Table 2.   The results of the 10 blind Aldrich datasets.  For this dataset, 7 of the 10 datasets (70%) were automatically evaluated by the software.  Ryan Sasaki, Brent Lefebvre, Antony J. Williams, and Sergey Golotvin Advanced Chemistry Development, Inc. Toronto, ON, Canada 110 Yonge Street, 14 th  floor, Toronto Ontario, Canada  M5C 1T4 Tel: (416) 368-3435 Fax: (416) 368-5596 Toll Free: 1-800-304-3988 Email: info@acdlabs.com   Validating Automated Structure Confirmation in a Blind Study Figure 6 .  ID #10 failed due to the close proximity of an important multiplet to an intense water peak in the experimental spectrum.  CONCLUSIONS The goal of this study was to evaluate a fully automated NMR processing and structure verification workflow for a blind test set of compounds.  The processing and evaluation settings for a typical group of samples was set up using a pilot set of 19 compounds.  Once these settings were adjusted, they were used to automatically process and evaluate a blind set of 10 compounds that were prepared under the same conditions.  The results revealed that this completely automated system could reduce the interpretation workload of a spectroscopist by up to 90% if problems with rotomers and impurities are filtered out before the NMR Verification step, up to 70% when these problem samples are left in.  This study highlighted several examples where datasets were flagged by the software for closer inspection by a spectroscopist.  These particular examples illustrate the software’s discrimination ability that help reduce the risk of false positives.  The results of this blind study suggested that a fully automated processing and interpretation system can perform sufficiently in an industrial environment.  ACKNOWLEDGEMENTS The authors would like to acknowledge Dr. Timothy D. Spitzer and Randy D. Rutkowske of GlaxoSmithKline for providing us with NMR data for the compounds in this study.  REFERENCES 1. Automated Structure Verification Based on  1 H NMR Prediction,  Sergey S. Golotvin, Eugene Vodopianov, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer .  Magn. Reson. Chem.  2006; 44: 524-538.  2. Automated Evaluation of a Chemical Structure with Only 1D  1 H and 2D  1 H– 13 C HSQC,  Sergey S. Golotvin, Eugene Vodopianov, Rostislav Pol, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer  .  ENC Poster  2006.   INTRODUCTION In previous work, we have presented several findings on the automated evaluation of chemical structures using  1 H,  13 C, and 2D NMR verification algorithms. 1–2   These studies have shown that these systems have performed extremely well through numerous challenges.  The current study focuses not only on the performance of the verification algorithms but also on the automated preparation of experimental data through a blind test.  This study was designed to prove that such a system would hold up in an industrial environment without any human intervention.  This study consisted of two distinct sets of structures and spectra.  The first contained 19 spectra sets (each dataset contained 1D  1 H and 2D HSQC spectra) that were provided ahead of time for adjustment of processing settings and options.  This step was necessary to identify the best software settings based on the instrument and data collection practices for the laboratory where the samples were prepared and run.  Once the first set was run through the system and results of the verification procedure obtained, the second, blind test, was performed on 10 distinct datasets (with chemical structures) that were not available to the software or the software operators in advance.  The details and results of these two tests are presented here, along with a comprehensive look at the structures that could not be confirmed.  Setting Up Ideal Processing and Evaluation Parameters In order to have a system that can run without human intervention, automated processing and structure verification procedures (macros) must be created in the software to perform these tasks.  The raw 1D and 2D NMR datasets for 19 Aldrich compounds were first evaluated using ACD/Labs’ standard macros.  These settings proved to be non-sufficient as the datasets contained several abnormally broad water peaks and low signal-to-noise ratios.  These macros were then modified to exclude these water peaks and set more stringent peak picking guidelines to combat the S/N issues.  The second attempt was improved but had some issues with the referencing in one of the 2D datasets.  In addition, the 1D spectra were not well-resolved, resulting in an inaccurate evaluation of some multiplets.  These issues were rectified by decreasing the line broadening setting in the software by a factor of 10.  Following this modification, the settings were then deemed to be sufficient.  Results of the First Test An explanation of the combined verification algorithms used to evaluate spectrum-to-structure matches have been previously reported. 2   Following the modification of ACD/Labs’ standard macros explained in the previous section, the raw data of the 19 Aldrich compounds were fully processed and evaluated automatically.  The results revealed that the software was able to successfully evaluate 13 of the 19 datasets provided.  In other words, for this particular dataset, 69% of the samples were automatically evaluated by software without any human intervention.  The remaining 6 samples would require manual analysis by an NMR Spectroscopist as the software had flagged them as being either inconsistent or incorrect (Table 1).   Of these 6 samples, it was concluded that 4 of the false negatives were a result of algorithm errors that have been fixed in Version 10 of the software (ID# 6, 12, 13, and 15).  The other two ambiguous results require a closer look to be explained.

Contenu connexe

Tendances

LC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminantsLC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminants
SCIEX
 
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
SCIEX
 
High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...
SCIEX
 

Tendances (20)

Selected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysisSelected ion flow tube MS - Online quantitative VOC analysis
Selected ion flow tube MS - Online quantitative VOC analysis
 
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
Analysis of “The Big Four” Heavy Metals in Hops by Graphite Furnace Atomic Ab...
 
Archana B lc ppt
Archana B lc pptArchana B lc ppt
Archana B lc ppt
 
LC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminantsLC-MS/MS analysis of emerging food contaminants
LC-MS/MS analysis of emerging food contaminants
 
Ionization Techniques in LC-MS
Ionization Techniques in LC-MSIonization Techniques in LC-MS
Ionization Techniques in LC-MS
 
Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015Practical Implementation of the New Elemental Impurities Guidelines May 2015
Practical Implementation of the New Elemental Impurities Guidelines May 2015
 
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
Air Monitoring Applications of Selected Ion Flow Tube MS (SIFT-MS)
 
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...Impact of novel MS/MSall acquisition and processing techniques on forensic to...
Impact of novel MS/MSall acquisition and processing techniques on forensic to...
 
DETECTORS IN HPLC
DETECTORS IN HPLCDETECTORS IN HPLC
DETECTORS IN HPLC
 
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
Expanding Your High Performance Liquid Chromatography and Ultra High Performa...
 
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
Estimation of olopatadine hydrochloride by RP–HPLC and U.V spectrophotometry ...
 
Case Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical InstrumentsCase Study: Sterilization of Surgical Instruments
Case Study: Sterilization of Surgical Instruments
 
Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615Evap 135-cxp gfn2017-evap-poster-170615
Evap 135-cxp gfn2017-evap-poster-170615
 
Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.Validation and method development of Apixaban A research project.
Validation and method development of Apixaban A research project.
 
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up ApproachMALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
MALDI-TOF MS Based Discovery Workflows: A Fully Automated, Bottom-Up Approach
 
High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...High throughput, data independent acquisition for qualitative and quantitativ...
High throughput, data independent acquisition for qualitative and quantitativ...
 
Refocus: Analysis of malodor
Refocus: Analysis of malodorRefocus: Analysis of malodor
Refocus: Analysis of malodor
 
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
Development and Validation of Novel RP-HPLC method for the estimation of Nalo...
 
LCMS
LCMSLCMS
LCMS
 
HPLC Method Development
HPLC Method DevelopmentHPLC Method Development
HPLC Method Development
 

Similaire à Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study

Comparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINALComparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINAL
Jessica Sitko
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Sean Ekins
 

Similaire à Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study (20)

The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
The Performance Validation of Neural Network Based 13C NMR Prediction Using a...
 
Event 32
Event 32Event 32
Event 32
 
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
 
Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’Automated well test analysis ii using ‘well test auto’
Automated well test analysis ii using ‘well test auto’
 
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...IRJET-  	  Detection and Identification of Artificially Ripened Fruits using ...
IRJET- Detection and Identification of Artificially Ripened Fruits using ...
 
Neural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatmentNeural network-based pH and coagulation adjustment system in water treatment
Neural network-based pH and coagulation adjustment system in water treatment
 
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
PERFORMANCE ASSESSMENT OF ANFIS APPLIED TO FAULT DIAGNOSIS OF POWER TRANSFORMER
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
Towards More Reliable 13C and 1H Chemical Shift Prediction: A Systematic Comp...
 
A360105
A360105A360105
A360105
 
2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai
 
Cw33592595
Cw33592595Cw33592595
Cw33592595
 
Cw33592595
Cw33592595Cw33592595
Cw33592595
 
High Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant DetectionHigh Sensitivity Sanger Sequencing for Minor Variant Detection
High Sensitivity Sanger Sequencing for Minor Variant Detection
 
Comparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINALComparison of Formulation Analysis by UPLC FINAL
Comparison of Formulation Analysis by UPLC FINAL
 
EMSDVici
EMSDViciEMSDVici
EMSDVici
 
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint PropagationA Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
A Path-Oriented Automatic Random Testing Based on Double Constraint Propagation
 
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning modelsDevelopment and sharing of ADME/Tox and Drug Discovery Machine learning models
Development and sharing of ADME/Tox and Drug Discovery Machine learning models
 
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
 
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
A Rule-Based Model of Human Problem Solving Performance in Fault Diagnosis Ta...
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

Validating Automated Structure Confirmation Using NMR Prediction in a Blind Study

  • 1. Looking closer at the failed samples provides some insight to the nature of the failure. ID #3 failed specifically because of an unassigned 2D peak appearing at 17.8 ppm and 2.2 ppm. It was determined that this peak was due to the aromatic methyl group in the structure. The reason for a lack of consistency between the predicted and experimental chemical shifts in this case were due to a slow rotation around the N-CO bond. As a result, this rotamer produces an experimental spectrum that looks like a mixture. The software was unable to accurately predict this mixture of forms based on the experimental conditions, and as a result, the predicted spectrum did not match the experimental and the sample was flagged for manual analysis. Figure 2. ID #3 was a false negative as the software was unable to assign the methyl group (highlighted in blue) due to slow rotation around the N-CO bond. The presence of this rotamer resulted in an inconsistency between the experimental and predicted chemical shifts. The software also flagged the sample ID #7 to be considered for closer inspection. In this particular case, the issue with this spectrum was determined to be based on the presence of a mixture. Based on some of the spectral features in both the 1 H and HSQC–DEPT spectra, it is believed that some of the product had converted to an alcohol resulting in a mixture of both the brominated and hydroxylated products. As a result, the software correctly identified this spectrum as not being consistent with the proposed structure and it was flagged for manual analysis. Figure 3. ID #7 was flagged as an ambiguous result. Spectral features in the 1 H and HSQC–DEPT dataset suggest a mixture between the two compounds shown above. The final sample in the blind test set (ID #10) was also flagged by the software. In this particular case, the software was unable to identify two protons in the experimental spectrum because the software correctly set a dark region over a large water peak. Unfortunately, the creation of this dark region resulted in the exclusion of an important multiplet in the experimental spectrum that was in close proximity to the water peak. Because of this dark region, the software was unable to confirm a match between the spectrum and structure. Table 1. The results of the 19 Aldrich datasets. For this dataset, 13 of the 19 datasets (69%) were automatically evaluated by the software. The software was unable to confirm the proposed structure in Figure 1 (ID #8). Upon closer inspection it was observed that the software failed because the experimental peak located at 4.74 ppm corresponding to atom #11 in the proposed structure had an integration value that was too low to assign correctly. As a result, the software flagged this result as ambiguous. It was later observed that the low integration value could be due to enol formation. A longer relaxation delay may have more adequately prepared the experimental spectrum for automatic evaluation by the software. Figure 1. ID #8 was a false negative because the integration value for the multiplet at 4.74 ppm was too low. A second compound, ID #11, was rejected by the software as well. Upon closer inspection of the experimental data, it was determined that the purity of this compound was not sufficient and that the sample contained several different components. In these two particular examples, the software did a good job of flagging the two problem spectra that required a closer look. Results of the Blind Test After the previous results and settings had been agreed upon, a blind test set of 10 compounds was run through the system in the exact same fashion. No changes to the processing or verification parameters were made. The results of this test are shown in table 2. Table 2. The results of the 10 blind Aldrich datasets. For this dataset, 7 of the 10 datasets (70%) were automatically evaluated by the software. Ryan Sasaki, Brent Lefebvre, Antony J. Williams, and Sergey Golotvin Advanced Chemistry Development, Inc. Toronto, ON, Canada 110 Yonge Street, 14 th floor, Toronto Ontario, Canada M5C 1T4 Tel: (416) 368-3435 Fax: (416) 368-5596 Toll Free: 1-800-304-3988 Email: info@acdlabs.com Validating Automated Structure Confirmation in a Blind Study Figure 6 . ID #10 failed due to the close proximity of an important multiplet to an intense water peak in the experimental spectrum. CONCLUSIONS The goal of this study was to evaluate a fully automated NMR processing and structure verification workflow for a blind test set of compounds. The processing and evaluation settings for a typical group of samples was set up using a pilot set of 19 compounds. Once these settings were adjusted, they were used to automatically process and evaluate a blind set of 10 compounds that were prepared under the same conditions. The results revealed that this completely automated system could reduce the interpretation workload of a spectroscopist by up to 90% if problems with rotomers and impurities are filtered out before the NMR Verification step, up to 70% when these problem samples are left in. This study highlighted several examples where datasets were flagged by the software for closer inspection by a spectroscopist. These particular examples illustrate the software’s discrimination ability that help reduce the risk of false positives. The results of this blind study suggested that a fully automated processing and interpretation system can perform sufficiently in an industrial environment. ACKNOWLEDGEMENTS The authors would like to acknowledge Dr. Timothy D. Spitzer and Randy D. Rutkowske of GlaxoSmithKline for providing us with NMR data for the compounds in this study. REFERENCES 1. Automated Structure Verification Based on 1 H NMR Prediction, Sergey S. Golotvin, Eugene Vodopianov, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer . Magn. Reson. Chem. 2006; 44: 524-538. 2. Automated Evaluation of a Chemical Structure with Only 1D 1 H and 2D 1 H– 13 C HSQC, Sergey S. Golotvin, Eugene Vodopianov, Rostislav Pol, Brent A. Lefebvre, Antony J. Williams, and Timothy D. Spitzer . ENC Poster 2006. INTRODUCTION In previous work, we have presented several findings on the automated evaluation of chemical structures using 1 H, 13 C, and 2D NMR verification algorithms. 1–2 These studies have shown that these systems have performed extremely well through numerous challenges. The current study focuses not only on the performance of the verification algorithms but also on the automated preparation of experimental data through a blind test. This study was designed to prove that such a system would hold up in an industrial environment without any human intervention. This study consisted of two distinct sets of structures and spectra. The first contained 19 spectra sets (each dataset contained 1D 1 H and 2D HSQC spectra) that were provided ahead of time for adjustment of processing settings and options. This step was necessary to identify the best software settings based on the instrument and data collection practices for the laboratory where the samples were prepared and run. Once the first set was run through the system and results of the verification procedure obtained, the second, blind test, was performed on 10 distinct datasets (with chemical structures) that were not available to the software or the software operators in advance. The details and results of these two tests are presented here, along with a comprehensive look at the structures that could not be confirmed. Setting Up Ideal Processing and Evaluation Parameters In order to have a system that can run without human intervention, automated processing and structure verification procedures (macros) must be created in the software to perform these tasks. The raw 1D and 2D NMR datasets for 19 Aldrich compounds were first evaluated using ACD/Labs’ standard macros. These settings proved to be non-sufficient as the datasets contained several abnormally broad water peaks and low signal-to-noise ratios. These macros were then modified to exclude these water peaks and set more stringent peak picking guidelines to combat the S/N issues. The second attempt was improved but had some issues with the referencing in one of the 2D datasets. In addition, the 1D spectra were not well-resolved, resulting in an inaccurate evaluation of some multiplets. These issues were rectified by decreasing the line broadening setting in the software by a factor of 10. Following this modification, the settings were then deemed to be sufficient. Results of the First Test An explanation of the combined verification algorithms used to evaluate spectrum-to-structure matches have been previously reported. 2 Following the modification of ACD/Labs’ standard macros explained in the previous section, the raw data of the 19 Aldrich compounds were fully processed and evaluated automatically. The results revealed that the software was able to successfully evaluate 13 of the 19 datasets provided. In other words, for this particular dataset, 69% of the samples were automatically evaluated by software without any human intervention. The remaining 6 samples would require manual analysis by an NMR Spectroscopist as the software had flagged them as being either inconsistent or incorrect (Table 1). Of these 6 samples, it was concluded that 4 of the false negatives were a result of algorithm errors that have been fixed in Version 10 of the software (ID# 6, 12, 13, and 15). The other two ambiguous results require a closer look to be explained.