SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
To provide a genomic narrative that can be trusted, microbiology
laboratories need quality control (QC) metrics to accompany their
genomic pipelines. QC metrics enable:
•  Implementing standards in routine lab sample processing
•  Performance comparison of pipeline optimizations or alternatives
•  Retrospective tracing of problems that arise
QC metrics are not easy to implement – they may need to be adjusted for
organism type, sample quality, sequencing technology and preparation,
and the mix of software components that are brought together in a
pipeline. Another challenge is to transform QC reporting from a manual
review of a pipeline’s disparate and often opaque application log files,
into an automated system of reporting and decision making that can be
adjusted by researchers and system administrators who are not expert
programmers.
We have developed a general purpose text-mining and reporting
application called Report Calc for Quality Control (RCQC) that works
directly within command-line scripts, or as a tool in Galaxy (an interactive
bioinformatics platform and workflow engine). An RCQC interpreter
follows instructions in a RCQC script to extract QC variables from various
application log and report files. It can implement rules that trigger
warning or failure statuses in an active pipeline. Various opportunities
arise for metrics along the stages of a genomic pipeline; our initial focus
is on basic assembly metrics as illustrated on this poster.
Abstract
RCQC Recipes
QC Ontology
Using the JSON-LD format’s metadata feature, RCQC can link particular
QC report terms to their standardized ontology counterparts. Creating a
controlled vocabulary for QC enables reports from disparate genomic
pipelines to be compared, which should eventually lead to a set of
pipeline metrics for accrediting commercial, government and open source
software. Within the context of the OBOFoundry of ontologies we are
introducing an ontology called GenEpiO (currently available at
https://github.com/Public-Health-Bioinformatics/irida_ontology) which
holds QC terms like "genome size ratio", “contig count”, etc. Using the
Protégé ontology editor it is easy to see the definitions for these terms.
Acknowledgements
IRIDA project funding is provided by Genome Canada, Genome BC, and
the Genomics R&D Initiative (GRDI) with additional support from Simon
Fraser University and Cystic Fibrosis Canada. We thank additional
project advisors for constructive comments.
We have started a library of simple "recipe" scripts that extract quality
control (QC) data from various reports like FastQC, QUAST, CheckM and
SPAdes into the popular and software-friendly JSON format (an auto-
generated HTML version of the same content is also available). One can
override sections of an RCQC recipe with settings that test variations in a
pipeline job. An example RCQC text-mining script and output HTML and
JSON report is shown below along with typical report files from other
pipeline tools.
1Department of Pathology, University of British Columbia; 2National Microbiology Laboratory, Public Health Agency of Canada; 3Department of Pathology,
University of British Columbia & BC Public Health Microbiology and Reference Laboratory
Damion M. Dooley1; Aaron J. Petkau2; Franklin Bristow2;
Gary Van Domselaar2; William W.L. Hsiao3
A Scripting Language For Standardized Evaluation Of Quality
Metrics In Galaxy And Command-line Driven Workflows
This work stemmed from the plan to enhance QC reporting on the web-
based Integrated Rapid Infectious Disease Analysis (www.IRIDA.ca)
project which manages sequence libraries and pipelines for food-born
pathogen assembly, annotation, SNP detection, and phylogenetic
analysis. RCQC has been developed to work as a command-line python
app, but in addition, since IRIDA uses Galaxy to execute its pipeline, we
have a Galaxy RCQC tool for “pro” users to develop recipes. We will be
offering a basic version of this tool that allows users without programming
skills to adjust key QC parameters only.
Recipes can include conditionals that trigger a halt to a pipeline by
sending the appropriate signal (exit code). More than one RCQC recipe
can be run in a pipeline, and their report output can be daisy chained in
order to contribute to a single collective report. QC metric conditionals
shown below can either signal a possible error situation (the “fail(qc…)”
call), or even call a halt to futile pipeline work (via “fail(job …)”).
adjusting parameters and formulae for pipeline operation – one that did
not require recompilation after each user-driven change. As a result, the
RCQC system provides a more transparent rule set that reduces the skill
needed to make process adjustments. Standard assembly pipeline QC
metrics are introduced which provide a blueprint for the way QC
components could be shared amongst NGS sequencing pipelines.
Further information, including source code, is available at
https://github.com/Public-Health-Bioinformatics/rcqc.
Implementation
Protege ontology editor view of GenEpiO assembly quality control terms
JSON-LDHTML
FLASHFastQC
CheckM
RCQC recipe for text-mining flash.log
In developing a scripting language to
do this work, we did not want to
reinvent the wheel (in fact RCQC offers
up for reuse all of python’s built-in
math and operator functions). We did
however need a flexible mechanism for
FLASH

Contenu connexe

En vedette (12)

Disney consumer products
Disney consumer productsDisney consumer products
Disney consumer products
 
Dispositivos del computador
Dispositivos del computadorDispositivos del computador
Dispositivos del computador
 
6-1 to 6-3 Quiz Day Concepts.pdf
6-1 to 6-3 Quiz Day Concepts.pdf6-1 to 6-3 Quiz Day Concepts.pdf
6-1 to 6-3 Quiz Day Concepts.pdf
 
Asbestos Spervisor Refresher 2-13-2015
Asbestos Spervisor Refresher 2-13-2015Asbestos Spervisor Refresher 2-13-2015
Asbestos Spervisor Refresher 2-13-2015
 
Tic Tac
Tic TacTic Tac
Tic Tac
 
certificate signing agent
certificate signing agentcertificate signing agent
certificate signing agent
 
Profile IPB
Profile IPB Profile IPB
Profile IPB
 
Blodomloppet
BlodomloppetBlodomloppet
Blodomloppet
 
3.2 thermal properties of matter
3.2   thermal properties of matter3.2   thermal properties of matter
3.2 thermal properties of matter
 
FABRICATION OF SOLAR BICYCLE.pptx new edited
FABRICATION OF SOLAR BICYCLE.pptx new editedFABRICATION OF SOLAR BICYCLE.pptx new edited
FABRICATION OF SOLAR BICYCLE.pptx new edited
 
FBK SVA - Saker i skolan
FBK SVA - Saker i skolanFBK SVA - Saker i skolan
FBK SVA - Saker i skolan
 
Magnetism
MagnetismMagnetism
Magnetism
 

Similaire à Report Calc for Quality Control

Ready, Set, Automate - Best Practices in Using Automated Tools for Validation
Ready, Set, Automate - Best Practices in Using Automated Tools for ValidationReady, Set, Automate - Best Practices in Using Automated Tools for Validation
Ready, Set, Automate - Best Practices in Using Automated Tools for ValidationCovance
 
Agile for Software as a Medical Device
Agile for Software as a Medical DeviceAgile for Software as a Medical Device
Agile for Software as a Medical DeviceOrthogonal
 
Scale and Load Testing of Micro-Service
Scale and Load Testing of Micro-ServiceScale and Load Testing of Micro-Service
Scale and Load Testing of Micro-ServiceIRJET Journal
 
ABAP Test Cockpit in action with Doctor ZedGe and abap2xlsx
ABAP Test Cockpit in action with Doctor ZedGe and abap2xlsxABAP Test Cockpit in action with Doctor ZedGe and abap2xlsx
ABAP Test Cockpit in action with Doctor ZedGe and abap2xlsxAlessandro Lavazzi
 
Solo Requisitos 2008 - 07 Upc
Solo Requisitos 2008 - 07 UpcSolo Requisitos 2008 - 07 Upc
Solo Requisitos 2008 - 07 UpcPepe
 
safety assurence in process control
safety assurence in process controlsafety assurence in process control
safety assurence in process controlNathiya Vaithi
 
Software Maintenance Bug Triaging
Software Maintenance Bug TriagingSoftware Maintenance Bug Triaging
Software Maintenance Bug TriagingRamis Khan
 
Reports & Analysis_Katalyst HLS
Reports & Analysis_Katalyst HLSReports & Analysis_Katalyst HLS
Reports & Analysis_Katalyst HLSKatalyst HLS
 
Oracle application testing suite (OATS)
Oracle application testing suite (OATS)Oracle application testing suite (OATS)
Oracle application testing suite (OATS)Koushik Arvapally
 
Cypress/VSAC Presentation at HIMSS13
Cypress/VSAC Presentation at HIMSS13Cypress/VSAC Presentation at HIMSS13
Cypress/VSAC Presentation at HIMSS13Saul Kravitz
 
Cypress nlm himss13_03042013
Cypress nlm himss13_03042013Cypress nlm himss13_03042013
Cypress nlm himss13_03042013Saul Kravitz
 
Control source code quality using the SonarQube platform
Control source code quality using the SonarQube platformControl source code quality using the SonarQube platform
Control source code quality using the SonarQube platformPVS-Studio
 
LT033 RIQAS Explained MAY17
LT033 RIQAS Explained MAY17LT033 RIQAS Explained MAY17
LT033 RIQAS Explained MAY17Randox
 
Value stream mapping for DevOps
Value stream mapping for DevOpsValue stream mapping for DevOps
Value stream mapping for DevOpsMarc Hornbeek
 
CV_SyedShoeb_2015
CV_SyedShoeb_2015CV_SyedShoeb_2015
CV_SyedShoeb_2015Syed Shoeb
 
Overview on “Computer System Validation” CSV
Overview on  “Computer System Validation” CSVOverview on  “Computer System Validation” CSV
Overview on “Computer System Validation” CSVAnil Sharma
 

Similaire à Report Calc for Quality Control (20)

Ready, Set, Automate - Best Practices in Using Automated Tools for Validation
Ready, Set, Automate - Best Practices in Using Automated Tools for ValidationReady, Set, Automate - Best Practices in Using Automated Tools for Validation
Ready, Set, Automate - Best Practices in Using Automated Tools for Validation
 
Agile for Software as a Medical Device
Agile for Software as a Medical DeviceAgile for Software as a Medical Device
Agile for Software as a Medical Device
 
Testing Process
Testing ProcessTesting Process
Testing Process
 
Nrnb project
Nrnb projectNrnb project
Nrnb project
 
Scale and Load Testing of Micro-Service
Scale and Load Testing of Micro-ServiceScale and Load Testing of Micro-Service
Scale and Load Testing of Micro-Service
 
ABAP Test Cockpit in action with Doctor ZedGe and abap2xlsx
ABAP Test Cockpit in action with Doctor ZedGe and abap2xlsxABAP Test Cockpit in action with Doctor ZedGe and abap2xlsx
ABAP Test Cockpit in action with Doctor ZedGe and abap2xlsx
 
Solo Requisitos 2008 - 07 Upc
Solo Requisitos 2008 - 07 UpcSolo Requisitos 2008 - 07 Upc
Solo Requisitos 2008 - 07 Upc
 
safety assurence in process control
safety assurence in process controlsafety assurence in process control
safety assurence in process control
 
Software Maintenance Bug Triaging
Software Maintenance Bug TriagingSoftware Maintenance Bug Triaging
Software Maintenance Bug Triaging
 
Reports & Analysis_Katalyst HLS
Reports & Analysis_Katalyst HLSReports & Analysis_Katalyst HLS
Reports & Analysis_Katalyst HLS
 
Oracle application testing suite (OATS)
Oracle application testing suite (OATS)Oracle application testing suite (OATS)
Oracle application testing suite (OATS)
 
Cypress/VSAC Presentation at HIMSS13
Cypress/VSAC Presentation at HIMSS13Cypress/VSAC Presentation at HIMSS13
Cypress/VSAC Presentation at HIMSS13
 
[EN] Success Story ArianeGroup
[EN] Success Story ArianeGroup[EN] Success Story ArianeGroup
[EN] Success Story ArianeGroup
 
Cypress nlm himss13_03042013
Cypress nlm himss13_03042013Cypress nlm himss13_03042013
Cypress nlm himss13_03042013
 
Control source code quality using the SonarQube platform
Control source code quality using the SonarQube platformControl source code quality using the SonarQube platform
Control source code quality using the SonarQube platform
 
LT033 RIQAS Explained MAY17
LT033 RIQAS Explained MAY17LT033 RIQAS Explained MAY17
LT033 RIQAS Explained MAY17
 
Value stream mapping for DevOps
Value stream mapping for DevOpsValue stream mapping for DevOps
Value stream mapping for DevOps
 
CV_SyedShoeb_2015
CV_SyedShoeb_2015CV_SyedShoeb_2015
CV_SyedShoeb_2015
 
Overview on “Computer System Validation” CSV
Overview on  “Computer System Validation” CSVOverview on  “Computer System Validation” CSV
Overview on “Computer System Validation” CSV
 
NRNB project
NRNB projectNRNB project
NRNB project
 

Dernier

Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 

Dernier (20)

Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 

Report Calc for Quality Control

  • 1. To provide a genomic narrative that can be trusted, microbiology laboratories need quality control (QC) metrics to accompany their genomic pipelines. QC metrics enable: •  Implementing standards in routine lab sample processing •  Performance comparison of pipeline optimizations or alternatives •  Retrospective tracing of problems that arise QC metrics are not easy to implement – they may need to be adjusted for organism type, sample quality, sequencing technology and preparation, and the mix of software components that are brought together in a pipeline. Another challenge is to transform QC reporting from a manual review of a pipeline’s disparate and often opaque application log files, into an automated system of reporting and decision making that can be adjusted by researchers and system administrators who are not expert programmers. We have developed a general purpose text-mining and reporting application called Report Calc for Quality Control (RCQC) that works directly within command-line scripts, or as a tool in Galaxy (an interactive bioinformatics platform and workflow engine). An RCQC interpreter follows instructions in a RCQC script to extract QC variables from various application log and report files. It can implement rules that trigger warning or failure statuses in an active pipeline. Various opportunities arise for metrics along the stages of a genomic pipeline; our initial focus is on basic assembly metrics as illustrated on this poster. Abstract RCQC Recipes QC Ontology Using the JSON-LD format’s metadata feature, RCQC can link particular QC report terms to their standardized ontology counterparts. Creating a controlled vocabulary for QC enables reports from disparate genomic pipelines to be compared, which should eventually lead to a set of pipeline metrics for accrediting commercial, government and open source software. Within the context of the OBOFoundry of ontologies we are introducing an ontology called GenEpiO (currently available at https://github.com/Public-Health-Bioinformatics/irida_ontology) which holds QC terms like "genome size ratio", “contig count”, etc. Using the Protégé ontology editor it is easy to see the definitions for these terms. Acknowledgements IRIDA project funding is provided by Genome Canada, Genome BC, and the Genomics R&D Initiative (GRDI) with additional support from Simon Fraser University and Cystic Fibrosis Canada. We thank additional project advisors for constructive comments. We have started a library of simple "recipe" scripts that extract quality control (QC) data from various reports like FastQC, QUAST, CheckM and SPAdes into the popular and software-friendly JSON format (an auto- generated HTML version of the same content is also available). One can override sections of an RCQC recipe with settings that test variations in a pipeline job. An example RCQC text-mining script and output HTML and JSON report is shown below along with typical report files from other pipeline tools. 1Department of Pathology, University of British Columbia; 2National Microbiology Laboratory, Public Health Agency of Canada; 3Department of Pathology, University of British Columbia & BC Public Health Microbiology and Reference Laboratory Damion M. Dooley1; Aaron J. Petkau2; Franklin Bristow2; Gary Van Domselaar2; William W.L. Hsiao3 A Scripting Language For Standardized Evaluation Of Quality Metrics In Galaxy And Command-line Driven Workflows This work stemmed from the plan to enhance QC reporting on the web- based Integrated Rapid Infectious Disease Analysis (www.IRIDA.ca) project which manages sequence libraries and pipelines for food-born pathogen assembly, annotation, SNP detection, and phylogenetic analysis. RCQC has been developed to work as a command-line python app, but in addition, since IRIDA uses Galaxy to execute its pipeline, we have a Galaxy RCQC tool for “pro” users to develop recipes. We will be offering a basic version of this tool that allows users without programming skills to adjust key QC parameters only. Recipes can include conditionals that trigger a halt to a pipeline by sending the appropriate signal (exit code). More than one RCQC recipe can be run in a pipeline, and their report output can be daisy chained in order to contribute to a single collective report. QC metric conditionals shown below can either signal a possible error situation (the “fail(qc…)” call), or even call a halt to futile pipeline work (via “fail(job …)”). adjusting parameters and formulae for pipeline operation – one that did not require recompilation after each user-driven change. As a result, the RCQC system provides a more transparent rule set that reduces the skill needed to make process adjustments. Standard assembly pipeline QC metrics are introduced which provide a blueprint for the way QC components could be shared amongst NGS sequencing pipelines. Further information, including source code, is available at https://github.com/Public-Health-Bioinformatics/rcqc. Implementation Protege ontology editor view of GenEpiO assembly quality control terms JSON-LDHTML FLASHFastQC CheckM RCQC recipe for text-mining flash.log In developing a scripting language to do this work, we did not want to reinvent the wheel (in fact RCQC offers up for reuse all of python’s built-in math and operator functions). We did however need a flexible mechanism for FLASH