SlideShare une entreprise Scribd logo
1  sur  62
Canada’s Integrated Rapid Infectious Disease Analysis Platform for
Genomic Epidemiology
Outline
• IRIDA Project Overview
• IRIDA Design Principles
• Platform Overview
• Demo
• Extending IRIDA with the REST API
• Works-in-progress and future directions
• Discussion
2
Genome Canada Bioinformatics Competition: Large-Scale Project
“A Federated Bioinformatics Platform for
Public Health Microbial Genomics”
Our Goal
An open source, standards compliant, high quality, and user friendly
genomic epidemiology analysis platform based on web-technology to
support real-time (food-borne) disease outbreak investigations
3 www.IRIDA.ca
Core Functions
• Rapid processing of genomic sequence data
• Informative display of genomic data
• Sample, Case, and aggregate data (“metadata”)
Management
• Exporting of data for downstream analysis
Integrated Rapid Infectious Disease Analysis informatics platform
Target Audience
• Public health agencies and other organizations who
need a platform to manage and process large amount
of pathogen genomic data
• Public health agencies who need a platform to use
genomics for outbreak investigations
• NML and BCCDC are our first testing centres
• Have engaged Australian and Scottish Public Health
Laboratories; other countries also expressed interests
GP1: Empowering the end users
• We design IRIDA for the end-users!
• User-friendly interface and usage
• Users’ institutes are in control of the system
• Local or private-cloud installation to allow each
organization to host their own instance of IRIDA
• Built on top of popular workflow engine: tools are
highly customizable for specific end users
• Users don’t have to “fork over” their data to use
IRIDA
Interviews with key personnel to identify barriers
to implement genomic epidemiology in public
health agencies
7
GP2: Promoting Responsible Data
Sharing
• Data sharing needs to be seamless and painless
• Designed programming interface (API) to import
and export data between instances of IRIDA or
between IRIDA and 3rd party (authenticated)
applications
• End users are in control of what to share and when
to share
• Harmonized metadata formats to facilitate data
sharing
• Use of Ontology to map different usages to the same
concept and standardized term
GP3: Accessible and Scalable
• Open-source development means anyone can help to
improve the platform and reduce redundant efforts
• Openness = transparent analysis pipeline and
comparable results
• Web technology = accessible from remote devices
• Well documented
• Installable on a single computer to a cluster of
computers
• VM available – deployed on a cloud possible
• Free!
GP4: Secure and High Quality
• Use industrial standard authentication and
authorization protocols for access
• Built-in QC components (under testing)
• Full data and pipeline provenances to keep track of
tasks performed by end-users
• Versioned reference databases (under testing)
Genomics Analytic Ecosystem
Raw Genomics
data
IRIDA backend: data
management and routine
analysis
Public Health Data
Raw Genomics
data
Public Health Data
Geneious: ad-hoc
analysis (interactive)
Galaxy: ad-hoc
analysis (pipelined)
Bionumerics: PulseNet
specific (pipelined)
CDMart and other
Marts
EDW (LADW)
Primary Data
Processed Data (ready
for analysis) and
routine / automated
analysis
Analysis Tools
Dashboards
(information display)
IRIDA frontend:
Visualization
Components of Genomic Analysis
25GB-250GB
(data transfer
bottleneck)
5: public Data repository and
other collaborating centres
3: Public Health Data
1: data generation 2: data storage and/or analysis
platform
4: Analysts
Raw data
Secure transfer
GSC?
NML?
IRIDA Project Phases
• Phase 1: genomics process and analysis pipeline to
produce categorical data (MLST and SNPs) suitable for
current epidemiological analysis – almost completed
• Phase 2: combine the categorical data with subset of
public health data (line list approach to replace current
Excel based approach) and export of categorical data to
CDMARTs – in progress
• Phase 3: Develop IRIDA as an exploratory platform for
new ways of interpreting genomics data in light of
epidemiological and clinical data – in progress;
continuous process beyond current project
Public Health Contextual Data
Integration into IRIDA is needed
• Genomic data requires special storage and analysis
considerations due to its size and complexity
• While we can export genomic typing results to
existing epi. analysis systems, by bringing
“contextual” info into IRIDA, we can come up with
more complex visualization and analysis tools (e.g.
GenGIS)
14
Genomics, Epidemiology, Clinical, Lab Data
Platform Overview
• Suite of tools to facilitate genomic epidemiology
15
IRIDA
Sequencing
Instruments
Web
Application
Data
management
Built-in
Analytical
Tools
External
Galaxy
Command-
line Tools
Sequence Instruments
• Typically MiSeq or NextSeq
• An easy to use batch uploader is available to send
data from MiSeq or a data-staging PC to IRIDA
• Uploader available on Molecular-PC (BCCDC Lab’s
data staging PC)
Platform Overview – data model
• Data model inspired by INSDC
• Makes data uploading to NCBI
easy
• Currently with limited metadata
(we will see in demo)
• Plan to extend data model
17
Project
Run
Metadata
Sample
Metadata
Sequencing
Data
Sample
Metadata
Sequencing
Data
Members
Metadata
Platform Overview
• System Architecture
18
IRIDAServletContainer
REST API
Central File
Storage
Web
Interface
ApplicationLogic
Compute
Cluster
Galaxy
$ ~ >_ Galaxy
Demo Topics
• Getting data into IRIDA
• Organizing data
• Getting data out of IRIDA
• Analyzing data in IRIDA
Getting data into IRIDA
• Manual web interface upload
• Automated instrument upload (Illumina MiSeq)
20
Organizing Data
• Sample management actions
• Creating
• Editing metadata
• Copy/move
samples
21
Analyzing data in IRIDA
• Integrated analytical workflows
• Built-in Galaxy
• Assembly (SPAdes) and annotation (prokka)
• Phylogenomics (SNVPhyl)
• Uses Galaxy in the back-end
• Extendable, if you can write a Galaxy workflow for a tool, it’s straightforward to integrate
into IRIDA.
22
Getting data out of IRIDA
• Sharing project data
• Downloading
• Export to external Galaxy instance
• Exporting to the command-line
23
Works-in-progress and Future
directions
• Ontology integration, line-list tool
• Quality control and Quality assurance in analytical workflows
• Robust metadata integration and management
• Simplification of SRA submission
24
Types of (Meta)Data Standardized Within IRIDA
Lab Analytics
Genomics, PFGE
Serotyping, Phage typing
MLST, AMR
Sample Metadata
Isolation Source (Food, Host
Body Product,
Environmental), BioSample
Epidemiology Investigation
Exposures
Clinical Data
Patient demographics, Medical
History, Comorbidities,
Symptoms, Health Status
Reporting
Case/Investigation Status
“Not just what data IS collected, but what SHOULD be collected”
Types of (Meta)Data Standardized Within IRIDA
Lab Analytics
Genomics, PFGE
Serotyping, Phage typing
MLST, Ribotyping
Sample Metadata
Isolation Source (Food, Host
Body Product,
Environmental), BioSample
Epidemiology Investigation
Exposures
Clinical Data
Patient demographics, Medical
History, Comorbidities,
Symptoms, Health Status
Reporting
Case/Investigation Status
“Not just what data IS collected, but what SHOULD be collected”
27
Improved Querying Using Genomic Epidemiology Application Ontology
1. Create a hierarchy of well-defined terms
(harmonized from different sources, for different
organisms)
2. Provide clearly-defined relationships between terms
3. Use OBO architecture
Water Related Exposure
Treated Untreated
Bottled Municipal Individual Pond River Lake
Transmissio
n through
ingestion or
contact
Transmissio
n through
ingestion or
contact
Advantage of Using Ontology
• Flexible – allow more transparent integration
• Invisible to the User (but you’ll feel the
convenience and familiarity)
• With defined relationships, computer can be used
to assist reasoning (better querying and better
automation)
• Build on a large body of existing work (OBO) means
we can benefit from other people’s effort
29
• “Person, place, time”
• Exposure, food items, geographical information, symptoms, onset of symptoms
• Created (manually in excel) on ad hoc basis per investigation
• Need to be shared between stakeholders, but data governance is an issue
The Line List : The Primary Tool for Epidemiological Investigations
• Data integration and ontology based reasoning 
automated case definitions!
Integrating
genomics and
epidemiological
data!
IRIDA Offers Line List Visualizations of Selectable Data!
1. Line List
View
2. Timeline
View
Hideable cases
Selectable fields
Travel
Symptoms and Onset
Exposure Types
Hospitalization
Ontology - In Progress
• Create a smaller core (Lab, Epi exposure, and Food)
ontology for line-list testing
• Create a consortium for group to take on different
domains of Genomic Epidemiology Application
Ontology
• Pursuing longer term funding for ontology
Workflow Quality Control Tool
Objectives:
• Develop an universal QC module for
various bioinformatics tools
• Provide generic text mining tools for
extracting key variables from
pipeline component log files.
• Make it easier to adjust pipeline QC
threshold parameters.
• Standardized rule engine with access
to many Python functions.
• Pathogen-specific configuration
settings.
• Galaxy tool or command line.
• E.g. quality control system for the
assembly, annotation, and snp-
calling pipelines.
Workflow Quality Control Tool
Input:
log files (datasets)
Output: report file + optionally halt workflow
+ rule file (json)
Workflow Quality Control Tool
Simple workflow add-on:
Complex rule capability:
External applications
• Authorized external applications can connect to
IRIDA to obtain data seamlessly via REST API
• Example: GenGIS can extract sample phylogeny
and geographic data from IRIDA to generate a map
that shows the phylogenetic and geographic
information associated with outbreak isolates
• Data can also be output to Dashboard applications
for real-time queries
2011 Cholera Outbreak
Red = Haiti Blue = Nepal
2011 Cholera Cartogram
Red = Haiti Blue = Nepal
Comments and Feedback on
IRIDA
• What existing features do you like?
• What existing features don’t you like?
• What features do you want to see soon?
• What’s needed before you will use the tool
• What features do you want to see eventually?
• Longer term functionality
Discussion
• Infrastructure
• Short-term (Jenn’s cluster)
• Long-term?
• Network connectivity (speed and security)
• Access to Metadata (Epi and Lab)
• Sustainability
• Currently supported by a Genome Canada grant (expiring
June 2016)
• NML committed to maintain the core development going
• Buy-In from BCCDC (customization and maintenance of the
platform)
Genomics Analytic Platforms
Raw Genomics
data
IRIDA backend: data
management and routine
analysis
Public Health Data
Raw Genomics
data
Public Health Data
Geneious: ad-hoc
analysis (interactive)
Galaxy: ad-hoc
analysis (pipelined)
Bionumerics: PulseNet
specific (pipelined)
CDMart and other
Marts
EDW (LADW)
Primary Data
Processed Data (ready
for analysis) and
routine / automated
analysis
Analysis Tools
Dashboards
(information display)
IRIDA frontend:
Visualization
Contact
• Project Information: http://www.irida.ca
• Project source:
• https://github.com/phac-nml/irida
• https://github.com/phac-nml/irida-miseq-uploader
• https://github.com/phac-nml/irida-galaxy-importer
• Documentation: https://irida.corefacility.ca/documentation/
• E-mail: IRIDA-mail@sfu.ca
• IRC: #irida on irc.freenode.net
Many slides provided by Franklin Bristow (NML), Alex Keddy and Rob
Beiko (Dalhousie U.), Melanie Courtot, Emma Griffiths, and Damion
Dooley
41
The End
Extending IRIDA with the REST API
• OAuth2 authorization (industry standard)
• HTTP API
• Examples:
• External Galaxy importer tool
• Command-line linker
• GenGIS
43
Retrieving IRIDA data through REST API
GenGIS: kiwi.cs.dal.ca/GenGIS
Login Prompt
The Blank IRIDA Display
IRIDA Credentials
Editing Credentials
Active Credentials
Logging In
Selecting Analyses
Retrieving Analyses
Initial Data View
(mock data set retrieved from IRIDA)
Geographic Locations
Required fields: Site ID, Latitude, Longitude
Individual Samples
Site IDs keyed to locations.
Many Sequence IDs(=multiple samples) can key to single Site IDs.
Phylogenetic Tree
Tree leaf IDs keyed to samples.
Geographically Coupled Phylogenetic
Distance (GCPD)
GCPD
IRIDA Cartogram Options
IRIDA Cartogram
2011 Cholera Outbreak
Red = Haiti Blue = Nepal
2011 Cholera Cartogram
Red = Haiti Blue = Nepal

Contenu connexe

Tendances

Developing Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportMatthew Vaughn
 
EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795Michael Williams
 
Simagis for healthcare
Simagis for healthcareSimagis for healthcare
Simagis for healthcarekhvatkov
 
Machine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis LiveMachine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis Livekhvatkov
 
eSource: What You Need To Know
eSource: What You Need To KnoweSource: What You Need To Know
eSource: What You Need To Knowwww.datatrak.com
 
eSource to eTrial - Integrating Technology and Data to Innovate Clinical Deve...
eSource to eTrial - Integrating Technology and Data to Innovate Clinical Deve...eSource to eTrial - Integrating Technology and Data to Innovate Clinical Deve...
eSource to eTrial - Integrating Technology and Data to Innovate Clinical Deve...Nick Hargaden
 
Successful Selection and Implementation of EDC (Electronic Data Capture) System
Successful Selection and Implementation of EDC (Electronic Data Capture) System Successful Selection and Implementation of EDC (Electronic Data Capture) System
Successful Selection and Implementation of EDC (Electronic Data Capture) System Eleazar Noel
 
eSource: Data Capture Simplified - Uncover Time and Cost Saving Possibilities
eSource: Data Capture Simplified - Uncover Time and Cost Saving PossibilitieseSource: Data Capture Simplified - Uncover Time and Cost Saving Possibilities
eSource: Data Capture Simplified - Uncover Time and Cost Saving Possibilitieswww.datatrak.com
 
Focus on the Evidence: a knowledge graph approach to profiling drug targets
Focus on the Evidence: a knowledge graph approach to profiling drug targetsFocus on the Evidence: a knowledge graph approach to profiling drug targets
Focus on the Evidence: a knowledge graph approach to profiling drug targetsNolan Nichols
 
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...Ahmad C. Bukhari
 
Tripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIITripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIIVivek Krishnakumar
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of AraportAraport
 

Tendances (15)

Developing Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through Araport
 
EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795
 
Simagis for healthcare
Simagis for healthcareSimagis for healthcare
Simagis for healthcare
 
Machine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis LiveMachine Learning in Pathology Diagnostics with Simagis Live
Machine Learning in Pathology Diagnostics with Simagis Live
 
Innovative project1
Innovative project1Innovative project1
Innovative project1
 
eSource: What You Need To Know
eSource: What You Need To KnoweSource: What You Need To Know
eSource: What You Need To Know
 
eSource to eTrial - Integrating Technology and Data to Innovate Clinical Deve...
eSource to eTrial - Integrating Technology and Data to Innovate Clinical Deve...eSource to eTrial - Integrating Technology and Data to Innovate Clinical Deve...
eSource to eTrial - Integrating Technology and Data to Innovate Clinical Deve...
 
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
 
Successful Selection and Implementation of EDC (Electronic Data Capture) System
Successful Selection and Implementation of EDC (Electronic Data Capture) System Successful Selection and Implementation of EDC (Electronic Data Capture) System
Successful Selection and Implementation of EDC (Electronic Data Capture) System
 
eSource: Data Capture Simplified - Uncover Time and Cost Saving Possibilities
eSource: Data Capture Simplified - Uncover Time and Cost Saving PossibilitieseSource: Data Capture Simplified - Uncover Time and Cost Saving Possibilities
eSource: Data Capture Simplified - Uncover Time and Cost Saving Possibilities
 
Focus on the Evidence: a knowledge graph approach to profiling drug targets
Focus on the Evidence: a knowledge graph approach to profiling drug targetsFocus on the Evidence: a knowledge graph approach to profiling drug targets
Focus on the Evidence: a knowledge graph approach to profiling drug targets
 
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
Leveraging the CEDAR Workbench for Ontology-linked Submission of Adaptive Imm...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
Tripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIITripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIII
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of Araport
 

Similaire à Irida bccdc dec10_2015

Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...confluent
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...David Peyruc
 
Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Sanjay Padhi, Ph.D
 
The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DLaura Berry
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theoryC. Tobin Magle
 
Enabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology supportEnabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology supportMelanie Courtot
 
Brisbane Health-y Data: RedCap
Brisbane Health-y Data: RedCapBrisbane Health-y Data: RedCap
Brisbane Health-y Data: RedCapARDC
 
An overview of clinical data repository
An overview of clinical data repositoryAn overview of clinical data repository
An overview of clinical data repositoryNetrah Laxminarayanan
 
The XNAT imaging informatics platform
The XNAT imaging informatics platformThe XNAT imaging informatics platform
The XNAT imaging informatics platformimgcommcall
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 
GVK BIO - BioIT Services
GVK BIO - BioIT ServicesGVK BIO - BioIT Services
GVK BIO - BioIT Servicesgvk_bio
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...David Peyruc
 
Medical Intelligence EDW 20 juni: Radboudumc
Medical Intelligence EDW 20 juni: RadboudumcMedical Intelligence EDW 20 juni: Radboudumc
Medical Intelligence EDW 20 juni: RadboudumcFurore_com
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?Robert Grossman
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsTom Plasterer
 
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterNIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterGlobus
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeTom Plasterer
 
Running Mixed Workloads on Kubernetes at IHME
Running Mixed Workloads on Kubernetes at IHMERunning Mixed Workloads on Kubernetes at IHME
Running Mixed Workloads on Kubernetes at IHMETyrone Grandison
 

Similaire à Irida bccdc dec10_2015 (20)

Irida immemxi hsiao
Irida immemxi hsiaoIrida immemxi hsiao
Irida immemxi hsiao
 
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
 
Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021
 
The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&D
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theory
 
Enabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology supportEnabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology support
 
Brisbane Health-y Data: RedCap
Brisbane Health-y Data: RedCapBrisbane Health-y Data: RedCap
Brisbane Health-y Data: RedCap
 
An overview of clinical data repository
An overview of clinical data repositoryAn overview of clinical data repository
An overview of clinical data repository
 
The XNAT imaging informatics platform
The XNAT imaging informatics platformThe XNAT imaging informatics platform
The XNAT imaging informatics platform
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
GVK BIO - BioIT Services
GVK BIO - BioIT ServicesGVK BIO - BioIT Services
GVK BIO - BioIT Services
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
 
Medical Intelligence EDW 20 juni: Radboudumc
Medical Intelligence EDW 20 juni: RadboudumcMedical Intelligence EDW 20 juni: Radboudumc
Medical Intelligence EDW 20 juni: Radboudumc
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge Graphs
 
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterNIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
 
Running Mixed Workloads on Kubernetes at IHME
Running Mixed Workloads on Kubernetes at IHMERunning Mixed Workloads on Kubernetes at IHME
Running Mixed Workloads on Kubernetes at IHME
 

Plus de IRIDA_community

Robertson immemxi final March 2016
Robertson immemxi final March 2016Robertson immemxi final March 2016
Robertson immemxi final March 2016IRIDA_community
 
Hetman immem xi final March 2016
Hetman immem xi final March 2016Hetman immem xi final March 2016
Hetman immem xi final March 2016IRIDA_community
 
Barker immemxi final March 2016
Barker immemxi final March 2016Barker immemxi final March 2016
Barker immemxi final March 2016IRIDA_community
 
Emma Food on workshop allergy_eg
Emma Food on workshop allergy_egEmma Food on workshop allergy_eg
Emma Food on workshop allergy_egIRIDA_community
 
Biocuration gen epio_poster
Biocuration gen epio_posterBiocuration gen epio_poster
Biocuration gen epio_posterIRIDA_community
 
Emma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_posterEmma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_posterIRIDA_community
 
Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016IRIDA_community
 
Integrate Ontologies into your apps
Integrate Ontologies into your appsIntegrate Ontologies into your apps
Integrate Ontologies into your appsIRIDA_community
 
Report Calc for Quality Control
Report Calc for Quality ControlReport Calc for Quality Control
Report Calc for Quality ControlIRIDA_community
 
Gen epio immem_griffiths
Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffithsIRIDA_community
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015IRIDA_community
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA_community
 
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance ExperienceDomselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance ExperienceIRIDA_community
 

Plus de IRIDA_community (14)

Robertson immemxi final March 2016
Robertson immemxi final March 2016Robertson immemxi final March 2016
Robertson immemxi final March 2016
 
Hetman immem xi final March 2016
Hetman immem xi final March 2016Hetman immem xi final March 2016
Hetman immem xi final March 2016
 
Barker immemxi final March 2016
Barker immemxi final March 2016Barker immemxi final March 2016
Barker immemxi final March 2016
 
Emma FoodON poster3
Emma FoodON poster3Emma FoodON poster3
Emma FoodON poster3
 
Emma Food on workshop allergy_eg
Emma Food on workshop allergy_egEmma Food on workshop allergy_eg
Emma Food on workshop allergy_eg
 
Biocuration gen epio_poster
Biocuration gen epio_posterBiocuration gen epio_poster
Biocuration gen epio_poster
 
Emma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_posterEmma Griffiths ASM microbe gen_epio_poster
Emma Griffiths ASM microbe gen_epio_poster
 
Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016Julie Shay CCBC poster may 11 2016
Julie Shay CCBC poster may 11 2016
 
Integrate Ontologies into your apps
Integrate Ontologies into your appsIntegrate Ontologies into your apps
Integrate Ontologies into your apps
 
Report Calc for Quality Control
Report Calc for Quality ControlReport Calc for Quality Control
Report Calc for Quality Control
 
Gen epio immem_griffiths
Gen epio immem_griffithsGen epio immem_griffiths
Gen epio immem_griffiths
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
 
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance ExperienceDomselaar GMI8 Beijing Canadian WGS Surveillance Experience
Domselaar GMI8 Beijing Canadian WGS Surveillance Experience
 

Dernier

Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...vidya singh
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escortsvidya singh
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...Taniya Sharma
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeCall Girls Delhi
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...astropune
 
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomdiscovermytutordmt
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...Taniya Sharma
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...jageshsingh5554
 

Dernier (20)

Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
 
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore EscortsCall Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
Call Girls Horamavu WhatsApp Number 7001035870 Meeting With Bangalore Escorts
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
 
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls JaipurCall Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
Call Girls Service Jaipur Grishma WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
 
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
VIP Service Call Girls Sindhi Colony 📳 7877925207 For 18+ VIP Call Girl At Th...
 

Irida bccdc dec10_2015

  • 1. Canada’s Integrated Rapid Infectious Disease Analysis Platform for Genomic Epidemiology
  • 2. Outline • IRIDA Project Overview • IRIDA Design Principles • Platform Overview • Demo • Extending IRIDA with the REST API • Works-in-progress and future directions • Discussion 2
  • 3. Genome Canada Bioinformatics Competition: Large-Scale Project “A Federated Bioinformatics Platform for Public Health Microbial Genomics” Our Goal An open source, standards compliant, high quality, and user friendly genomic epidemiology analysis platform based on web-technology to support real-time (food-borne) disease outbreak investigations 3 www.IRIDA.ca
  • 4. Core Functions • Rapid processing of genomic sequence data • Informative display of genomic data • Sample, Case, and aggregate data (“metadata”) Management • Exporting of data for downstream analysis Integrated Rapid Infectious Disease Analysis informatics platform
  • 5. Target Audience • Public health agencies and other organizations who need a platform to manage and process large amount of pathogen genomic data • Public health agencies who need a platform to use genomics for outbreak investigations • NML and BCCDC are our first testing centres • Have engaged Australian and Scottish Public Health Laboratories; other countries also expressed interests
  • 6. GP1: Empowering the end users • We design IRIDA for the end-users! • User-friendly interface and usage • Users’ institutes are in control of the system • Local or private-cloud installation to allow each organization to host their own instance of IRIDA • Built on top of popular workflow engine: tools are highly customizable for specific end users • Users don’t have to “fork over” their data to use IRIDA
  • 7. Interviews with key personnel to identify barriers to implement genomic epidemiology in public health agencies 7
  • 8. GP2: Promoting Responsible Data Sharing • Data sharing needs to be seamless and painless • Designed programming interface (API) to import and export data between instances of IRIDA or between IRIDA and 3rd party (authenticated) applications • End users are in control of what to share and when to share • Harmonized metadata formats to facilitate data sharing • Use of Ontology to map different usages to the same concept and standardized term
  • 9. GP3: Accessible and Scalable • Open-source development means anyone can help to improve the platform and reduce redundant efforts • Openness = transparent analysis pipeline and comparable results • Web technology = accessible from remote devices • Well documented • Installable on a single computer to a cluster of computers • VM available – deployed on a cloud possible • Free!
  • 10. GP4: Secure and High Quality • Use industrial standard authentication and authorization protocols for access • Built-in QC components (under testing) • Full data and pipeline provenances to keep track of tasks performed by end-users • Versioned reference databases (under testing)
  • 11. Genomics Analytic Ecosystem Raw Genomics data IRIDA backend: data management and routine analysis Public Health Data Raw Genomics data Public Health Data Geneious: ad-hoc analysis (interactive) Galaxy: ad-hoc analysis (pipelined) Bionumerics: PulseNet specific (pipelined) CDMart and other Marts EDW (LADW) Primary Data Processed Data (ready for analysis) and routine / automated analysis Analysis Tools Dashboards (information display) IRIDA frontend: Visualization
  • 12. Components of Genomic Analysis 25GB-250GB (data transfer bottleneck) 5: public Data repository and other collaborating centres 3: Public Health Data 1: data generation 2: data storage and/or analysis platform 4: Analysts Raw data Secure transfer GSC? NML?
  • 13. IRIDA Project Phases • Phase 1: genomics process and analysis pipeline to produce categorical data (MLST and SNPs) suitable for current epidemiological analysis – almost completed • Phase 2: combine the categorical data with subset of public health data (line list approach to replace current Excel based approach) and export of categorical data to CDMARTs – in progress • Phase 3: Develop IRIDA as an exploratory platform for new ways of interpreting genomics data in light of epidemiological and clinical data – in progress; continuous process beyond current project
  • 14. Public Health Contextual Data Integration into IRIDA is needed • Genomic data requires special storage and analysis considerations due to its size and complexity • While we can export genomic typing results to existing epi. analysis systems, by bringing “contextual” info into IRIDA, we can come up with more complex visualization and analysis tools (e.g. GenGIS) 14 Genomics, Epidemiology, Clinical, Lab Data
  • 15. Platform Overview • Suite of tools to facilitate genomic epidemiology 15 IRIDA Sequencing Instruments Web Application Data management Built-in Analytical Tools External Galaxy Command- line Tools
  • 16. Sequence Instruments • Typically MiSeq or NextSeq • An easy to use batch uploader is available to send data from MiSeq or a data-staging PC to IRIDA • Uploader available on Molecular-PC (BCCDC Lab’s data staging PC)
  • 17. Platform Overview – data model • Data model inspired by INSDC • Makes data uploading to NCBI easy • Currently with limited metadata (we will see in demo) • Plan to extend data model 17 Project Run Metadata Sample Metadata Sequencing Data Sample Metadata Sequencing Data Members Metadata
  • 18. Platform Overview • System Architecture 18 IRIDAServletContainer REST API Central File Storage Web Interface ApplicationLogic Compute Cluster Galaxy $ ~ >_ Galaxy
  • 19. Demo Topics • Getting data into IRIDA • Organizing data • Getting data out of IRIDA • Analyzing data in IRIDA
  • 20. Getting data into IRIDA • Manual web interface upload • Automated instrument upload (Illumina MiSeq) 20
  • 21. Organizing Data • Sample management actions • Creating • Editing metadata • Copy/move samples 21
  • 22. Analyzing data in IRIDA • Integrated analytical workflows • Built-in Galaxy • Assembly (SPAdes) and annotation (prokka) • Phylogenomics (SNVPhyl) • Uses Galaxy in the back-end • Extendable, if you can write a Galaxy workflow for a tool, it’s straightforward to integrate into IRIDA. 22
  • 23. Getting data out of IRIDA • Sharing project data • Downloading • Export to external Galaxy instance • Exporting to the command-line 23
  • 24. Works-in-progress and Future directions • Ontology integration, line-list tool • Quality control and Quality assurance in analytical workflows • Robust metadata integration and management • Simplification of SRA submission 24
  • 25. Types of (Meta)Data Standardized Within IRIDA Lab Analytics Genomics, PFGE Serotyping, Phage typing MLST, AMR Sample Metadata Isolation Source (Food, Host Body Product, Environmental), BioSample Epidemiology Investigation Exposures Clinical Data Patient demographics, Medical History, Comorbidities, Symptoms, Health Status Reporting Case/Investigation Status “Not just what data IS collected, but what SHOULD be collected”
  • 26. Types of (Meta)Data Standardized Within IRIDA Lab Analytics Genomics, PFGE Serotyping, Phage typing MLST, Ribotyping Sample Metadata Isolation Source (Food, Host Body Product, Environmental), BioSample Epidemiology Investigation Exposures Clinical Data Patient demographics, Medical History, Comorbidities, Symptoms, Health Status Reporting Case/Investigation Status “Not just what data IS collected, but what SHOULD be collected”
  • 27. 27 Improved Querying Using Genomic Epidemiology Application Ontology 1. Create a hierarchy of well-defined terms (harmonized from different sources, for different organisms) 2. Provide clearly-defined relationships between terms 3. Use OBO architecture Water Related Exposure Treated Untreated Bottled Municipal Individual Pond River Lake Transmissio n through ingestion or contact Transmissio n through ingestion or contact
  • 28. Advantage of Using Ontology • Flexible – allow more transparent integration • Invisible to the User (but you’ll feel the convenience and familiarity) • With defined relationships, computer can be used to assist reasoning (better querying and better automation) • Build on a large body of existing work (OBO) means we can benefit from other people’s effort
  • 29. 29 • “Person, place, time” • Exposure, food items, geographical information, symptoms, onset of symptoms • Created (manually in excel) on ad hoc basis per investigation • Need to be shared between stakeholders, but data governance is an issue The Line List : The Primary Tool for Epidemiological Investigations • Data integration and ontology based reasoning  automated case definitions! Integrating genomics and epidemiological data!
  • 30. IRIDA Offers Line List Visualizations of Selectable Data! 1. Line List View 2. Timeline View Hideable cases Selectable fields Travel Symptoms and Onset Exposure Types Hospitalization
  • 31. Ontology - In Progress • Create a smaller core (Lab, Epi exposure, and Food) ontology for line-list testing • Create a consortium for group to take on different domains of Genomic Epidemiology Application Ontology • Pursuing longer term funding for ontology
  • 32. Workflow Quality Control Tool Objectives: • Develop an universal QC module for various bioinformatics tools • Provide generic text mining tools for extracting key variables from pipeline component log files. • Make it easier to adjust pipeline QC threshold parameters. • Standardized rule engine with access to many Python functions. • Pathogen-specific configuration settings. • Galaxy tool or command line. • E.g. quality control system for the assembly, annotation, and snp- calling pipelines.
  • 33. Workflow Quality Control Tool Input: log files (datasets) Output: report file + optionally halt workflow + rule file (json)
  • 34. Workflow Quality Control Tool Simple workflow add-on: Complex rule capability:
  • 35. External applications • Authorized external applications can connect to IRIDA to obtain data seamlessly via REST API • Example: GenGIS can extract sample phylogeny and geographic data from IRIDA to generate a map that shows the phylogenetic and geographic information associated with outbreak isolates • Data can also be output to Dashboard applications for real-time queries
  • 36. 2011 Cholera Outbreak Red = Haiti Blue = Nepal
  • 37. 2011 Cholera Cartogram Red = Haiti Blue = Nepal
  • 38. Comments and Feedback on IRIDA • What existing features do you like? • What existing features don’t you like? • What features do you want to see soon? • What’s needed before you will use the tool • What features do you want to see eventually? • Longer term functionality
  • 39. Discussion • Infrastructure • Short-term (Jenn’s cluster) • Long-term? • Network connectivity (speed and security) • Access to Metadata (Epi and Lab) • Sustainability • Currently supported by a Genome Canada grant (expiring June 2016) • NML committed to maintain the core development going • Buy-In from BCCDC (customization and maintenance of the platform)
  • 40. Genomics Analytic Platforms Raw Genomics data IRIDA backend: data management and routine analysis Public Health Data Raw Genomics data Public Health Data Geneious: ad-hoc analysis (interactive) Galaxy: ad-hoc analysis (pipelined) Bionumerics: PulseNet specific (pipelined) CDMart and other Marts EDW (LADW) Primary Data Processed Data (ready for analysis) and routine / automated analysis Analysis Tools Dashboards (information display) IRIDA frontend: Visualization
  • 41. Contact • Project Information: http://www.irida.ca • Project source: • https://github.com/phac-nml/irida • https://github.com/phac-nml/irida-miseq-uploader • https://github.com/phac-nml/irida-galaxy-importer • Documentation: https://irida.corefacility.ca/documentation/ • E-mail: IRIDA-mail@sfu.ca • IRC: #irida on irc.freenode.net Many slides provided by Franklin Bristow (NML), Alex Keddy and Rob Beiko (Dalhousie U.), Melanie Courtot, Emma Griffiths, and Damion Dooley 41
  • 43. Extending IRIDA with the REST API • OAuth2 authorization (industry standard) • HTTP API • Examples: • External Galaxy importer tool • Command-line linker • GenGIS 43
  • 44. Retrieving IRIDA data through REST API GenGIS: kiwi.cs.dal.ca/GenGIS
  • 46. The Blank IRIDA Display
  • 53. Initial Data View (mock data set retrieved from IRIDA)
  • 54. Geographic Locations Required fields: Site ID, Latitude, Longitude
  • 55. Individual Samples Site IDs keyed to locations. Many Sequence IDs(=multiple samples) can key to single Site IDs.
  • 56. Phylogenetic Tree Tree leaf IDs keyed to samples.
  • 58. GCPD
  • 61. 2011 Cholera Outbreak Red = Haiti Blue = Nepal
  • 62. 2011 Cholera Cartogram Red = Haiti Blue = Nepal

Notes de l'éditeur

  1. Platform Overview Data structure System Architecture Getting data into IRIDA Web interface From Illumina MiSeq instruments Organizing data Projects as containers Managing samples (moving, copying, merging) Getting data out of IRIDA Sharing data (permissions, sharing project data) Downloading data Exporting: to Galaxy and the command-line Analyzing data in IRIDA Assembly and annotation, and core SNV Phylogenomics pipelines Extending IRIDA with the REST API Data model exposed over HTTP OAuth2 authorization Example: GenGIS Works-in-progress and future directions for IRIDA Ontology integration Epi line-list QA/QC
  2. IRIDA was conceived about 2 years ago through a Genome Canada Bioinformatics Grant. It is an effort to build an open source, standards compliant, high quality genomic epidemiology analysis platform to support real-time disease outbreak investigations, initially focused on food-borne illnesses
  3. Extending more interdisciplinary data integration further…
  4. Both at NML and BCCDC Build the foundation and now ready for more engagement
  5. No manual copying and pasting into spreadsheets between programs We have a standardized way to allow other programs to access data from IRIDA securely
  6. So where do I see IRIDA sits in the Genomic analytics Ecosystem
  7. What is IRIDA?
  8. Sequencing Instruments Java Web application REST API User interface Central file storage area Internal Galaxy External Galaxy Command-line tools
  9. Emphasize that it’s a mock data set. Constructed from real sequences but created with “user generated” geographic locations. Locations were faked inside the IRIDA instance, and not after retrieval