SlideShare une entreprise Scribd logo
1  sur  33
A SOFTWARE SUITE FOR DATA
HARMONIZATION AND FEDERATION
Vincent Ferretti
Ontario Institute for Cancer Research
The Maelstrom Research Software Suite
Software development started in 2007
$3,800,000 CAD of investment so far
Onyx
Opal
Mica DataSHIELD
Collection
Storage
Management
Harmonization
Publication Analysis
Some User’s Stories
Name Type Activities Tools
The Canadian Longitudinal
Study on Aging (CLSA)
Single study
50,000 participants
Collection, management,
portal
The Canadian Partnership for
tomorrow project (CPTP)
Study consortium
5 studies, 300,000
participants
Collection, harmonization,
portal
BBMRI-LPC
Network
>30 studies
Cataloguing
Maelstrom Research Research project
Cataloguing,
harmonization
Interconnect Network
Cataloguing,
(harmonization, federated
data analysis)
BioSHaRE Network
Cataloguing,
harmonization, federated
data analysis DataSHIELD
Onyx
Opal
Mica
Onyx
Opal
Mica
Mica
Opal
Mica
Mica
Opal
Mica
1 - Data Harmonization with Opal
The Canadian Partnership for Tomorrow Project (CPTP)
 5 cohorts with baseline data on ~ 300,000 participants
• 5 Different legislations, questionnaires, data access policies, languages,
etc.
 Project’s objectives
• To create harmonized datasets across the 5 cohorts
• To create a data portal to browse harmonized datasets and request
access to them
Phase 1
 The baseline Health and Risk Factor
questionnaire (CoreQx)
• 716 harmonized variables
Opal Software
A database application for integrating and storing data from
multiple and heterogeneous sources
• Used by studies to create central data repositories
Metadata in Opal
 Projects -> tables -> variables
 Tables are defined by a customizable dictionaries in Excel format
 Variables are annotated with an arbitrary number of attributes
 Controlled vocabularies - Taxonomies - (e.g. ICD-10)
 Maelstrom Research variable classification
 More than 130 terms in 17 classes (e.g. Reproduction, Physical Measures)
Variable Name Attribute Name Attribute Value
Cancer_type Diseases Neoplasm
Asthma_ever Diseases Respiratory system (J00-J99)
Ever_smoke Question label [EN] Have you ever smoked?
[FR] Avez-vous déjà fumé?
Ever_smoke Health
behaviors
Tobacco
Data Derivation
Opal derive new variables by executing custom JavaScript code
 Useful for data validation, curation and harmonisation
User-friendly interfaces for
recoding variables
JavaScript API for more
advanced derivation
JavaScript code executed by Opal when needed
Derived data is not persisted – Views or Virtual tables
Deriving the CoreQx datasets with Opal
Deriving the CoreQx datasets with Opal
Deriving the CoreQx datasets with Opal
How to query and access these harmonized datasets?
The Mica Software
Software to create web data portals for individual studies or for
study consortia
Study catalogue
• MR Standard description of
longitudinal studies
• Publication workflow
Datasets
• Data dictionaries, data
harmonization,
• database federation
Data Access
• Online forms, requests
management workflow with
roles
Data Persistence
MongoDB
Opal Server
Mica Server
Mica2
New client-server architecture
The CPTP Data Portal
Study Catalogue
Querying Opal Servers for Metadata and
Aggregated Data
Dictionary Faceted Search
Variable Page
Real time summary statistics
Harmonization Result
Data Access Requests
 Researcher account registration
 Customized application form
 Application review workflow
 Email notifications
 Multi-languages
2 - Advanced Cataloguing with Mica
Maelstrom-research.org
 Maelstrom Research web site is powered by Mica
 Includes a catalogue of international networks and studies with
annotated dictionaries
Current version
• 6 Networks
• 129 Studies
• 222 datasets
• 182,622 Variables
Search Harmonisation Potential
Multi-dimensional Search Tool
3- Data Analysis
The BioSHaRE Healthy Obese Project
 10 studies from 7 European countries
 200,000 subjects
 The HOP dataset - 103 harmonized
variables
How to analyze these datasets
» without pooling data
» without accessing individual-level
data?
A Federated Approach
Real Time Cross Tabulation on Harmonized Data
New Improved Version
Real Time Advanced Queries on Harmonized Data
More Advanced Analyses with R
R Studio Web Console
rstudio.bioshare.eu
More Information
 www.maelstrom-research.org
 www.obiba.org
 Code available at github.com/obiba
Let us know and acknowledge Maelstrom Research if you are using
our software, it’s important for our funding and our ability to
provide support
Acknowledgement
Yannick Marcon and his software developer team
The Maelstrom Research scientific team
The research leading to these results has received funding from the
European Union Seventh Framework Programme (FP7/2007-2013) under
grant agreement n°261433 (Biobank Standardisation and Harmonisation
for Research Excellence in the European Union - BioSHaRE-EU)

Contenu connexe

Tendances

The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...
The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...
The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...OpenAIRE
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphIoan Toma
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the webChiara Del Vescovo
 
Open Archives Initiatives For Metadata Harvesting
Open Archives Initiatives For Metadata   HarvestingOpen Archives Initiatives For Metadata   Harvesting
Open Archives Initiatives For Metadata HarvestingNikesh Narayanan
 
Research Data Spring - Spring Update 2016
Research Data Spring - Spring Update 2016Research Data Spring - Spring Update 2016
Research Data Spring - Spring Update 2016Jisc RDM
 
Bioschemas findability and interoperability
Bioschemas findability and interoperabilityBioschemas findability and interoperability
Bioschemas findability and interoperabilityBioschemas
 
Metadata harvesting
Metadata harvestingMetadata harvesting
Metadata harvestingAndrewLIS688
 
Data Sharing via Globus in the NIH Intramural Program
Data Sharing via Globus in the NIH Intramural ProgramData Sharing via Globus in the NIH Intramural Program
Data Sharing via Globus in the NIH Intramural ProgramGlobus
 
Using Linked Data Resources to generate web pages based on a BBC case study
Using Linked Data Resources to generate web pages based on a BBC case studyUsing Linked Data Resources to generate web pages based on a BBC case study
Using Linked Data Resources to generate web pages based on a BBC case studyLeila Zemmouchi-Ghomari
 
Iochem.carles bo
Iochem.carles boIochem.carles bo
Iochem.carles bomaredata
 
Tracking compliance of the REF2021 policy with the CORE Repository Dashboard
Tracking compliance of the REF2021 policy with the CORE Repository DashboardTracking compliance of the REF2021 policy with the CORE Repository Dashboard
Tracking compliance of the REF2021 policy with the CORE Repository Dashboardpetrknoth
 
Linked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcareLinked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcareKerstin Forsberg
 
ICIC 2017: New product presentations CAS
ICIC 2017: New product presentations CASICIC 2017: New product presentations CAS
ICIC 2017: New product presentations CASDr. Haxel Consult
 
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...Kerstin Forsberg
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodeiASIS&T
 
Rots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal AgenciesRots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal AgenciesASIS&T
 
Health Sciences Research Informatics, Powered by Globus
Health Sciences Research Informatics, Powered by GlobusHealth Sciences Research Informatics, Powered by Globus
Health Sciences Research Informatics, Powered by GlobusGlobus
 

Tendances (20)

The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...
The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...
The Scholix Framework and the OpenAIRE Scholexplorer Service (OpenAIRE webina...
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
OAI and OAI-PMH
OAI and OAI-PMHOAI and OAI-PMH
OAI and OAI-PMH
 
Open Archives Initiatives For Metadata Harvesting
Open Archives Initiatives For Metadata   HarvestingOpen Archives Initiatives For Metadata   Harvesting
Open Archives Initiatives For Metadata Harvesting
 
Research Data Spring - Spring Update 2016
Research Data Spring - Spring Update 2016Research Data Spring - Spring Update 2016
Research Data Spring - Spring Update 2016
 
Bioschemas findability and interoperability
Bioschemas findability and interoperabilityBioschemas findability and interoperability
Bioschemas findability and interoperability
 
Metadata harvesting
Metadata harvestingMetadata harvesting
Metadata harvesting
 
Data Sharing via Globus in the NIH Intramural Program
Data Sharing via Globus in the NIH Intramural ProgramData Sharing via Globus in the NIH Intramural Program
Data Sharing via Globus in the NIH Intramural Program
 
Using Linked Data Resources to generate web pages based on a BBC case study
Using Linked Data Resources to generate web pages based on a BBC case studyUsing Linked Data Resources to generate web pages based on a BBC case study
Using Linked Data Resources to generate web pages based on a BBC case study
 
Iochem.carles bo
Iochem.carles boIochem.carles bo
Iochem.carles bo
 
Tracking compliance of the REF2021 policy with the CORE Repository Dashboard
Tracking compliance of the REF2021 policy with the CORE Repository DashboardTracking compliance of the REF2021 policy with the CORE Repository Dashboard
Tracking compliance of the REF2021 policy with the CORE Repository Dashboard
 
Pieper NISO Virtual Conf Feb17
Pieper NISO Virtual Conf Feb17Pieper NISO Virtual Conf Feb17
Pieper NISO Virtual Conf Feb17
 
Linked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcareLinked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcare
 
ICIC 2017: New product presentations CAS
ICIC 2017: New product presentations CASICIC 2017: New product presentations CAS
ICIC 2017: New product presentations CAS
 
McCallum, Making and Moving Metadata: Two Library of Congress Initiatives
McCallum, Making and Moving Metadata: Two Library of Congress InitiativesMcCallum, Making and Moving Metadata: Two Library of Congress Initiatives
McCallum, Making and Moving Metadata: Two Library of Congress Initiatives
 
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodei
 
Rots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal AgenciesRots RDAP11 Data Archives in Federal Agencies
Rots RDAP11 Data Archives in Federal Agencies
 
Health Sciences Research Informatics, Powered by Globus
Health Sciences Research Informatics, Powered by GlobusHealth Sciences Research Informatics, Powered by Globus
Health Sciences Research Informatics, Powered by Globus
 

Similaire à BioSHaRE: Opal and Mica: a software suite for data harmonization and federation - Vincent Ferretti - Ontario Institute for Cancer Research

MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...Yongyao Jiang
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
UKRDDS Project Overview - Feb 2016
UKRDDS Project Overview - Feb 2016UKRDDS Project Overview - Feb 2016
UKRDDS Project Overview - Feb 2016Christopher Brown
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsKen Karapetyan
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013Frauke Ziedorn
 
Green Shoots: Research Data Management Pilot at Imperial College London
Green Shoots:Research Data Management Pilot at Imperial College LondonGreen Shoots:Research Data Management Pilot at Imperial College London
Green Shoots: Research Data Management Pilot at Imperial College LondonTorsten Reimer
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...Ardan Patwardhan
 
Matthew Hale - Open Source at the Kings Fund
Matthew Hale - Open Source at the Kings FundMatthew Hale - Open Source at the Kings Fund
Matthew Hale - Open Source at the Kings FundTracy Kent
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College LondonSarah Anna Stewart
 
Accelerating your Research with Microsoft Azure (June 2015)
Accelerating your Research with Microsoft Azure (June 2015)Accelerating your Research with Microsoft Azure (June 2015)
Accelerating your Research with Microsoft Azure (June 2015)Microsoft Azure for Research
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...BigData_Europe
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareHistoric Environment Scotland
 
Jisc unleashing data 5 minutes
Jisc unleashing data 5 minutesJisc unleashing data 5 minutes
Jisc unleashing data 5 minutesDaniela G. Duca
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc RDM
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Repositories unleashing data and Jisc projects
Repositories unleashing data and Jisc projectsRepositories unleashing data and Jisc projects
Repositories unleashing data and Jisc projectsJisc RDM
 

Similaire à BioSHaRE: Opal and Mica: a software suite for data harmonization and federation - Vincent Ferretti - Ontario Institute for Cancer Research (20)

MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
MUDROD - Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Me...
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
The UK National Chemical Database Service – an integration of commercial and ...
The UK National Chemical Database Service – an integration of commercial and ...The UK National Chemical Database Service – an integration of commercial and ...
The UK National Chemical Database Service – an integration of commercial and ...
 
UKRDDS Project Overview - Feb 2016
UKRDDS Project Overview - Feb 2016UKRDDS Project Overview - Feb 2016
UKRDDS Project Overview - Feb 2016
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013
 
Green Shoots: Research Data Management Pilot at Imperial College London
Green Shoots:Research Data Management Pilot at Imperial College LondonGreen Shoots:Research Data Management Pilot at Imperial College London
Green Shoots: Research Data Management Pilot at Imperial College London
 
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
2nd Microscopy Congress: Public archiving of bio-imaging data - perspectives,...
 
Matthew Hale - Open Source at the Kings Fund
Matthew Hale - Open Source at the Kings FundMatthew Hale - Open Source at the Kings Fund
Matthew Hale - Open Source at the Kings Fund
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
Accelerating your Research with Microsoft Azure (June 2015)
Accelerating your Research with Microsoft Azure (June 2015)Accelerating your Research with Microsoft Azure (June 2015)
Accelerating your Research with Microsoft Azure (June 2015)
 
OAI-PMH
OAI-PMHOAI-PMH
OAI-PMH
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
DSpace for Data Revisited
DSpace for Data RevisitedDSpace for Data Revisited
DSpace for Data Revisited
 
Jisc unleashing data 5 minutes
Jisc unleashing data 5 minutesJisc unleashing data 5 minutes
Jisc unleashing data 5 minutes
 
Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016Jisc Research data shared service overview and update - May 2016
Jisc Research data shared service overview and update - May 2016
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Repositories unleashing data and Jisc projects
Repositories unleashing data and Jisc projectsRepositories unleashing data and Jisc projects
Repositories unleashing data and Jisc projects
 

Plus de Lisette Giepmans

European Reference Network on genetic tumour risk syndromes - GENTURIS
European Reference Network on genetic tumour risk syndromes - GENTURISEuropean Reference Network on genetic tumour risk syndromes - GENTURIS
European Reference Network on genetic tumour risk syndromes - GENTURISLisette Giepmans
 
Kick off SPRINT@Work 16 jan 2014
Kick off SPRINT@Work 16 jan 2014Kick off SPRINT@Work 16 jan 2014
Kick off SPRINT@Work 16 jan 2014Lisette Giepmans
 
BioSHaRE - UMCG Close out meeting 20160118
BioSHaRE - UMCG Close out meeting 20160118 BioSHaRE - UMCG Close out meeting 20160118
BioSHaRE - UMCG Close out meeting 20160118 Lisette Giepmans
 
BioSHaRE Catalogue of tools and services for data sharing
BioSHaRE Catalogue of tools and services for data sharingBioSHaRE Catalogue of tools and services for data sharing
BioSHaRE Catalogue of tools and services for data sharingLisette Giepmans
 
BioSHaRE: Biosample quality for omics downstream analysis - Gabriele Anton -...
BioSHaRE: Biosample quality for omics downstream analysis  - Gabriele Anton -...BioSHaRE: Biosample quality for omics downstream analysis  - Gabriele Anton -...
BioSHaRE: Biosample quality for omics downstream analysis - Gabriele Anton -...Lisette Giepmans
 
BioSHaRE: EnviroSHAPER Noise Model and The Rapid Inquiry Facility (RIF); link...
BioSHaRE: EnviroSHAPER Noise Model and The Rapid Inquiry Facility (RIF); link...BioSHaRE: EnviroSHAPER Noise Model and The Rapid Inquiry Facility (RIF); link...
BioSHaRE: EnviroSHAPER Noise Model and The Rapid Inquiry Facility (RIF); link...Lisette Giepmans
 
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...Lisette Giepmans
 
BioSHaRE: Risk stratification using genomic and lifestyle information - Samul...
BioSHaRE: Risk stratification using genomic and lifestyle information - Samul...BioSHaRE: Risk stratification using genomic and lifestyle information - Samul...
BioSHaRE: Risk stratification using genomic and lifestyle information - Samul...Lisette Giepmans
 
BioSHaRE: Making data useful without direct sharing: Cafe Variome and Omics b...
BioSHaRE: Making data useful without direct sharing: Cafe Variome and Omics b...BioSHaRE: Making data useful without direct sharing: Cafe Variome and Omics b...
BioSHaRE: Making data useful without direct sharing: Cafe Variome and Omics b...Lisette Giepmans
 
BioSHaRE: Evaluation of tools and MEthods for Sharing Data - ENMESHD - Madele...
BioSHaRE: Evaluation of tools and MEthods for Sharing Data - ENMESHD - Madele...BioSHaRE: Evaluation of tools and MEthods for Sharing Data - ENMESHD - Madele...
BioSHaRE: Evaluation of tools and MEthods for Sharing Data - ENMESHD - Madele...Lisette Giepmans
 
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...Lisette Giepmans
 
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...Lisette Giepmans
 
BioSHaRE: The BBMRI-ERIC ELSI Services - Jasper Bovenberg - Legal Pathways
BioSHaRE: The BBMRI-ERIC ELSI Services - Jasper Bovenberg - Legal PathwaysBioSHaRE: The BBMRI-ERIC ELSI Services - Jasper Bovenberg - Legal Pathways
BioSHaRE: The BBMRI-ERIC ELSI Services - Jasper Bovenberg - Legal PathwaysLisette Giepmans
 
BRIF: Bioresource Research Impact Factor - Anne Cambon-Thomsen - INSERM
BRIF: Bioresource Research Impact Factor - Anne Cambon-Thomsen - INSERMBRIF: Bioresource Research Impact Factor - Anne Cambon-Thomsen - INSERM
BRIF: Bioresource Research Impact Factor - Anne Cambon-Thomsen - INSERMLisette Giepmans
 
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...Lisette Giepmans
 
BioSHaRE Latest tools and services for data sharing - introduction
BioSHaRE Latest tools and services for data sharing - introductionBioSHaRE Latest tools and services for data sharing - introduction
BioSHaRE Latest tools and services for data sharing - introductionLisette Giepmans
 

Plus de Lisette Giepmans (16)

European Reference Network on genetic tumour risk syndromes - GENTURIS
European Reference Network on genetic tumour risk syndromes - GENTURISEuropean Reference Network on genetic tumour risk syndromes - GENTURIS
European Reference Network on genetic tumour risk syndromes - GENTURIS
 
Kick off SPRINT@Work 16 jan 2014
Kick off SPRINT@Work 16 jan 2014Kick off SPRINT@Work 16 jan 2014
Kick off SPRINT@Work 16 jan 2014
 
BioSHaRE - UMCG Close out meeting 20160118
BioSHaRE - UMCG Close out meeting 20160118 BioSHaRE - UMCG Close out meeting 20160118
BioSHaRE - UMCG Close out meeting 20160118
 
BioSHaRE Catalogue of tools and services for data sharing
BioSHaRE Catalogue of tools and services for data sharingBioSHaRE Catalogue of tools and services for data sharing
BioSHaRE Catalogue of tools and services for data sharing
 
BioSHaRE: Biosample quality for omics downstream analysis - Gabriele Anton -...
BioSHaRE: Biosample quality for omics downstream analysis  - Gabriele Anton -...BioSHaRE: Biosample quality for omics downstream analysis  - Gabriele Anton -...
BioSHaRE: Biosample quality for omics downstream analysis - Gabriele Anton -...
 
BioSHaRE: EnviroSHAPER Noise Model and The Rapid Inquiry Facility (RIF); link...
BioSHaRE: EnviroSHAPER Noise Model and The Rapid Inquiry Facility (RIF); link...BioSHaRE: EnviroSHAPER Noise Model and The Rapid Inquiry Facility (RIF); link...
BioSHaRE: EnviroSHAPER Noise Model and The Rapid Inquiry Facility (RIF); link...
 
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
 
BioSHaRE: Risk stratification using genomic and lifestyle information - Samul...
BioSHaRE: Risk stratification using genomic and lifestyle information - Samul...BioSHaRE: Risk stratification using genomic and lifestyle information - Samul...
BioSHaRE: Risk stratification using genomic and lifestyle information - Samul...
 
BioSHaRE: Making data useful without direct sharing: Cafe Variome and Omics b...
BioSHaRE: Making data useful without direct sharing: Cafe Variome and Omics b...BioSHaRE: Making data useful without direct sharing: Cafe Variome and Omics b...
BioSHaRE: Making data useful without direct sharing: Cafe Variome and Omics b...
 
BioSHaRE: Evaluation of tools and MEthods for Sharing Data - ENMESHD - Madele...
BioSHaRE: Evaluation of tools and MEthods for Sharing Data - ENMESHD - Madele...BioSHaRE: Evaluation of tools and MEthods for Sharing Data - ENMESHD - Madele...
BioSHaRE: Evaluation of tools and MEthods for Sharing Data - ENMESHD - Madele...
 
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
BioSHaRE: Maelstrom Research tools for data harmonization and co-analysis - I...
 
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
 
BioSHaRE: The BBMRI-ERIC ELSI Services - Jasper Bovenberg - Legal Pathways
BioSHaRE: The BBMRI-ERIC ELSI Services - Jasper Bovenberg - Legal PathwaysBioSHaRE: The BBMRI-ERIC ELSI Services - Jasper Bovenberg - Legal Pathways
BioSHaRE: The BBMRI-ERIC ELSI Services - Jasper Bovenberg - Legal Pathways
 
BRIF: Bioresource Research Impact Factor - Anne Cambon-Thomsen - INSERM
BRIF: Bioresource Research Impact Factor - Anne Cambon-Thomsen - INSERMBRIF: Bioresource Research Impact Factor - Anne Cambon-Thomsen - INSERM
BRIF: Bioresource Research Impact Factor - Anne Cambon-Thomsen - INSERM
 
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...
BioSHaRE: Operationalizing responsible data sharing and access: GA4GH - Barth...
 
BioSHaRE Latest tools and services for data sharing - introduction
BioSHaRE Latest tools and services for data sharing - introductionBioSHaRE Latest tools and services for data sharing - introduction
BioSHaRE Latest tools and services for data sharing - introduction
 

Dernier

♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...astropune
 
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...narwatsonia7
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...Taniya Sharma
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...narwatsonia7
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...astropune
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...indiancallgirl4rent
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...narwatsonia7
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...chandars293
 
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiLow Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiSuhani Kapoor
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Servicevidya singh
 

Dernier (20)

♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
 
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
VIP Call Girls Tirunelveli Aaradhya 8250192130 Independent Escort Service Tir...
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
Russian Call Girls in Delhi Tanvi ➡️ 9711199012 💋📞 Independent Escort Service...
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Bhubaneswar Just Call 9907093804 Top Class Call Girl Service Avail...
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
 
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
(Rocky) Jaipur Call Girl - 09521753030 Escorts Service 50% Off with Cash ON D...
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
 
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiLow Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 9907093804 Top Class Call Girl Service Available
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
 

BioSHaRE: Opal and Mica: a software suite for data harmonization and federation - Vincent Ferretti - Ontario Institute for Cancer Research

  • 1. A SOFTWARE SUITE FOR DATA HARMONIZATION AND FEDERATION Vincent Ferretti Ontario Institute for Cancer Research
  • 2. The Maelstrom Research Software Suite Software development started in 2007 $3,800,000 CAD of investment so far Onyx Opal Mica DataSHIELD Collection Storage Management Harmonization Publication Analysis
  • 3. Some User’s Stories Name Type Activities Tools The Canadian Longitudinal Study on Aging (CLSA) Single study 50,000 participants Collection, management, portal The Canadian Partnership for tomorrow project (CPTP) Study consortium 5 studies, 300,000 participants Collection, harmonization, portal BBMRI-LPC Network >30 studies Cataloguing Maelstrom Research Research project Cataloguing, harmonization Interconnect Network Cataloguing, (harmonization, federated data analysis) BioSHaRE Network Cataloguing, harmonization, federated data analysis DataSHIELD Onyx Opal Mica Onyx Opal Mica Mica Opal Mica Mica Opal Mica
  • 4. 1 - Data Harmonization with Opal The Canadian Partnership for Tomorrow Project (CPTP)  5 cohorts with baseline data on ~ 300,000 participants • 5 Different legislations, questionnaires, data access policies, languages, etc.  Project’s objectives • To create harmonized datasets across the 5 cohorts • To create a data portal to browse harmonized datasets and request access to them Phase 1  The baseline Health and Risk Factor questionnaire (CoreQx) • 716 harmonized variables
  • 5. Opal Software A database application for integrating and storing data from multiple and heterogeneous sources • Used by studies to create central data repositories
  • 6. Metadata in Opal  Projects -> tables -> variables  Tables are defined by a customizable dictionaries in Excel format  Variables are annotated with an arbitrary number of attributes  Controlled vocabularies - Taxonomies - (e.g. ICD-10)  Maelstrom Research variable classification  More than 130 terms in 17 classes (e.g. Reproduction, Physical Measures) Variable Name Attribute Name Attribute Value Cancer_type Diseases Neoplasm Asthma_ever Diseases Respiratory system (J00-J99) Ever_smoke Question label [EN] Have you ever smoked? [FR] Avez-vous déjà fumé? Ever_smoke Health behaviors Tobacco
  • 7.
  • 8. Data Derivation Opal derive new variables by executing custom JavaScript code  Useful for data validation, curation and harmonisation User-friendly interfaces for recoding variables JavaScript API for more advanced derivation
  • 9. JavaScript code executed by Opal when needed Derived data is not persisted – Views or Virtual tables
  • 10. Deriving the CoreQx datasets with Opal
  • 11. Deriving the CoreQx datasets with Opal
  • 12. Deriving the CoreQx datasets with Opal How to query and access these harmonized datasets?
  • 13. The Mica Software Software to create web data portals for individual studies or for study consortia Study catalogue • MR Standard description of longitudinal studies • Publication workflow Datasets • Data dictionaries, data harmonization, • database federation Data Access • Online forms, requests management workflow with roles Data Persistence MongoDB Opal Server Mica Server Mica2 New client-server architecture
  • 14. The CPTP Data Portal
  • 16.
  • 17. Querying Opal Servers for Metadata and Aggregated Data
  • 19. Variable Page Real time summary statistics
  • 21. Data Access Requests  Researcher account registration  Customized application form  Application review workflow  Email notifications  Multi-languages
  • 22. 2 - Advanced Cataloguing with Mica Maelstrom-research.org  Maelstrom Research web site is powered by Mica  Includes a catalogue of international networks and studies with annotated dictionaries Current version • 6 Networks • 129 Studies • 222 datasets • 182,622 Variables
  • 25. 3- Data Analysis The BioSHaRE Healthy Obese Project  10 studies from 7 European countries  200,000 subjects  The HOP dataset - 103 harmonized variables How to analyze these datasets » without pooling data » without accessing individual-level data?
  • 27. Real Time Cross Tabulation on Harmonized Data
  • 29. Real Time Advanced Queries on Harmonized Data
  • 31. R Studio Web Console rstudio.bioshare.eu
  • 32. More Information  www.maelstrom-research.org  www.obiba.org  Code available at github.com/obiba Let us know and acknowledge Maelstrom Research if you are using our software, it’s important for our funding and our ability to provide support
  • 33. Acknowledgement Yannick Marcon and his software developer team The Maelstrom Research scientific team The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement n°261433 (Biobank Standardisation and Harmonisation for Research Excellence in the European Union - BioSHaRE-EU)