SlideShare une entreprise Scribd logo
1  sur  29
PhenoSpD: an integrated toolkit for phenotypic
correlation estimation and multiple testing
correction using GWAS summary statistics
Jie Zheng
The 12th International Conference on Genomics
27th Oct 2017
One belt one road
Bristol
An invitation from GIGA Science and BGI
Phenome wide association study (PheWAS)
• PheWAS analyzes many phenotypes
compared to a single or multiple
genetic variant(s).
• PheWAS is common place, e.g.
• MR-PheWAS. Millard et al, Sci Rep,
2015
• Haycock et al, JAMA Oncology, 2017
It is likely that longer telomeres increase risk for
several cancers but reduce risk for some non-
neoplastic diseases, including cardiovascular
diseases.
Post GWAS era: a database of harmonized GWAS summary
data in MRC Integrative Epidemiology Unit in Bristol
The network of post GWAS analysis software
Centralized
Database
PhenoSpD MR-Base
LD Hub
LD Hub for LD Score Regression
Univariate analysis: SNP heritability
0 20 40 60 80 100
0246810
LD Score
Chisquare
Bivariate analysis: genetic correlations
LD Hub web app
Scope of LD Hub
LD Hub
Database
233 publicly
available
GWAS traits
Test Center:
On-the-fly LD
score regression
analysis pipeline
Lookup Center:
Existing LD score
regression results
lookup
GWAShare
Center:
Summary data
sharing & user
contribution
MR-Base for Mendelian randomization
SNPs Trait 1
Confounders
Trait 2
Trait 1 = risk factor (exposure)
Trait 2 = disease (outcome)
Two sample Mendelian randomization
MR-Base web interface
Scope of MR-Base
MR-Base
SNP lookups
12 two-sample
MR
methodologies
MR-Base
R- package
Database
~2000 GWAS
(1100 with full data)
PhenoSpD: why we need it?
• Molecular phenotypes such as
metabolites are highly correlated.
• Multiple testing correction is a
headache problem: Bonferroni
correction is definitely over killed.
• When individual-level phenotype
data is available, phenotypic
correlation matrix can be calculated
easily.
• However, in real world, phenotype
data is normally not available.
• In MR-Base / LD Hub, we only have
GWAS summary statistics.
• We need a magic hand to correct
multiple testing!
Wurtz et al, J Am Coll Cardiol. 2013
PhenoSpD: how it works
1. Harmonize GWAS summary statistics
2. Estimate phenotypic correlation matrix
using metaCCA / LD score regression
3. Apply Spectral decomposition (SpD) to
estimate the equivalent number of
independent variables in the
phenotypic correlation matrix
MetaCCA
• Summary statistics-based multivariate association
testing using canonical correlation analysis –
Cichonska et al Bioinformatics 2016
• As a sub-product, it provides a way to estimate
phenotypic correlation matrix 𝑌𝑌, which is equal
to the Pearson correlation between regression
coefficients (betas) of two GWASs
• The assumption is, both traits are from the same
samples
• PS: 1000 Genomes is not the best option to
estimate LD matrix between SNPs. See Benner et
al AJHG 2017, and LDstore
LD score regression
• Method to estimate SNP heritability and
genetic correlations -- Bulik-Sullivan et al NG
2014, 2015
• It is also provides a way to estimate phenotypic
correlations between two traits, which is the
intercept term of the bi-variate LD score
regression.
• Compare to metaCCA, it adjusted for sample
overlap automatically
• Both genetic and phenotypic correlation
matrixes can be found in LD Hub
SNPSpD and MatSpD
• SNPSpD: A simple correction for multiple testing for SNPs in LD using
spectral decomposition (SpD). Nyholt 2004 AJHG
• MatSpD: MatrixSpD, estimate the equivalent number of independent
variables in a correlation (r) matrix
• The same method can be used to estimate the number of
independent variables in a phenotypic correlation matrix
Simulation
• How accurate is the phenotypic correlation estimation using GWAS results?
• Is there any parameters strongly affecting such estimation?
Model N_ind_A
N_in
d_B N_overlap Overlap_% N_SNPs N_EnvF N_simu y1_y2_A_obs y1_y2_B_obs Mean_y1_y2_est SD_y1_y2_est Deviation_obs_est (%)
sample size 1 300 300 150 50% 1000 100 100 -0.70 -0.70 -0.46 0.56 34.1%
sample size 2 500 500 250 50% 1000 100 100 -0.71 -0.70 -0.47 0.56 33.0%
sample size 3 1000 1000 500 50% 1000 100 100 -0.70 -0.70 -0.47 0.54 33.3%
sample size 4 3000 3000 1500 50% 1000 100 100 -0.70 -0.70 -0.46 0.54 33.6%
sample size 5 5000 5000 2500 50% 1000 100 100 -0.70 -0.70 -0.47 0.54 33.2%
sample size 6 10000 10000 5000 50% 1000 100 100 -0.71 -0.71 -0.47 0.54 33.9%
sample overlap 1 5000 5000 1000 10% 1000 100 100 -0.70 -0.70 -0.13 0.39 82.1%
sample overlap 2 5000 5000 2000 20% 1000 100 100 -0.70 -0.70 -0.23 0.47 67.2%
sample overlap 3 5000 5000 3000 30% 1000 100 100 -0.71 -0.71 -0.33 0.47 54.0%
sample overlap 4 5000 5000 4000 40% 1000 100 100 -0.71 -0.71 -0.40 0.51 43.2%
sample overlap 5 5000 5000 5000 50% 1000 100 100 -0.71 -0.71 -0.47 0.53 33.3%
sample overlap 6 5000 5000 6000 60% 1000 100 100 -0.71 -0.71 -0.53 0.59 25.2%
sample overlap 7 5000 5000 7000 70% 1000 100 100 -0.70 -0.70 -0.58 0.57 17.5%
sample overlap 8 5000 5000 8000 80% 1000 100 100 -0.70 -0.70 -0.62 0.65 11.4%
sample overlap 9 5000 5000 9000 90% 1000 100 100 -0.71 -0.71 -0.67 0.67 5.8%
unbalance sample 1 5000 5000 9000 90% 1000 100 100 -0.71 -0.71 -0.67 0.67 5.8%
unbalance sample 2 5000 6000 9000 82% 1000 100 100 -0.71 -0.71 -0.64 0.66 9.8%
unbalance sample 3 5000 8000 9000 69% 1000 100 100 -0.70 -0.70 -0.58 0.65 17.3%
unbalance sample 4 5000 10000 9000 60% 1000 100 100 -0.70 -0.70 -0.54 0.62 22.9%
unbalance sample 5 5000 13000 9000 50% 1000 100 100 -0.70 -0.70 -0.48 0.54 30.9%
number of SNPs 1 5000 5000 2500 50% 10 100 100 -0.70 -0.70 -0.44 0.73 38.1%
number of SNPs 2 5000 5000 2500 50% 100 100 100 -0.70 -0.70 -0.48 0.53 34.1%
number of SNPs 3 5000 5000 2500 50% 500 100 100 -0.70 -0.70 -0.47 0.53 34.3%
number of SNPs 4 5000 5000 2500 50% 1000 100 100 -0.71 -0.70 -0.47 0.56 33.5%
number of SNPs 5 5000 5000 2500 50% 5000 100 100 -0.70 -0.70 -0.47 0.55 33.6%
number of SNPs 6 5000 5000 2500 50% 10000 100 100 -0.71 -0.71 -0.47 0.59 33.7%
Accuracy tests using real data
The estimated phenotypic correlations have
good agreement with observed phenotypic
correlations
The exceptions are traits with limited sample size
(therefore limited sample overlap).
• Shin et al provided the observed phenotypic correlation matrix for 452 metabolites, which can be used as a
test dataset
• So we compared the observed phenotypic correlation with the estimated phenotypic correlation using
PhenoSpD.
Growth importance of PhenoSpD
• PhenoSpD is particularly useful for multiple GWASs from the same
samples, e.g. complex molecular traits such as metabolites and
cytokines
• It can also be applied to all traits in MR-Base / LD Hub, which we can
split traits into groups, e.g. all traits in GIANT consortium are highly
possible to be correlated and majority of them are from the same
sample
Real case application in MR-Base and LD Hub
Consortium / First
author
Category N_traits N_SNPs N_correlations N_independent_traits
Kettunen Blood metabolites 123 9826292 7503 44.9
Shin Metaoblites 451 2482345 101475 324.4
Roederer Immune system
phenotypes
151 1585187 11325 94.2
CARDIOGRAM 2 335391 1 1
TRICL 4 335391 6 3
TAG 4 1449634 6 3.98
SSGAC 7 1449634 21 6
PGC 4 335391 6 3.644
Leptin 2 1449634 1 1
MAGIC 16 1449634 120 11.098
IIBDGC 3 335391 3 2
Hrgene 8 1449634 28 7
HaemGen 6 1449634 15 5
GPC 6 1449634 15 5
GLGC 4 1449634 6 3
GIANT 15 1449634 105 10.1097
GEFOS 3 1449634 3 3
CKDGen 9 335391 36 8
EGG 4 1449634 6 4
GIS 2 2029112 1 1
GUGC 2 2449580 1 1
ENIGMA 7 7237736 21 6
UK Biobank 5 9440243 9 5
Others 24 / / 24
All 862 / 120713 577.3317
Number of independent traits in MR-Base
Consortium /
First author
Category N_traits N_SNPs N_correlations N_independent_traits
All traits All traits 221 / 24310 134.1167
Number of independent traits in LD Hub
Growth importance of PhenoSpD
• There is a great potential to apply PhenoSpD to multiple traits in large
scale biobanks and cohorts such as UK Biobank, China Kadoorie
Biobank, HUNT study (all traits in one sample)
UK Biobank release from Ben Neale’s group
• RAPID GWAS OF THOUSANDS OF
PHENOTYPES FOR 337,000 SAMPLES IN
THE UK BIOBANK
(http://www.nealelab.is/blog/2017/7/
19/rapid-gwas-of-thousands-of-
phenotypes-for-337000-samples-in-
the-uk-biobank)
• GWAS summary statistics of 337,000
European samples are available for
over 2,400 human traits, everyone can
access and download the results.
• ~600 traits are heritable, which are the
most valuable data
PhenoSpD application
• Assess the potential causal relationship between genetic variation, DNA methylation and 139
complex traits.
• PhenoSpD:
139 outcomes  62 independent outcomes
Hypothesis free MR of DNA methylation on 139 human traits
Links for PhenoSpD
• PhenoSpD Paper is on bioRxiv:
https://www.biorxiv.org/content/early/2017/07/25/148627
• R scripts of PhenoSpD can be found on MRC-IEU github:
https://github.com/MRCIEU/PhenoSpD
• LD Hub: http://ldsc.broadinstitute.org/ldhub/
• MR-Base: www.mrbase.org
Acknowledgements
• LD Hub team
• Jie Zheng
• David M Evans
• Benjamin Neale
• MR-Base team
• Gibran Hemani
• Jie Zheng
• George Davey Smith
• Tom Gaunt
• Philip Haycock
• PhenoSpD team
• Jie Zheng
• Tom Richardson
• Louise Millard
• Gibran Hemani
• Chris Raistrick
• Bjarni Vilhjalmsson
• Philip Haycock
• Tom Gaunt
Q & A
Thank you!
Questions welcomed

Contenu connexe

Tendances

Computational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisComputational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisAlexander Decker
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Chirag Patel
 
Detection of novel metabolites and enzyme functions though in silico expansio...
Detection of novel metabolites and enzyme functions though in silico expansio...Detection of novel metabolites and enzyme functions though in silico expansio...
Detection of novel metabolites and enzyme functions though in silico expansio...James Jeffryes
 
Qtl analysis and its mapping
Qtl analysis and its mappingQtl analysis and its mapping
Qtl analysis and its mappingVikas Verma
 
Crimson Publishers-Potential Application of Raman Micro-Spectroscopy as an In...
Crimson Publishers-Potential Application of Raman Micro-Spectroscopy as an In...Crimson Publishers-Potential Application of Raman Micro-Spectroscopy as an In...
Crimson Publishers-Potential Application of Raman Micro-Spectroscopy as an In...CrimsonPublishersMAPP
 
Personalized medicine via molecular interrogation, data mining and systems bi...
Personalized medicine via molecular interrogation, data mining and systems bi...Personalized medicine via molecular interrogation, data mining and systems bi...
Personalized medicine via molecular interrogation, data mining and systems bi...Gerald Lushington
 
Shotmap meta center_2014
Shotmap meta center_2014Shotmap meta center_2014
Shotmap meta center_2014tsharpton
 
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...Sean Ekins
 
Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Chirag Patel
 
Final MTBNZ.pptx
Final MTBNZ.pptxFinal MTBNZ.pptx
Final MTBNZ.pptxAri Mandler
 
Multi-trait modeling in polygenic scores
Multi-trait modeling in polygenic scoresMulti-trait modeling in polygenic scores
Multi-trait modeling in polygenic scoresYosuke Tanigawa
 
Loss of Connectivity in Cancer Co-Expression Networks - PLoS ONE 9(1): e87075...
Loss of Connectivity in Cancer Co-Expression Networks - PLoS ONE 9(1): e87075...Loss of Connectivity in Cancer Co-Expression Networks - PLoS ONE 9(1): e87075...
Loss of Connectivity in Cancer Co-Expression Networks - PLoS ONE 9(1): e87075...Roberto Anglani
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminarVarsha Gayatonde
 
A miniaturized sandwich immunoassay platform
A miniaturized sandwich immunoassay platformA miniaturized sandwich immunoassay platform
A miniaturized sandwich immunoassay platformQing Chen
 
Effect of Food Source on Enzymatic Activity in C. maculatus [draft 2]
Effect of Food Source on Enzymatic Activity in C. maculatus [draft 2]Effect of Food Source on Enzymatic Activity in C. maculatus [draft 2]
Effect of Food Source on Enzymatic Activity in C. maculatus [draft 2]Dylan Easterday
 
Jiankang Wang. Principle of QTL mapping and inclusive composite interval mapp...
Jiankang Wang. Principle of QTL mapping and inclusive composite interval mapp...Jiankang Wang. Principle of QTL mapping and inclusive composite interval mapp...
Jiankang Wang. Principle of QTL mapping and inclusive composite interval mapp...FOODCROPS
 
Bioengineered 3D Co culture Lung In Vitro Models: Platforms to Integrate Cell...
Bioengineered 3D Co culture Lung In Vitro Models: Platforms to Integrate Cell...Bioengineered 3D Co culture Lung In Vitro Models: Platforms to Integrate Cell...
Bioengineered 3D Co culture Lung In Vitro Models: Platforms to Integrate Cell...Ken Rogan
 

Tendances (20)

Computational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisComputational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasis
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
Detection of novel metabolites and enzyme functions though in silico expansio...
Detection of novel metabolites and enzyme functions though in silico expansio...Detection of novel metabolites and enzyme functions though in silico expansio...
Detection of novel metabolites and enzyme functions though in silico expansio...
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Qtl analysis and its mapping
Qtl analysis and its mappingQtl analysis and its mapping
Qtl analysis and its mapping
 
Basics of association_mapping
Basics of association_mappingBasics of association_mapping
Basics of association_mapping
 
Crimson Publishers-Potential Application of Raman Micro-Spectroscopy as an In...
Crimson Publishers-Potential Application of Raman Micro-Spectroscopy as an In...Crimson Publishers-Potential Application of Raman Micro-Spectroscopy as an In...
Crimson Publishers-Potential Application of Raman Micro-Spectroscopy as an In...
 
Personalized medicine via molecular interrogation, data mining and systems bi...
Personalized medicine via molecular interrogation, data mining and systems bi...Personalized medicine via molecular interrogation, data mining and systems bi...
Personalized medicine via molecular interrogation, data mining and systems bi...
 
Shotmap meta center_2014
Shotmap meta center_2014Shotmap meta center_2014
Shotmap meta center_2014
 
Gdt 2-126 (1)
Gdt 2-126 (1)Gdt 2-126 (1)
Gdt 2-126 (1)
 
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...Looking Back at Mycobacterium tuberculosis Mouse  Efficacy Testing To Move Ne...
Looking Back at Mycobacterium tuberculosis Mouse Efficacy Testing To Move Ne...
 
Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Correlation globes of the exposome 2016
Correlation globes of the exposome 2016
 
Final MTBNZ.pptx
Final MTBNZ.pptxFinal MTBNZ.pptx
Final MTBNZ.pptx
 
Multi-trait modeling in polygenic scores
Multi-trait modeling in polygenic scoresMulti-trait modeling in polygenic scores
Multi-trait modeling in polygenic scores
 
Loss of Connectivity in Cancer Co-Expression Networks - PLoS ONE 9(1): e87075...
Loss of Connectivity in Cancer Co-Expression Networks - PLoS ONE 9(1): e87075...Loss of Connectivity in Cancer Co-Expression Networks - PLoS ONE 9(1): e87075...
Loss of Connectivity in Cancer Co-Expression Networks - PLoS ONE 9(1): e87075...
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminar
 
A miniaturized sandwich immunoassay platform
A miniaturized sandwich immunoassay platformA miniaturized sandwich immunoassay platform
A miniaturized sandwich immunoassay platform
 
Effect of Food Source on Enzymatic Activity in C. maculatus [draft 2]
Effect of Food Source on Enzymatic Activity in C. maculatus [draft 2]Effect of Food Source on Enzymatic Activity in C. maculatus [draft 2]
Effect of Food Source on Enzymatic Activity in C. maculatus [draft 2]
 
Jiankang Wang. Principle of QTL mapping and inclusive composite interval mapp...
Jiankang Wang. Principle of QTL mapping and inclusive composite interval mapp...Jiankang Wang. Principle of QTL mapping and inclusive composite interval mapp...
Jiankang Wang. Principle of QTL mapping and inclusive composite interval mapp...
 
Bioengineered 3D Co culture Lung In Vitro Models: Platforms to Integrate Cell...
Bioengineered 3D Co culture Lung In Vitro Models: Platforms to Integrate Cell...Bioengineered 3D Co culture Lung In Vitro Models: Platforms to Integrate Cell...
Bioengineered 3D Co culture Lung In Vitro Models: Platforms to Integrate Cell...
 

Similaire à Jie Zheng at #ICG12: PhenoSpD: an atlas of phenotypic correlations and a multiple testing correction for the human phenome

Aug2015 analysis team 10 mason epigentics
Aug2015 analysis team 10 mason epigenticsAug2015 analysis team 10 mason epigentics
Aug2015 analysis team 10 mason epigenticsGenomeInABottle
 
Bioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast CancerBioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast CancerTom Kelly
 
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Elia Brodsky
 
STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...Lars Juhl Jensen
 
ArrayBridge(2016)
ArrayBridge(2016)ArrayBridge(2016)
ArrayBridge(2016)Xing Wang
 
Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Sean Ekins
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationJoaquin Dopazo
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoverySean Ekins
 
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...John Blue
 
Global Gene Expression Profiles from Breast Tumor Samples using the Ion Ampli...
Global Gene Expression Profiles from Breast Tumor Samples using the Ion Ampli...Global Gene Expression Profiles from Breast Tumor Samples using the Ion Ampli...
Global Gene Expression Profiles from Breast Tumor Samples using the Ion Ampli...Thermo Fisher Scientific
 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Rajarshi Guha
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGenomeInABottle
 
Choosing the Right Microbial Typing Method: A Quantitative Approach
Choosing the Right Microbial Typing Method: A Quantitative ApproachChoosing the Right Microbial Typing Method: A Quantitative Approach
Choosing the Right Microbial Typing Method: A Quantitative ApproachJoão André Carriço
 
dual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverydual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverySean Ekins
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917GenomeInABottle
 
NGS-Based Clinical Analysis
NGS-Based Clinical AnalysisNGS-Based Clinical Analysis
NGS-Based Clinical AnalysisDelaina Hawkins
 

Similaire à Jie Zheng at #ICG12: PhenoSpD: an atlas of phenotypic correlations and a multiple testing correction for the human phenome (20)

Aug2015 analysis team 10 mason epigentics
Aug2015 analysis team 10 mason epigenticsAug2015 analysis team 10 mason epigentics
Aug2015 analysis team 10 mason epigentics
 
Bioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast CancerBioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast Cancer
 
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
 
STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...
 
7 0
7 07 0
7 0
 
ArrayBridge(2016)
ArrayBridge(2016)ArrayBridge(2016)
ArrayBridge(2016)
 
Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug Discovery
 
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
 
SPECTROPHOTOMETRIC MEASUREMENT OF SOLUBILITY TEST TURBIDITY AS AN IMPROVED DI...
SPECTROPHOTOMETRIC MEASUREMENT OF SOLUBILITY TEST TURBIDITY AS AN IMPROVED DI...SPECTROPHOTOMETRIC MEASUREMENT OF SOLUBILITY TEST TURBIDITY AS AN IMPROVED DI...
SPECTROPHOTOMETRIC MEASUREMENT OF SOLUBILITY TEST TURBIDITY AS AN IMPROVED DI...
 
Axt microarrays
Axt microarraysAxt microarrays
Axt microarrays
 
Global Gene Expression Profiles from Breast Tumor Samples using the Ion Ampli...
Global Gene Expression Profiles from Breast Tumor Samples using the Ion Ampli...Global Gene Expression Profiles from Breast Tumor Samples using the Ion Ampli...
Global Gene Expression Profiles from Breast Tumor Samples using the Ion Ampli...
 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
 
microarray_paper
microarray_papermicroarray_paper
microarray_paper
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
 
Choosing the Right Microbial Typing Method: A Quantitative Approach
Choosing the Right Microbial Typing Method: A Quantitative ApproachChoosing the Right Microbial Typing Method: A Quantitative Approach
Choosing the Right Microbial Typing Method: A Quantitative Approach
 
dual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discoverydual-event machine learning models to accelerate drug discovery
dual-event machine learning models to accelerate drug discovery
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
NGS-Based Clinical Analysis
NGS-Based Clinical AnalysisNGS-Based Clinical Analysis
NGS-Based Clinical Analysis
 

Plus de GigaScience, BGI Hong Kong

IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...GigaScience, BGI Hong Kong
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteGigaScience, BGI Hong Kong
 
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...GigaScience, BGI Hong Kong
 
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...GigaScience, BGI Hong Kong
 
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...GigaScience, BGI Hong Kong
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...GigaScience, BGI Hong Kong
 
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...GigaScience, BGI Hong Kong
 
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...GigaScience, BGI Hong Kong
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...GigaScience, BGI Hong Kong
 
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixRicardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixGigaScience, BGI Hong Kong
 
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserAnil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserGigaScience, BGI Hong Kong
 
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...GigaScience, BGI Hong Kong
 
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceVenice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceGigaScience, BGI Hong Kong
 
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...GigaScience, BGI Hong Kong
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...GigaScience, BGI Hong Kong
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveGigaScience, BGI Hong Kong
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...GigaScience, BGI Hong Kong
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...GigaScience, BGI Hong Kong
 
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...GigaScience, BGI Hong Kong
 

Plus de GigaScience, BGI Hong Kong (20)

IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByte
 
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
 
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
 
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
 
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
 
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
 
Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10
 
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixRicardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
 
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserAnil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
 
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
 
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceVenice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
 
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...
 
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
 

Dernier

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 

Dernier (20)

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 

Jie Zheng at #ICG12: PhenoSpD: an atlas of phenotypic correlations and a multiple testing correction for the human phenome

  • 1. PhenoSpD: an integrated toolkit for phenotypic correlation estimation and multiple testing correction using GWAS summary statistics Jie Zheng The 12th International Conference on Genomics 27th Oct 2017
  • 2. One belt one road Bristol
  • 3. An invitation from GIGA Science and BGI
  • 4. Phenome wide association study (PheWAS) • PheWAS analyzes many phenotypes compared to a single or multiple genetic variant(s). • PheWAS is common place, e.g. • MR-PheWAS. Millard et al, Sci Rep, 2015 • Haycock et al, JAMA Oncology, 2017 It is likely that longer telomeres increase risk for several cancers but reduce risk for some non- neoplastic diseases, including cardiovascular diseases.
  • 5. Post GWAS era: a database of harmonized GWAS summary data in MRC Integrative Epidemiology Unit in Bristol
  • 6. The network of post GWAS analysis software Centralized Database PhenoSpD MR-Base LD Hub
  • 7. LD Hub for LD Score Regression Univariate analysis: SNP heritability 0 20 40 60 80 100 0246810 LD Score Chisquare Bivariate analysis: genetic correlations
  • 9. Scope of LD Hub LD Hub Database 233 publicly available GWAS traits Test Center: On-the-fly LD score regression analysis pipeline Lookup Center: Existing LD score regression results lookup GWAShare Center: Summary data sharing & user contribution
  • 10. MR-Base for Mendelian randomization SNPs Trait 1 Confounders Trait 2 Trait 1 = risk factor (exposure) Trait 2 = disease (outcome)
  • 11. Two sample Mendelian randomization
  • 13. Scope of MR-Base MR-Base SNP lookups 12 two-sample MR methodologies MR-Base R- package Database ~2000 GWAS (1100 with full data)
  • 14. PhenoSpD: why we need it? • Molecular phenotypes such as metabolites are highly correlated. • Multiple testing correction is a headache problem: Bonferroni correction is definitely over killed. • When individual-level phenotype data is available, phenotypic correlation matrix can be calculated easily. • However, in real world, phenotype data is normally not available. • In MR-Base / LD Hub, we only have GWAS summary statistics. • We need a magic hand to correct multiple testing! Wurtz et al, J Am Coll Cardiol. 2013
  • 15. PhenoSpD: how it works 1. Harmonize GWAS summary statistics 2. Estimate phenotypic correlation matrix using metaCCA / LD score regression 3. Apply Spectral decomposition (SpD) to estimate the equivalent number of independent variables in the phenotypic correlation matrix
  • 16. MetaCCA • Summary statistics-based multivariate association testing using canonical correlation analysis – Cichonska et al Bioinformatics 2016 • As a sub-product, it provides a way to estimate phenotypic correlation matrix 𝑌𝑌, which is equal to the Pearson correlation between regression coefficients (betas) of two GWASs • The assumption is, both traits are from the same samples • PS: 1000 Genomes is not the best option to estimate LD matrix between SNPs. See Benner et al AJHG 2017, and LDstore
  • 17. LD score regression • Method to estimate SNP heritability and genetic correlations -- Bulik-Sullivan et al NG 2014, 2015 • It is also provides a way to estimate phenotypic correlations between two traits, which is the intercept term of the bi-variate LD score regression. • Compare to metaCCA, it adjusted for sample overlap automatically • Both genetic and phenotypic correlation matrixes can be found in LD Hub
  • 18. SNPSpD and MatSpD • SNPSpD: A simple correction for multiple testing for SNPs in LD using spectral decomposition (SpD). Nyholt 2004 AJHG • MatSpD: MatrixSpD, estimate the equivalent number of independent variables in a correlation (r) matrix • The same method can be used to estimate the number of independent variables in a phenotypic correlation matrix
  • 19. Simulation • How accurate is the phenotypic correlation estimation using GWAS results? • Is there any parameters strongly affecting such estimation?
  • 20. Model N_ind_A N_in d_B N_overlap Overlap_% N_SNPs N_EnvF N_simu y1_y2_A_obs y1_y2_B_obs Mean_y1_y2_est SD_y1_y2_est Deviation_obs_est (%) sample size 1 300 300 150 50% 1000 100 100 -0.70 -0.70 -0.46 0.56 34.1% sample size 2 500 500 250 50% 1000 100 100 -0.71 -0.70 -0.47 0.56 33.0% sample size 3 1000 1000 500 50% 1000 100 100 -0.70 -0.70 -0.47 0.54 33.3% sample size 4 3000 3000 1500 50% 1000 100 100 -0.70 -0.70 -0.46 0.54 33.6% sample size 5 5000 5000 2500 50% 1000 100 100 -0.70 -0.70 -0.47 0.54 33.2% sample size 6 10000 10000 5000 50% 1000 100 100 -0.71 -0.71 -0.47 0.54 33.9% sample overlap 1 5000 5000 1000 10% 1000 100 100 -0.70 -0.70 -0.13 0.39 82.1% sample overlap 2 5000 5000 2000 20% 1000 100 100 -0.70 -0.70 -0.23 0.47 67.2% sample overlap 3 5000 5000 3000 30% 1000 100 100 -0.71 -0.71 -0.33 0.47 54.0% sample overlap 4 5000 5000 4000 40% 1000 100 100 -0.71 -0.71 -0.40 0.51 43.2% sample overlap 5 5000 5000 5000 50% 1000 100 100 -0.71 -0.71 -0.47 0.53 33.3% sample overlap 6 5000 5000 6000 60% 1000 100 100 -0.71 -0.71 -0.53 0.59 25.2% sample overlap 7 5000 5000 7000 70% 1000 100 100 -0.70 -0.70 -0.58 0.57 17.5% sample overlap 8 5000 5000 8000 80% 1000 100 100 -0.70 -0.70 -0.62 0.65 11.4% sample overlap 9 5000 5000 9000 90% 1000 100 100 -0.71 -0.71 -0.67 0.67 5.8% unbalance sample 1 5000 5000 9000 90% 1000 100 100 -0.71 -0.71 -0.67 0.67 5.8% unbalance sample 2 5000 6000 9000 82% 1000 100 100 -0.71 -0.71 -0.64 0.66 9.8% unbalance sample 3 5000 8000 9000 69% 1000 100 100 -0.70 -0.70 -0.58 0.65 17.3% unbalance sample 4 5000 10000 9000 60% 1000 100 100 -0.70 -0.70 -0.54 0.62 22.9% unbalance sample 5 5000 13000 9000 50% 1000 100 100 -0.70 -0.70 -0.48 0.54 30.9% number of SNPs 1 5000 5000 2500 50% 10 100 100 -0.70 -0.70 -0.44 0.73 38.1% number of SNPs 2 5000 5000 2500 50% 100 100 100 -0.70 -0.70 -0.48 0.53 34.1% number of SNPs 3 5000 5000 2500 50% 500 100 100 -0.70 -0.70 -0.47 0.53 34.3% number of SNPs 4 5000 5000 2500 50% 1000 100 100 -0.71 -0.70 -0.47 0.56 33.5% number of SNPs 5 5000 5000 2500 50% 5000 100 100 -0.70 -0.70 -0.47 0.55 33.6% number of SNPs 6 5000 5000 2500 50% 10000 100 100 -0.71 -0.71 -0.47 0.59 33.7%
  • 21. Accuracy tests using real data The estimated phenotypic correlations have good agreement with observed phenotypic correlations The exceptions are traits with limited sample size (therefore limited sample overlap). • Shin et al provided the observed phenotypic correlation matrix for 452 metabolites, which can be used as a test dataset • So we compared the observed phenotypic correlation with the estimated phenotypic correlation using PhenoSpD.
  • 22. Growth importance of PhenoSpD • PhenoSpD is particularly useful for multiple GWASs from the same samples, e.g. complex molecular traits such as metabolites and cytokines • It can also be applied to all traits in MR-Base / LD Hub, which we can split traits into groups, e.g. all traits in GIANT consortium are highly possible to be correlated and majority of them are from the same sample
  • 23. Real case application in MR-Base and LD Hub Consortium / First author Category N_traits N_SNPs N_correlations N_independent_traits Kettunen Blood metabolites 123 9826292 7503 44.9 Shin Metaoblites 451 2482345 101475 324.4 Roederer Immune system phenotypes 151 1585187 11325 94.2 CARDIOGRAM 2 335391 1 1 TRICL 4 335391 6 3 TAG 4 1449634 6 3.98 SSGAC 7 1449634 21 6 PGC 4 335391 6 3.644 Leptin 2 1449634 1 1 MAGIC 16 1449634 120 11.098 IIBDGC 3 335391 3 2 Hrgene 8 1449634 28 7 HaemGen 6 1449634 15 5 GPC 6 1449634 15 5 GLGC 4 1449634 6 3 GIANT 15 1449634 105 10.1097 GEFOS 3 1449634 3 3 CKDGen 9 335391 36 8 EGG 4 1449634 6 4 GIS 2 2029112 1 1 GUGC 2 2449580 1 1 ENIGMA 7 7237736 21 6 UK Biobank 5 9440243 9 5 Others 24 / / 24 All 862 / 120713 577.3317 Number of independent traits in MR-Base Consortium / First author Category N_traits N_SNPs N_correlations N_independent_traits All traits All traits 221 / 24310 134.1167 Number of independent traits in LD Hub
  • 24. Growth importance of PhenoSpD • There is a great potential to apply PhenoSpD to multiple traits in large scale biobanks and cohorts such as UK Biobank, China Kadoorie Biobank, HUNT study (all traits in one sample)
  • 25. UK Biobank release from Ben Neale’s group • RAPID GWAS OF THOUSANDS OF PHENOTYPES FOR 337,000 SAMPLES IN THE UK BIOBANK (http://www.nealelab.is/blog/2017/7/ 19/rapid-gwas-of-thousands-of- phenotypes-for-337000-samples-in- the-uk-biobank) • GWAS summary statistics of 337,000 European samples are available for over 2,400 human traits, everyone can access and download the results. • ~600 traits are heritable, which are the most valuable data
  • 26. PhenoSpD application • Assess the potential causal relationship between genetic variation, DNA methylation and 139 complex traits. • PhenoSpD: 139 outcomes  62 independent outcomes Hypothesis free MR of DNA methylation on 139 human traits
  • 27. Links for PhenoSpD • PhenoSpD Paper is on bioRxiv: https://www.biorxiv.org/content/early/2017/07/25/148627 • R scripts of PhenoSpD can be found on MRC-IEU github: https://github.com/MRCIEU/PhenoSpD • LD Hub: http://ldsc.broadinstitute.org/ldhub/ • MR-Base: www.mrbase.org
  • 28. Acknowledgements • LD Hub team • Jie Zheng • David M Evans • Benjamin Neale • MR-Base team • Gibran Hemani • Jie Zheng • George Davey Smith • Tom Gaunt • Philip Haycock • PhenoSpD team • Jie Zheng • Tom Richardson • Louise Millard • Gibran Hemani • Chris Raistrick • Bjarni Vilhjalmsson • Philip Haycock • Tom Gaunt
  • 29. Q & A Thank you! Questions welcomed