SlideShare une entreprise Scribd logo
1  sur  35
Télécharger pour lire hors ligne
Joaquín Dopazo 
Computational Genomics Department, 
Centro de Investigación Príncipe Felipe (CIPF), 
Functional Genomics Node, (INB), 
Bioinformatics Group (CIBERER) and 
Medical Genome Project, 
Spain. 
From reads to pathways for efficient disease gene finding 
http://bioinfo.cipf.es 
http://www.medicalgenomeproject.com 
http://www.babelomics.org 
http://www.hpc4g.org 
@xdopazo 
6th Annual Next Generation Sequencing Congress, November 21st, 2014
Odds Ratio: 3.6 
95% CI = 1.3 to 10.4 
The dawn of genomic big data. 
Candidate gene studies using GWAS
NHGRI GWA Catalog 
www.genome.gov/GWAStudies 
Published Genome-Wide Associations 
By the time of the completion of the human genome sequence, in 2005, just a few genetic variants were known to be significantly associated to diseases. 
When the first exhaustive catalogue of GWAS was compiled, in 2008, only three years later, more than 500 single nucleotide polymorphisms (SNPs) were associated to traits. 
Today, the catalog has collected more than 1,900 papers reporting 14,012 SNPs significantly associated to more than 1,500 traits.
Lessons learned from GWAS 
•Many loci/variants contribute to complex-trait variation 
•There is evidence for pleiotropy, i.e., that the same loci/variants are associated with multiple traits. 
•Much of the heritability of the trait cannot be explained by the individual loci/variants found associated to the trait. 
Visscher et al, 2013 AJHG
The missing heritability problem: individual genes cannot explain the heritability of traits 
Where did the heritability go? 
How to explain this problem? 
Rare Variants, rare CNVs, epigenetics or.. epistatic effects?
If rare variants eluded detection because were under represented among the SNPs, genomic sequencing would reveal them. 
http://www.genome.gov/sequencingcosts/
The principle: comparison of patients to reference controls or segregation within families 
A 
B 
C 
D 
A 
B 
C 
D 
Cases 
Controls 
Segregation within a pedigree
Variant/gene prioritization by heuristic filtering 
Variant level 
Potential impact of the variant 
Population frequencies 
Experimental design level 
Family(es) 
Trios 
Case / control 
Functional (system) level 
Gene set 
Network analysis 
Pathway analysis 
Testing strategies 
Control of sequencing errors (missing values)
Successive Filtering approach An example with 3-Methylglutaconic aciduria syndrome 
3-Methylglutaconic aciduria (3-MGA-uria) is a heterogeneous group of syndromes characterized by an increased excretion of 3- methylglutaconic and 3-methylglutaric acids. 
WES with a consecutive filter approach is enough to detect the new mutation in this case.
Exome sequencing has been systematically used to identify Mendelian disease genes
Limitations of the single-gene approach Can disease-related variants be so easily detected? 
a)Interrogating 60Mb sites (exomes) produces too many variants. Many of these segregating with our experimental design 
b)There is a non-negligible amount of apparently deleterious variants with no pathologic effect 
c)In many cases we are not targeting rare but common variants (which occur in normal population) 
d)In many cases only one variant does not explain the disease but rather a combination of them Result: variants found associated to the disease usually account for a small portion of the trait heritability
How to explain missing heritability? 
Rare Variants, rare CNVs, epigenetics or.. epistatic effects? 
Is the heritability really missing or are we looking at the wrong place? 
At the end, most of the heritability was there…
At the crossroad: how detection power 
of genomic technologies can be increased? 
There are 
two (non 
mutually 
exclusive) 
ways 
Scaling up: by increasing sample size. 
It is known that larger size allows detecting 
more individual gene (biomarker) 
associations. 
Limitations: Budget, patients availability and 
the own nature of the disease. 
Changing the perspective: systems 
approach to understand variation 
Interactions, multigenicity can be better 
detected and the role of variants understood 
in the context of disease mechanism. 
Limitations: Available information
Clear individual gene associations are difficult to find for most diseases 
Affected cases in complex diseases actually conform a heterogeneous population with different mutations (or combinations). 
Many cases and controls are needed to obtain a few significant associations (more prevalent, often population-specific variants, non-reproducible). 
The only common element is the (know or unknown) functional module affected. 
Disease understood as the failure of a functional module 
Cases 
Controls
Same disease in different populations is caused by different genes affecting the same functions 
Number of genes (blue line) and GO terms (red line) common to the four Consortium populations studied.
Modular nature of human genetic diseases 
•With the development of systems biology, studies have shown that phenotypically similar diseases are often caused by functionally related genes, being referred to as the modular nature of human genetic diseases (Oti and Brunner, 2007; Oti et al, 2008). 
•This modularity suggests that causative genes for the same or phenotypically similar diseases may generally reside in the same biological module, either a protein complex (Lage et al, 2007), a sub-network of protein interactions (Lim et al, 2006) , or a pathway (Wood et al, 2007) 
Disease 
Functional module 
Failure 
Evidence 
Monogenic 
One gene 
one LoF 
Association test 
Complex 
Several genes 
Several LoF 
Test on module functionality 
Burden (if unknown)
How to test functional module failure. A simple approach 
SNPs WES/WGS 
Gene expression 
AND/OR 
Gene Ontology: define sets of genes functionally related
Analysis of SNP biomarkers in a GWAS 
GWAS in Breast Cancer. The CGEMS initiative. (Hunter et al. Nat Genet 2007) 1145 cases 1142 controls. Affy 500K Conventional association test reports only significant 4 SNPs mapping only on one gene: FGFR2 Conventional tests are not providing much resolution. What is the problem with them? Are there solutions?
GSEA of the same GWAS data 
Breast Cancer 
CGEMS initiative. (Hunter et al. Nat 
Genet 2007) 
1145 cases 1142 controls. Affy 500K 
Only 4 SNPs were significantly associated, mapping only in one gene: FGFR2 
Bonifaci et al., BMC Medical Genomics 2008; Medina et al., 2009 NAR 
PBA reveals 19 GO categories including regulation of signal transduction (FDR-adjusted p-value=4.45x10-03) in which FGFR2 is included.
GO processes significantly associated to breast cancer 
Rho pathway 
Chromosomal instability 
Metastasis
From single gene to module approach 
mutations, gene expression, etc. 
GO 
Detection power 
Low (only very prevalent genes) 
High 
Information coverage 
Almost all 
Almost all 
Finds 
Gene 
Illustrative, give hints
Defining functional modules with the interactome 
SNPs WES/WGS 
Gene expression 
Using protein interaction networks as an scaffold to interpret the genomic data in a functionally-derived context 
AND/OR 
What part of the interactome is active and/or is damaged
CHRNA7 (rs2175886 p = 0.000607) 
IQGAP2 (rs950643 p = 0.0003585) 
DLC1 (rs1454947 p = 0.007526) 
SNPs validated in independent cohorts 
Network analysis helps to find disease genes
From single gene to module approach 
mutations, gene expression, etc. 
GO 
Protein interaction networks 
Detection power 
Low (only very prevalent genes) 
High 
High 
Information coverage 
Almost all 
Almost all 
Less (~9000 genes in human) 
Finds 
Gene 
Illustrative, give hints 
Genes* 
*Need of extra information (e.g. GO) to provide functional insights in the findings
Defining functional modules using signalling pathways 
SNPs 
WES/WGS 
Gene expression 
We only need to estimate the probabilities of gene activation and integrity and then calculate the probability for each circuit of being switched on or off. 
AND/OR
Defining real functional modules 
Transforming gene expression / mutation measurements into another value that accounts for a cell functionality. 
Easiest example of modeling function: signaling pathways. 
Function: transmission of a signal from a receptor to an effector 
Activations and repressions occur 
Receptor 
Effector
A 
B 
A 
B 
B 
D 
C 
E 
F 
A 
G 
P(A→G activated) = P(A)P(B)P(D)P(F)P(G) + P(A)P(C)P(E)P(F)P(G) - P(A)P(F)P(G)P(B)P(C)P(D)P(E) 
Prob. = [1-P(A activated)]P(B activated) 
Prob. = P(A activated)P(B activated) 
Activation 
Inhibition 
Sub-pathway 
Modeling pathways
Modeling pathways 
We only need to estimate the probabilities of gene activation and then calculate the probability for each circuit of being active. 
And the probability for each circuit of the pathway of being active can be calculated as well: 
Gene expression A large dataset of Affymetrix microarrays (10,000) is used to adjust a mixture of distributions of gene activity for every gene. Then, the activation state of any gene from a new microarray can be calculated as a probability: 
ON 
OFF
What would you predict about the consequences of gene activity changes in the apoptosis pathway in a case control experiment of colorectal cancer? 
The figure shows the gene up-regulations (red) and down- regulations (blue) 
The effects of changes in gene activity over pathway functionality are not obvious
Apoptosis inhibition is not obvious from gene expression 
Two of the three possible sub- pathways leading to apoptosis are inhibited in colorectal cancer. Upper panel shows the inhibited sub-pathways in blue. Lower panel shows the actual gene up- regulations (red) and down- regulations (blue) that justify this change in the activity of the sub- pathways
Different pathways cross-talk to deregulate programmed death in Fanconi anemia 
FA is a rare chromosome instability syndrome characterized by aplastic anemia and cancer and leukemia susceptibility. It has been proposed that disruption of the apoptotic control, a hallmark of FA, accounts for part of the phenotype of the disease. 
No proliferation 
No degradation 
Survival 
No degradation 
No apoptosis 
Activation apoptosis pathway
Sensitivity and specificity of model results 
Simulated datasets with equal random gene expression values with noise added 
Real dataset of pediatric acute myeloid leukemia with gene expression microarray data of 237 samples 
Dataset 
KNN 
RF 
SVM 
Accuracy 
0.86-0.89 
0.90-0.92 
0.99 
Breast cancer 
MCC 
0.74-0.79 
0.81-0.83 
0.98 
RMSC 
0.29-0.32 
0.31 
0.04 
AUC 
0.97-0.98 
0.98 
0.99 
Accuracy 
0.88-0.89 
0.92-0.95 
0.96 
AML 
MCC 
0.76-0.78 
0.84-0.90 
0.92-0.93 
RMSC 
0.27-0.30 
0.31 
0.10-0.11 
AUC 
0.94 
0.98 
0.96 
Circuit activities used for disease class prediction. Accuracy is evaluated by 10-fold cross-validation. A good prediction is a proxy for sensitivity (low type II error) 
Absence of false positives (high specificity low type I error)
From gene-based to function-based biomarkers 
Mutations, gene expression, etc. 
GO 
Protein interaction networks 
Models of cellular functions 
Detection power 
Low (only very prevalent genes) 
High 
High 
Very high 
Information coverage 
Almost all 
Almost all 
Moderate (~10000 genes in human) 
Low (~6700 genes in human)* 
Finds 
Genes 
Illustrative, give hints 
Genes 
Genes that explain disease mechanism 
*Only ~800 genes in human signaling pathways
Software development 
See interactive map of for the last 24h use http://bioinfo.cipf.es/toolsusage 
Babelomics is the third most cited tool for functional analysis. Includes more than 30 tools for advanced, systems-biology based data analysis 
More than 150.000 experiments were analyzed in our tools during the last year 
HPC on CPU, SSE4, GPUs on NGS data processing 
Speedups up to 40X 
Genome maps is now part of the ICGC data portal 
Ultrafast genome viewer with google technology 
Mapping 
Visualization 
Functional analysis 
Variant annotation 
CellBase 
Knowledge database 
Variant prioritization 
NGS panels 
Signaling network 
Regulatory network 
Interaction network 
Diagnostic
The Computational Genomics Department at the Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain, and… 
...the INB, National Institute of Bioinformatics (Functional Genomics Node) and the BiER (CIBERER Network of Centers for Rare Diseases) 
@xdopazo 
@bioinfocipf

Contenu connexe

Tendances

Forum on Personalized Medicine: Challenges for the next decade
Forum on Personalized Medicine: Challenges for the next decadeForum on Personalized Medicine: Challenges for the next decade
Forum on Personalized Medicine: Challenges for the next decadeJoaquin Dopazo
 
The trivial case of the missing heritability
The trivial case of the missing heritabilityThe trivial case of the missing heritability
The trivial case of the missing heritabilityMax Moldovan
 
Big Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowBig Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowKnome_Inc
 
Big Data Challenges for Real-Time Personalized Medicine
Big Data Challenges for Real-Time Personalized MedicineBig Data Challenges for Real-Time Personalized Medicine
Big Data Challenges for Real-Time Personalized MedicineSAP Technology
 
Gardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineGardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineAmanda Natalizio
 
Monarch Initiative Poster - Rare Disease Symposium 2015
Monarch Initiative Poster - Rare Disease Symposium 2015Monarch Initiative Poster - Rare Disease Symposium 2015
Monarch Initiative Poster - Rare Disease Symposium 2015Nicole Vasilevsky
 
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...Antoaneta Vladimirova
 
What's In a Genotype?: An Ontological Characterization for the Integration of...
What's In a Genotype?: An Ontological Characterization for the Integration of...What's In a Genotype?: An Ontological Characterization for the Integration of...
What's In a Genotype?: An Ontological Characterization for the Integration of...mhb120
 
Cell Authentication By STR Profiling
Cell Authentication By STR ProfilingCell Authentication By STR Profiling
Cell Authentication By STR ProfilingCreative-Bioarray
 
Improved sensitivit
Improved sensitivitImproved sensitivit
Improved sensitivitt7260678
 
Array_nmeth.3507
Array_nmeth.3507Array_nmeth.3507
Array_nmeth.3507Aana Hahn
 
Invicta eshre-poster-pregnancy rate after frozen blastocyst
Invicta eshre-poster-pregnancy rate after frozen blastocystInvicta eshre-poster-pregnancy rate after frozen blastocyst
Invicta eshre-poster-pregnancy rate after frozen blastocystINVICTA GENETICS
 
Clin Cancer Res-2012-Lechner-4549-59
Clin Cancer Res-2012-Lechner-4549-59Clin Cancer Res-2012-Lechner-4549-59
Clin Cancer Res-2012-Lechner-4549-59Karolina Megiel
 
2015 functional genomics variant annotation and interpretation- tools and p...
2015 functional genomics   variant annotation and interpretation- tools and p...2015 functional genomics   variant annotation and interpretation- tools and p...
2015 functional genomics variant annotation and interpretation- tools and p...Gabe Rudy
 
Bacterial Pathogen Genomics at NCBI
Bacterial Pathogen Genomics at NCBIBacterial Pathogen Genomics at NCBI
Bacterial Pathogen Genomics at NCBInist-spin
 
American gut microbiome-2
American gut microbiome-2American gut microbiome-2
American gut microbiome-2JaclynW
 

Tendances (20)

Forum on Personalized Medicine: Challenges for the next decade
Forum on Personalized Medicine: Challenges for the next decadeForum on Personalized Medicine: Challenges for the next decade
Forum on Personalized Medicine: Challenges for the next decade
 
The trivial case of the missing heritability
The trivial case of the missing heritabilityThe trivial case of the missing heritability
The trivial case of the missing heritability
 
Big Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey NislowBig Data and Genomic Medicine by Corey Nislow
Big Data and Genomic Medicine by Corey Nislow
 
JALANov2000
JALANov2000JALANov2000
JALANov2000
 
Overview of the ECDC whole genome sequencing strategy
Overview of the ECDC whole genome sequencing strategyOverview of the ECDC whole genome sequencing strategy
Overview of the ECDC whole genome sequencing strategy
 
Big Data Challenges for Real-Time Personalized Medicine
Big Data Challenges for Real-Time Personalized MedicineBig Data Challenges for Real-Time Personalized Medicine
Big Data Challenges for Real-Time Personalized Medicine
 
Gardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in MedicineGardner and Song_2015_Genetics in Medicine
Gardner and Song_2015_Genetics in Medicine
 
Monarch Initiative Poster - Rare Disease Symposium 2015
Monarch Initiative Poster - Rare Disease Symposium 2015Monarch Initiative Poster - Rare Disease Symposium 2015
Monarch Initiative Poster - Rare Disease Symposium 2015
 
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
A Retrospective Analysis of Exome Sequencing Cases Using the GenePool™ Genomi...
 
What's In a Genotype?: An Ontological Characterization for the Integration of...
What's In a Genotype?: An Ontological Characterization for the Integration of...What's In a Genotype?: An Ontological Characterization for the Integration of...
What's In a Genotype?: An Ontological Characterization for the Integration of...
 
Cell Authentication By STR Profiling
Cell Authentication By STR ProfilingCell Authentication By STR Profiling
Cell Authentication By STR Profiling
 
Improved sensitivit
Improved sensitivitImproved sensitivit
Improved sensitivit
 
Array_nmeth.3507
Array_nmeth.3507Array_nmeth.3507
Array_nmeth.3507
 
Invicta eshre-poster-pregnancy rate after frozen blastocyst
Invicta eshre-poster-pregnancy rate after frozen blastocystInvicta eshre-poster-pregnancy rate after frozen blastocyst
Invicta eshre-poster-pregnancy rate after frozen blastocyst
 
Clin Cancer Res-2012-Lechner-4549-59
Clin Cancer Res-2012-Lechner-4549-59Clin Cancer Res-2012-Lechner-4549-59
Clin Cancer Res-2012-Lechner-4549-59
 
2015 functional genomics variant annotation and interpretation- tools and p...
2015 functional genomics   variant annotation and interpretation- tools and p...2015 functional genomics   variant annotation and interpretation- tools and p...
2015 functional genomics variant annotation and interpretation- tools and p...
 
Proposal for 2016 survey of WGS capacity in EU/EEA Member States
Proposal for 2016 survey of WGS capacity in EU/EEA Member StatesProposal for 2016 survey of WGS capacity in EU/EEA Member States
Proposal for 2016 survey of WGS capacity in EU/EEA Member States
 
Bacterial Pathogen Genomics at NCBI
Bacterial Pathogen Genomics at NCBIBacterial Pathogen Genomics at NCBI
Bacterial Pathogen Genomics at NCBI
 
Mohammed I Khan
Mohammed I KhanMohammed I Khan
Mohammed I Khan
 
American gut microbiome-2
American gut microbiome-2American gut microbiome-2
American gut microbiome-2
 

Similaire à From reads to pathways for efficient disease gene finding

Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategiesAshfaq Ahmad
 
Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.Varsha Gayatonde
 
How Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayHow Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayStefanie Yang
 
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Reid Robison
 
Open Source Pharma /Genomics and clinical practice / Prof Hosur
Open Source Pharma /Genomics and clinical practice / Prof Hosur Open Source Pharma /Genomics and clinical practice / Prof Hosur
Open Source Pharma /Genomics and clinical practice / Prof Hosur opensourcepharmafound
 
Poster_NHGRI_DahliaShvets.ver2
Poster_NHGRI_DahliaShvets.ver2Poster_NHGRI_DahliaShvets.ver2
Poster_NHGRI_DahliaShvets.ver2Dahlia Shvets
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data setsimprovemed
 
Navigating through disease maps
Navigating through disease mapsNavigating through disease maps
Navigating through disease mapsJoaquin Dopazo
 
gene mapping, clonning of disease gene(1).pptx
gene mapping, clonning of disease gene(1).pptxgene mapping, clonning of disease gene(1).pptx
gene mapping, clonning of disease gene(1).pptxRajesh Yadav
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation SequencingShelomi Karoon
 
Bioinformatics-driven discovery of EGFR mutant Lung Cancer
Bioinformatics-driven discovery of EGFR mutant Lung CancerBioinformatics-driven discovery of EGFR mutant Lung Cancer
Bioinformatics-driven discovery of EGFR mutant Lung CancerPreveenRamamoorthy
 
Verifying the role of AID in Chronic Lymphocytic Leukemia
Verifying the role of AID in Chronic Lymphocytic LeukemiaVerifying the role of AID in Chronic Lymphocytic Leukemia
Verifying the role of AID in Chronic Lymphocytic LeukemiaCharlotte Broadbent
 
Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Ronak Shah
 
Detecting Somatic Mutation - Ensemble Approach
Detecting Somatic Mutation - Ensemble ApproachDetecting Somatic Mutation - Ensemble Approach
Detecting Somatic Mutation - Ensemble ApproachHong ChangBum
 
OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009Sean Davis
 
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi RehmSupporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi RehmKnome_Inc
 

Similaire à From reads to pathways for efficient disease gene finding (20)

Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategies
 
GWAS
GWASGWAS
GWAS
 
Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.
 
GWAS Study.pdf
GWAS Study.pdfGWAS Study.pdf
GWAS Study.pdf
 
How Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayHow Can Ngs Forward Research Essay
How Can Ngs Forward Research Essay
 
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...
 
Open Source Pharma /Genomics and clinical practice / Prof Hosur
Open Source Pharma /Genomics and clinical practice / Prof Hosur Open Source Pharma /Genomics and clinical practice / Prof Hosur
Open Source Pharma /Genomics and clinical practice / Prof Hosur
 
Poster_NHGRI_DahliaShvets.ver2
Poster_NHGRI_DahliaShvets.ver2Poster_NHGRI_DahliaShvets.ver2
Poster_NHGRI_DahliaShvets.ver2
 
INBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria LópezINBIOMEDvision Workshop at MIE 2011. Victoria López
INBIOMEDvision Workshop at MIE 2011. Victoria López
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
Navigating through disease maps
Navigating through disease mapsNavigating through disease maps
Navigating through disease maps
 
gene mapping, clonning of disease gene(1).pptx
gene mapping, clonning of disease gene(1).pptxgene mapping, clonning of disease gene(1).pptx
gene mapping, clonning of disease gene(1).pptx
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Bioinformatics-driven discovery of EGFR mutant Lung Cancer
Bioinformatics-driven discovery of EGFR mutant Lung CancerBioinformatics-driven discovery of EGFR mutant Lung Cancer
Bioinformatics-driven discovery of EGFR mutant Lung Cancer
 
NGS-report-amir.pdf
NGS-report-amir.pdfNGS-report-amir.pdf
NGS-report-amir.pdf
 
Verifying the role of AID in Chronic Lymphocytic Leukemia
Verifying the role of AID in Chronic Lymphocytic LeukemiaVerifying the role of AID in Chronic Lymphocytic Leukemia
Verifying the role of AID in Chronic Lymphocytic Leukemia
 
Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...
 
Detecting Somatic Mutation - Ensemble Approach
Detecting Somatic Mutation - Ensemble ApproachDetecting Somatic Mutation - Ensemble Approach
Detecting Somatic Mutation - Ensemble Approach
 
OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
 
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi RehmSupporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi Rehm
 

Plus de Joaquin Dopazo

Accelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwideAccelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwideJoaquin Dopazo
 
Differential metabolic activity and discovery of therapeutic targets using su...
Differential metabolic activity and discovery of therapeutic targets using su...Differential metabolic activity and discovery of therapeutic targets using su...
Differential metabolic activity and discovery of therapeutic targets using su...Joaquin Dopazo
 
Taller Genómica y cáncer Introducción
Taller Genómica y cáncer IntroducciónTaller Genómica y cáncer Introducción
Taller Genómica y cáncer IntroducciónJoaquin Dopazo
 
From empirical biomarkers to models of disease mechanisms in the transition t...
From empirical biomarkers to models of disease mechanisms in the transition t...From empirical biomarkers to models of disease mechanisms in the transition t...
From empirical biomarkers to models of disease mechanisms in the transition t...Joaquin Dopazo
 
A look at the human mutational load from the systems biology perspective
A look at the human mutational load from the systems biology perspectiveA look at the human mutational load from the systems biology perspective
A look at the human mutational load from the systems biology perspectiveJoaquin Dopazo
 
Functional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodFunctional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodJoaquin Dopazo
 

Plus de Joaquin Dopazo (7)

Accelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwideAccelerating the benefits of genomics worldwide
Accelerating the benefits of genomics worldwide
 
Differential metabolic activity and discovery of therapeutic targets using su...
Differential metabolic activity and discovery of therapeutic targets using su...Differential metabolic activity and discovery of therapeutic targets using su...
Differential metabolic activity and discovery of therapeutic targets using su...
 
Taller Genómica y cáncer Introducción
Taller Genómica y cáncer IntroducciónTaller Genómica y cáncer Introducción
Taller Genómica y cáncer Introducción
 
From empirical biomarkers to models of disease mechanisms in the transition t...
From empirical biomarkers to models of disease mechanisms in the transition t...From empirical biomarkers to models of disease mechanisms in the transition t...
From empirical biomarkers to models of disease mechanisms in the transition t...
 
A look at the human mutational load from the systems biology perspective
A look at the human mutational load from the systems biology perspectiveA look at the human mutational load from the systems biology perspective
A look at the human mutational load from the systems biology perspective
 
Functional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodFunctional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in blood
 
Big data genomico
Big data genomicoBig data genomico
Big data genomico
 

Dernier

DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxzaydmeerab121
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxMedical College
 
Replisome-Cohesin Interfacing A Molecular Perspective.pdf
Replisome-Cohesin Interfacing A Molecular Perspective.pdfReplisome-Cohesin Interfacing A Molecular Perspective.pdf
Replisome-Cohesin Interfacing A Molecular Perspective.pdfAtiaGohar1
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsDobusch Leonhard
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxfarhanvvdk
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterHanHyoKim
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxPayal Shrivastava
 
Quarter 4_Grade 8_Digestive System Structure and Functions
Quarter 4_Grade 8_Digestive System Structure and FunctionsQuarter 4_Grade 8_Digestive System Structure and Functions
Quarter 4_Grade 8_Digestive System Structure and FunctionsCharlene Llagas
 
Abnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxAbnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxzeus70441
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
DOG BITE management in pediatrics # for Pediatric pgs# topic presentation # f...
DOG BITE management in pediatrics # for Pediatric pgs# topic presentation # f...DOG BITE management in pediatrics # for Pediatric pgs# topic presentation # f...
DOG BITE management in pediatrics # for Pediatric pgs# topic presentation # f...HafsaHussainp
 
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11GelineAvendao
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxtuking87
 
Explainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosExplainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosZachary Labe
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
Immunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptImmunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptAmirRaziq1
 
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDivyaK787011
 

Dernier (20)

DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptx
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptx
 
Replisome-Cohesin Interfacing A Molecular Perspective.pdf
Replisome-Cohesin Interfacing A Molecular Perspective.pdfReplisome-Cohesin Interfacing A Molecular Perspective.pdf
Replisome-Cohesin Interfacing A Molecular Perspective.pdf
 
Science (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and PitfallsScience (Communication) and Wikipedia - Potentials and Pitfalls
Science (Communication) and Wikipedia - Potentials and Pitfalls
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptx
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarter
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptx
 
Quarter 4_Grade 8_Digestive System Structure and Functions
Quarter 4_Grade 8_Digestive System Structure and FunctionsQuarter 4_Grade 8_Digestive System Structure and Functions
Quarter 4_Grade 8_Digestive System Structure and Functions
 
Abnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxAbnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptx
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
DOG BITE management in pediatrics # for Pediatric pgs# topic presentation # f...
DOG BITE management in pediatrics # for Pediatric pgs# topic presentation # f...DOG BITE management in pediatrics # for Pediatric pgs# topic presentation # f...
DOG BITE management in pediatrics # for Pediatric pgs# topic presentation # f...
 
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
 
Explainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosExplainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenarios
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
Immunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptImmunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.ppt
 
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdfDECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
DECOMPOSITION PATHWAYS of TM-alkyl complexes.pdf
 

From reads to pathways for efficient disease gene finding

  • 1. Joaquín Dopazo Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Functional Genomics Node, (INB), Bioinformatics Group (CIBERER) and Medical Genome Project, Spain. From reads to pathways for efficient disease gene finding http://bioinfo.cipf.es http://www.medicalgenomeproject.com http://www.babelomics.org http://www.hpc4g.org @xdopazo 6th Annual Next Generation Sequencing Congress, November 21st, 2014
  • 2. Odds Ratio: 3.6 95% CI = 1.3 to 10.4 The dawn of genomic big data. Candidate gene studies using GWAS
  • 3. NHGRI GWA Catalog www.genome.gov/GWAStudies Published Genome-Wide Associations By the time of the completion of the human genome sequence, in 2005, just a few genetic variants were known to be significantly associated to diseases. When the first exhaustive catalogue of GWAS was compiled, in 2008, only three years later, more than 500 single nucleotide polymorphisms (SNPs) were associated to traits. Today, the catalog has collected more than 1,900 papers reporting 14,012 SNPs significantly associated to more than 1,500 traits.
  • 4. Lessons learned from GWAS •Many loci/variants contribute to complex-trait variation •There is evidence for pleiotropy, i.e., that the same loci/variants are associated with multiple traits. •Much of the heritability of the trait cannot be explained by the individual loci/variants found associated to the trait. Visscher et al, 2013 AJHG
  • 5. The missing heritability problem: individual genes cannot explain the heritability of traits Where did the heritability go? How to explain this problem? Rare Variants, rare CNVs, epigenetics or.. epistatic effects?
  • 6. If rare variants eluded detection because were under represented among the SNPs, genomic sequencing would reveal them. http://www.genome.gov/sequencingcosts/
  • 7. The principle: comparison of patients to reference controls or segregation within families A B C D A B C D Cases Controls Segregation within a pedigree
  • 8. Variant/gene prioritization by heuristic filtering Variant level Potential impact of the variant Population frequencies Experimental design level Family(es) Trios Case / control Functional (system) level Gene set Network analysis Pathway analysis Testing strategies Control of sequencing errors (missing values)
  • 9. Successive Filtering approach An example with 3-Methylglutaconic aciduria syndrome 3-Methylglutaconic aciduria (3-MGA-uria) is a heterogeneous group of syndromes characterized by an increased excretion of 3- methylglutaconic and 3-methylglutaric acids. WES with a consecutive filter approach is enough to detect the new mutation in this case.
  • 10. Exome sequencing has been systematically used to identify Mendelian disease genes
  • 11. Limitations of the single-gene approach Can disease-related variants be so easily detected? a)Interrogating 60Mb sites (exomes) produces too many variants. Many of these segregating with our experimental design b)There is a non-negligible amount of apparently deleterious variants with no pathologic effect c)In many cases we are not targeting rare but common variants (which occur in normal population) d)In many cases only one variant does not explain the disease but rather a combination of them Result: variants found associated to the disease usually account for a small portion of the trait heritability
  • 12. How to explain missing heritability? Rare Variants, rare CNVs, epigenetics or.. epistatic effects? Is the heritability really missing or are we looking at the wrong place? At the end, most of the heritability was there…
  • 13. At the crossroad: how detection power of genomic technologies can be increased? There are two (non mutually exclusive) ways Scaling up: by increasing sample size. It is known that larger size allows detecting more individual gene (biomarker) associations. Limitations: Budget, patients availability and the own nature of the disease. Changing the perspective: systems approach to understand variation Interactions, multigenicity can be better detected and the role of variants understood in the context of disease mechanism. Limitations: Available information
  • 14. Clear individual gene associations are difficult to find for most diseases Affected cases in complex diseases actually conform a heterogeneous population with different mutations (or combinations). Many cases and controls are needed to obtain a few significant associations (more prevalent, often population-specific variants, non-reproducible). The only common element is the (know or unknown) functional module affected. Disease understood as the failure of a functional module Cases Controls
  • 15. Same disease in different populations is caused by different genes affecting the same functions Number of genes (blue line) and GO terms (red line) common to the four Consortium populations studied.
  • 16. Modular nature of human genetic diseases •With the development of systems biology, studies have shown that phenotypically similar diseases are often caused by functionally related genes, being referred to as the modular nature of human genetic diseases (Oti and Brunner, 2007; Oti et al, 2008). •This modularity suggests that causative genes for the same or phenotypically similar diseases may generally reside in the same biological module, either a protein complex (Lage et al, 2007), a sub-network of protein interactions (Lim et al, 2006) , or a pathway (Wood et al, 2007) Disease Functional module Failure Evidence Monogenic One gene one LoF Association test Complex Several genes Several LoF Test on module functionality Burden (if unknown)
  • 17. How to test functional module failure. A simple approach SNPs WES/WGS Gene expression AND/OR Gene Ontology: define sets of genes functionally related
  • 18. Analysis of SNP biomarkers in a GWAS GWAS in Breast Cancer. The CGEMS initiative. (Hunter et al. Nat Genet 2007) 1145 cases 1142 controls. Affy 500K Conventional association test reports only significant 4 SNPs mapping only on one gene: FGFR2 Conventional tests are not providing much resolution. What is the problem with them? Are there solutions?
  • 19. GSEA of the same GWAS data Breast Cancer CGEMS initiative. (Hunter et al. Nat Genet 2007) 1145 cases 1142 controls. Affy 500K Only 4 SNPs were significantly associated, mapping only in one gene: FGFR2 Bonifaci et al., BMC Medical Genomics 2008; Medina et al., 2009 NAR PBA reveals 19 GO categories including regulation of signal transduction (FDR-adjusted p-value=4.45x10-03) in which FGFR2 is included.
  • 20. GO processes significantly associated to breast cancer Rho pathway Chromosomal instability Metastasis
  • 21. From single gene to module approach mutations, gene expression, etc. GO Detection power Low (only very prevalent genes) High Information coverage Almost all Almost all Finds Gene Illustrative, give hints
  • 22. Defining functional modules with the interactome SNPs WES/WGS Gene expression Using protein interaction networks as an scaffold to interpret the genomic data in a functionally-derived context AND/OR What part of the interactome is active and/or is damaged
  • 23. CHRNA7 (rs2175886 p = 0.000607) IQGAP2 (rs950643 p = 0.0003585) DLC1 (rs1454947 p = 0.007526) SNPs validated in independent cohorts Network analysis helps to find disease genes
  • 24. From single gene to module approach mutations, gene expression, etc. GO Protein interaction networks Detection power Low (only very prevalent genes) High High Information coverage Almost all Almost all Less (~9000 genes in human) Finds Gene Illustrative, give hints Genes* *Need of extra information (e.g. GO) to provide functional insights in the findings
  • 25. Defining functional modules using signalling pathways SNPs WES/WGS Gene expression We only need to estimate the probabilities of gene activation and integrity and then calculate the probability for each circuit of being switched on or off. AND/OR
  • 26. Defining real functional modules Transforming gene expression / mutation measurements into another value that accounts for a cell functionality. Easiest example of modeling function: signaling pathways. Function: transmission of a signal from a receptor to an effector Activations and repressions occur Receptor Effector
  • 27. A B A B B D C E F A G P(A→G activated) = P(A)P(B)P(D)P(F)P(G) + P(A)P(C)P(E)P(F)P(G) - P(A)P(F)P(G)P(B)P(C)P(D)P(E) Prob. = [1-P(A activated)]P(B activated) Prob. = P(A activated)P(B activated) Activation Inhibition Sub-pathway Modeling pathways
  • 28. Modeling pathways We only need to estimate the probabilities of gene activation and then calculate the probability for each circuit of being active. And the probability for each circuit of the pathway of being active can be calculated as well: Gene expression A large dataset of Affymetrix microarrays (10,000) is used to adjust a mixture of distributions of gene activity for every gene. Then, the activation state of any gene from a new microarray can be calculated as a probability: ON OFF
  • 29. What would you predict about the consequences of gene activity changes in the apoptosis pathway in a case control experiment of colorectal cancer? The figure shows the gene up-regulations (red) and down- regulations (blue) The effects of changes in gene activity over pathway functionality are not obvious
  • 30. Apoptosis inhibition is not obvious from gene expression Two of the three possible sub- pathways leading to apoptosis are inhibited in colorectal cancer. Upper panel shows the inhibited sub-pathways in blue. Lower panel shows the actual gene up- regulations (red) and down- regulations (blue) that justify this change in the activity of the sub- pathways
  • 31. Different pathways cross-talk to deregulate programmed death in Fanconi anemia FA is a rare chromosome instability syndrome characterized by aplastic anemia and cancer and leukemia susceptibility. It has been proposed that disruption of the apoptotic control, a hallmark of FA, accounts for part of the phenotype of the disease. No proliferation No degradation Survival No degradation No apoptosis Activation apoptosis pathway
  • 32. Sensitivity and specificity of model results Simulated datasets with equal random gene expression values with noise added Real dataset of pediatric acute myeloid leukemia with gene expression microarray data of 237 samples Dataset KNN RF SVM Accuracy 0.86-0.89 0.90-0.92 0.99 Breast cancer MCC 0.74-0.79 0.81-0.83 0.98 RMSC 0.29-0.32 0.31 0.04 AUC 0.97-0.98 0.98 0.99 Accuracy 0.88-0.89 0.92-0.95 0.96 AML MCC 0.76-0.78 0.84-0.90 0.92-0.93 RMSC 0.27-0.30 0.31 0.10-0.11 AUC 0.94 0.98 0.96 Circuit activities used for disease class prediction. Accuracy is evaluated by 10-fold cross-validation. A good prediction is a proxy for sensitivity (low type II error) Absence of false positives (high specificity low type I error)
  • 33. From gene-based to function-based biomarkers Mutations, gene expression, etc. GO Protein interaction networks Models of cellular functions Detection power Low (only very prevalent genes) High High Very high Information coverage Almost all Almost all Moderate (~10000 genes in human) Low (~6700 genes in human)* Finds Genes Illustrative, give hints Genes Genes that explain disease mechanism *Only ~800 genes in human signaling pathways
  • 34. Software development See interactive map of for the last 24h use http://bioinfo.cipf.es/toolsusage Babelomics is the third most cited tool for functional analysis. Includes more than 30 tools for advanced, systems-biology based data analysis More than 150.000 experiments were analyzed in our tools during the last year HPC on CPU, SSE4, GPUs on NGS data processing Speedups up to 40X Genome maps is now part of the ICGC data portal Ultrafast genome viewer with google technology Mapping Visualization Functional analysis Variant annotation CellBase Knowledge database Variant prioritization NGS panels Signaling network Regulatory network Interaction network Diagnostic
  • 35. The Computational Genomics Department at the Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain, and… ...the INB, National Institute of Bioinformatics (Functional Genomics Node) and the BiER (CIBERER Network of Centers for Rare Diseases) @xdopazo @bioinfocipf