SlideShare une entreprise Scribd logo
1  sur  65
Télécharger pour lire hors ligne
Overview on co-expression network analysis Case study 1 Case study 2 References
Mining co-expression networks
Nathalie Villa-Vialaneix
http://www.nathalievilla.org
INRA, Unité MIA-T, INRA, Toulouse (France)
School for advanced sciences of Luchon
Network analysis and applications
NV2 | Mining co-expression networks 1/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Outline
1 Overview on co-expression network analysis
2 Case study 1: gene network analysis in relations with meat
quality
3 Case study 2: gene network analysis in LCD experiment
NV2 | Mining co-expression networks 2/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Outline
1 Overview on co-expression network analysis
2 Case study 1: gene network analysis in relations with meat
quality
3 Case study 2: gene network analysis in LCD experiment
NV2 | Mining co-expression networks 3/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Transcriptomic data
DNA transcripted into mRNA
to produce proteins
NV2 | Mining co-expression networks 4/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Transcriptomic data
DNA transcripted into mRNA
to produce proteins
transcriptomic data: measure
of the quantity of mRNA
corresponding to a given
gene in given cells (blood,
muscle...) of a living organism
NV2 | Mining co-expression networks 4/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Systems biology
Some genes’ expressions activate or repress other genes’
expressions ⇒ understanding the whole cascade helps to
comprehend the global functioning of living organisms1
1
Picture taken from: Abdollahi A et al., PNAS 2007, 104:12890-12895. c 2007
by National Academy of Sciences
NV2 | Mining co-expression networks 5/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Standard issues in network analysis
Inference
Giving expression data, how to build a graph whose edges
represent the direct links between genes?
Example: co-expression networks built from microarray/RNAseq data (nodes =
genes; edges = significant “direct links” between expressions of two genes)
NV2 | Mining co-expression networks 6/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Standard issues in network analysis
Inference
Giving expression data, how to build a graph whose edges
represent the direct links between genes?
Graph mining (examples)
1 Network visualization: nodes are not a priori given a position.
Random positions
Positions aiming at representing connected
nodes closer
NV2 | Mining co-expression networks 6/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Standard issues in network analysis
Inference
Giving expression data, how to build a graph whose edges
represent the direct links between genes?
Graph mining (examples)
1 Network visualization: nodes are not a priori given a position.
2 Important node extraction (high degree, high centrality...)
NV2 | Mining co-expression networks 6/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Standard issues in network analysis
Inference
Giving expression data, how to build a graph whose edges
represent the direct links between genes?
Graph mining (examples)
1 Network visualization: nodes are not a priori given a position.
2 Important node extraction (high degree, high centrality...)
3 Network clustering: identify “communities”
NV2 | Mining co-expression networks 6/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Network inference
Data: large scale gene expression data
individuals
n 30/50



X =


. . . . . .
. . X
j
i
. . .
. . . . . .


variables (genes expression), p 103/4
What we want to obtain: a graph/network with
nodes: genes (a selected sublist of interest2
; usually, DE
genes);
edges: “strong relations” between gene expressions.
2
See [Verzelen, 2012] for conditions on respective n/p suited for inference.
NV2 | Mining co-expression networks 7/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Advantages of this network model
1 over raw data: focuses on the strongest direct relationships:
irrelevant or indirect relations are removed (more robust) and
the data are easier to visualize and understand (track
transcription relations).
NV2 | Mining co-expression networks 8/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Advantages of this network model
1 over raw data: focuses on the strongest direct relationships:
irrelevant or indirect relations are removed (more robust) and
the data are easier to visualize and understand (track
transcription relations).
Expression data are analyzed all together and not by pairs
(systems model).
NV2 | Mining co-expression networks 8/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Advantages of this network model
1 over raw data: focuses on the strongest direct relationships:
irrelevant or indirect relations are removed (more robust) and
the data are easier to visualize and understand (track
transcription relations).
Expression data are analyzed all together and not by pairs
(systems model).
2 over bibliographic network: can handle interactions with yet
unknown (not annotated) genes and deal with data collected
in a particular condition.
NV2 | Mining co-expression networks 8/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Using correlations: relevance
network [Butte and Kohane, 1999, Butte and Kohane, 2000]
First (naive) approach: calculate correlations between expressions
for all pairs of genes, threshold the smallest ones and build the
network.
Correlations Thresholding Graph
NV2 | Mining co-expression networks 9/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Using partial correlations
strong indirect correlation
y z
x
NV2 | Mining co-expression networks 10/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Using partial correlations
strong indirect correlation
y z
x
set.seed(2807); x <- rnorm(100)
y <- 2*x+1+rnorm(100,0,0.1); cor(x,y) [1] 0.998826
z <- 2*x+1+rnorm(100,0,0.1); cor(x,z) [1] 0.998751
cor(y,z) [1] 0.9971105
NV2 | Mining co-expression networks 10/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Using partial correlations
strong indirect correlation
y z
x
set.seed(2807); x <- rnorm(100)
y <- 2*x+1+rnorm(100,0,0.1); cor(x,y) [1] 0.998826
z <- 2*x+1+rnorm(100,0,0.1); cor(x,z) [1] 0.998751
cor(y,z) [1] 0.9971105
Partial correlation
cor(lm(x∼z)$residuals,lm(y∼z)$residuals) [1] 0.7801174
cor(lm(x∼y)$residuals,lm(z∼y)$residuals) [1] 0.7639094
cor(lm(y∼x)$residuals,lm(z∼x)$residuals) [1] -0.1933699
NV2 | Mining co-expression networks 10/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Partial correlation and GGM
Gaussian Graphical Model framework:
(Xi)i=1,...,n are i.i.d. Gaussian random variables N(0, Σ) (gene
expression); then
j ←→ j (genes j and j are linked) ⇔ Cor Xj
, Xj
|(Xk
)k j,j 0
NV2 | Mining co-expression networks 11/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Partial correlation and GGM
Gaussian Graphical Model framework:
(Xi)i=1,...,n are i.i.d. Gaussian random variables N(0, Σ) (gene
expression); then
j ←→ j (genes j and j are linked) ⇔ Cor Xj
, Xj
|(Xk
)k j,j 0
If (concentration matrix) S = Σ−1
,
Cor Xj
, Xj
|(Xk
)k j,j = −
Sjj
SjjSj j
⇒ Estimate Σ−1
to unravel the graph structure
NV2 | Mining co-expression networks 11/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Partial correlation and GGM
Gaussian Graphical Model framework:
(Xi)i=1,...,n are i.i.d. Gaussian random variables N(0, Σ) (gene
expression); then
j ←→ j (genes j and j are linked) ⇔ Cor Xj
, Xj
|(Xk
)k j,j 0
If (concentration matrix) S = Σ−1
,
Cor Xj
, Xj
|(Xk
)k j,j = −
Sjj
SjjSj j
⇒ Estimate Σ−1
to unravel the graph structure
Problem: Σ: p-dimensional matrix and n p ⇒ (Σn
)−1
is a poor
estimate of S)!
NV2 | Mining co-expression networks 11/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Estimation in GGM
Graphical Gaussian Model estimation
seminal work:
[Schäfer and Strimmer, 2005a, Schäfer and Strimmer, 2005b] (with
shrinkage and a proposal for a Bayesian test of significance)
estimate Σ−1
by (Σn
+ λI)−1
use a Bayesian test to test which coefficients are significantly
non zero.
NV2 | Mining co-expression networks 12/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Estimation in GGM
Graphical Gaussian Model estimation
seminal work:
[Schäfer and Strimmer, 2005a, Schäfer and Strimmer, 2005b] (with
shrinkage and a proposal for a Bayesian test of significance)
estimate Σ−1
by (Σn
+ λI)−1
use a Bayesian test to test which coefficients are significantly
non zero.
sparse approaches:
[Meinshausen and Bühlmann, 2006, Friedman et al., 2008]: ∀ j,
estimate the linear models Xj
= βT
j
X−j
+ by penalized ML
arg min(βjj )j
n
i=1 Xij − βT
j
X
−j
i
2
+ λ βj L1 , with
βj L1 = j |βjj |, L1
penalty yields to βjj = 0 for most j
(variable selection)
NV2 | Mining co-expression networks 12/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Visualization
Purpose: How to display the nodes in a meaningful and aesthetic
way?
NV2 | Mining co-expression networks 13/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Visualization
Purpose: How to display the nodes in a meaningful and aesthetic
way?
Standard approach: force directed placement algorithms (FDP)
(e.g., [Fruchterman and Reingold, 1991])
NV2 | Mining co-expression networks 13/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Visualization
Purpose: How to display the nodes in a meaningful and aesthetic
way?
Standard approach: force directed placement algorithms (FDP)
(e.g., [Fruchterman and Reingold, 1991])
attractive forces: similar to springs along the edges
NV2 | Mining co-expression networks 13/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Visualization
Purpose: How to display the nodes in a meaningful and aesthetic
way?
Standard approach: force directed placement algorithms (FDP)
(e.g., [Fruchterman and Reingold, 1991])
attractive forces: similar to springs along the edges
repulsive forces: similar to electric forces between all pairs of
vertices
NV2 | Mining co-expression networks 13/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Visualization
Purpose: How to display the nodes in a meaningful and aesthetic
way?
Standard approach: force directed placement algorithms (FDP)
(e.g., [Fruchterman and Reingold, 1991])
iterative algorithm until stabilization of the vertex positions.
NV2 | Mining co-expression networks 13/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Important node extraction
1 vertex degree: number of edges adjacent to a given vertex.
Vertices with a high degree are called hubs: measure of the
vertex popularity.
NV2 | Mining co-expression networks 14/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Important node extraction
1 vertex degree: number of edges adjacent to a given vertex.
Vertices with a high degree are called hubs: measure of the
vertex popularity.
2 vertex betweenness: number of shortest paths between all
pairs of vertices that pass through the vertex. Betweenness is
a centrality measure (vertices with a large betweenness that are the
most likely to disconnect the network if removed).
The orange node’s degree is equal to 2, its betweenness to 4.
NV2 | Mining co-expression networks 14/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Vertex clustering
Cluster vertexes into groups that are densely connected and share
a few links (comparatively) with the other groups. Clusters are
often called communities (social sciences) or modules (biology).
NV2 | Mining co-expression networks 15/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Vertex clustering
Cluster vertexes into groups that are densely connected and share
a few links (comparatively) with the other groups. Clusters are
often called communities (social sciences) or modules (biology).
Several clustering methods:
min cut minimization minimizes the number of edges between clusters;
spectral clustering [von Luxburg, 2007] and kernel clustering
uses eigen-decomposition of the Laplacian
Lij =
−wij if i j
di otherwise
(matrix strongly related to the graph structure);
Generative (Bayesian) models [Zanghi et al., 2008];
Markov clustering simulate a flow on the graph;
modularity maximization
... (clustering jungle... see e.g., [Fortunato and Barthélémy, 2007,
Schaeffer, 2007, Brohée and van Helden, 2006])
NV2 | Mining co-expression networks 15/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Modularity optimization
The modularity [Newman and Girvan, 2004] of the partition
(C1, . . . , CK ) is equal to:
Q(C1, . . . , CK ) =
1
2m
K
k=1 xi,xj∈Ck
(Wij − Pij)
with Pij: weight of a “null model” (graph with the same degree
distribution but no preferential attachment):
Pij =
didj
2m
with di = 1
2 j i Wij.
NV2 | Mining co-expression networks 16/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Interpretation
A good clustering should maximize the modularity:
Q when (xi, xj) are in the same cluster and Wij Pij
Q when (xi, xj) are in two different clusters and Wij Pij
(m = 20)
Pij = 7.5
Wij = 5 ⇒ Wij − Pij = −2.5
di = 15 dj = 20
i and j in the same cluster decreases the modularity
NV2 | Mining co-expression networks 17/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Interpretation
A good clustering should maximize the modularity:
Q when (xi, xj) are in the same cluster and Wij Pij
Q when (xi, xj) are in two different clusters and Wij Pij
(m = 20)
Pij = 0.05
Wij = 5 ⇒ Wij − Pij = 4.95
di = 1 dj = 2
i and j in the same cluster increases the modularity
NV2 | Mining co-expression networks 17/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Interpretation
A good clustering should maximize the modularity:
Q when (xi, xj) are in the same cluster and Wij Pij
Q when (xi, xj) are in two different clusters and Wij Pij
Modularity
helps separate hubs ( spectral clustering or min cut criterion);
is not an increasing function of the number of clusters: useful
to choose the relevant number of clusters (with a grid search:
several values are tested, the clustering with the highest
modularity is kept) but modularity has a small resolution
default (see [Fortunato and Barthélémy, 2007])
NV2 | Mining co-expression networks 17/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Interpretation
A good clustering should maximize the modularity:
Q when (xi, xj) are in the same cluster and Wij Pij
Q when (xi, xj) are in two different clusters and Wij Pij
Modularity
helps separate hubs ( spectral clustering or min cut criterion);
is not an increasing function of the number of clusters: useful
to choose the relevant number of clusters (with a grid search:
several values are tested, the clustering with the highest
modularity is kept) but modularity has a small resolution
default (see [Fortunato and Barthélémy, 2007])
Main issue: Optimization = NP-complete problem (exhaustive
search is not not usable)
Different solutions are provided in [Newman and Girvan, 2004,
Blondel et al., 2008, Noack and Rotta, 2009, Rossi and Villa-Vialaneix, 2011]
(among others) and some of them are implemented in the R
package igraph.
NV2 | Mining co-expression networks 17/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Outline
1 Overview on co-expression network analysis
2 Case study 1: gene network analysis in relations with meat
quality
3 Case study 2: gene network analysis in LCD experiment
NV2 | Mining co-expression networks 18/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Dataset description
[Villa-Vialaneix et al., 2013]
F2: 1200 animals
muscle sampling
phenotypic measures (30)
(pH ...)
NV2 | Mining co-expression networks 19/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Dataset description
[Villa-Vialaneix et al., 2013]
F2: 1200 animals
muscle sampling
phenotypic measures (30)
(pH ...)
Used data: 57 F2 pigs (largest variability for PH); transcriptomic
data for 272 genes regulated by an eQTL
Problems with these particular data:
how to understand the relationships between these genes’
expression as their co-expression is weaker than between
other kind of genes (TF/genes, for instance)?
how to relate gene expression with a phenotype of interest
(muscle pH)?
NV2 | Mining co-expression networks 19/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Inferred network description
Use of [Schäfer and Strimmer, 2005a]
Obtained network: 272 nodes (connected); Density: 6,4%;
Transitivity: 25,4%
NV2 | Mining co-expression networks 20/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Inferred network description
Use of [Schäfer and Strimmer, 2005a]
Obtained network: 272 nodes (connected); Density: 6,4%;
Transitivity: 25,4%
degree distribution
Degrés
Frequency
5 10 15 20 25 30
010203040
NV2 | Mining co-expression networks 20/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Inferred network description
Use of [Schäfer and Strimmer, 2005a]
Obtained network: 272 nodes (connected); Density: 6,4%;
Transitivity: 25,4%
8 genes both have high degree and high betweenness
BX921641; FTH1; TRIAP1; SLC9A14; GPI; SUZ12; MGP; PRDX4
and several have been identified by the biologist as relevant to
meat quality.
NV2 | Mining co-expression networks 20/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Clustering
clustering with modularity optimization: 7 clusters
for each cluster, annotated genes submitted to IPA3
(bibliographic network database): from 71% to 94% of the
genes of a single cluster belong to the same IPA network with
a biological function associated
3
https://analysis.ingenuity.com/pa
NV2 | Mining co-expression networks 21/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Relation to muscle pH
model: label each node of the network with its partial correlation to
the muscle pH.
Questions: is there a relation between muscle pH and network
structure? is there a relation between clustering and muscle pH?
NV2 | Mining co-expression networks 22/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Relation to muscle pH
model: label each node of the network with its partial correlation to
the muscle pH.
Moran’s I (used in spatial statistics): I =
1
2m i j wij¯ci¯cj
1
n i ¯c2
i
, where
m = 1
2 i j Wij and ci is the partial correlation with pH, ¯ci = ci − ¯c
with ¯c = 1
n i ci. Using a MC simulation (edge permutations):
Moran's I
Frequency
−0.05 0.00 0.05 0.10 0.15 0.20
050100150
Moran’s I is
significantly
larger than
expected: genes
tend to be linked
to genes which
have a similar
correlation to
muscle pH.
NV2 | Mining co-expression networks 22/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Relation to muscle pH
model: label each node of the network with its partial correlation to
the muscle pH.
qq
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
1 2 3 4 5 6 7
−0.02−0.010.000.010.020.03
Cluster
PartialcorrelationwithPH
Significant Student test for
cluster 4: its correlation with pH is larger than for the other clusters
NV2 | Mining co-expression networks 22/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Moran’s plot
Moran’s plot help to emphasize influential points: WC vs C
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q q
q
q
q
−0.02 −0.01 0.00 0.01 0.02 0.03
−0.0050.0000.0050.010
CorPH
AxCorPH
H−H
H−L
L−L
L−H
NV2 | Mining co-expression networks 23/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Moran’s plot
Moran’s plot help to emphasize influential points: WC vs C
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qqq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
qq
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q q
q
q
q
−0.02 −0.01 0.00 0.01 0.02 0.03
−0.0050.0000.0050.010
CorPH
AxCorPH
H−H
H−L
L−L
L−H
Associated influential measures and tests for finding influential
points.
NV2 | Mining co-expression networks 23/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Influential points: example of cluster
4
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
BX919092
PSMC3IP
THRB
XIAP
ARHGAP8
X91721
BX917912
EAPP
LSM2
BX922053
BX922491
H2AFY
ENH_RAT
LMF1
FTCD
BX925690
B2M
GPI
BX667979
BX920538
BX671131
RNF2
BX673501
KPNA1
BX674063
BX918923
RPS11
UBE2H
Cluster 4
NV2 | Mining co-expression networks 24/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Outline
1 Overview on co-expression network analysis
2 Case study 1: gene network analysis in relations with meat
quality
3 Case study 2: gene network analysis in LCD experiment
NV2 | Mining co-expression networks 25/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Data: DIOGENES project
Experimental protocol
135 obese women and 3 times: before LCD, after a 2-month LCD
and 6 months later (between the end of LCD and the last
measurement, women are randomized into one of 5 recommended
diet groups).
At every time step, 221 gene expressions, 28 fatty acids and 15
clinical variables (i.e., weight, HDL, ...)
NV2 | Mining co-expression networks 26/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Data: DIOGENES project
Experimental protocol
135 obese women and 3 times: before LCD, after a 2-month LCD
and 6 months later (between the end of LCD and the last
measurement, women are randomized into one of 5 recommended
diet groups).
At every time step, 221 gene expressions, 28 fatty acids and 15
clinical variables (i.e., weight, HDL, ...)
Correlations between gene expressions and between a gene
expression and a fatty acid levels are not of the same order:
inference method must be different inside the groups and between
two groups.
NV2 | Mining co-expression networks 26/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Data: DIOGENES project
Data pre-processing
At CID3, individuals are split into three groups: weight loss, weight
regain and stable weight (groups are not correlated to the diet
group according to χ2
-test).
NV2 | Mining co-expression networks 26/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Method for CID1, CID2 and 3× CID3
Network inference Clustering Mining
3 inter-dataset networks
rCCA
merge into one
network
3 intra-dataset networks
sparse partial correlation
5 networks
CID1 CID2
3×CID3
Extract important nodes
Study/Compare clusters
NV2 | Mining co-expression networks 27/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Inference
Intra-level networks: use of partial correlations and a sparse
approach (graphical Lasso as in the R package gLasso) to select
edges [Friedman et al., 2008]
NV2 | Mining co-expression networks 28/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Inference
Intra-level networks: use of partial correlations and a sparse
approach (graphical Lasso as in the R package gLasso) to select
edges [Friedman et al., 2008]
Inter-levels networks: use of regularized CCA (as in the R package
mixOmics) to evaluate strength of the correlations
[Lê Cao et al., 2009]
NV2 | Mining co-expression networks 28/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Inference
Intra-level networks: use of partial correlations and a sparse
approach (graphical Lasso as in the R package gLasso) to select
edges [Friedman et al., 2008]
Inter-levels networks: use of regularized CCA (as in the R package
mixOmics) to evaluate strength of the correlations
[Lê Cao et al., 2009]
Combination of the 6 informations: tune the number of edges intra
or inter-levels so that it is of the order of the number of nodes in the
corresponding level(s)
NV2 | Mining co-expression networks 28/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Brief overview on results
5 networks inferred with 264 nodes each:
CID1 CID2 CID3g1 CID3g2 CID3g3
size LCC 244 251 240 259 258
density 2.3% 2.3% 2.3% 2.3% 2.3%
transitivity 17.2% 11.9% 21.6% 10.6% 10.4%
nb clusters 14 (2-52) 10 (4-52) 11 (2-46) 12 (2-51) 12 (3-54)
clusters were visualized and analyzed for important node extraction
NV2 | Mining co-expression networks 29/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Findings
CID1: clusters were found to be associated to biological
functions (fatty acids biosynthesis, adhesion and
diapedesis...)
CID3: for people with weight loss, an unexpected fatty acid
was found to be an important node (high betweenness) in a
cluster linked to fatty acids biosynthesis
NV2 | Mining co-expression networks 30/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Conclusion
biological network mining can help the biologist comprehend
the complex biological system in its whole
groups of genes are more robust models, often linked to a
biological function, than pairwise relations between genes
simple tools, such as, numeric characteristics are useful to
extract important nodes
NV2 | Mining co-expression networks 31/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Thank you for your attention...
... questions?
NV2 | Mining co-expression networks 32/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Blondel, V., Guillaume, J., Lambiotte, R., and Lefebvre, E. (2008).
Fast unfolding of communites in large networks.
Journal of Statistical Mechanics: Theory and Experiment, P10008:1742–5468.
Brohée, S. and van Helden, J. (2006).
Evaluation of clustering algorithms for protein-protein interaction networks.
BMC Bioinformatics, 7(488).
Butte, A. and Kohane, I. (1999).
Unsupervised knowledge discovery in medical databases using relevance networks.
In Proceedings of the AMIA Symposium, pages 711–715.
Butte, A. and Kohane, I. (2000).
Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements.
In Proceedings of the Pacific Symposium on Biocomputing, pages 418–429.
Fortunato, S. and Barthélémy, M. (2007).
Resolution limit in community detection.
In Proceedings of the National Academy of Sciences, volume 104, pages 36–41.
doi:10.1073/pnas.0605965104; URL: http://www.pnas.org/content/104/1/36.abstract.
Friedman, J., Hastie, T., and Tibshirani, R. (2008).
Sparse inverse covariance estimation with the graphical lasso.
Biostatistics, 9(3):432–441.
Fruchterman, T. and Reingold, B. (1991).
Graph drawing by force-directed placement.
Software, Practice and Experience, 21:1129–1164.
NV2 | Mining co-expression networks 32/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Lê Cao, K., González, I., and Déjean, S. (2009).
*****Omics: an R package to unravel relationships between two omics data sets.
Bioinformatics, 25(21):2855–2856.
Meinshausen, N. and Bühlmann, P. (2006).
High dimensional graphs and variable selection with the lasso.
Annals of Statistic, 34(3):1436–1462.
Newman, M. and Girvan, M. (2004).
Finding and evaluating community structure in networks.
Physical Review, E, 69:026113.
Noack, A. and Rotta, R. (2009).
Multi-level algorithms for modularity clustering.
In SEA ’09: Proceedings of the 8th International Symposium on Experimental Algorithms, pages 257–268, Berlin,
Heidelberg. Springer-Verlag.
Rossi, F. and Villa-Vialaneix, N. (2011).
Représentation d’un grand réseau à partir d’une classification hiérarchique de ses sommets.
Journal de la Société Française de Statistique, 152(3):34–65.
Schaeffer, S. (2007).
Graph clustering.
Computer Science Review, 1(1):27–64.
Schäfer, J. and Strimmer, K. (2005a).
An empirical bayes approach to inferring large-scale gene association networks.
Bioinformatics, 21(6):754–764.
NV2 | Mining co-expression networks 32/32
Overview on co-expression network analysis Case study 1 Case study 2 References
Schäfer, J. and Strimmer, K. (2005b).
A shrinkage approach to large-scale covariance matrix estimation and implication for functional genomics.
Statistical Applications in Genetics and Molecular Biology, 4:1–32.
Verzelen, N. (2012).
Minimax risks for sparse regressions: ultra-high-dimensional phenomenons.
Electronic Journal of Statistics, 6:38–90.
Villa-Vialaneix, N., Liaubet, L., Laurent, T., Cherel, P., Gamot, A., and San Cristobal, M. (2013).
The structure of a gene co-expression network reveals biological functions underlying eQTLs.
PLoS ONE, 8(4):e60045.
von Luxburg, U. (2007).
A tutorial on spectral clustering.
Statistics and Computing, 17(4):395–416.
Zanghi, H., Ambroise, C., and Miele, V. (2008).
Fast online graph clustering via erdös-rényi mixture.
Pattern Recognition, 41:3592–3599.
NV2 | Mining co-expression networks 32/32

Contenu connexe

Tendances

RFNM-Aranda-Final.PDF
RFNM-Aranda-Final.PDFRFNM-Aranda-Final.PDF
RFNM-Aranda-Final.PDFThomas Aranda
 
Sparse inverse covariance estimation using skggm
Sparse inverse covariance estimation using skggmSparse inverse covariance estimation using skggm
Sparse inverse covariance estimation using skggmManjari Narayan
 
Analytic tools for higher-order data
Analytic tools for higher-order dataAnalytic tools for higher-order data
Analytic tools for higher-order dataAustin Benson
 
Higher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoIHigher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoIAustin Benson
 
Distance-based bias in model-directed optimization of additively decomposable...
Distance-based bias in model-directed optimization of additively decomposable...Distance-based bias in model-directed optimization of additively decomposable...
Distance-based bias in model-directed optimization of additively decomposable...Martin Pelikan
 
IEEE Fuzzy system Title and Abstract 2016
IEEE Fuzzy system Title and Abstract 2016 IEEE Fuzzy system Title and Abstract 2016
IEEE Fuzzy system Title and Abstract 2016 tsysglobalsolutions
 
Geometric correlations in multiplexes and how they make them more robust
Geometric correlations in multiplexes and how they make them more robustGeometric correlations in multiplexes and how they make them more robust
Geometric correlations in multiplexes and how they make them more robustKolja Kleineberg
 
Inference in HMM and Bayesian Models
Inference in HMM and Bayesian ModelsInference in HMM and Bayesian Models
Inference in HMM and Bayesian ModelsMinakshi Atre
 
Learning from (dis)similarity data
Learning from (dis)similarity dataLearning from (dis)similarity data
Learning from (dis)similarity datatuxette
 
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics Kolja Kleineberg
 
Artificial neural networks and its application
Artificial neural networks and its applicationArtificial neural networks and its application
Artificial neural networks and its applicationHưng Đặng
 
A short and naive introduction to epistasis in association studies
A short and naive introduction to epistasis in association studiesA short and naive introduction to epistasis in association studies
A short and naive introduction to epistasis in association studiestuxette
 
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms ijcseit
 
CoCoWa A Collaborative Contact-Based
CoCoWa  A Collaborative Contact-BasedCoCoWa  A Collaborative Contact-Based
CoCoWa A Collaborative Contact-BasedBen Paul
 
Combining co-expression and co-location for gene network inference in porcine...
Combining co-expression and co-location for gene network inference in porcine...Combining co-expression and co-location for gene network inference in porcine...
Combining co-expression and co-location for gene network inference in porcine...tuxette
 
Spatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesSpatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesKolja Kleineberg
 

Tendances (18)

RFNM-Aranda-Final.PDF
RFNM-Aranda-Final.PDFRFNM-Aranda-Final.PDF
RFNM-Aranda-Final.PDF
 
Sparse inverse covariance estimation using skggm
Sparse inverse covariance estimation using skggmSparse inverse covariance estimation using skggm
Sparse inverse covariance estimation using skggm
 
Deepwalk vs Node2vec
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vec
 
Analytic tools for higher-order data
Analytic tools for higher-order dataAnalytic tools for higher-order data
Analytic tools for higher-order data
 
Higher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoIHigher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoI
 
Distance-based bias in model-directed optimization of additively decomposable...
Distance-based bias in model-directed optimization of additively decomposable...Distance-based bias in model-directed optimization of additively decomposable...
Distance-based bias in model-directed optimization of additively decomposable...
 
Ijciet 10 01_153-2
Ijciet 10 01_153-2Ijciet 10 01_153-2
Ijciet 10 01_153-2
 
IEEE Fuzzy system Title and Abstract 2016
IEEE Fuzzy system Title and Abstract 2016 IEEE Fuzzy system Title and Abstract 2016
IEEE Fuzzy system Title and Abstract 2016
 
Geometric correlations in multiplexes and how they make them more robust
Geometric correlations in multiplexes and how they make them more robustGeometric correlations in multiplexes and how they make them more robust
Geometric correlations in multiplexes and how they make them more robust
 
Inference in HMM and Bayesian Models
Inference in HMM and Bayesian ModelsInference in HMM and Bayesian Models
Inference in HMM and Bayesian Models
 
Learning from (dis)similarity data
Learning from (dis)similarity dataLearning from (dis)similarity data
Learning from (dis)similarity data
 
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
 
Artificial neural networks and its application
Artificial neural networks and its applicationArtificial neural networks and its application
Artificial neural networks and its application
 
A short and naive introduction to epistasis in association studies
A short and naive introduction to epistasis in association studiesA short and naive introduction to epistasis in association studies
A short and naive introduction to epistasis in association studies
 
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
A Novel Dencos Model For High Dimensional Data Using Genetic Algorithms
 
CoCoWa A Collaborative Contact-Based
CoCoWa  A Collaborative Contact-BasedCoCoWa  A Collaborative Contact-Based
CoCoWa A Collaborative Contact-Based
 
Combining co-expression and co-location for gene network inference in porcine...
Combining co-expression and co-location for gene network inference in porcine...Combining co-expression and co-location for gene network inference in porcine...
Combining co-expression and co-location for gene network inference in porcine...
 
Spatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexesSpatial patterns in evolutionary games on scale-free networks and multiplexes
Spatial patterns in evolutionary games on scale-free networks and multiplexes
 

En vedette

Visualiser et fouiller des réseaux - Méthodes et exemples dans R
Visualiser et fouiller des réseaux - Méthodes et exemples dans RVisualiser et fouiller des réseaux - Méthodes et exemples dans R
Visualiser et fouiller des réseaux - Méthodes et exemples dans Rtuxette
 
Interpretable Sparse Sliced Inverse Regression for digitized functional data
Interpretable Sparse Sliced Inverse Regression for digitized functional dataInterpretable Sparse Sliced Inverse Regression for digitized functional data
Interpretable Sparse Sliced Inverse Regression for digitized functional datatuxette
 
Slides Lycée Jules Fil 2014
Slides Lycée Jules Fil 2014Slides Lycée Jules Fil 2014
Slides Lycée Jules Fil 2014tuxette
 
Graph mining 2: Statistical approaches for graph mining
Graph mining 2: Statistical approaches for graph miningGraph mining 2: Statistical approaches for graph mining
Graph mining 2: Statistical approaches for graph miningtuxette
 
Random Forest for Big Data
Random Forest for Big DataRandom Forest for Big Data
Random Forest for Big Datatuxette
 
A short introduction to statistical learning
A short introduction to statistical learningA short introduction to statistical learning
A short introduction to statistical learningtuxette
 
Integrating Tara Oceans datasets using unsupervised multiple kernel learning
Integrating Tara Oceans datasets using unsupervised multiple kernel learningIntegrating Tara Oceans datasets using unsupervised multiple kernel learning
Integrating Tara Oceans datasets using unsupervised multiple kernel learningtuxette
 
Multiple kernel learning applied to the integration of Tara oceans datasets
Multiple kernel learning applied to the integration of Tara oceans datasetsMultiple kernel learning applied to the integration of Tara oceans datasets
Multiple kernel learning applied to the integration of Tara oceans datasetstuxette
 

En vedette (8)

Visualiser et fouiller des réseaux - Méthodes et exemples dans R
Visualiser et fouiller des réseaux - Méthodes et exemples dans RVisualiser et fouiller des réseaux - Méthodes et exemples dans R
Visualiser et fouiller des réseaux - Méthodes et exemples dans R
 
Interpretable Sparse Sliced Inverse Regression for digitized functional data
Interpretable Sparse Sliced Inverse Regression for digitized functional dataInterpretable Sparse Sliced Inverse Regression for digitized functional data
Interpretable Sparse Sliced Inverse Regression for digitized functional data
 
Slides Lycée Jules Fil 2014
Slides Lycée Jules Fil 2014Slides Lycée Jules Fil 2014
Slides Lycée Jules Fil 2014
 
Graph mining 2: Statistical approaches for graph mining
Graph mining 2: Statistical approaches for graph miningGraph mining 2: Statistical approaches for graph mining
Graph mining 2: Statistical approaches for graph mining
 
Random Forest for Big Data
Random Forest for Big DataRandom Forest for Big Data
Random Forest for Big Data
 
A short introduction to statistical learning
A short introduction to statistical learningA short introduction to statistical learning
A short introduction to statistical learning
 
Integrating Tara Oceans datasets using unsupervised multiple kernel learning
Integrating Tara Oceans datasets using unsupervised multiple kernel learningIntegrating Tara Oceans datasets using unsupervised multiple kernel learning
Integrating Tara Oceans datasets using unsupervised multiple kernel learning
 
Multiple kernel learning applied to the integration of Tara oceans datasets
Multiple kernel learning applied to the integration of Tara oceans datasetsMultiple kernel learning applied to the integration of Tara oceans datasets
Multiple kernel learning applied to the integration of Tara oceans datasets
 

Similaire à Mining co-expression network

Higher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsHigher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsAustin Benson
 
08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)dnac
 
Disintegration of the small world property with increasing diversity of chemi...
Disintegration of the small world property with increasing diversity of chemi...Disintegration of the small world property with increasing diversity of chemi...
Disintegration of the small world property with increasing diversity of chemi...N. Sukumar
 
NS-CUK Seminar: H.E.Lee, Review on "Structural Deep Embedding for Hyper-Netw...
NS-CUK Seminar: H.E.Lee,  Review on "Structural Deep Embedding for Hyper-Netw...NS-CUK Seminar: H.E.Lee,  Review on "Structural Deep Embedding for Hyper-Netw...
NS-CUK Seminar: H.E.Lee, Review on "Structural Deep Embedding for Hyper-Netw...ssuser4b1f48
 
Multivariate data analysis and visualization tools for biological data
Multivariate data analysis and visualization tools for biological dataMultivariate data analysis and visualization tools for biological data
Multivariate data analysis and visualization tools for biological dataDmitry Grapov
 
Spectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structuresSpectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structuresDavid Gleich
 
Geometric correlations mitigate the extreme vulnerability of multiplex networ...
Geometric correlations mitigate the extreme vulnerability of multiplex networ...Geometric correlations mitigate the extreme vulnerability of multiplex networ...
Geometric correlations mitigate the extreme vulnerability of multiplex networ...Kolja Kleineberg
 
Chapter 10 link prediction
Chapter 10 link predictionChapter 10 link prediction
Chapter 10 link predictionAbanobZakaria1
 
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...BigMine
 
Using spectral radius ratio for node degree
Using spectral radius ratio for node degreeUsing spectral radius ratio for node degree
Using spectral radius ratio for node degreeIJCNCJournal
 
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...Devansh16
 
Foundation and Synchronization of the Dynamic Output Dual Systems
Foundation and Synchronization of the Dynamic Output Dual SystemsFoundation and Synchronization of the Dynamic Output Dual Systems
Foundation and Synchronization of the Dynamic Output Dual Systemsijtsrd
 
Self-organizing Network for Variable Clustering and Predictive Modeling
Self-organizing Network for Variable Clustering and Predictive ModelingSelf-organizing Network for Variable Clustering and Predictive Modeling
Self-organizing Network for Variable Clustering and Predictive ModelingHui Yang
 
International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)CSCJournals
 
Nonequilibrium Network Dynamics_Inference, Fluctuation-Respones & Tipping Poi...
Nonequilibrium Network Dynamics_Inference, Fluctuation-Respones & Tipping Poi...Nonequilibrium Network Dynamics_Inference, Fluctuation-Respones & Tipping Poi...
Nonequilibrium Network Dynamics_Inference, Fluctuation-Respones & Tipping Poi...Förderverein Technische Fakultät
 
To the numerical modeling of self similar solutions of
To the numerical modeling of self similar solutions ofTo the numerical modeling of self similar solutions of
To the numerical modeling of self similar solutions ofeSAT Publishing House
 
Graph Representation Learning
Graph Representation LearningGraph Representation Learning
Graph Representation LearningJure Leskovec
 
Higher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsHigher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsAustin Benson
 
Mining group correlations over data streams
Mining group correlations over data streamsMining group correlations over data streams
Mining group correlations over data streamsyuanchung
 

Similaire à Mining co-expression network (20)

Higher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsHigher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifs
 
08 Exponential Random Graph Models (2016)
08 Exponential Random Graph Models (2016)08 Exponential Random Graph Models (2016)
08 Exponential Random Graph Models (2016)
 
08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)
 
Disintegration of the small world property with increasing diversity of chemi...
Disintegration of the small world property with increasing diversity of chemi...Disintegration of the small world property with increasing diversity of chemi...
Disintegration of the small world property with increasing diversity of chemi...
 
NS-CUK Seminar: H.E.Lee, Review on "Structural Deep Embedding for Hyper-Netw...
NS-CUK Seminar: H.E.Lee,  Review on "Structural Deep Embedding for Hyper-Netw...NS-CUK Seminar: H.E.Lee,  Review on "Structural Deep Embedding for Hyper-Netw...
NS-CUK Seminar: H.E.Lee, Review on "Structural Deep Embedding for Hyper-Netw...
 
Multivariate data analysis and visualization tools for biological data
Multivariate data analysis and visualization tools for biological dataMultivariate data analysis and visualization tools for biological data
Multivariate data analysis and visualization tools for biological data
 
Spectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structuresSpectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structures
 
Geometric correlations mitigate the extreme vulnerability of multiplex networ...
Geometric correlations mitigate the extreme vulnerability of multiplex networ...Geometric correlations mitigate the extreme vulnerability of multiplex networ...
Geometric correlations mitigate the extreme vulnerability of multiplex networ...
 
Chapter 10 link prediction
Chapter 10 link predictionChapter 10 link prediction
Chapter 10 link prediction
 
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
 
Using spectral radius ratio for node degree
Using spectral radius ratio for node degreeUsing spectral radius ratio for node degree
Using spectral radius ratio for node degree
 
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...
 
Foundation and Synchronization of the Dynamic Output Dual Systems
Foundation and Synchronization of the Dynamic Output Dual SystemsFoundation and Synchronization of the Dynamic Output Dual Systems
Foundation and Synchronization of the Dynamic Output Dual Systems
 
Self-organizing Network for Variable Clustering and Predictive Modeling
Self-organizing Network for Variable Clustering and Predictive ModelingSelf-organizing Network for Variable Clustering and Predictive Modeling
Self-organizing Network for Variable Clustering and Predictive Modeling
 
International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)
 
Nonequilibrium Network Dynamics_Inference, Fluctuation-Respones & Tipping Poi...
Nonequilibrium Network Dynamics_Inference, Fluctuation-Respones & Tipping Poi...Nonequilibrium Network Dynamics_Inference, Fluctuation-Respones & Tipping Poi...
Nonequilibrium Network Dynamics_Inference, Fluctuation-Respones & Tipping Poi...
 
To the numerical modeling of self similar solutions of
To the numerical modeling of self similar solutions ofTo the numerical modeling of self similar solutions of
To the numerical modeling of self similar solutions of
 
Graph Representation Learning
Graph Representation LearningGraph Representation Learning
Graph Representation Learning
 
Higher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsHigher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifs
 
Mining group correlations over data streams
Mining group correlations over data streamsMining group correlations over data streams
Mining group correlations over data streams
 

Plus de tuxette

Racines en haut et feuilles en bas : les arbres en maths
Racines en haut et feuilles en bas : les arbres en mathsRacines en haut et feuilles en bas : les arbres en maths
Racines en haut et feuilles en bas : les arbres en mathstuxette
 
Méthodes à noyaux pour l’intégration de données hétérogènes
Méthodes à noyaux pour l’intégration de données hétérogènesMéthodes à noyaux pour l’intégration de données hétérogènes
Méthodes à noyaux pour l’intégration de données hétérogènestuxette
 
Méthodologies d'intégration de données omiques
Méthodologies d'intégration de données omiquesMéthodologies d'intégration de données omiques
Méthodologies d'intégration de données omiquestuxette
 
Projets autour de l'Hi-C
Projets autour de l'Hi-CProjets autour de l'Hi-C
Projets autour de l'Hi-Ctuxette
 
Can deep learning learn chromatin structure from sequence?
Can deep learning learn chromatin structure from sequence?Can deep learning learn chromatin structure from sequence?
Can deep learning learn chromatin structure from sequence?tuxette
 
Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...tuxette
 
ASTERICS : une application pour intégrer des données omiques
ASTERICS : une application pour intégrer des données omiquesASTERICS : une application pour intégrer des données omiques
ASTERICS : une application pour intégrer des données omiquestuxette
 
Autour des projets Idefics et MetaboWean
Autour des projets Idefics et MetaboWeanAutour des projets Idefics et MetaboWean
Autour des projets Idefics et MetaboWeantuxette
 
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...tuxette
 
Apprentissage pour la biologie moléculaire et l’analyse de données omiques
Apprentissage pour la biologie moléculaire et l’analyse de données omiquesApprentissage pour la biologie moléculaire et l’analyse de données omiques
Apprentissage pour la biologie moléculaire et l’analyse de données omiquestuxette
 
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...tuxette
 
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...tuxette
 
Journal club: Validation of cluster analysis results on validation data
Journal club: Validation of cluster analysis results on validation dataJournal club: Validation of cluster analysis results on validation data
Journal club: Validation of cluster analysis results on validation datatuxette
 
Overfitting or overparametrization?
Overfitting or overparametrization?Overfitting or overparametrization?
Overfitting or overparametrization?tuxette
 
Selective inference and single-cell differential analysis
Selective inference and single-cell differential analysisSelective inference and single-cell differential analysis
Selective inference and single-cell differential analysistuxette
 
SOMbrero : un package R pour les cartes auto-organisatrices
SOMbrero : un package R pour les cartes auto-organisatricesSOMbrero : un package R pour les cartes auto-organisatrices
SOMbrero : un package R pour les cartes auto-organisatricestuxette
 
Graph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype PredictionGraph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype Predictiontuxette
 
A short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelsA short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelstuxette
 
Explanable models for time series with random forest
Explanable models for time series with random forestExplanable models for time series with random forest
Explanable models for time series with random foresttuxette
 
Présentation du projet ASTERICS
Présentation du projet ASTERICSPrésentation du projet ASTERICS
Présentation du projet ASTERICStuxette
 

Plus de tuxette (20)

Racines en haut et feuilles en bas : les arbres en maths
Racines en haut et feuilles en bas : les arbres en mathsRacines en haut et feuilles en bas : les arbres en maths
Racines en haut et feuilles en bas : les arbres en maths
 
Méthodes à noyaux pour l’intégration de données hétérogènes
Méthodes à noyaux pour l’intégration de données hétérogènesMéthodes à noyaux pour l’intégration de données hétérogènes
Méthodes à noyaux pour l’intégration de données hétérogènes
 
Méthodologies d'intégration de données omiques
Méthodologies d'intégration de données omiquesMéthodologies d'intégration de données omiques
Méthodologies d'intégration de données omiques
 
Projets autour de l'Hi-C
Projets autour de l'Hi-CProjets autour de l'Hi-C
Projets autour de l'Hi-C
 
Can deep learning learn chromatin structure from sequence?
Can deep learning learn chromatin structure from sequence?Can deep learning learn chromatin structure from sequence?
Can deep learning learn chromatin structure from sequence?
 
Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...Multi-omics data integration methods: kernel and other machine learning appro...
Multi-omics data integration methods: kernel and other machine learning appro...
 
ASTERICS : une application pour intégrer des données omiques
ASTERICS : une application pour intégrer des données omiquesASTERICS : une application pour intégrer des données omiques
ASTERICS : une application pour intégrer des données omiques
 
Autour des projets Idefics et MetaboWean
Autour des projets Idefics et MetaboWeanAutour des projets Idefics et MetaboWean
Autour des projets Idefics et MetaboWean
 
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
 
Apprentissage pour la biologie moléculaire et l’analyse de données omiques
Apprentissage pour la biologie moléculaire et l’analyse de données omiquesApprentissage pour la biologie moléculaire et l’analyse de données omiques
Apprentissage pour la biologie moléculaire et l’analyse de données omiques
 
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
 
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
 
Journal club: Validation of cluster analysis results on validation data
Journal club: Validation of cluster analysis results on validation dataJournal club: Validation of cluster analysis results on validation data
Journal club: Validation of cluster analysis results on validation data
 
Overfitting or overparametrization?
Overfitting or overparametrization?Overfitting or overparametrization?
Overfitting or overparametrization?
 
Selective inference and single-cell differential analysis
Selective inference and single-cell differential analysisSelective inference and single-cell differential analysis
Selective inference and single-cell differential analysis
 
SOMbrero : un package R pour les cartes auto-organisatrices
SOMbrero : un package R pour les cartes auto-organisatricesSOMbrero : un package R pour les cartes auto-organisatrices
SOMbrero : un package R pour les cartes auto-organisatrices
 
Graph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype PredictionGraph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype Prediction
 
A short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelsA short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction models
 
Explanable models for time series with random forest
Explanable models for time series with random forestExplanable models for time series with random forest
Explanable models for time series with random forest
 
Présentation du projet ASTERICS
Présentation du projet ASTERICSPrésentation du projet ASTERICS
Présentation du projet ASTERICS
 

Dernier

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 

Dernier (20)

GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 

Mining co-expression network

  • 1. Overview on co-expression network analysis Case study 1 Case study 2 References Mining co-expression networks Nathalie Villa-Vialaneix http://www.nathalievilla.org INRA, Unité MIA-T, INRA, Toulouse (France) School for advanced sciences of Luchon Network analysis and applications NV2 | Mining co-expression networks 1/32
  • 2. Overview on co-expression network analysis Case study 1 Case study 2 References Outline 1 Overview on co-expression network analysis 2 Case study 1: gene network analysis in relations with meat quality 3 Case study 2: gene network analysis in LCD experiment NV2 | Mining co-expression networks 2/32
  • 3. Overview on co-expression network analysis Case study 1 Case study 2 References Outline 1 Overview on co-expression network analysis 2 Case study 1: gene network analysis in relations with meat quality 3 Case study 2: gene network analysis in LCD experiment NV2 | Mining co-expression networks 3/32
  • 4. Overview on co-expression network analysis Case study 1 Case study 2 References Transcriptomic data DNA transcripted into mRNA to produce proteins NV2 | Mining co-expression networks 4/32
  • 5. Overview on co-expression network analysis Case study 1 Case study 2 References Transcriptomic data DNA transcripted into mRNA to produce proteins transcriptomic data: measure of the quantity of mRNA corresponding to a given gene in given cells (blood, muscle...) of a living organism NV2 | Mining co-expression networks 4/32
  • 6. Overview on co-expression network analysis Case study 1 Case study 2 References Systems biology Some genes’ expressions activate or repress other genes’ expressions ⇒ understanding the whole cascade helps to comprehend the global functioning of living organisms1 1 Picture taken from: Abdollahi A et al., PNAS 2007, 104:12890-12895. c 2007 by National Academy of Sciences NV2 | Mining co-expression networks 5/32
  • 7. Overview on co-expression network analysis Case study 1 Case study 2 References Standard issues in network analysis Inference Giving expression data, how to build a graph whose edges represent the direct links between genes? Example: co-expression networks built from microarray/RNAseq data (nodes = genes; edges = significant “direct links” between expressions of two genes) NV2 | Mining co-expression networks 6/32
  • 8. Overview on co-expression network analysis Case study 1 Case study 2 References Standard issues in network analysis Inference Giving expression data, how to build a graph whose edges represent the direct links between genes? Graph mining (examples) 1 Network visualization: nodes are not a priori given a position. Random positions Positions aiming at representing connected nodes closer NV2 | Mining co-expression networks 6/32
  • 9. Overview on co-expression network analysis Case study 1 Case study 2 References Standard issues in network analysis Inference Giving expression data, how to build a graph whose edges represent the direct links between genes? Graph mining (examples) 1 Network visualization: nodes are not a priori given a position. 2 Important node extraction (high degree, high centrality...) NV2 | Mining co-expression networks 6/32
  • 10. Overview on co-expression network analysis Case study 1 Case study 2 References Standard issues in network analysis Inference Giving expression data, how to build a graph whose edges represent the direct links between genes? Graph mining (examples) 1 Network visualization: nodes are not a priori given a position. 2 Important node extraction (high degree, high centrality...) 3 Network clustering: identify “communities” NV2 | Mining co-expression networks 6/32
  • 11. Overview on co-expression network analysis Case study 1 Case study 2 References Network inference Data: large scale gene expression data individuals n 30/50    X =   . . . . . . . . X j i . . . . . . . . .   variables (genes expression), p 103/4 What we want to obtain: a graph/network with nodes: genes (a selected sublist of interest2 ; usually, DE genes); edges: “strong relations” between gene expressions. 2 See [Verzelen, 2012] for conditions on respective n/p suited for inference. NV2 | Mining co-expression networks 7/32
  • 12. Overview on co-expression network analysis Case study 1 Case study 2 References Advantages of this network model 1 over raw data: focuses on the strongest direct relationships: irrelevant or indirect relations are removed (more robust) and the data are easier to visualize and understand (track transcription relations). NV2 | Mining co-expression networks 8/32
  • 13. Overview on co-expression network analysis Case study 1 Case study 2 References Advantages of this network model 1 over raw data: focuses on the strongest direct relationships: irrelevant or indirect relations are removed (more robust) and the data are easier to visualize and understand (track transcription relations). Expression data are analyzed all together and not by pairs (systems model). NV2 | Mining co-expression networks 8/32
  • 14. Overview on co-expression network analysis Case study 1 Case study 2 References Advantages of this network model 1 over raw data: focuses on the strongest direct relationships: irrelevant or indirect relations are removed (more robust) and the data are easier to visualize and understand (track transcription relations). Expression data are analyzed all together and not by pairs (systems model). 2 over bibliographic network: can handle interactions with yet unknown (not annotated) genes and deal with data collected in a particular condition. NV2 | Mining co-expression networks 8/32
  • 15. Overview on co-expression network analysis Case study 1 Case study 2 References Using correlations: relevance network [Butte and Kohane, 1999, Butte and Kohane, 2000] First (naive) approach: calculate correlations between expressions for all pairs of genes, threshold the smallest ones and build the network. Correlations Thresholding Graph NV2 | Mining co-expression networks 9/32
  • 16. Overview on co-expression network analysis Case study 1 Case study 2 References Using partial correlations strong indirect correlation y z x NV2 | Mining co-expression networks 10/32
  • 17. Overview on co-expression network analysis Case study 1 Case study 2 References Using partial correlations strong indirect correlation y z x set.seed(2807); x <- rnorm(100) y <- 2*x+1+rnorm(100,0,0.1); cor(x,y) [1] 0.998826 z <- 2*x+1+rnorm(100,0,0.1); cor(x,z) [1] 0.998751 cor(y,z) [1] 0.9971105 NV2 | Mining co-expression networks 10/32
  • 18. Overview on co-expression network analysis Case study 1 Case study 2 References Using partial correlations strong indirect correlation y z x set.seed(2807); x <- rnorm(100) y <- 2*x+1+rnorm(100,0,0.1); cor(x,y) [1] 0.998826 z <- 2*x+1+rnorm(100,0,0.1); cor(x,z) [1] 0.998751 cor(y,z) [1] 0.9971105 Partial correlation cor(lm(x∼z)$residuals,lm(y∼z)$residuals) [1] 0.7801174 cor(lm(x∼y)$residuals,lm(z∼y)$residuals) [1] 0.7639094 cor(lm(y∼x)$residuals,lm(z∼x)$residuals) [1] -0.1933699 NV2 | Mining co-expression networks 10/32
  • 19. Overview on co-expression network analysis Case study 1 Case study 2 References Partial correlation and GGM Gaussian Graphical Model framework: (Xi)i=1,...,n are i.i.d. Gaussian random variables N(0, Σ) (gene expression); then j ←→ j (genes j and j are linked) ⇔ Cor Xj , Xj |(Xk )k j,j 0 NV2 | Mining co-expression networks 11/32
  • 20. Overview on co-expression network analysis Case study 1 Case study 2 References Partial correlation and GGM Gaussian Graphical Model framework: (Xi)i=1,...,n are i.i.d. Gaussian random variables N(0, Σ) (gene expression); then j ←→ j (genes j and j are linked) ⇔ Cor Xj , Xj |(Xk )k j,j 0 If (concentration matrix) S = Σ−1 , Cor Xj , Xj |(Xk )k j,j = − Sjj SjjSj j ⇒ Estimate Σ−1 to unravel the graph structure NV2 | Mining co-expression networks 11/32
  • 21. Overview on co-expression network analysis Case study 1 Case study 2 References Partial correlation and GGM Gaussian Graphical Model framework: (Xi)i=1,...,n are i.i.d. Gaussian random variables N(0, Σ) (gene expression); then j ←→ j (genes j and j are linked) ⇔ Cor Xj , Xj |(Xk )k j,j 0 If (concentration matrix) S = Σ−1 , Cor Xj , Xj |(Xk )k j,j = − Sjj SjjSj j ⇒ Estimate Σ−1 to unravel the graph structure Problem: Σ: p-dimensional matrix and n p ⇒ (Σn )−1 is a poor estimate of S)! NV2 | Mining co-expression networks 11/32
  • 22. Overview on co-expression network analysis Case study 1 Case study 2 References Estimation in GGM Graphical Gaussian Model estimation seminal work: [Schäfer and Strimmer, 2005a, Schäfer and Strimmer, 2005b] (with shrinkage and a proposal for a Bayesian test of significance) estimate Σ−1 by (Σn + λI)−1 use a Bayesian test to test which coefficients are significantly non zero. NV2 | Mining co-expression networks 12/32
  • 23. Overview on co-expression network analysis Case study 1 Case study 2 References Estimation in GGM Graphical Gaussian Model estimation seminal work: [Schäfer and Strimmer, 2005a, Schäfer and Strimmer, 2005b] (with shrinkage and a proposal for a Bayesian test of significance) estimate Σ−1 by (Σn + λI)−1 use a Bayesian test to test which coefficients are significantly non zero. sparse approaches: [Meinshausen and Bühlmann, 2006, Friedman et al., 2008]: ∀ j, estimate the linear models Xj = βT j X−j + by penalized ML arg min(βjj )j n i=1 Xij − βT j X −j i 2 + λ βj L1 , with βj L1 = j |βjj |, L1 penalty yields to βjj = 0 for most j (variable selection) NV2 | Mining co-expression networks 12/32
  • 24. Overview on co-expression network analysis Case study 1 Case study 2 References Visualization Purpose: How to display the nodes in a meaningful and aesthetic way? NV2 | Mining co-expression networks 13/32
  • 25. Overview on co-expression network analysis Case study 1 Case study 2 References Visualization Purpose: How to display the nodes in a meaningful and aesthetic way? Standard approach: force directed placement algorithms (FDP) (e.g., [Fruchterman and Reingold, 1991]) NV2 | Mining co-expression networks 13/32
  • 26. Overview on co-expression network analysis Case study 1 Case study 2 References Visualization Purpose: How to display the nodes in a meaningful and aesthetic way? Standard approach: force directed placement algorithms (FDP) (e.g., [Fruchterman and Reingold, 1991]) attractive forces: similar to springs along the edges NV2 | Mining co-expression networks 13/32
  • 27. Overview on co-expression network analysis Case study 1 Case study 2 References Visualization Purpose: How to display the nodes in a meaningful and aesthetic way? Standard approach: force directed placement algorithms (FDP) (e.g., [Fruchterman and Reingold, 1991]) attractive forces: similar to springs along the edges repulsive forces: similar to electric forces between all pairs of vertices NV2 | Mining co-expression networks 13/32
  • 28. Overview on co-expression network analysis Case study 1 Case study 2 References Visualization Purpose: How to display the nodes in a meaningful and aesthetic way? Standard approach: force directed placement algorithms (FDP) (e.g., [Fruchterman and Reingold, 1991]) iterative algorithm until stabilization of the vertex positions. NV2 | Mining co-expression networks 13/32
  • 29. Overview on co-expression network analysis Case study 1 Case study 2 References Important node extraction 1 vertex degree: number of edges adjacent to a given vertex. Vertices with a high degree are called hubs: measure of the vertex popularity. NV2 | Mining co-expression networks 14/32
  • 30. Overview on co-expression network analysis Case study 1 Case study 2 References Important node extraction 1 vertex degree: number of edges adjacent to a given vertex. Vertices with a high degree are called hubs: measure of the vertex popularity. 2 vertex betweenness: number of shortest paths between all pairs of vertices that pass through the vertex. Betweenness is a centrality measure (vertices with a large betweenness that are the most likely to disconnect the network if removed). The orange node’s degree is equal to 2, its betweenness to 4. NV2 | Mining co-expression networks 14/32
  • 31. Overview on co-expression network analysis Case study 1 Case study 2 References Vertex clustering Cluster vertexes into groups that are densely connected and share a few links (comparatively) with the other groups. Clusters are often called communities (social sciences) or modules (biology). NV2 | Mining co-expression networks 15/32
  • 32. Overview on co-expression network analysis Case study 1 Case study 2 References Vertex clustering Cluster vertexes into groups that are densely connected and share a few links (comparatively) with the other groups. Clusters are often called communities (social sciences) or modules (biology). Several clustering methods: min cut minimization minimizes the number of edges between clusters; spectral clustering [von Luxburg, 2007] and kernel clustering uses eigen-decomposition of the Laplacian Lij = −wij if i j di otherwise (matrix strongly related to the graph structure); Generative (Bayesian) models [Zanghi et al., 2008]; Markov clustering simulate a flow on the graph; modularity maximization ... (clustering jungle... see e.g., [Fortunato and Barthélémy, 2007, Schaeffer, 2007, Brohée and van Helden, 2006]) NV2 | Mining co-expression networks 15/32
  • 33. Overview on co-expression network analysis Case study 1 Case study 2 References Modularity optimization The modularity [Newman and Girvan, 2004] of the partition (C1, . . . , CK ) is equal to: Q(C1, . . . , CK ) = 1 2m K k=1 xi,xj∈Ck (Wij − Pij) with Pij: weight of a “null model” (graph with the same degree distribution but no preferential attachment): Pij = didj 2m with di = 1 2 j i Wij. NV2 | Mining co-expression networks 16/32
  • 34. Overview on co-expression network analysis Case study 1 Case study 2 References Interpretation A good clustering should maximize the modularity: Q when (xi, xj) are in the same cluster and Wij Pij Q when (xi, xj) are in two different clusters and Wij Pij (m = 20) Pij = 7.5 Wij = 5 ⇒ Wij − Pij = −2.5 di = 15 dj = 20 i and j in the same cluster decreases the modularity NV2 | Mining co-expression networks 17/32
  • 35. Overview on co-expression network analysis Case study 1 Case study 2 References Interpretation A good clustering should maximize the modularity: Q when (xi, xj) are in the same cluster and Wij Pij Q when (xi, xj) are in two different clusters and Wij Pij (m = 20) Pij = 0.05 Wij = 5 ⇒ Wij − Pij = 4.95 di = 1 dj = 2 i and j in the same cluster increases the modularity NV2 | Mining co-expression networks 17/32
  • 36. Overview on co-expression network analysis Case study 1 Case study 2 References Interpretation A good clustering should maximize the modularity: Q when (xi, xj) are in the same cluster and Wij Pij Q when (xi, xj) are in two different clusters and Wij Pij Modularity helps separate hubs ( spectral clustering or min cut criterion); is not an increasing function of the number of clusters: useful to choose the relevant number of clusters (with a grid search: several values are tested, the clustering with the highest modularity is kept) but modularity has a small resolution default (see [Fortunato and Barthélémy, 2007]) NV2 | Mining co-expression networks 17/32
  • 37. Overview on co-expression network analysis Case study 1 Case study 2 References Interpretation A good clustering should maximize the modularity: Q when (xi, xj) are in the same cluster and Wij Pij Q when (xi, xj) are in two different clusters and Wij Pij Modularity helps separate hubs ( spectral clustering or min cut criterion); is not an increasing function of the number of clusters: useful to choose the relevant number of clusters (with a grid search: several values are tested, the clustering with the highest modularity is kept) but modularity has a small resolution default (see [Fortunato and Barthélémy, 2007]) Main issue: Optimization = NP-complete problem (exhaustive search is not not usable) Different solutions are provided in [Newman and Girvan, 2004, Blondel et al., 2008, Noack and Rotta, 2009, Rossi and Villa-Vialaneix, 2011] (among others) and some of them are implemented in the R package igraph. NV2 | Mining co-expression networks 17/32
  • 38. Overview on co-expression network analysis Case study 1 Case study 2 References Outline 1 Overview on co-expression network analysis 2 Case study 1: gene network analysis in relations with meat quality 3 Case study 2: gene network analysis in LCD experiment NV2 | Mining co-expression networks 18/32
  • 39. Overview on co-expression network analysis Case study 1 Case study 2 References Dataset description [Villa-Vialaneix et al., 2013] F2: 1200 animals muscle sampling phenotypic measures (30) (pH ...) NV2 | Mining co-expression networks 19/32
  • 40. Overview on co-expression network analysis Case study 1 Case study 2 References Dataset description [Villa-Vialaneix et al., 2013] F2: 1200 animals muscle sampling phenotypic measures (30) (pH ...) Used data: 57 F2 pigs (largest variability for PH); transcriptomic data for 272 genes regulated by an eQTL Problems with these particular data: how to understand the relationships between these genes’ expression as their co-expression is weaker than between other kind of genes (TF/genes, for instance)? how to relate gene expression with a phenotype of interest (muscle pH)? NV2 | Mining co-expression networks 19/32
  • 41. Overview on co-expression network analysis Case study 1 Case study 2 References Inferred network description Use of [Schäfer and Strimmer, 2005a] Obtained network: 272 nodes (connected); Density: 6,4%; Transitivity: 25,4% NV2 | Mining co-expression networks 20/32
  • 42. Overview on co-expression network analysis Case study 1 Case study 2 References Inferred network description Use of [Schäfer and Strimmer, 2005a] Obtained network: 272 nodes (connected); Density: 6,4%; Transitivity: 25,4% degree distribution Degrés Frequency 5 10 15 20 25 30 010203040 NV2 | Mining co-expression networks 20/32
  • 43. Overview on co-expression network analysis Case study 1 Case study 2 References Inferred network description Use of [Schäfer and Strimmer, 2005a] Obtained network: 272 nodes (connected); Density: 6,4%; Transitivity: 25,4% 8 genes both have high degree and high betweenness BX921641; FTH1; TRIAP1; SLC9A14; GPI; SUZ12; MGP; PRDX4 and several have been identified by the biologist as relevant to meat quality. NV2 | Mining co-expression networks 20/32
  • 44. Overview on co-expression network analysis Case study 1 Case study 2 References Clustering clustering with modularity optimization: 7 clusters for each cluster, annotated genes submitted to IPA3 (bibliographic network database): from 71% to 94% of the genes of a single cluster belong to the same IPA network with a biological function associated 3 https://analysis.ingenuity.com/pa NV2 | Mining co-expression networks 21/32
  • 45. Overview on co-expression network analysis Case study 1 Case study 2 References Relation to muscle pH model: label each node of the network with its partial correlation to the muscle pH. Questions: is there a relation between muscle pH and network structure? is there a relation between clustering and muscle pH? NV2 | Mining co-expression networks 22/32
  • 46. Overview on co-expression network analysis Case study 1 Case study 2 References Relation to muscle pH model: label each node of the network with its partial correlation to the muscle pH. Moran’s I (used in spatial statistics): I = 1 2m i j wij¯ci¯cj 1 n i ¯c2 i , where m = 1 2 i j Wij and ci is the partial correlation with pH, ¯ci = ci − ¯c with ¯c = 1 n i ci. Using a MC simulation (edge permutations): Moran's I Frequency −0.05 0.00 0.05 0.10 0.15 0.20 050100150 Moran’s I is significantly larger than expected: genes tend to be linked to genes which have a similar correlation to muscle pH. NV2 | Mining co-expression networks 22/32
  • 47. Overview on co-expression network analysis Case study 1 Case study 2 References Relation to muscle pH model: label each node of the network with its partial correlation to the muscle pH. qq q q q q q q q qq q q q q q q 1 2 3 4 5 6 7 −0.02−0.010.000.010.020.03 Cluster PartialcorrelationwithPH Significant Student test for cluster 4: its correlation with pH is larger than for the other clusters NV2 | Mining co-expression networks 22/32
  • 48. Overview on co-expression network analysis Case study 1 Case study 2 References Moran’s plot Moran’s plot help to emphasize influential points: WC vs C q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q qqq q q q q q q q q q q q q q q q q q q q q q q qq q q q qq q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q −0.02 −0.01 0.00 0.01 0.02 0.03 −0.0050.0000.0050.010 CorPH AxCorPH H−H H−L L−L L−H NV2 | Mining co-expression networks 23/32
  • 49. Overview on co-expression network analysis Case study 1 Case study 2 References Moran’s plot Moran’s plot help to emphasize influential points: WC vs C q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q qqq q q q q q q q q q q q q q q q q q q q q q q qq q q q qq q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q −0.02 −0.01 0.00 0.01 0.02 0.03 −0.0050.0000.0050.010 CorPH AxCorPH H−H H−L L−L L−H Associated influential measures and tests for finding influential points. NV2 | Mining co-expression networks 23/32
  • 50. Overview on co-expression network analysis Case study 1 Case study 2 References Influential points: example of cluster 4 q q q q q q q q q q q q q q q q q q q q q q q q q q q q BX919092 PSMC3IP THRB XIAP ARHGAP8 X91721 BX917912 EAPP LSM2 BX922053 BX922491 H2AFY ENH_RAT LMF1 FTCD BX925690 B2M GPI BX667979 BX920538 BX671131 RNF2 BX673501 KPNA1 BX674063 BX918923 RPS11 UBE2H Cluster 4 NV2 | Mining co-expression networks 24/32
  • 51. Overview on co-expression network analysis Case study 1 Case study 2 References Outline 1 Overview on co-expression network analysis 2 Case study 1: gene network analysis in relations with meat quality 3 Case study 2: gene network analysis in LCD experiment NV2 | Mining co-expression networks 25/32
  • 52. Overview on co-expression network analysis Case study 1 Case study 2 References Data: DIOGENES project Experimental protocol 135 obese women and 3 times: before LCD, after a 2-month LCD and 6 months later (between the end of LCD and the last measurement, women are randomized into one of 5 recommended diet groups). At every time step, 221 gene expressions, 28 fatty acids and 15 clinical variables (i.e., weight, HDL, ...) NV2 | Mining co-expression networks 26/32
  • 53. Overview on co-expression network analysis Case study 1 Case study 2 References Data: DIOGENES project Experimental protocol 135 obese women and 3 times: before LCD, after a 2-month LCD and 6 months later (between the end of LCD and the last measurement, women are randomized into one of 5 recommended diet groups). At every time step, 221 gene expressions, 28 fatty acids and 15 clinical variables (i.e., weight, HDL, ...) Correlations between gene expressions and between a gene expression and a fatty acid levels are not of the same order: inference method must be different inside the groups and between two groups. NV2 | Mining co-expression networks 26/32
  • 54. Overview on co-expression network analysis Case study 1 Case study 2 References Data: DIOGENES project Data pre-processing At CID3, individuals are split into three groups: weight loss, weight regain and stable weight (groups are not correlated to the diet group according to χ2 -test). NV2 | Mining co-expression networks 26/32
  • 55. Overview on co-expression network analysis Case study 1 Case study 2 References Method for CID1, CID2 and 3× CID3 Network inference Clustering Mining 3 inter-dataset networks rCCA merge into one network 3 intra-dataset networks sparse partial correlation 5 networks CID1 CID2 3×CID3 Extract important nodes Study/Compare clusters NV2 | Mining co-expression networks 27/32
  • 56. Overview on co-expression network analysis Case study 1 Case study 2 References Inference Intra-level networks: use of partial correlations and a sparse approach (graphical Lasso as in the R package gLasso) to select edges [Friedman et al., 2008] NV2 | Mining co-expression networks 28/32
  • 57. Overview on co-expression network analysis Case study 1 Case study 2 References Inference Intra-level networks: use of partial correlations and a sparse approach (graphical Lasso as in the R package gLasso) to select edges [Friedman et al., 2008] Inter-levels networks: use of regularized CCA (as in the R package mixOmics) to evaluate strength of the correlations [Lê Cao et al., 2009] NV2 | Mining co-expression networks 28/32
  • 58. Overview on co-expression network analysis Case study 1 Case study 2 References Inference Intra-level networks: use of partial correlations and a sparse approach (graphical Lasso as in the R package gLasso) to select edges [Friedman et al., 2008] Inter-levels networks: use of regularized CCA (as in the R package mixOmics) to evaluate strength of the correlations [Lê Cao et al., 2009] Combination of the 6 informations: tune the number of edges intra or inter-levels so that it is of the order of the number of nodes in the corresponding level(s) NV2 | Mining co-expression networks 28/32
  • 59. Overview on co-expression network analysis Case study 1 Case study 2 References Brief overview on results 5 networks inferred with 264 nodes each: CID1 CID2 CID3g1 CID3g2 CID3g3 size LCC 244 251 240 259 258 density 2.3% 2.3% 2.3% 2.3% 2.3% transitivity 17.2% 11.9% 21.6% 10.6% 10.4% nb clusters 14 (2-52) 10 (4-52) 11 (2-46) 12 (2-51) 12 (3-54) clusters were visualized and analyzed for important node extraction NV2 | Mining co-expression networks 29/32
  • 60. Overview on co-expression network analysis Case study 1 Case study 2 References Findings CID1: clusters were found to be associated to biological functions (fatty acids biosynthesis, adhesion and diapedesis...) CID3: for people with weight loss, an unexpected fatty acid was found to be an important node (high betweenness) in a cluster linked to fatty acids biosynthesis NV2 | Mining co-expression networks 30/32
  • 61. Overview on co-expression network analysis Case study 1 Case study 2 References Conclusion biological network mining can help the biologist comprehend the complex biological system in its whole groups of genes are more robust models, often linked to a biological function, than pairwise relations between genes simple tools, such as, numeric characteristics are useful to extract important nodes NV2 | Mining co-expression networks 31/32
  • 62. Overview on co-expression network analysis Case study 1 Case study 2 References Thank you for your attention... ... questions? NV2 | Mining co-expression networks 32/32
  • 63. Overview on co-expression network analysis Case study 1 Case study 2 References Blondel, V., Guillaume, J., Lambiotte, R., and Lefebvre, E. (2008). Fast unfolding of communites in large networks. Journal of Statistical Mechanics: Theory and Experiment, P10008:1742–5468. Brohée, S. and van Helden, J. (2006). Evaluation of clustering algorithms for protein-protein interaction networks. BMC Bioinformatics, 7(488). Butte, A. and Kohane, I. (1999). Unsupervised knowledge discovery in medical databases using relevance networks. In Proceedings of the AMIA Symposium, pages 711–715. Butte, A. and Kohane, I. (2000). Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. In Proceedings of the Pacific Symposium on Biocomputing, pages 418–429. Fortunato, S. and Barthélémy, M. (2007). Resolution limit in community detection. In Proceedings of the National Academy of Sciences, volume 104, pages 36–41. doi:10.1073/pnas.0605965104; URL: http://www.pnas.org/content/104/1/36.abstract. Friedman, J., Hastie, T., and Tibshirani, R. (2008). Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9(3):432–441. Fruchterman, T. and Reingold, B. (1991). Graph drawing by force-directed placement. Software, Practice and Experience, 21:1129–1164. NV2 | Mining co-expression networks 32/32
  • 64. Overview on co-expression network analysis Case study 1 Case study 2 References Lê Cao, K., González, I., and Déjean, S. (2009). *****Omics: an R package to unravel relationships between two omics data sets. Bioinformatics, 25(21):2855–2856. Meinshausen, N. and Bühlmann, P. (2006). High dimensional graphs and variable selection with the lasso. Annals of Statistic, 34(3):1436–1462. Newman, M. and Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review, E, 69:026113. Noack, A. and Rotta, R. (2009). Multi-level algorithms for modularity clustering. In SEA ’09: Proceedings of the 8th International Symposium on Experimental Algorithms, pages 257–268, Berlin, Heidelberg. Springer-Verlag. Rossi, F. and Villa-Vialaneix, N. (2011). Représentation d’un grand réseau à partir d’une classification hiérarchique de ses sommets. Journal de la Société Française de Statistique, 152(3):34–65. Schaeffer, S. (2007). Graph clustering. Computer Science Review, 1(1):27–64. Schäfer, J. and Strimmer, K. (2005a). An empirical bayes approach to inferring large-scale gene association networks. Bioinformatics, 21(6):754–764. NV2 | Mining co-expression networks 32/32
  • 65. Overview on co-expression network analysis Case study 1 Case study 2 References Schäfer, J. and Strimmer, K. (2005b). A shrinkage approach to large-scale covariance matrix estimation and implication for functional genomics. Statistical Applications in Genetics and Molecular Biology, 4:1–32. Verzelen, N. (2012). Minimax risks for sparse regressions: ultra-high-dimensional phenomenons. Electronic Journal of Statistics, 6:38–90. Villa-Vialaneix, N., Liaubet, L., Laurent, T., Cherel, P., Gamot, A., and San Cristobal, M. (2013). The structure of a gene co-expression network reveals biological functions underlying eQTLs. PLoS ONE, 8(4):e60045. von Luxburg, U. (2007). A tutorial on spectral clustering. Statistics and Computing, 17(4):395–416. Zanghi, H., Ambroise, C., and Miele, V. (2008). Fast online graph clustering via erdös-rényi mixture. Pattern Recognition, 41:3592–3599. NV2 | Mining co-expression networks 32/32