The discovery of novel gene regulatory processes improves the understanding of cell phenotypic responses to external stimuli for many biological applications, such as medicine, environment or biotechnologies. To this purpose, transcriptomic data are generated and analyzed from DNA microarrays or more recently RNAseq experiments. They consist in genetic expression level sequences obtained for all genes of a studied organism placed in different living conditions. From these data, gene regulation mechanisms can be recovered by revealing topological links encoded in graphs. In regulatory graphs, nodes correspond to genes. A link between two nodes is identified if a regulation relationship exists between the two corresponding genes. Such networks are called Gene Regulatory Networks (GRNs). Their construction as well as their analysis remain challenging despite the large number of available inference methods.
In this thesis, we propose to address this network inference problem with recently developed techniques pertaining to graph optimization. Given all the pairwise gene regulation information available, we propose to determine the presence of edges in the final GRN by adopting an energy optimization formulation integrating additional constraints. Either biological (information about gene interactions) or structural (information about node connectivity) a priori have been considered to restrict the space of possible solutions. Different priors lead to different properties of the global cost function, for which various optimization strategies, either discrete and continuous, can be applied. The post-processing network refinements we designed led to computational approaches named BRANE for "Biologically-Related A priori for Network Enhancement". For each of the proposed methods — BRANE Cut, BRANE Relax and BRANE Clust — our contributions are threefold: a priori-based formulation, design of the optimization strategy and validation (numerical and/or biological) on benchmark datasets from DREAM4 and DREAM5 challenges showing numerical improvement reaching 20%.
In a ramification of this thesis, we slide from graph inference to more generic data processing such as inverse problems. We notably invest in HOGMep, a Bayesian-based approach using a Variational Bayesian Approximation framework for its resolution. This approach allows to jointly perform reconstruction and clustering/segmentation tasks on multi-component data (for instance signals or images). Its performance in a color image deconvolution context demonstrates both quality of reconstruction and segmentation. A preliminary study in a medical data classification context linking genotype and phenotype yields promising results for forthcoming bioinformatics adaptations.
Formation of low mass protostars and their circumstellar disks
Reconstruction and Clustering with Graph optimization and Priors on Gene networks and Images
1. Reconstruction and Clustering with Graph optimization
and Priors on Gene Networks and Images
Aurélie Pirayre
PhD supervisors:
Frédérique BIDARD-MICHELOT IFP Energies nouvelles
Camille COUPRIE Facebook A.I. Research
Laurent DUVAL IFP Energies nouvelles
Jean-Christophe PESQUET CentraleSupélec
July 3th
, 2017
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 1 / 45
2. INTRODUCTION
An overview
Gene regulatory networks Signals and images
Reconstruction
Clustering
Our framework Variational Bayes variational
Method BRANE HOGMep
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 2 / 45
3. INTRODUCTION Context
Biological motivation
Second generation bio-fuel production
Cellulases from
Trichoderma reesei
Lignocellulosic
Biomass
Cellulose
Hemi-cellulose
Sugar
Ethanol
Bio-fuels
Enzymatic
Hydrolysis
Pre-treatment
Fermentation Mixing
with fuels
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 3 / 45
4. INTRODUCTION Context
Biological motivation
Second generation bio-fuel production
Cellulases from
Trichoderma reesei
Lignocellulosic
Biomass
Cellulose
Hemi-cellulose
Sugar
Ethanol
Bio-fuels
Enzymatic
Hydrolysis
Pre-treatment
Fermentation Mixing
with fuels
Improve Trichoderma reseei cellulase production
Understand cellulase production mechanisms
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 3 / 45
5. INTRODUCTION Context
Biological motivation
Second generation bio-fuel production
Cellulases from
Trichoderma reesei
Lignocellulosic
Biomass
Cellulose
Hemi-cellulose
Sugar
Ethanol
Bio-fuels
Enzymatic
Hydrolysis
Pre-treatment
Fermentation Mixing
with fuels
Improve Trichoderma reseei cellulase production
Understand cellulase production mechanisms
⇒ Use of Gene Regulatory Network (GRN)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 3 / 45
6. INTRODUCTION GRN overview
What is a Gene Regulatory Network (GRN)?
GRN: a graph G(V, E)
V = {v1, . . . , vG}: a set of G nodes (corresponding to genes)
E: a set of edges (corresponding to interactions between genes)
v1
v2
v3
vX
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 4 / 45
7. INTRODUCTION GRN overview
What is a Gene Regulatory Network (GRN)?
GRN: a graph G(V, E)
V = {v1, . . . , vG}: a set of G nodes (corresponding to genes)
E: a set of edges (corresponding to interactions between genes)
A gene regulatory network...
v1
v2
v3
vX
... models biological gene regulatory mechanisms
DNA
Gene 1 Gene 2 Gene 3 Gene X
mRNA
Protein
TF1 TF2 TF3 TFX⊕
⊕⊕
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 4 / 45
8. INTRODUCTION Data overview
What biological data can be used?
For a given experimental condition, transcriptomic data answer to:
which genes are expressed? in which amount?
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 5 / 45
9. INTRODUCTION Data overview
What biological data can be used?
For a given experimental condition, transcriptomic data answer to:
which genes are expressed? in which amount?
How to obtain transcriptomic data?
Microarray and RNAseq experiments
What do transcriptomic data look like?
Gene expression data (GED): G genes × S conditions
M =
S conditions
−0.948 −0.013 . . . −1.308 −0.977
0.737 0.619 . . . −0.141 −0.803
−0.253 −0.175 . . . −0.859 −0.595
3.747 1.115 . . . −0.418 −0.084
1.383 1.184 . . . −0.493 −0.562
Ggenes
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 5 / 45
10. INTRODUCTION Links between data and GRNs
How to use GED to produce a GRN ?
From gene expression data... M =
sj
.
.
.
gi · · · mi,j
,
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 6 / 45
11. INTRODUCTION Links between data and GRNs
How to use GED to produce a GRN ?
From gene expression data... M =
sj
.
.
.
gi · · · mi,j
,
V = {v1, · · · , vG} a set of vertices (genes) and E a set of edges
leading to a complete
graph...
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 6 / 45
12. INTRODUCTION Links between data and GRNs
How to use GED to produce a GRN ?
From gene expression data... M =
sj
.
.
.
gi · · · mi,j
,
V = {v1, · · · , vG} a set of vertices (genes) and E a set of edges
Each edge ei,j is weighted by ωi,j
leading to a complete weighted
graph... W =
gj
.
.
.
gi · · · ωi,j
,
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 6 / 45
13. INTRODUCTION Links between data and GRNs
How to use GED to produce a GRN ?
From gene expression data... M =
sj
.
.
.
gi · · · mi,j
,
V = {v1, · · · , vG} a set of vertices (genes) and E a set of edges
Each edge ei,j is weighted by ωi,j
leading to a complete weighted
graph... W =
gj
.
.
.
gi · · · ωi,j
,
We look for a subset of edges E∗
reflecting regulatory links between genes
to infer a meaningful gene
network. Wi,j =
1 if ei,j ∈ E∗
0 otherwise.
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 6 / 45
14. INTRODUCTION
Difficulties in GRN inference
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 7 / 45
15. INTRODUCTION
Difficulties in GRN inference
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 7 / 45
16. INTRODUCTION
Difficulties in GRN inference
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 7 / 45
17. INTRODUCTION
Difficulties in GRN inference
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 7 / 45
18. INTRODUCTION
Our BRANE strategy
What is the subset of edges E∗ reflecting real regulatory links between genes?
⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G?
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 8 / 45
19. INTRODUCTION
Our BRANE strategy
What is the subset of edges E∗ reflecting real regulatory links between genes?
⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G?
We note xi,j the binary label of edge presence: xi,j =
1 if ei,j ∈ E∗,
0 otherwise.
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 8 / 45
20. INTRODUCTION
Our BRANE strategy
What is the subset of edges E∗ reflecting real regulatory links between genes?
⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G?
We note xi,j the binary label of edge presence: xi,j =
1 if ei,j ∈ E∗,
0 otherwise.
Classical thresholding: x∗
i,j =
1 if ωi,j > λ,
0 otherwise.
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 8 / 45
21. INTRODUCTION
Our BRANE strategy
What is the subset of edges E∗ reflecting real regulatory links between genes?
⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G?
We note xi,j the binary label of edge presence: xi,j =
1 if ei,j ∈ E∗,
0 otherwise.
Classical thresholding: x∗
i,j =
1 if ωi,j > λ,
0 otherwise.
Given by a cost function for given weights ω:
maximize
x∈{0,1}E
(i,j)∈V2
ωi,j xi,j + λ(1 − xi,j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 8 / 45
22. INTRODUCTION
Our BRANE strategy
What is the subset of edges E∗ reflecting real regulatory links between genes?
⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G?
We note xi,j the binary label of edge presence: xi,j =
1 if ei,j ∈ E∗,
0 otherwise.
Classical thresholding: x∗
i,j =
1 if ωi,j > λ,
0 otherwise.
Given by a cost function for given weights ω:
maximize
x∈{0,1}E
(i,j)∈V2
ωi,j xi,j + λ(1 − xi,j) ⇔ minimize
x∈{0,1}E
(i,j)∈V2
ωi,j (1 − xi,j) + λ xi,j
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 8 / 45
23. INTRODUCTION
Our BRANE strategy
BRANE: Biologically Related A priori Network Enhancement
Extend classical thresholding
Integrate biological priors into the functional to be optimized
Enforce modular networks
Additional knowledge:
Transcription factors (TFs): regulators
Non transcription factors (TFs): targets
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 9 / 45
24. INTRODUCTION
Our BRANE strategy
BRANE: Biologically Related A priori Network Enhancement
Extend classical thresholding
Integrate biological priors into the functional to be optimized
Enforce modular networks
Additional knowledge:
Transcription factors (TFs): regulators
Non transcription factors (TFs): targets
Method a priori Formulation Algorithm
Inference
BRANE Cut Gene co-regulatiton Discrete Maximal flow
BRANE Relax TF-connectivity Continuous Proximal method
Joint inference
and clustering
BRANE Clust Gene grouping Mixed Alternating scheme
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 9 / 45
25. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS
A discrete method: BRANE Cut
We look for a discrete solution for x ⇔ x ∈ {0, 1}E
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 10 / 45
26. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS A priori
A priori: modular structure and gene co-regulation
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 11 / 45
27. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS A priori
A priori: modular structure and gene co-regulation
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
Modular network: favors links between TFs and TFs
λi,j =
2η if (i, j) /∈ T2
2λTF if (i, j) ∈ T2
λTF + λTF otherwise.
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 11 / 45
28. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS A priori
A priori: modular structure and gene co-regulation
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
Modular network: favors links between TFs and TFs
λi,j =
2η if (i, j) /∈ T2
2λTF if (i, j) ∈ T2
λTF + λTF otherwise.
with:
T: the set of TF indices
η > max {ωi,j | (i, j) ∈ V2
}
λTF > λTF
A linear relation is sufficient: λTF = βλTF with β = |V|
|T |
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 11 / 45
29. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS A priori
A priori: modular structure and gene co-regulation
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 12 / 45
30. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS A priori
A priori: modular structure and gene co-regulation
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
Gene co-regulation: favors edge coupling
Ψ(xi,j) =
(j,j )∈T2
i∈VT
ρi,j,j |xi,j − xi,j |
ρi,j,j : co-regulation probability
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 12 / 45
31. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS A priori
A priori: modular structure and gene co-regulation
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
Gene co-regulation: favors edge coupling
Ψ(xi,j) =
(j,j )∈T2
i∈VT
ρi,j,j |xi,j − xi,j |
ρi,j,j : co-regulation probability
with
ρi,j,j =
k∈V(T ∪{i})
1(min{ωj,j , ωj,k, ωj ,k} > γ)
|VT |−1
γ: the (|V| − 1)th
of the normalized weights ω
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 12 / 45
32. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
33. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
34. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s t
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
35. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s t
x1,2
x1,3
x1,4
x2,3
x2.4
x3,4
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
36. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s t
x1,2
x1,3
x1,4
x2,3
x2.4
x3,4
8
5
10
5
5
1
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
37. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s t
x1,2
x1,3
x1,4
x2,3
x2.4
x3,4
v1
v2
v3
v4
8
5
10
5
5
1
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
38. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s t
x1,2
x1,3
x1,4
x2,3
x2.4
x3,4
v1
v2
v3
v4
8
5
10
5
5
1
∞
∞
∞
∞
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
39. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s t
x1,2
x1,3
x1,4
x2,3
x2.4
x3,4
v1
v2
v3
v4
8
5
10
5
5
1
∞
∞
∞
∞
η = 6
λTF = 3
λTF = 1
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
40. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s t
x1,2
x1,3
x1,4
x2,3
x2.4
x3,4
v1
v2
v3
v4
8
5
10
5
5
1
3
∞
∞
∞
∞
η = 6
λTF = 3
λTF = 1
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
41. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s t
x1,2
x1,3
x1,4
x2,3
x2.4
x3,4
v1
v2
v3
v4
8
5
10
5
5
1
3
∞
∞
∞
∞
η = 6
λTF = 3
λTF = 1
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
42. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s = 1 t = 0
x1,2
x1,3
x1,4
x2,3
x2.4
x3,4
v1
v2
v3
v4
8
5
10
5
5
1
3
∞
∞
∞
∞
η = 6
λTF = 3
λTF = 1
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
43. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s = 1 t = 0
1
1
0
1
0
0
v1
v2
v3
v4
8
5
10
5
5
1
3
∞
∞
∞
∞
η = 6
λTF = 3
λTF = 1
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
44. BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS Formulation and resolution
A maximal flow for a minimum cut formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j|xi,j − 1| + λi,j xi,j +
i∈VT
(j,j )∈T2, j >j
ρi,j,j |xi,j − xi,j |
s = 1 t = 0
1
1
0
1
0
0
v1
v2
v3
v4
8
5
10
5
5
1
3
∞
∞
∞
∞
η = 6
λTF = 3
λTF = 1
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 13 / 45
45. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM
A continuous method: BRANE Relax
We look for a continuous solution for x ⇔ x ∈ [0, 1]E
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 14 / 45
46. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM A priori
A priori: modular structure and TF connectivity
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 15 / 45
47. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM A priori
A priori: modular structure and TF connectivity
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
Modular network: favors links between TFs and TFs
λi,j =
2η if (i, j) /∈ T2
2λTF if (i, j) ∈ T2
λTF + λTF otherwise.
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 15 / 45
48. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM A priori
A priori: modular structure and TF connectivity
minimize
x∈{0,1}E
(i,j)∈V2
ωi,jϕ(xi,j − 1) + λi,jϕ(xi,j) + µΨ(xi,j)
Modular network: favors links between TFs and TFs
λi,j =
2η if (i, j) /∈ T2
2λTF if (i, j) ∈ T2
λTF + λTF otherwise.
TF connectivity: constraint TF node degree
Ψ(xi,j) =
i∈VT
φ
j∈V
xi,j − d
φ(·): a convex distance function with
β-Lipschitz continuous gradient
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 15 / 45
49. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM Formulation
A convex relaxation for a continuous formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j(1 − xi,j) + λi,j xi,j + µ
i∈VT
φ
j∈V
xi,j − d
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 16 / 45
50. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM Formulation
A convex relaxation for a continuous formulation
minimize
x∈{0,1}E
(i,j)∈V2
j>i
ωi,j(1 − xi,j) + λi,j xi,j + µ
i∈VT
φ
j∈V
xi,j − d
Relaxation and vectorization:
minimize
x∈[0,1]E
E
l=1
ωl(1 − xl) + λl xl + µ
P
i=1
φ
E
k=1
Ωi,kxk − d ,
where Ω ∈ {0, 1}P×E
encodes the degree of the P TFs nodes in the complete graph.
Ωi,j =
1 if j is the index of an edge linking the TF node vi in the complete graph,
0 otherwise.
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 16 / 45
51. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM Formulation
Distance function in BRANE Relax
minimize
x∈[0,1]E
E
l=1
ωl(1 − xl) + λl xl + µ
P
i=1
φ
E
k=1
Ωi,kxk − d
Choice of φ: node degree distance function, with respect to d
zi =
E
k=1
Ωi,kxk − d
squared 2 norm: φ(z) = ||z||2
Huber function: φ(zi) =
z2
i if |zi| ≤ δ
2δ(|zi| − 1
2 δ) otherwise
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 17 / 45
52. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM Resolution
Optimization strategy via proximal methods
Splitting
minimize
x∈RE
ω (1E − x) + λ x + µΦ(Ωx − d)
f2
+ ι[0,1]E (x)
f1
f1 ∈ Γ0(RE): proper, convex, and lower semi-continuous
f2: convex, differentiable with an L−Lipschitz continuous gradient
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 18 / 45
53. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM Resolution
Optimization strategy via proximal methods
Splitting
minimize
x∈RE
ω (1E − x) + λ x + µΦ(Ωx − d)
f2
+ ι[0,1]E (x)
f1
f1 ∈ Γ0(RE): proper, convex, and lower semi-continuous
f2: convex, differentiable with an L−Lipschitz continuous gradient
Algorithm 1: Forward-Backward
Fix x0 ∈ RE
for k = 0, 1, . . . do
Select the index jk ∈ {1, . . . , J} of a block of variables
z
(jk)
k = x
(jk)
k − γkA−1
jk jk f2(xk)
x
(jk)
k+1 = proxγk
−1,Ajk
f
(jk)
1
(z
(jk)
k )
x
(¯jk)
k+1 = x
(¯jk)
k , ¯jk = {1, . . . , J}{jk}
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 18 / 45
54. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM Resolution
Optimization strategy via proximal methods
Splitting
minimize
x∈RE
ω (1E − x) + λ x + µΦ(Ωx − d)
f2
+ ι[0,1]E (x)
f1
f1 ∈ Γ0(RE): proper, convex, and lower semi-continuous
f2: convex, differentiable with an L−Lipschitz continuous gradient
Algorithm 2: Preconditioned Forward-Backward
Fix x0 ∈ RE
for k = 0, 1, . . . do
Select the index jk ∈ {1, . . . , J} of a block of variables
z
(jk)
k = x
(jk)
k − γkA−1
jk jk f2(xk)
x
(jk)
k+1 = proxγk
−1,Ajk
,f
(jk)
1
(z
(jk)
k )
x
(¯jk)
k+1 = x
(¯jk)
k , ¯jk = {1, . . . , J}{jk}
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 18 / 45
55. BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM Resolution
Optimization strategy via proximal methods
Splitting
minimize
x∈RE
ω (1E − x) + λ x + µΦ(Ωx − d)
f2
+ ι[0,1]E (x)
f1
f1 ∈ Γ0(RE): proper, convex, and lower semi-continuous
f2: convex, differentiable with an L−Lipschitz continuous gradient
Algorithm 3: Block Coordinate + Preconditioned Forward-Backward
Fix x0 ∈ RE
for k = 0, 1, . . . do
Select the index jk ∈ {1, . . . , J} of a block of variables
z
(jk)
k = x
(jk)
k − γkA−1
jk jk f2(xk)
x
(jk)
k+1 = proxγk
−1,Ajk
,f
(jk)
1
(z
(jk)
k )
x
(¯jk)
k+1 = x
(¯jk)
k , ¯jk = {1, . . . , J}{jk}
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 18 / 45
56. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING
A mixed method: BRANE Clust
We look for a discrete solution for x and a continuous one for y
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 19 / 45
57. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING A priori
A priori: gene grouping and modular structure
maximize
x∈{0,1}E
y∈NG
(i,j)∈V2
f(yi, yj)ωi,jxi,j + λ(1 − xi,j) + Ψ(yi)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 20 / 45
58. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING A priori
A priori: gene grouping and modular structure
maximize
x∈{0,1}E
y∈NG
(i,j)∈V2
f(yi, yj)ωi,jxi,j + λ(1 − xi,j) + Ψ(yi)
Clustering-assisted inference
Node labeling y ∈ NG
Weight ωi,j reduction if nodes vi and vj belong to distinct clusters
Cost function:
f(yi, yj) =
β − 1(yi = yj)
β
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 20 / 45
59. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING A priori
A priori: gene grouping and modular structure
maximize
x∈{0,1}E
y∈NG
(i,j)∈V2
f(yi, yj)ωi,jxi,j + λ(1 − xi,j) + Ψ(yi)
Clustering-assisted inference
Node labeling y ∈ NG
Weight ωi,j reduction if nodes vi and vj belong to distinct clusters
Cost function:
f(yi, yj) =
β − 1(yi = yj)
β
TF-driven clustering promoting modular structure
Ψ(yi) =
i∈V
j∈T
µi,j1(yi = j)
µi,j: modular structure controlling
parameter
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 20 / 45
60. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Alternating optimization strategy
maximize
x∈{0,1}n
y∈NG
(i,j)∈V2
β−1(yi=yj)
β ωi,jxi,j + λ(1 − xi,j) +
i∈V
j∈T
µi,j1(yi = j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 21 / 45
61. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Alternating optimization strategy
Alternating optimization
maximize
x∈{0,1}n
y∈NG
(i,j)∈V2
β−1(yi=yj)
β ωi,jxi,j + λ(1 − xi,j) +
i∈V
j∈T
µi,j1(yi = j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 21 / 45
62. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Alternating optimization strategy
Alternating optimization
maximize
x∈{0,1}n
y∈NG
(i,j)∈V2
β−1(yi=yj)
β ωi,jxi,j + λ(1 − xi,j) +
i∈V
j∈T
µi,j1(yi = j)
At y fixed and x variable:
maximize
x∈{0,1}n
(i,j)∈V2
β − 1(yi = yj)
β
ωi,j xi,j + λ(1 − xi,j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 21 / 45
63. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Alternating optimization strategy
Alternating optimization
maximize
x∈{0,1}n
y∈NG
(i,j)∈V2
β−1(yi=yj)
β ωi,jxi,j + λ(1 − xi,j) +
i∈V
j∈T
µi,j1(yi = j)
At y fixed and x variable:
maximize
x∈{0,1}n
(i,j)∈V2
β − 1(yi = yj)
β
ωi,j xi,j + λ(1 − xi,j)
At x fixed and y variable:
minimize
y∈NG
(i,j)∈V2
ωi,j xi,j
β
1(yi = yj) +
i∈V, j∈T
µi,j 1(yi = j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 21 / 45
64. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Alternating optimization strategy
Alternating optimization
maximize
x∈{0,1}n
y∈NG
(i,j)∈V2
β−1(yi=yj)
β ωi,jxi,j + λ(1 − xi,j) +
i∈V
j∈T
µi,j1(yi = j)
At y fixed and x variable:
maximize
x∈{0,1}n
(i,j)∈V2
β − 1(yi = yj)
β
ωi,j xi,j + λ(1 − xi,j)
Explicit form: x∗
i,j =
1 if ωi,j > λβ
β−1(yi=yj)
0 otherwise.
At x fixed and y variable:
minimize
y∈NG
(i,j)∈V2
ωi,j xi,j
β
1(yi = yj) +
i∈V, j∈T
µi,j 1(yi = j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 21 / 45
65. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Clustering optimization strategy
At x fixed and y variable:
minimize
y∈NG
(i,j)∈V2
ωi,j xi,j
β
1(yi = yj) +
i∈V, j∈T
µi,j1(yi = j)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 22 / 45
66. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Clustering optimization strategy
At x fixed and y variable:
minimize
y∈NG
(i,j)∈V2
ωi,j xi,j
β
1(yi = yj) +
i∈V, j∈T
µi,j1(yi = j) (NP)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 22 / 45
67. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Clustering optimization strategy
At x fixed and y variable:
minimize
y∈NG
(i,j)∈V2
ωi,j xi,j
β
1(yi = yj) +
i∈V, j∈T
µi,j1(yi = j) (NP)
discrete problem ⇒ quadratic relaxation
T-class problem ⇒ T binary sub-problems
label restriction to T: {s(1)
, . . . , s(T)
} such that s
(t)
j = 1 if j = t and 0 otherwise.
Y = {y(1)
, . . . , y(T)
} such that y(t)
∈ [0, 1]G
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 22 / 45
68. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Clustering optimization strategy
At x fixed and y variable:
minimize
y∈NG
(i,j)∈V2
ωi,j xi,j
β
1(yi = yj) +
i∈V, j∈T
µi,j1(yi = j) (NP)
discrete problem ⇒ quadratic relaxation
T-class problem ⇒ T binary sub-problems
label restriction to T: {s(1)
, . . . , s(T)
} such that s
(t)
j = 1 if j = t and 0 otherwise.
Y = {y(1)
, . . . , y(T)
} such that y(t)
∈ [0, 1]G
Problem re-expressed as:
minimize
Y
T
t=1
(i,j)∈V2
ωi,j xi,j
β
y
(t)
i − y
(t)
j
2
+
i∈V, j∈T
µi,j y
(t)
i − s
(t)
j
2
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 22 / 45
69. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Clustering optimization strategy
minimize
Y
T
t=1
(i,j)∈V2
ωi,j xi,j
β
y
(t)
i − y
(t)
j
2
+
i∈V, j∈T
µi,j y
(t)
i − s
(t)
j
2
This is the Combinatorial Dirichlet problem
Minimization via solving a linear system of equations [Grady, 2006]
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 23 / 45
70. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Clustering optimization strategy
minimize
Y
T
t=1
(i,j)∈V2
ωi,j xi,j
β
y
(t)
i − y
(t)
j
2
+
i∈V, j∈T
µi,j y
(t)
i − s
(t)
j
2
This is the Combinatorial Dirichlet problem
Minimization via solving a linear system of equations [Grady, 2006]
Final labeling: node i is assigned to label t for which y
(t)
i is maximal
y∗
i = argmax
t∈T
y
(t)
i
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 23 / 45
71. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Random walker in graphs
We want to obtain the optimal labeling y∗ based on
a weighted graph ⇒ Random Walker algorithm
y1
y4
y2y3
y5
12310
7
6
5
5
9
3
10
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 24 / 45
72. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Random walker in graphs
We want to obtain the optimal labeling y∗ based on
a weighted graph ⇒ Random Walker algorithm
y1
y4
y2y3
y5
1
23
12310
7
6
5
5
9
3
10
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 24 / 45
73. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Random walker in graphs
We want to obtain the optimal labeling y∗ based on
a weighted graph ⇒ Random Walker algorithm
y1
y4
y2y3
y5
1
23
12310
7
6
5
5
9
3
10
0.95
0.46
0.010.03
0.35
1
00
y(1)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 24 / 45
74. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Random walker in graphs
We want to obtain the optimal labeling y∗ based on
a weighted graph ⇒ Random Walker algorithm
y1
y4
y2y3
y5
1
23
12310
7
6
5
5
9
3
10
0.95
0.46
0.010.03
0.35
1
00
0.01
0.28
0.970.02
0.19
0
10
y(1) y(2)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 24 / 45
75. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Random walker in graphs
We want to obtain the optimal labeling y∗ based on
a weighted graph ⇒ Random Walker algorithm
y1
y4
y2y3
y5
1
23
12310
7
6
5
5
9
3
10
0.95
0.46
0.010.03
0.35
1
00
0.01
0.28
0.970.02
0.19
0
10
0.03
0.26
0.020.95
0.46
0
01
y(1) y(2) y(3)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 24 / 45
76. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
Random walker in graphs
We want to obtain the optimal labeling y∗ based on
a weighted graph ⇒ Random Walker algorithm
y1
y4
y2y3
y5
1
23
12310
7
6
5
5
9
3
10
0.95
0.46
0.95
0.46
0.010.03
0.35
1
00
0.01
0.28
0.970.970.02
0.19
0
10
0.03
0.26
0.020.95
0.46
0.95
0.46
0
01
1
1
23
3
y(1) y(2) y(3) y∗ = {1, 1, 2, 3, 3}
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 24 / 45
77. BRANE Clust — NETWORK INFERENCE WITH CLUSTERING Formulation and resolution
hard- vs soft- clustering in BRANE Clust
minimize
Y
T
t=1 (i,j)∈V2
ωi,j xi,j
β y
(t)
i − y
(t)
j
2
+
i∈V, j∈T
µi,j y
(t)
i − s
(t)
j
2
hard-clustering soft-clustering
# clusters = # TF # clusters ≤ # TF
µi,j =
→ ∞ if i = j
0 otherwise.
µi,j =
α if i = j
α1(ωi,j > τ) if i = j and i ∈ T
ωi,j1(ωi,j > τ) if i = j and i /∈ T
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 25 / 45
78. BRANE RESULTS
It’s time to test the BRANE philosophy...
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 26 / 45
79. BRANE RESULTS Methodology
Numerical evaluation strategy
Gene expression data
DREAM4 or DREAM5 challenges
Gene-gene interaction scores
(ND)-CLR or (ND)-GENIE3
Classical thresholding BRANE edge selection
P = |TP|
|TP|+|FP|
R = |TP|
|TP|+|FN|
Reference
Precision-Recall curve
BRANE
Precision-Recall curve
AUPR
Ref
AUPR
BRANE
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 27 / 45
80. BRANE RESULTS DREAM4 synthetic results
BRANE performance on in-silico data
DREAM4 [Marbach et al., 2010]
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 28 / 45
82. BRANE RESULTS DREAM4 synthetic results
BRANE performance on in-silico data
DREAM4 [Marbach et al., 2010]
CLR GENIE3 ND-CLR ND-GENIE3
BRANE Cut 10.9 % 8.4 % 5.9 % 7.2 %
BRANE Relax 7.8 % 8.5 % 3.1 % 7.3 %
BRANE Clust 12.2 % 12.8 % 2.5 % 8.1 %
BRANE approaches validated on small synthetic data
BRANE methodologies outperform classical thresholding
First and second best performers: BRANE Clust and BRANE Cut
⇒ Validation on more realistic synthetic data
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 29 / 45
83. BRANE RESULTS DREAM5 synthetic results
BRANE performance on in-silico data
DREAM5 [Marbach et al., 2012]
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 30 / 45
84. BRANE RESULTS DREAM5 synthetic results
BRANE performance on in-silico data
DREAM5 [Marbach et al., 2012]
AUPR Gain AUPR Gain
CLR 0.252 GENIE3 0.283
BRANE Cut 0.268 6.3 % BRANE Cut 0.295 4.2 %
BRANE Relax 0.272 7.9 % BRANE Relax 0.294 3.8 %
BRANE Clust 0.301 19.4 % BRANE Clust 0.336 18.6 %
AUPR Gain AUPR Gain
ND-CLR 0.272 ND-GENIE3 0.313
BRANE Cut 0.277 1.9 % BRANE Cut 0.317 1.1 %
BRANE Relax 0.274 0.6 % BRANE Relax 0.314 0.3 %
BRANE Clust 0.289 6.2 % BRANE Clust 0.345 10.2 %
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 30 / 45
85. BRANE RESULTS DREAM5 synthetic results
BRANE performance on in-silico data
DREAM5 [Marbach et al., 2012]
AUPR Gain AUPR Gain
CLR 0.252 GENIE3 0.283
BRANE Cut 0.268 6.3 % BRANE Cut 0.295 4.2 %
BRANE Relax 0.272 7.9 % BRANE Relax 0.294 3.8 %
BRANE Clust 0.301 19.4 % BRANE Clust 0.336 18.6 %
AUPR Gain AUPR Gain
ND-CLR 0.272 ND-GENIE3 0.313
BRANE Cut 0.277 1.9 % BRANE Cut 0.317 1.1 %
BRANE Relax 0.274 0.6 % BRANE Relax 0.314 0.3 %
BRANE Clust 0.289 6.2 % BRANE Clust 0.345 10.2 %
BRANE approaches validated on realistic synthetic data and outperform
classical thresholding
First and second best performer: BRANE Clust and BRANE Cut
⇒ Validation of BRANE Cut and BRANE Clust on real data
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 30 / 45
86. BRANE RESULTS Escherichia coli results
BRANE Clust performance on real data
Escherichia coli dataset
AUPR Gain AUPR Gain
CLR 0.0378
5.5 %
GENIE3 0.0488
9.8 %
BRANE Clust 0.0399 BRANE Clust 0.0536
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 31 / 45
87. BRANE RESULTS Escherichia coli results
BRANE Clust performance on real data
Escherichia coli dataset
AUPR Gain AUPR Gain
CLR 0.0378
5.5 %
GENIE3 0.0488
9.8 %
BRANE Clust 0.0399 BRANE Clust 0.0536
BRANE Clust predictions using GENIE3 weights
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 31 / 45
88. BRANE RESULTS Escherichia coli results
BRANE Clust performance on real data
Escherichia coli dataset
AUPR Gain AUPR Gain
CLR 0.0378
5.5 %
GENIE3 0.0488
9.8 %
BRANE Clust 0.0399 BRANE Clust 0.0536
BRANE Clust predictions using GENIE3 weights
BRANE Clust validated on real dataset
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 31 / 45
89. BRANE RESULTS Trichoderma results results
BRANE Cut in the real life
GRN of T. reesei obtained with BRANE Cut using CLR weights
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 32 / 45
90. BRANE RESULTS Trichoderma results results
BRANE Cut in the real life
GRN of T. reesei obtained with BRANE Cut using CLR weights
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 32 / 45
91. BRANE RESULTS Trichoderma results results
BRANE Cut in the real life
GRN of T. reesei obtained with BRANE Cut using CLR weights
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 32 / 45
92. BRANE RESULTS Trichoderma results results
BRANE Cut in the real life
GRN of T. reesei obtained with BRANE Cut using CLR weights
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 32 / 45
93. CONCLUSIONS
It’s time to conclude...
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 33 / 45
94. CONCLUSIONS
Conclusions
Inference: BRANE Cut and BRANE Relax
Joint inference and clustering: BRANE Clust
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 34 / 45
95. CONCLUSIONS
Conclusions
Inference: BRANE Cut and BRANE Relax
Joint inference and clustering: BRANE Clust
The BRA- in BRANE: integrating biological a priori constrains the search of
relevant edges
The -NE in BRANE: proposed graph inference methods lead to promising
results and outperforms state-of-the-art methods
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 34 / 45
96. CONCLUSIONS
Conclusions
Inference: BRANE Cut and BRANE Relax
Joint inference and clustering: BRANE Clust
The BRA- in BRANE: integrating biological a priori constrains the search of
relevant edges
The -NE in BRANE: proposed graph inference methods lead to promising
results and outperforms state-of-the-art methods
⇒ Average improvements around 10 %
⇒ Biological relevant inferred networks
⇒ Negligible time complexity with respect to graph weight computation
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 34 / 45
97. CONCLUSIONS
Conclusions
Inference: BRANE Cut and BRANE Relax
Joint inference and clustering: BRANE Clust
The BRA- in BRANE: integrating biological a priori constrains the search of
relevant edges
The -NE in BRANE: proposed graph inference methods lead to promising
results and outperforms state-of-the-art methods
⇒ Average improvements around 10 %
⇒ Biological relevant inferred networks
⇒ Negligible time complexity with respect to graph weight computation
Biological a priori relevance for network inference
BRANE Clust BRANE Cut BRANE Relax
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 34 / 45
104. CONCLUSIONS
Perspectives
From biological graphs...
TF-based a priori
GRN Clustering
Gene-gene scores
Extend TF-based a priori for
GRN, clustering
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 35 / 45
105. CONCLUSIONS
Perspectives
From biological graphs...
TF-based a priori
GRN Clustering
Gene-gene scores
Normalized gene
expression data
Extend TF-based a priori for
GRN, clustering
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 35 / 45
106. CONCLUSIONS
Perspectives
From biological graphs...
TF-based a priori
GRN Clustering
Normalized gene
expression data
Gene-gene scores
Extend TF-based a priori for
GRN, clustering , graph
weighting,
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 35 / 45
107. CONCLUSIONS
Perspectives
From biological graphs...
TF-based a priori
GRN Clustering
Gene-gene scores
Normalized gene
expression data
Extend TF-based a priori for
GRN, clustering , graph
weighting, data normalization...
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 35 / 45
108. CONCLUSIONS
Perspectives
From biological graphs...
TF-based a priori
GRN Clustering
Gene-gene scores
Normalized gene
expression data
Extend TF-based a priori for
GRN, clustering , graph
weighting, data normalization...
Integrate transcriptomic data
treatment
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 35 / 45
109. CONCLUSIONS
Perspectives
From biological graphs...
TF-based a priori
GRN Clustering
Gene-gene scores
Normalized gene
expression data
Extend TF-based a priori for
GRN, clustering , graph
weighting, data normalization...
Integrate transcriptomic data
treatment
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 35 / 45
110. CONCLUSIONS
Perspectives
From biological graphs...
TF-based a priori
GRN Clustering
Normalized gene
expression data
Extend TF-based a priori for
GRN, clustering , graph
weighting, data normalization...
Integrate transcriptomic data
treatment
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 35 / 45
111. CONCLUSIONS
Perspectives
From biological graphs...
Omics data
TF-based a priori
GRN Clustering
Omics data
Extend TF-based a priori for
GRN, clustering , graph
weighting, data normalization...
Integrate transcriptomic data
treatment
Integrate a priori, omics- data
and treatments
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 35 / 45
112. CONCLUSIONS
Perspectives
... to general graphs
BRANE-like applications for non biological graphs
Coupled edge inference: social networks
Node-degree constraint: telecommunication
Coupling between inference and clustering: temperature networks, brain
networks
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 36 / 45
113. CONCLUSIONS
Perspectives
... to general graphs
BRANE-like applications for non biological graphs
Coupled edge inference: social networks
Node-degree constraint: telecommunication
Coupling between inference and clustering: temperature networks, brain
networks
Topological constraint in graph inference
Expected node degree distribution
Scale-free networks: webgraphs, financial networks, social networks...
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 36 / 45
114. CONCLUSIONS
Perspectives
... to general graphs
BRANE-like applications for non biological graphs
Coupled edge inference: social networks
Node-degree constraint: telecommunication
Coupling between inference and clustering: temperature networks, brain
networks
Topological constraint in graph inference
Expected node degree distribution
Scale-free networks: webgraphs, financial networks, social networks...
Laplacian-based approach for graph comparison
Spectral view of the graph
Modularity
Local and topological-based criteria
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 36 / 45
115. CONCLUSIONS
Publications
Journal papers — published
D. Poggi-Parodi, F. Bidard, A. Pirayre, T. Portnoy. C. Blugeon, B. Seiboth, C.P. Kubicek, S. Le Crom and A. Margeot
Kinetic transcriptome reveals an essentially intact induction system in a cellu- lase hyper-producer Trichoderma reesei strain
Biotechnology for Biofuels, 7:173, Dec. 2014
A. Pirayre, C. Couprie, F. Bidard, L. Duval, and J.-C. Pesquet.
BRANE Cut: biologically-related a priori network enhancement with graph cuts for gene regulatory network inference
BMC Bioinformatics, 16(1):369, Dec. 2015.
A. Pirayre, C. Couprie, L. Duval, and J.-C. Pesquet.
BRANE Clust: Cluster-Assisted Gene Regulatory Network Inference Refinement
IEEE/ACM Transactions on Computational Biology and Bioinformatics, Mar. 2017.
Journal papers — in preparation
Y. Zheng, A. Pirayre, L. Duval and J.-C. Pesquet
Joint restoration/segmentation of multicomponent images with variational Bayes and higher-order graphical models (HOGMep)
To be submitted to IEEE Transactions on Computational Imaging, Jul. 2017.
A. Pirayre, D. Ivanoff, L. Duval, C. Blugeon, C. Firmo, S. Perrin, E. Jourdier, A. Margeot and F. Bidard
Growing Trichoderma reesei on a mix of carbon sources suggests links between development and cellulase production
To be submitted to BMC Genomics, Jul. 2017.
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 37 / 45
117. HOGMep
HOGMep for non-blind inverse problems
y = H x + n
x: unknown signal to be recovered
H: known degradation operator
n: additive noise
y: observations
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 39 / 45
118. HOGMep
HOGMep — Bayesian framework
Estimation of x from the knowledge of the posterior pdf p(x|y)
p(x|y) =
p(x)p(y|x)
p(y)
p(x): the marginal pdf encoding information about x
p(y|x): the likelihood highlighting the uncertainty in y
p(y): the marginal pdf of y
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 40 / 45
119. HOGMep
HOGMep — Variational Bayesian Approximation
q(x): approximation of p(x|y)
qopt
(x) = argmin
q(x)
KL(q(x) || p(x | y))
Separable distribution:
q(x) =
J
j=1
qj(xj),
with
qopt
j (xj) ∝ exp ln p(y, x) i=j qi(xi)
Estimation of the distributions in an iterative manner
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 41 / 45
120. HOGMep
HOGMep — Bayesian formulation
Likelihood prior: p(y | x, γ) = N(Hx, γ−1
I)
p(z): prior on hidden variables z ⇒
generalized Potts model
p(x|z): prior on x conditionally to z ⇒ MEP
distribution restricted to Gaussian Scale
Mixtures GSM(m, Ω, β)
Hyperpriors: p(γ), p(ml) and p(Ωl)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 42 / 45
121. HOGMep
HOGMep — Bayesian formulation
Likelihood prior: p(y | x, γ) = N(Hx, γ−1
I)
p(z): prior on hidden variables z ⇒
generalized Potts model
p(x|z): prior on x conditionally to z ⇒ MEP
distribution restricted to Gaussian Scale
Mixtures GSM(m, Ω, β)
Hyperpriors: p(γ), p(ml) and p(Ωl)
Joint posterior distribution
p(y | x, γ)
N
i=1
p(xi | zi, ui, m, Ω)p(ui | β) p(z)p(γ)
L
l=1
p(ml)p(Ωl)
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 42 / 45
122. HOGMep
HOGMep — VBA strategy
Separable form for the approximation:
q(Θ) =
N
i=1
(q(xi, zi)q(ui)) q(γ)
L
l=1
(q(ml)q(Ωl))
with
q(xi|zi = l) = N(ηi,l, Ξi,l),
q(zi = l) = πi,l,
q(ml) = N(µl, Λl),
q(Ωl) = W(Γl, νl),
q(γ) = G(a, b).
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 43 / 45
123. HOGMep
HOGMep — Some restoration results
Restoration
Original Degraded DR 3MG VB-MIG HOGMep
SNR 6.655 9.467 6.744 12.737 12.895
Original Degraded DR 3MG VB-MIG HOGMep
SNR 19.659 18.728 17.188 15.486 19.555
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 44 / 45
124. HOGMep
HOGMep — Some segmentation results
Segmentation
ICM ICM SC SC VB-MIG HOGMep
ICM ICM SC SC VB-MIG HOGMep
July 3th
, 2017 Recons. and Clust. with Graph Optim. and Priors on GRN and Images 45 / 45