SlideShare une entreprise Scribd logo
1  sur  13
Télécharger pour lire hors ligne
Clustered graph, visualization, hierarchical
visualization
Nathalie Villa-Vialaneix
http://www.nathalievilla.org
SAMM (Université Paris 1)
2012/02/21 - Dagstuhl seminar 12081
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 1 / 4
Full vs simplified visualization
Framework: Static graph visualization.
Standard (FDP) approach: visualize the whole graph
aims at being aesthetic ⇒ tends to place the hubs in the center of the
figure (edges with uniform length); does not emphasize dense groups
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 2 / 4
Full vs simplified visualization
Framework: Static graph visualization.
Simplified approach: find communities and represent each one by a
glyph
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 2 / 4
Full vs simplified visualization
Framework: Static graph visualization.
Simplified approach: find communities and represent each one by a
glyph and investigate sub-structure by a hierarchical clustering
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 2 / 4
Basic description
1 Search for communities: node clustering (e.g., modularity
optimization)
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
Basic description
1 Search for communities: node clustering (e.g., modularity
optimization)
Is the clustering relevant / significant?
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
Basic description
1 Search for communities: node clustering (e.g., modularity
optimization)
Is the clustering relevant / significant?
Possible answer: generate N random graphs with the same degree
distribution and compare the observed optimal modularity to the
optimal modularity distribution among the N random graphs
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
Basic description
1 Search for communities: node clustering (e.g., modularity
optimization)
2 Iterate the clustering in each class in a hierarchical way.
When to stop the process? Is the clustering relevant /
significant?
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
Basic description
1 Search for communities: node clustering (e.g., modularity
optimization)
2 Iterate the clustering in each class in a hierarchical way.
3 Visualize the graph (in a simplified way) at various levels of the
clustering hierarchy.
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
Basic description
1 Search for communities: node clustering (e.g., modularity
optimization)
2 Iterate the clustering in each class in a hierarchical way.
3 Visualize the graph (in a simplified way) at various levels of the
clustering hierarchy.
How to have consistent representations? (a cluster and its
subclusters are approximately displayed at the same place) How to
take into account the space needed for a cluster of the last level of the
hierarchy in any representation (at any level)?
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
Basic description
1 Search for communities: node clustering (e.g., modularity
optimization)
2 Iterate the clustering in each class in a hierarchical way.
3 Visualize the graph (in a simplified way) at various levels of the
clustering hierarchy.
How to have consistent representations? (a cluster and its
subclusters are approximately displayed at the same place) How to
take into account the space needed for a cluster of the last level of the
hierarchy in any representation (at any level)?
Possible solution: Recursively estimate the place needed for each
cluster in the hierarchy (by a circle encompassing the visualization of
all sub-clusters) ⇒ over-estimation
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
Basic description
1 Search for communities: node clustering (e.g., modularity
optimization)
2 Iterate the clustering in each class in a hierarchical way.
3 Visualize the graph (in a simplified way) at various levels of the
clustering hierarchy.
Include information about the quality of the clustering in the
representation? (user warning)
Example: Color and weight edges
between clusters according to their
contribution to the modularity
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
Open issues
• Clustering: what is a meaningful clustering? When to stop the
hierarchy?
• Clustering hierarchy representation: how to anticipate, at a given
level, the place needed for the representation of the finest levels?
• Including estimation about the clustering quality in the
representation: at the node level (“quality” of the clustering for the
cluster? What does that mean?) or at the edge level (contribution to
the modularity between clusters?)
Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 4 / 4

Contenu connexe

Plus de tuxette

Autour des projets Idefics et MetaboWean
Autour des projets Idefics et MetaboWeanAutour des projets Idefics et MetaboWean
Autour des projets Idefics et MetaboWeantuxette
 
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...tuxette
 
Apprentissage pour la biologie moléculaire et l’analyse de données omiques
Apprentissage pour la biologie moléculaire et l’analyse de données omiquesApprentissage pour la biologie moléculaire et l’analyse de données omiques
Apprentissage pour la biologie moléculaire et l’analyse de données omiquestuxette
 
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...tuxette
 
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...tuxette
 
Journal club: Validation of cluster analysis results on validation data
Journal club: Validation of cluster analysis results on validation dataJournal club: Validation of cluster analysis results on validation data
Journal club: Validation of cluster analysis results on validation datatuxette
 
Overfitting or overparametrization?
Overfitting or overparametrization?Overfitting or overparametrization?
Overfitting or overparametrization?tuxette
 
Selective inference and single-cell differential analysis
Selective inference and single-cell differential analysisSelective inference and single-cell differential analysis
Selective inference and single-cell differential analysistuxette
 
SOMbrero : un package R pour les cartes auto-organisatrices
SOMbrero : un package R pour les cartes auto-organisatricesSOMbrero : un package R pour les cartes auto-organisatrices
SOMbrero : un package R pour les cartes auto-organisatricestuxette
 
Graph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype PredictionGraph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype Predictiontuxette
 
A short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelsA short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelstuxette
 
Explanable models for time series with random forest
Explanable models for time series with random forestExplanable models for time series with random forest
Explanable models for time series with random foresttuxette
 
Présentation du projet ASTERICS
Présentation du projet ASTERICSPrésentation du projet ASTERICS
Présentation du projet ASTERICStuxette
 
Présentation du projet ASTERICS
Présentation du projet ASTERICSPrésentation du projet ASTERICS
Présentation du projet ASTERICStuxette
 
Kernel methods and variable selection for exploratory analysis and multi-omic...
Kernel methods and variable selection for exploratory analysis and multi-omic...Kernel methods and variable selection for exploratory analysis and multi-omic...
Kernel methods and variable selection for exploratory analysis and multi-omic...tuxette
 
A review on structure learning in GNN
A review on structure learning in GNNA review on structure learning in GNN
A review on structure learning in GNNtuxette
 
La statistique et le machine learning pour l'intégration de données de la bio...
La statistique et le machine learning pour l'intégration de données de la bio...La statistique et le machine learning pour l'intégration de données de la bio...
La statistique et le machine learning pour l'intégration de données de la bio...tuxette
 
Graph Neural Network in practice
Graph Neural Network in practiceGraph Neural Network in practice
Graph Neural Network in practicetuxette
 
La famille *down
La famille *downLa famille *down
La famille *downtuxette
 
Differential analyses of structures in HiC data
Differential analyses of structures in HiC dataDifferential analyses of structures in HiC data
Differential analyses of structures in HiC datatuxette
 

Plus de tuxette (20)

Autour des projets Idefics et MetaboWean
Autour des projets Idefics et MetaboWeanAutour des projets Idefics et MetaboWean
Autour des projets Idefics et MetaboWean
 
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
Rserve, renv, flask, Vue.js dans un docker pour intégrer des données omiques ...
 
Apprentissage pour la biologie moléculaire et l’analyse de données omiques
Apprentissage pour la biologie moléculaire et l’analyse de données omiquesApprentissage pour la biologie moléculaire et l’analyse de données omiques
Apprentissage pour la biologie moléculaire et l’analyse de données omiques
 
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
Quelques résultats préliminaires de l'évaluation de méthodes d'inférence de r...
 
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
Intégration de données omiques multi-échelles : méthodes à noyau et autres ap...
 
Journal club: Validation of cluster analysis results on validation data
Journal club: Validation of cluster analysis results on validation dataJournal club: Validation of cluster analysis results on validation data
Journal club: Validation of cluster analysis results on validation data
 
Overfitting or overparametrization?
Overfitting or overparametrization?Overfitting or overparametrization?
Overfitting or overparametrization?
 
Selective inference and single-cell differential analysis
Selective inference and single-cell differential analysisSelective inference and single-cell differential analysis
Selective inference and single-cell differential analysis
 
SOMbrero : un package R pour les cartes auto-organisatrices
SOMbrero : un package R pour les cartes auto-organisatricesSOMbrero : un package R pour les cartes auto-organisatrices
SOMbrero : un package R pour les cartes auto-organisatrices
 
Graph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype PredictionGraph Neural Network for Phenotype Prediction
Graph Neural Network for Phenotype Prediction
 
A short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelsA short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction models
 
Explanable models for time series with random forest
Explanable models for time series with random forestExplanable models for time series with random forest
Explanable models for time series with random forest
 
Présentation du projet ASTERICS
Présentation du projet ASTERICSPrésentation du projet ASTERICS
Présentation du projet ASTERICS
 
Présentation du projet ASTERICS
Présentation du projet ASTERICSPrésentation du projet ASTERICS
Présentation du projet ASTERICS
 
Kernel methods and variable selection for exploratory analysis and multi-omic...
Kernel methods and variable selection for exploratory analysis and multi-omic...Kernel methods and variable selection for exploratory analysis and multi-omic...
Kernel methods and variable selection for exploratory analysis and multi-omic...
 
A review on structure learning in GNN
A review on structure learning in GNNA review on structure learning in GNN
A review on structure learning in GNN
 
La statistique et le machine learning pour l'intégration de données de la bio...
La statistique et le machine learning pour l'intégration de données de la bio...La statistique et le machine learning pour l'intégration de données de la bio...
La statistique et le machine learning pour l'intégration de données de la bio...
 
Graph Neural Network in practice
Graph Neural Network in practiceGraph Neural Network in practice
Graph Neural Network in practice
 
La famille *down
La famille *downLa famille *down
La famille *down
 
Differential analyses of structures in HiC data
Differential analyses of structures in HiC dataDifferential analyses of structures in HiC data
Differential analyses of structures in HiC data
 

Clustered graph, visualization, hierarchical visualization

  • 1. Clustered graph, visualization, hierarchical visualization Nathalie Villa-Vialaneix http://www.nathalievilla.org SAMM (Université Paris 1) 2012/02/21 - Dagstuhl seminar 12081 Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 1 / 4
  • 2. Full vs simplified visualization Framework: Static graph visualization. Standard (FDP) approach: visualize the whole graph aims at being aesthetic ⇒ tends to place the hubs in the center of the figure (edges with uniform length); does not emphasize dense groups Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 2 / 4
  • 3. Full vs simplified visualization Framework: Static graph visualization. Simplified approach: find communities and represent each one by a glyph Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 2 / 4
  • 4. Full vs simplified visualization Framework: Static graph visualization. Simplified approach: find communities and represent each one by a glyph and investigate sub-structure by a hierarchical clustering Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 2 / 4
  • 5. Basic description 1 Search for communities: node clustering (e.g., modularity optimization) Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
  • 6. Basic description 1 Search for communities: node clustering (e.g., modularity optimization) Is the clustering relevant / significant? Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
  • 7. Basic description 1 Search for communities: node clustering (e.g., modularity optimization) Is the clustering relevant / significant? Possible answer: generate N random graphs with the same degree distribution and compare the observed optimal modularity to the optimal modularity distribution among the N random graphs Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
  • 8. Basic description 1 Search for communities: node clustering (e.g., modularity optimization) 2 Iterate the clustering in each class in a hierarchical way. When to stop the process? Is the clustering relevant / significant? Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
  • 9. Basic description 1 Search for communities: node clustering (e.g., modularity optimization) 2 Iterate the clustering in each class in a hierarchical way. 3 Visualize the graph (in a simplified way) at various levels of the clustering hierarchy. Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
  • 10. Basic description 1 Search for communities: node clustering (e.g., modularity optimization) 2 Iterate the clustering in each class in a hierarchical way. 3 Visualize the graph (in a simplified way) at various levels of the clustering hierarchy. How to have consistent representations? (a cluster and its subclusters are approximately displayed at the same place) How to take into account the space needed for a cluster of the last level of the hierarchy in any representation (at any level)? Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
  • 11. Basic description 1 Search for communities: node clustering (e.g., modularity optimization) 2 Iterate the clustering in each class in a hierarchical way. 3 Visualize the graph (in a simplified way) at various levels of the clustering hierarchy. How to have consistent representations? (a cluster and its subclusters are approximately displayed at the same place) How to take into account the space needed for a cluster of the last level of the hierarchy in any representation (at any level)? Possible solution: Recursively estimate the place needed for each cluster in the hierarchy (by a circle encompassing the visualization of all sub-clusters) ⇒ over-estimation Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
  • 12. Basic description 1 Search for communities: node clustering (e.g., modularity optimization) 2 Iterate the clustering in each class in a hierarchical way. 3 Visualize the graph (in a simplified way) at various levels of the clustering hierarchy. Include information about the quality of the clustering in the representation? (user warning) Example: Color and weight edges between clusters according to their contribution to the modularity Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 3 / 4
  • 13. Open issues • Clustering: what is a meaningful clustering? When to stop the hierarchy? • Clustering hierarchy representation: how to anticipate, at a given level, the place needed for the representation of the finest levels? • Including estimation about the clustering quality in the representation: at the node level (“quality” of the clustering for the cluster? What does that mean?) or at the edge level (contribution to the modularity between clusters?) Dagstuhl Seminar 12081 (2012/02/21) Graph visualization & clustering Nathalie Villa-Vialaneix 4 / 4