SlideShare une entreprise Scribd logo
1  sur  50
Visual Network Analysis
Tommaso Venturini
tommaso.venturini@sciences-po.org
Today’s menu
1. The complexity of complex networks
2. Beheading the complexity of networks
3. Visual network analysis with Gephi
Deploying
innovation
networks
Part I:
The complexity of complex networks
The bad news:
networks are complex
The power law
(pareto’s law)
characteristic
scale distribution
scale-free
distribution
The power law Barabási, Albert-László (2002)
Linked: The New Science of Networks
Networks of scientific
papers D. De Solla Prince, 1965
Science, 149(3683) : 510-515
The Kevin Bacon number
http://oracleofbacon.org/
The Paul Erdos
number http://www.ams.org/mathscinet/collaborationDistance.htm
l
The Erdos-Bacon
number http://en.wikipedia.org/wiki/Erd%C5%91s
%E2%80%93Bacon_number
The Erdos-Bacon
number
5 6 1
The complexity of complex
network
Complex network as
rhizomes
“Unlike trees or their roots, the rhizome connects any point to any other point”
Gilles Deleuze & Felix Guattari “A Thousand Plateaus”, 1980
“The main feature of a net is that every point can be connected with every other
point, and where the connections are not yet designed, they are, however,
conceivable and designable.
A net is an unlimited territory”
Umberto Eco, “Semiotics and the Philosophy of Language”, 1986
Deploying
innovation
networks
Part II:
Beheading network
complexity
Carving more than assembling
For example:
two types of network-maps
the (pseudo-) exhaustive ones
For example:
two types of network-maps
the good ones
the (pseudo-) exhaustive ones
An (pseudo-) exhaustive map of
the Web http://internet-map.net
A good
map of the Web politicosphere.blog.lemonde.fr
Making networks
readeable
A small complex network
Exploiting the power law to make
things readable
The layers of the Web
The layers of complex networks
(visibility)
impossible to miss
more or less visible
higher layer
lower layer
middle layer
almost invisible
The layers of complex networks
(connectivity)
Highly linked
locally and globally
Highly linked locally
Scarcely linked globally
higher layer
lower layer
middle layer
Scarcely linked
locally and globally
The reverse gravity of networks
(many ascending links)
higher layer
lower layer
middle layer
The reverse gravity of networks
(few descending links)
higher layer
lower layer
middle layer
Cutting above
and below
cutting
arbitrary choice
easy separation Everywhere
Nowhere
Somewhere
Ripping the sides
Ripping
constrained choice
difficult separation
Anatomy
of a corpus
Core
Tendrils
Nebula
higher layer
lower layer
middle layer
Part III:
Visual network with Gephi
Deploying
innovation
networks
Learn how
to use Gephi http://gephi.org/users/
Overview window
Gephi.org
Data laboratory window
Gephi.org
Preview window
Gephi.org
1. nodes position – layout
2. nodes size – ranking
3. nodes color – partitions
3 visual variables of analysis
Gephi.org
L’analyse du réseau en 6 questions
Application d’une spatialisation force-vecteur
1. Quelles sont les débats/communautés discursives ?
(identification des clusters de nœuds)
2. Quels sont les sites au centre des débats/communautés ?
(identification des nœuds centraux dans le réseau et les clusters)
3. Quels sont les sites qui connectent les débats/communautés ?
(identification des ponts/bridge entre les différents clusters)
Application d’une classement par degrée-entrant/sortant
4. Quels sont les sites leaders d’opinion du débat en ligne ?
(identification des autorités du graphe)
5. Quels sont les sites qui fédèrent le le débat en ligne ?
(identification des hubs du graphe)
Application d’une coloration par partition
6. Comment sont reparties les différentes catégories de sites ?
(évaluation de la cohérence topologie/catégorisation)
Application d’une spatialisation force-
vecteur (ForceAtlas 2)
• LinLog mode
(maximizes the legibility of clusters)
• Prevent overlap
(enhances legibility, but distorts spatialization)
• Scaling
(increases/decreases all distance proportionally)
• Gravity
(pulls everything towards the center, prevents
dispersions, but distorts spatialization)
• Approximate repulsion
(accelerate spatialization on large graphs, but
Quelles sont les débats/communautés discursives ?
(identification des clusters de nœuds)
Quelles sont les débats/communautés discursives ?
(identification des clusters de nœuds)
HeatGraph
(use Google Chrome) http://tools.medialab.sciences-po.fr/
heatgraph/
Quels sont les sites au centre des débats/communautés ?
(identification des nœuds centraux dans le réseau et les clusters)
Quels sont les sites qui connectent les débats/communautés ?
(identification des ponts/bridge entre les différents clusters)
Application d’un classement par degrée-
entrant/sortant
Quels sont les sites leaders d’opinion du débat en ligne ?
(identification des autorités du graphe)
Quels sont les sites qui fédèrent le le débat en ligne ?
(identification des hubs du graphe)
Application d’une coloration
par partitions
Comment sont reparties les différentes catégories de sites ?
(évaluation de la cohérence topologie/catégorisation)
Comment sont reparties les différentes catégories de sites ?
(évaluation de la cohérence topologie/catégorisation)
tommaso.venturini@sciences-po.org

Contenu connexe

En vedette

Filtros de cabine
Filtros de cabineFiltros de cabine
Filtros de cabinetuliovmg1
 
TBEX15 Asia Thailand Sara Meaney
TBEX15 Asia Thailand Sara MeaneyTBEX15 Asia Thailand Sara Meaney
TBEX15 Asia Thailand Sara MeaneyTBEX
 
Aplikom_Unsri_1. MyBiodata dan keunikan Matematika_Sutri Octaviana
Aplikom_Unsri_1. MyBiodata dan keunikan Matematika_Sutri OctavianaAplikom_Unsri_1. MyBiodata dan keunikan Matematika_Sutri Octaviana
Aplikom_Unsri_1. MyBiodata dan keunikan Matematika_Sutri Octavianasutrioctavianasitorus
 
Sulucionario electromagnetismo cheng
Sulucionario electromagnetismo cheng Sulucionario electromagnetismo cheng
Sulucionario electromagnetismo cheng Saku Garcia
 
Dedicado a mis amig@s ciberentic@s
Dedicado a mis amig@s ciberentic@sDedicado a mis amig@s ciberentic@s
Dedicado a mis amig@s ciberentic@sstaro G.G
 
Monografia fic
Monografia ficMonografia fic
Monografia ficromercen
 
Windows 8.1 Deployment - Tools, Tools, Tools
Windows 8.1 Deployment - Tools, Tools, ToolsWindows 8.1 Deployment - Tools, Tools, Tools
Windows 8.1 Deployment - Tools, Tools, ToolsRoel van Bueren
 
Aspire one series service guide
Aspire one series service guideAspire one series service guide
Aspire one series service guideSetyo Prasadja
 
Everybody Polyglot! - Cross-Language RPC with Erlang
Everybody Polyglot! - Cross-Language RPC with ErlangEverybody Polyglot! - Cross-Language RPC with Erlang
Everybody Polyglot! - Cross-Language RPC with ErlangRusty Klophaus
 
Miquel Martí i Pol
Miquel Martí i PolMiquel Martí i Pol
Miquel Martí i PolQuim Civil
 
Beautiful Women Of China
Beautiful Women Of ChinaBeautiful Women Of China
Beautiful Women Of ChinaRen
 

En vedette (20)

Filtros de cabine
Filtros de cabineFiltros de cabine
Filtros de cabine
 
TBEX15 Asia Thailand Sara Meaney
TBEX15 Asia Thailand Sara MeaneyTBEX15 Asia Thailand Sara Meaney
TBEX15 Asia Thailand Sara Meaney
 
Aplikom_Unsri_1. MyBiodata dan keunikan Matematika_Sutri Octaviana
Aplikom_Unsri_1. MyBiodata dan keunikan Matematika_Sutri OctavianaAplikom_Unsri_1. MyBiodata dan keunikan Matematika_Sutri Octaviana
Aplikom_Unsri_1. MyBiodata dan keunikan Matematika_Sutri Octaviana
 
Ususnmptn2011
Ususnmptn2011Ususnmptn2011
Ususnmptn2011
 
Sulucionario electromagnetismo cheng
Sulucionario electromagnetismo cheng Sulucionario electromagnetismo cheng
Sulucionario electromagnetismo cheng
 
Dedicado a mis amig@s ciberentic@s
Dedicado a mis amig@s ciberentic@sDedicado a mis amig@s ciberentic@s
Dedicado a mis amig@s ciberentic@s
 
Monografia fic
Monografia ficMonografia fic
Monografia fic
 
Windows 8.1 Deployment - Tools, Tools, Tools
Windows 8.1 Deployment - Tools, Tools, ToolsWindows 8.1 Deployment - Tools, Tools, Tools
Windows 8.1 Deployment - Tools, Tools, Tools
 
Aspire one series service guide
Aspire one series service guideAspire one series service guide
Aspire one series service guide
 
Everybody Polyglot! - Cross-Language RPC with Erlang
Everybody Polyglot! - Cross-Language RPC with ErlangEverybody Polyglot! - Cross-Language RPC with Erlang
Everybody Polyglot! - Cross-Language RPC with Erlang
 
Inside sina weibo
Inside sina weiboInside sina weibo
Inside sina weibo
 
Miquel Martí i Pol
Miquel Martí i PolMiquel Martí i Pol
Miquel Martí i Pol
 
Poor Pigs
Poor PigsPoor Pigs
Poor Pigs
 
Beautiful Women Of China
Beautiful Women Of ChinaBeautiful Women Of China
Beautiful Women Of China
 
工作狂日记
工作狂日记工作狂日记
工作狂日记
 
Rules around us
Rules around usRules around us
Rules around us
 
Changhong
ChanghongChanghong
Changhong
 
C++ Chapter I
C++ Chapter IC++ Chapter I
C++ Chapter I
 
1st Grade Unit 6: Blue jay finds a way
1st Grade Unit 6: Blue jay finds a way1st Grade Unit 6: Blue jay finds a way
1st Grade Unit 6: Blue jay finds a way
 
Yg Ini 1
Yg Ini 1Yg Ini 1
Yg Ini 1
 

Similaire à Visual Network Analysis

Introduction_Reseau.pdf
Introduction_Reseau.pdfIntroduction_Reseau.pdf
Introduction_Reseau.pdfMohamedElbrak2
 
Introduction_Reseau.ppt
Introduction_Reseau.pptIntroduction_Reseau.ppt
Introduction_Reseau.pptAmalHadri2
 
LES RESEAUX INFORMATIQUES.pdf
LES RESEAUX INFORMATIQUES.pdfLES RESEAUX INFORMATIQUES.pdf
LES RESEAUX INFORMATIQUES.pdfssuser18776b
 
Support De Cours - Reseaux avec et sans fil
Support De Cours - Reseaux avec et sans filSupport De Cours - Reseaux avec et sans fil
Support De Cours - Reseaux avec et sans filTunisie collège
 
Diaporamas-Réseaux Informatiques.pdf
Diaporamas-Réseaux Informatiques.pdfDiaporamas-Réseaux Informatiques.pdf
Diaporamas-Réseaux Informatiques.pdfEST-UMI de Meknès
 
Université d’été ferney voltaire 2014 – les réseaux atelier-pajek
Université d’été ferney voltaire 2014 – les réseaux atelier-pajekUniversité d’été ferney voltaire 2014 – les réseaux atelier-pajek
Université d’été ferney voltaire 2014 – les réseaux atelier-pajekMarion Maisonobe
 
Correction TD Réseau informatiques .pdf
Correction  TD Réseau informatiques .pdfCorrection  TD Réseau informatiques .pdf
Correction TD Réseau informatiques .pdfOmarFersi2
 
ADMINISTRATION SYST ME ET R SEAUX
ADMINISTRATION SYST ME ET R SEAUXADMINISTRATION SYST ME ET R SEAUX
ADMINISTRATION SYST ME ET R SEAUXMonica Waters
 
administration réseaux.pdf
administration réseaux.pdfadministration réseaux.pdf
administration réseaux.pdfharizi riadh
 
administration des systemes et des reseaux.pptx
administration des systemes et des reseaux.pptxadministration des systemes et des reseaux.pptx
administration des systemes et des reseaux.pptxEyaSarhani
 
Initiation à l'analyse de réseaux - formation fmr - séance 1
Initiation à l'analyse de réseaux - formation fmr - séance 1Initiation à l'analyse de réseaux - formation fmr - séance 1
Initiation à l'analyse de réseaux - formation fmr - séance 1Marion Maisonobe
 
Archi reseaux
Archi reseauxArchi reseaux
Archi reseauxMed Labbi
 
Réseaux 2016.pptx
Réseaux 2016.pptxRéseaux 2016.pptx
Réseaux 2016.pptxrdsngqbjgdhf
 

Similaire à Visual Network Analysis (20)

Introduction_Reseau.pdf
Introduction_Reseau.pdfIntroduction_Reseau.pdf
Introduction_Reseau.pdf
 
Introduction_Reseau.ppt
Introduction_Reseau.pptIntroduction_Reseau.ppt
Introduction_Reseau.ppt
 
LES RESEAUX INFORMATIQUES.pdf
LES RESEAUX INFORMATIQUES.pdfLES RESEAUX INFORMATIQUES.pdf
LES RESEAUX INFORMATIQUES.pdf
 
Support De Cours - Reseaux avec et sans fil
Support De Cours - Reseaux avec et sans filSupport De Cours - Reseaux avec et sans fil
Support De Cours - Reseaux avec et sans fil
 
Diaporamas-Réseaux Informatiques.pdf
Diaporamas-Réseaux Informatiques.pdfDiaporamas-Réseaux Informatiques.pdf
Diaporamas-Réseaux Informatiques.pdf
 
Université d’été ferney voltaire 2014 – les réseaux atelier-pajek
Université d’été ferney voltaire 2014 – les réseaux atelier-pajekUniversité d’été ferney voltaire 2014 – les réseaux atelier-pajek
Université d’été ferney voltaire 2014 – les réseaux atelier-pajek
 
Topologie des-reseaux-512-kjmscd
Topologie des-reseaux-512-kjmscdTopologie des-reseaux-512-kjmscd
Topologie des-reseaux-512-kjmscd
 
Correction TD Réseau informatiques .pdf
Correction  TD Réseau informatiques .pdfCorrection  TD Réseau informatiques .pdf
Correction TD Réseau informatiques .pdf
 
ADMINISTRATION SYST ME ET R SEAUX
ADMINISTRATION SYST ME ET R SEAUXADMINISTRATION SYST ME ET R SEAUX
ADMINISTRATION SYST ME ET R SEAUX
 
assiter AR.pdf
assiter AR.pdfassiter AR.pdf
assiter AR.pdf
 
administration réseaux.pdf
administration réseaux.pdfadministration réseaux.pdf
administration réseaux.pdf
 
administration des systemes et des reseaux.pptx
administration des systemes et des reseaux.pptxadministration des systemes et des reseaux.pptx
administration des systemes et des reseaux.pptx
 
Archi reseaux
Archi reseauxArchi reseaux
Archi reseaux
 
20170320logiciels
20170320logiciels20170320logiciels
20170320logiciels
 
Cours s5 réseau FSO
Cours s5 réseau FSOCours s5 réseau FSO
Cours s5 réseau FSO
 
Initiation à l'analyse de réseaux - formation fmr - séance 1
Initiation à l'analyse de réseaux - formation fmr - séance 1Initiation à l'analyse de réseaux - formation fmr - séance 1
Initiation à l'analyse de réseaux - formation fmr - séance 1
 
MobDHop
MobDHopMobDHop
MobDHop
 
Archi reseaux
Archi reseauxArchi reseaux
Archi reseaux
 
Ic05plan
Ic05planIc05plan
Ic05plan
 
Réseaux 2016.pptx
Réseaux 2016.pptxRéseaux 2016.pptx
Réseaux 2016.pptx
 

Plus de INRIA - ENS Lyon

Actor-Network Theory as a Theory of Action
Actor-Network Theory as a Theory of ActionActor-Network Theory as a Theory of Action
Actor-Network Theory as a Theory of ActionINRIA - ENS Lyon
 
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...INRIA - ENS Lyon
 
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International ExpertiseDr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International ExpertiseINRIA - ENS Lyon
 
Dancing Together: the Fluidification of the Modern Mind
Dancing Together: the Fluidification of the Modern MindDancing Together: the Fluidification of the Modern Mind
Dancing Together: the Fluidification of the Modern MindINRIA - ENS Lyon
 
Digital methods - 1 : Introduction
Digital methods - 1 : IntroductionDigital methods - 1 : Introduction
Digital methods - 1 : IntroductionINRIA - ENS Lyon
 
Contropedia, and the question of analytically separating the medium and the m...
Contropedia, and the question of analytically separating the medium and the m...Contropedia, and the question of analytically separating the medium and the m...
Contropedia, and the question of analytically separating the medium and the m...INRIA - ENS Lyon
 
Escaping greatdivide coimbra
Escaping greatdivide coimbraEscaping greatdivide coimbra
Escaping greatdivide coimbraINRIA - ENS Lyon
 
Climaps by EMAPS et Europeana2015
Climaps by EMAPS et Europeana2015Climaps by EMAPS et Europeana2015
Climaps by EMAPS et Europeana2015INRIA - ENS Lyon
 
Keynote speech at the Digitale Praxen conference at Frankfurt University
Keynote speech at the Digitale Praxen conference at Frankfurt UniversityKeynote speech at the Digitale Praxen conference at Frankfurt University
Keynote speech at the Digitale Praxen conference at Frankfurt UniversityINRIA - ENS Lyon
 
On Continuity in Social Sciences
On Continuity in Social SciencesOn Continuity in Social Sciences
On Continuity in Social SciencesINRIA - ENS Lyon
 
A Trip to Flatland: mapping or modeling in the social sciences
A Trip to Flatland: mapping or modeling in the social sciencesA Trip to Flatland: mapping or modeling in the social sciences
A Trip to Flatland: mapping or modeling in the social sciencesINRIA - ENS Lyon
 
How to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceabilityHow to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceabilityINRIA - ENS Lyon
 
What’s in a controversy. Deploying the folds of collective action
What’s in a controversy. Deploying the folds of collective actionWhat’s in a controversy. Deploying the folds of collective action
What’s in a controversy. Deploying the folds of collective actionINRIA - ENS Lyon
 
Who are the actors of controversies? appreciating the heterogeneity of collec...
Who are the actors of controversies? appreciating the heterogeneity of collec...Who are the actors of controversies? appreciating the heterogeneity of collec...
Who are the actors of controversies? appreciating the heterogeneity of collec...INRIA - ENS Lyon
 
1. Why controversies? Learning to be constructivist
1. Why controversies? Learning to be constructivist1. Why controversies? Learning to be constructivist
1. Why controversies? Learning to be constructivistINRIA - ENS Lyon
 
From Before the Cradle: mapping online debates on c-section and family planning
From Before the Cradle: mapping online debates on c-section and family planningFrom Before the Cradle: mapping online debates on c-section and family planning
From Before the Cradle: mapping online debates on c-section and family planningINRIA - ENS Lyon
 

Plus de INRIA - ENS Lyon (20)

Actor-Network Theory as a Theory of Action
Actor-Network Theory as a Theory of ActionActor-Network Theory as a Theory of Action
Actor-Network Theory as a Theory of Action
 
Sprinting with Data
Sprinting with DataSprinting with Data
Sprinting with Data
 
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
Actor‐Network Theory VS Network Analysis VS Digital Networks Are We Talking A...
 
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International ExpertiseDr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
Dr. Jekyll and Mr. Hyde IPCC and the Double Logic of International Expertise
 
Dancing Together: the Fluidification of the Modern Mind
Dancing Together: the Fluidification of the Modern MindDancing Together: the Fluidification of the Modern Mind
Dancing Together: the Fluidification of the Modern Mind
 
Digital methods - 1 : Introduction
Digital methods - 1 : IntroductionDigital methods - 1 : Introduction
Digital methods - 1 : Introduction
 
Contropedia, and the question of analytically separating the medium and the m...
Contropedia, and the question of analytically separating the medium and the m...Contropedia, and the question of analytically separating the medium and the m...
Contropedia, and the question of analytically separating the medium and the m...
 
A Tale of Two Cities
A Tale of Two CitiesA Tale of Two Cities
A Tale of Two Cities
 
Escaping greatdivide coimbra
Escaping greatdivide coimbraEscaping greatdivide coimbra
Escaping greatdivide coimbra
 
What isa border_kings
What isa border_kingsWhat isa border_kings
What isa border_kings
 
Climaps by EMAPS et Europeana2015
Climaps by EMAPS et Europeana2015Climaps by EMAPS et Europeana2015
Climaps by EMAPS et Europeana2015
 
Medusa haidresser
Medusa haidresserMedusa haidresser
Medusa haidresser
 
Keynote speech at the Digitale Praxen conference at Frankfurt University
Keynote speech at the Digitale Praxen conference at Frankfurt UniversityKeynote speech at the Digitale Praxen conference at Frankfurt University
Keynote speech at the Digitale Praxen conference at Frankfurt University
 
On Continuity in Social Sciences
On Continuity in Social SciencesOn Continuity in Social Sciences
On Continuity in Social Sciences
 
A Trip to Flatland: mapping or modeling in the social sciences
A Trip to Flatland: mapping or modeling in the social sciencesA Trip to Flatland: mapping or modeling in the social sciences
A Trip to Flatland: mapping or modeling in the social sciences
 
How to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceabilityHow to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceability
 
What’s in a controversy. Deploying the folds of collective action
What’s in a controversy. Deploying the folds of collective actionWhat’s in a controversy. Deploying the folds of collective action
What’s in a controversy. Deploying the folds of collective action
 
Who are the actors of controversies? appreciating the heterogeneity of collec...
Who are the actors of controversies? appreciating the heterogeneity of collec...Who are the actors of controversies? appreciating the heterogeneity of collec...
Who are the actors of controversies? appreciating the heterogeneity of collec...
 
1. Why controversies? Learning to be constructivist
1. Why controversies? Learning to be constructivist1. Why controversies? Learning to be constructivist
1. Why controversies? Learning to be constructivist
 
From Before the Cradle: mapping online debates on c-section and family planning
From Before the Cradle: mapping online debates on c-section and family planningFrom Before the Cradle: mapping online debates on c-section and family planning
From Before the Cradle: mapping online debates on c-section and family planning
 

Visual Network Analysis

Notes de l'éditeur

  1. 23/03/12
  2. 27/08/12
  3. 27/08/12
  4. 27/08/12
  5. And even from both of them.
  6. 27/08/12
  7. 27/08/12
  8. 27/08/12
  9. Because of the monstrous size of the Web, there are two types of maps of it. The maps that try to be exhaustive and to trace the entire Web or most of it (and fail)…
  10. … and the good ones.
  11. 27/08/12
  12. A good map of the Web is always limited in its ambition: it tries to represent a limited portion of the Web and the better this portion is delimited, the better is the map. In the example a very interesting map of the French political blogosphere, realized by Linkfluence (a research partner of the médialab).
  13. Indeed, the carving process that we just described is precisely what allows going from a pseudo-exhaustive (in fact, poorly delimited) network to a legible one.
  14. 27/08/12 This is a model of a tiny web corpus. It only has some 80 nodes and yet it looks as a plate of spaghetti (or an hairball).
  15. 27/08/12 Now that we know about power law, however, we can try to de-spaghetticize this graph. To do so, we will first change the size of the nodes according to their in-degree (the number of hyperlinks that they receive).
  16. 27/08/12 Secondly, we will re-order the nodes on the Y axe again according to their in-degree.
  17. 27/08/12 Focusing on visibility, the higher layer contains the websites that are highly visible, appear on the first page of search engines’ results and can be easily found by anyone; the middle layer contains the websites that are less visible, appear in the following pages of search engines’ results and can only be found by experts; the lower layers contains the websites that are almost invisible, don’t show up in search engines and are almost impossible to find.
  18. 27/08/12 Focusing on connectivity, the higher layer contains the websites that are highly connected both locally and globally; the middle layer contains the websites that are highly connected locally but poorly connected globally; the lower layers contains the websites that are poorly connected both locally and globally.
  19. 27/08/12 The three layers can also be distinguished by looking at the direction of the links. The World Wide Web is characterized by a very peculiar reverse gravity: where the less visible websites points toward the more visible ones (thereby making them even more visible)…
  20. 27/08/12 … but not the other way around.
  21. 27/08/12 The reason why we want to exclude these websites is because it is impossible to define where they are located. The websites that are too high in the in-degree hierarchy are connected to everyone and are therefore everywhere. The websites that are too low in the in-degree hierarchy are connected to none and are therefore nowhere. Only the websites in the middle are somewhere because they are connected to only someone. Of course where exactly this first cut is done depends entirely on the level of specificity of the research that you are doing. And this is why this cut is arbitrary and relatively easy.
  22. 27/08/12 This cut is more difficult because the thematic separation on the Web is as we said a question of density and rarefaction and separating one theme from another is more a question of ripping than of cutting.
  23. 27/08/12 Through this two operation is possible to delimit a thematic corpus. This corpus is composed of websites of the intermediary layer, but also of the upper and lower layers. In particular, the websites of the higher layer constitute the core of the corpus, which is surrounded by a nebula of websites of the middle layer and several tendrils in the lower layer.
  24. Now that we have extracted our scientometrics network from Scopus, we can analyse it with Gephi.
  25. Gephi is a very complex piece of software and here I will only have the time for a quick introduction. However, if you want to know more about Gephi and its usage, I strongly encourage you to have a look at the documentation on the Gephi’s website ( http://gephi.org/users/ ) which is extremely well done.
  26. Very quickly, Gephi has three main windows one for the ‘Overview’, which is the one where you can manipulate and analyze your graph (and the one on which you’ll spend most of the time).
  27. The second window is the ‘Data Laboratory’ where you have a table view of the nodes and the edges of your graph and their attributes.
  28. Finally the ‘Preview’ window allow you tweaking the visualization parameters of your graph and export the result of your work as a static image (pdf, png, svg).
  29. Back to the ‘Overview’ window there are three main palettes that we will employ in the analysis: 1. The ‘Layout’ palette, to change the position of the nodes 2. The ‘Ranking palette, to change the size of the nodes 3. The ‘Partitions’ palette, to change the color of the nodes
  30. 27/08/12 As you see the cells of the table are colored with four different colors that indicates the four steps of the analysis: 1. Identification of clusters (layout) 2. Characterization of clusters (layout) 3. Remarkable nodes (layout & ranking) 4. Categories projection (partitions)
  31. To identify the clusters, therefore, the first thing to do is to spatialize the network using a force-vector algorithm. The first action that we will do on our graph is to spatialize it with the ForceAtlas 2 layout. This algorithm can be tweaked by changing several parameters, the most important of which are - LinLog mode (maximizes the legibility of clusters) - Prevent overlap (enhances legibility, but distorts spatialization) - Scaling (increases/decreases all distance proportionally) - Gravity (pulls everything towards the center, prevents dispersions, but distorts spatialization) - Approximate repulsion (reduce the time required to spatialize large graphs, but distorts spatialization)
  32. … it is easy to identify the areas which contains no or few nodes, also called structural holes …
  33. - Central clusters (located in the middle of the network), because centrality in a spatialized graph is a sign of high and highly diverse connectivity. - Bridging clusters (located in-between two clusters), because this clusters play a crucial role in allowing the circulation of things in the network.
  34. - Central clusters (located in the middle of the network), because centrality in a spatialized graph is a sign of high and highly diverse connectivity. - Bridging clusters (located in-between two clusters), because this clusters play a crucial role in allowing the circulation of things in the network.
  35. To identify the clusters, therefore, the first thing to do is to spatialize the network using a force-vector algorithm. The first action that we will do on our graph is to spatialize it with the ForceAtlas 2 layout. This algorithm can be tweaked by changing several parameters, the most important of which are - LinLog mode (maximizes the legibility of clusters) - Prevent overlap (enhances legibility, but distorts spatialization) - Scaling (increases/decreases all distance proportionally) - Gravity (pulls everything towards the center, prevents dispersions, but distorts spatialization) - Approximate repulsion (reduce the time required to spatialize large graphs, but distorts spatialization)
  36. - The in-degree, corresponding to the number of incoming edges (the number of connection pointing toward the node). The in-degree of a node is also called its ‘authority score’, because receiving many connections is generally correlated to the fact that the node is considered ‘important’ or ‘remarkable’ by the other nodes of the network.
  37. The out-degree, corresponding to the number of outgoing edges (the number of starting from the node). The out-degree of a node is also called its ‘hub score’. Hubs are important in networks because the play a crucial role in the circulation of the information. Of course, in-degree and out-degree can only be computed in directed graphs (graph in which the connections have a direction). In non-directed graph (such as a graph of friendship, if we assume that friendship is always mutual), it is however possible to compute the degree of nodes (the number of edges connected to a each node).
  38. To identify the clusters, therefore, the first thing to do is to spatialize the network using a force-vector algorithm. The first action that we will do on our graph is to spatialize it with the ForceAtlas 2 layout. This algorithm can be tweaked by changing several parameters, the most important of which are - LinLog mode (maximizes the legibility of clusters) - Prevent overlap (enhances legibility, but distorts spatialization) - Scaling (increases/decreases all distance proportionally) - Gravity (pulls everything towards the center, prevents dispersions, but distorts spatialization) - Approximate repulsion (reduce the time required to spatialize large graphs, but distorts spatialization)
  39. But it is also interesting to observe if topology and classification are consistent (if most of the nodes of a given type are located within the same clusters and, conversely, if clusters are formed by nodes of the same type).
  40. If topology and classification are consistent, it is then interesting to zoom on the exceptions and have a closer look to the nodes that have and unusual position compared to the other nodes of the same type.