SlideShare une entreprise Scribd logo
1  sur  54
Interactive visualization and exploration
of network data with gephi
Bernhard Rieder
Universiteit van Amsterdam
Mediastudies Department
and some conceptual context
Context
Terms like "big data", "computational social science", "digital humanities",
"digital methods", etc. are receiving a lot of attention.
They point to a set of practices of knowledge production: data analysis,
visualization, modeling, etc.
Instead of a totalizing search for a "logic" of data analysis, we could
inquire into the vocabulary of concepts and analytical gestures that
constitute the practice of data analysis.
A twofold approach to methods:
☉ Engagement, development, application => digital methods
☉ Conceptual, historical, and political analysis and critique => software studies
This workshop
How do we talk about data? How do we analyze them? What is our frame
of thought? How do we go further in terms of imagination, expressivity?
☉ Introduction
☉ A bit of math
☉ Two kinds of mathematics
☉ Concepts and techniques from graph theory
☉ Working with gephi
Engage the theory of knowledge (epistemology) mobilized in data analysis,
but through the actual techniques and not generalizing concepts.
Basic ideas
Why?
Why do network analysis and visualization? Which arguments are put
forward?
☉ New media: technical and conceptual structures modeled as networks
☉ Calculative capacities: powerful techniques and tools
☉ Visualization: the network diagram, "visual analytics"
☉ Logistics: data and software are available
☉ Methodology: dissatisfaction with statistics (SNA)
☉ Society: diversification, problems with demographics / statistics / theory
Platforms like Twitter
boost opportunities for
connectivity between
various types of actors.
At the same time, they
produce detailed data
traces that are highly
centralized and searchable.
Much of these data can be
analyzed as graphs.
What styles of reasoning?
Hacking (1991) building the concept of "style of reasoning" on A. C.
Crombie’s (1994) "styles of scientific thinking":
☉ postulation and deduction
☉ experiment and empirical research
☉ reasoning by analogy
☉ ordering by comparison and taxonomy
☉ statistical analysis of regularities and probabilities
☉ genetic development
What kind of reasoning are we mobilizing in data analysis?
Is it one type of reasoning or many?
Are we "positivists" when we do data analysis? Reductionists?
Quality / quantity
"One of my favorite fantasies is a dialogue between Mills and Lazarsfeld in which the former
reads to the latter the first sentence of The Sociological Imagination: 'Nowadays men often
feel that their private lives are a series of traps.' Lazarsfeld immediately replies: 'How many
men, which men, how long have they felt this way, which aspects of their private lives
bother them, do their public lives bother them, when do they feel free rather than trapped,
what kinds of traps do they experience, etc., etc., etc.' If Mills succumbed, the two of them
would have to apply to the National Institute of Mental Health for a million-dollar grant to
check out and elaborate that first sentence. They would need a staff of hundreds, and when
finished they would have written Americans View Their Mental Health rather than The
Sociological Imagination, provided that they finished at all, and provided that either of them
cared enough at the end to bother writing anything." (Maurice Stein, cit. in Gitlin 1978)
Theory vs. empiricism, macro vs. micro, qualitative vs. quantitative, inductive vs.
deductive, associative vs. formalistic, etc.
The promise of data analysis tools, applied to exhaustive (and cheap) data, is to
bridge the gap, to allow zooming, "quali-quanti" (Latour 2010).
Two kinds of mathematics
Can there be data analysis without math? No.
Does this imply epistemological commitments? Yes.
But there are choices, e.g. between:
☉ Confirmatory data analysis => deductive
☉ Exploratory data analysis (Tukey 1962) => inductive
There is a fast growing variety of formal analytical gestures relying on
mathematical modeling and computation.
Two kinds of mathematics
Statistics
Observed: objects and properties
Inferred: social forces
Data representation: the table
Visual representation: quantity charts
Grouping: "class" (similar properties)
Graph-theory
Observed: objects and relations
Inferred: structure
Data representation: the matrix
Visual representation: network diagrams
Grouping: "clique" (dense relations)
Graph theory
Leonhard Euler, "Seven Bridges of Königsberg", 1735
Introducing the "point and line" model
Graph theory
Develops over the 20th century, in particular the second half.
Integrates branches of mathematics (topology, geometry, statistics, etc.).
Graph theory is "the mathematics of structure" (Harary 1965), "a
mathematical model for any system involving a binary relation" (Harary
1969); it makes relational structure calculable.
"Perhaps even more than to the contact between mankind and nature, graph theory owes to
the contact of human beings between each other." (König 1936)
Basic ideas
Moreno 1934
Graph theory developed in
exchange with sociometry,
small-group research and
(later) social exchange
theory.
Starting point:
"the sociometric test"
(experimental definition of
"relation")
Basic ideas
Forsythe and Katz, 1946, "adjacency matrix"
Harary, Graph Theory, 1969
Basic ideas
The late 1990s
The network "singularity":
☉ The network imaginary, a "new science of networks" (Watts 2005)
☉ Computational capacities (memory, speed, interfaces, etc.)
☉ New platforms and datasets
☉ Packaged tools
Different traditions conflate to form network analysis:
☉ Social network analysis and sociometrics
☉ Scientometrics / science and technology studies
☉ Mathematics / physics / computer science
☉ Information and data visualization
☉ Digital sociology / new media studies
Basic ideas
Adamic and Glance, "Divided They Blog", 2005
Formalization
"As we have seen, the basic terms of digraph theory are point and line. Thus, if an
appropriate coordination is made so that each entity of an empirical system is identified
with a point and each relationship is identified with a line, then for all true statements
about structural properties of the obtained digraph there are corresponding true statements
about structural properties of the empirical system." (Harary et al. 1965)
There is always an epistemological commitment!
=> What can "carry" the reductionism and formalization?
=> What types of analytical gestures?
Facebook Page "ElShaheeed", June 2010 – June 2011, (Poell / Rieder, forthcoming)
7K posts, 700K users, 3.6M comments, 10M likes (tool: netvizz), work in progress!
Facebook Page "ElShaheeed", June 2010 – June 2011:
comment timescatter, log10 y scale, likes on
Facebook Page "ElShaheeed", June 2010 – June 2011:
scatterplot comments / likes, per post type
Facebook Page "ElShaheeed"
700K nodes, 11M connections
Color: type
Facebook Page "ElShaheeed"
700K nodes, 11M connections
Color: outdegree
Basic ideas
What Kind of Phenomena/Data?
Interactive networks (Watts 2004): link encodes tangible interaction
☉ social network
☉ citation networks
☉ hypertext networks
Symbolic networks (Watts 2004): link is conceptual
☉ co-presence (Tracker Tracker, IMDB, etc.)
☉ co-word
☉ any kind of "structure" that can be as point and line
=> do all kinds of analysis (SNA, transportation, text mining, etc.)
=> analyze structure in various ways
Basic ideas
What is a graph?
An abstract representation of nodes connected by links.
Two ways of dealing with graphs:
☉ mathematical analysis (graph statistics, structural measures, etc.)
☉ visualization (network diagram, matrix, arc diagram, etc.)
Three different force-based layouts of my FB profile
OpenOrd, ForceAtlas, Fruchterman-Reingold
Non force-based layouts
Circle diagram, parallel bubble lines, arc diagram
Network statistics
betweenness centrality
degree
Relational elements of graphs can
be represented as tables (nodes
have properties) and analyzed
through statistics.
Network statistics bridge the gap
between individual units and the
structural forms they are
embedded in.
This is currently an extremely
prolific field of research.
Basic ideas
Basic ideas
What is a graph?
Vertices and edges!
Nodes and lines!
Two main types:
Directed (e.g. Twitter)
Undirected (e.g. Facebook)
Properties of nodes:
degree, centrality, etc.
Properties of edges:
weight, direction, etc.
Properties of the graph:
averages, diameter, communities, etc.
Basic ideas
Basic ideas
Wikipedia: Glossary of graph theory
Tools are easy, concepts are hard
Basic ideas
Interactive visual analytics
Bringing structure to the surface (gephi panel: "layout")
☉ different spatializations (force, geometry, etc.)
Projecting variables into the diagram (gephi panel: "ranking")
☉ Size (nodes, edges, labels, etc.)
☉ Color (nodes, edges, labels, etc.)
Deriving measures (gephi panel: "statistics")
☉ Properties of nodes, edges, structure => new variables
Analysis: e.g. correlation between spatial layout and variables?
Basic ideashttp://courses.polsys.net/gephi/
Basic ideas
Basic ideas
Twitter #ows dataset, co-hashtag analysis
Strong topic clustering
Twitter 1% sample, co-hashtag analysis
227,029 unique hashtags, 1627 displayed (freq >= 50)
Size: frequency
Color: modularity
Size: frequency
Color: user diversity
Twitter 1% sample, co-hashtag analysis
227,029 unique hashtags, 1627 displayed (freq >= 50)
Size: frequency
Color: degree
Twitter 1% sample, co-hashtag analysis
227,029 unique hashtags, 1627 displayed (freq >= 50)
Twitter 1% sample
Co-hashtag analysis
Degree vs.
wordFrequency
Degree vs. userDiversity
Twitter 1% sample
Co-hashtag analysis
FB group "Islam is dangerous"
Friendship network, color: betweenness centrality
2.339 members
Average degree of 39.69
81.7% have at least one friend in the group
55.4% five or more
37.2% have 20 or more
founder and admin has 609 friends
FB group "Islam is dangerous"
Friendship network, color: Interface language
en_us, de, en_uk, it dominate
Mapping European Extremism
Friendship relations of 18 extreme-right groups
FB page "Educate children about the evils of Islam"
Links have more comments, photos more likes.
FB page "Stop the Islamization
of the World"
Number of posts and reactions
FB page "Stop the
Islamization of the World"
Basic ideas
Interactive visual analytics
Bringing structure to the surface (gephi panel: "layout")
☉ different spatializations (force, geometry, etc.)
Projecting variables into the diagram (gephi panel: "ranking")
☉ Size (nodes, edges, labels, etc.)
☉ Color (nodes, edges, labels, etc.)
Deriving measures (gephi panel: "statistics")
☉ Properties of nodes, edges, structure => new variables
Analysis: e.g. correlation between spatial layout and variables?
Basic ideashttp://courses.polsys.net/gephi/
Nine measures of centrality (Freeman 1979)
Label PR α=0.85 PR α=0.7 PR α=0.55 PR α=0.4 In-Degree Out-Degree Degree
n34 0.0944 0.0743 0.0584 0.0460 4 1 5
n1 0.0867 0.0617 0.0450 0.0345 1 2 3
n17 0.0668 0.0521 0.0423 0.0355 2 1 3
n39 0.0663 0.0541 0.0453 0.0388 5 1 6
n22 0.0619 0.0506 0.0441 0.0393 5 1 6
n27 0.0591 0.0451 0.0371 0.0318 1 0 1
n38 0.0522 0.0561 0.0542 0.0486 6 0 6
n11 0.0492 0.0372 0.0306 0.0274 3 1 4
Basic ideas
US Airports
Thank You
rieder@uva.nl
https://www.digitalmethods.net
http://thepoliticsofsystems.net
"Far better an approximate answer to the right question,
which is often vague, than an exact answer to the wrong
question, which can always be made precise. Data
analysis must progress by approximate answers, at best,
since its knowledge of what the problem really is will at
best be approximate." (Tukey 1962)

Contenu connexe

Tendances

On Digital Markets, Data, and Concentric Diversification
On Digital Markets, Data, and Concentric DiversificationOn Digital Markets, Data, and Concentric Diversification
On Digital Markets, Data, and Concentric DiversificationBernhard Rieder
 
On the Diversity of the Accountability Problem. Machine Learning and Knowing ...
On the Diversity of the Accountability Problem. Machine Learning and Knowing ...On the Diversity of the Accountability Problem. Machine Learning and Knowing ...
On the Diversity of the Accountability Problem. Machine Learning and Knowing ...Bernhard Rieder
 
Tweets are Not Created Equal. Intersecting Devices in the 1% Sample
Tweets are Not Created Equal. Intersecting Devices in the 1% SampleTweets are Not Created Equal. Intersecting Devices in the 1% Sample
Tweets are Not Created Equal. Intersecting Devices in the 1% SampleBernhard Rieder
 
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Lauri Eloranta
 
Introduction to Topological Data Analysis
Introduction to Topological Data AnalysisIntroduction to Topological Data Analysis
Introduction to Topological Data AnalysisMason Porter
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Jonathan Stray
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Doug Needham
 
Centrality in Time- Dependent Networks
Centrality in Time- Dependent NetworksCentrality in Time- Dependent Networks
Centrality in Time- Dependent NetworksMason Porter
 
Beauty as a Bridge to NodeXL
Beauty as a Bridge to NodeXLBeauty as a Bridge to NodeXL
Beauty as a Bridge to NodeXLShalin Hai-Jew
 
Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Lauri Eloranta
 
Complex Social Systems - Lecture 5 in Introduction to Computational Social Sc...
Complex Social Systems - Lecture 5 in Introduction to Computational Social Sc...Complex Social Systems - Lecture 5 in Introduction to Computational Social Sc...
Complex Social Systems - Lecture 5 in Introduction to Computational Social Sc...Lauri Eloranta
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...Jeromy Anglim
 
Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)Mason Porter
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...BAINIDA
 
Towards Contested Collective Intelligence
Towards Contested Collective IntelligenceTowards Contested Collective Intelligence
Towards Contested Collective IntelligenceSimon Buckingham Shum
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...Daniel Katz
 
Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Vala Ali Rohani
 
Formations & Deformations of Social Network Graphs
Formations & Deformations of Social Network GraphsFormations & Deformations of Social Network Graphs
Formations & Deformations of Social Network GraphsShalin Hai-Jew
 

Tendances (20)

On Digital Markets, Data, and Concentric Diversification
On Digital Markets, Data, and Concentric DiversificationOn Digital Markets, Data, and Concentric Diversification
On Digital Markets, Data, and Concentric Diversification
 
On the Diversity of the Accountability Problem. Machine Learning and Knowing ...
On the Diversity of the Accountability Problem. Machine Learning and Knowing ...On the Diversity of the Accountability Problem. Machine Learning and Knowing ...
On the Diversity of the Accountability Problem. Machine Learning and Knowing ...
 
Tweets are Not Created Equal. Intersecting Devices in the 1% Sample
Tweets are Not Created Equal. Intersecting Devices in the 1% SampleTweets are Not Created Equal. Intersecting Devices in the 1% Sample
Tweets are Not Created Equal. Intersecting Devices in the 1% Sample
 
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
 
07 Network Visualization
07 Network Visualization07 Network Visualization
07 Network Visualization
 
Introduction to Topological Data Analysis
Introduction to Topological Data AnalysisIntroduction to Topological Data Analysis
Introduction to Topological Data Analysis
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview.
 
Centrality in Time- Dependent Networks
Centrality in Time- Dependent NetworksCentrality in Time- Dependent Networks
Centrality in Time- Dependent Networks
 
Beauty as a Bridge to NodeXL
Beauty as a Bridge to NodeXLBeauty as a Bridge to NodeXL
Beauty as a Bridge to NodeXL
 
Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1
 
Complex Social Systems - Lecture 5 in Introduction to Computational Social Sc...
Complex Social Systems - Lecture 5 in Introduction to Computational Social Sc...Complex Social Systems - Lecture 5 in Introduction to Computational Social Sc...
Complex Social Systems - Lecture 5 in Introduction to Computational Social Sc...
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...
 
Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)Paper Writing in Applied Mathematics (slightly updated slides)
Paper Writing in Applied Mathematics (slightly updated slides)
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
 
Towards Contested Collective Intelligence
Towards Contested Collective IntelligenceTowards Contested Collective Intelligence
Towards Contested Collective Intelligence
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...
 
Social Network Analysis (Part 1)
Social Network Analysis (Part 1)Social Network Analysis (Part 1)
Social Network Analysis (Part 1)
 
Formations & Deformations of Social Network Graphs
Formations & Deformations of Social Network GraphsFormations & Deformations of Social Network Graphs
Formations & Deformations of Social Network Graphs
 

En vedette

Gephi : dynamic features
Gephi : dynamic featuresGephi : dynamic features
Gephi : dynamic featuresSébastien
 
Introduction to Biourbanism . Epistemology for a new Architecture, by Antonio...
Introduction to Biourbanism. Epistemology for a new Architecture, by Antonio...Introduction to Biourbanism. Epistemology for a new Architecture, by Antonio...
Introduction to Biourbanism . Epistemology for a new Architecture, by Antonio...International Society of Biourbanism
 
Social Network Analysis Using Gephi
Social Network Analysis Using Gephi Social Network Analysis Using Gephi
Social Network Analysis Using Gephi Goa App
 
Gephi Tutorial Visualization
Gephi Tutorial VisualizationGephi Tutorial Visualization
Gephi Tutorial VisualizationGephi Consortium
 
91 Free Twitter Tools and Apps to Fit Any Need
91 Free Twitter Tools and Apps to Fit Any Need91 Free Twitter Tools and Apps to Fit Any Need
91 Free Twitter Tools and Apps to Fit Any NeedBuffer
 

En vedette (9)

Gephi : dynamic features
Gephi : dynamic featuresGephi : dynamic features
Gephi : dynamic features
 
Gephi Tutorial Layouts
Gephi Tutorial LayoutsGephi Tutorial Layouts
Gephi Tutorial Layouts
 
Gephi Quick Start
Gephi Quick StartGephi Quick Start
Gephi Quick Start
 
Introduction to Biourbanism . Epistemology for a new Architecture, by Antonio...
Introduction to Biourbanism. Epistemology for a new Architecture, by Antonio...Introduction to Biourbanism. Epistemology for a new Architecture, by Antonio...
Introduction to Biourbanism . Epistemology for a new Architecture, by Antonio...
 
Data visualization 4 dummies
Data visualization 4 dummiesData visualization 4 dummies
Data visualization 4 dummies
 
Social Network Analysis Using Gephi
Social Network Analysis Using Gephi Social Network Analysis Using Gephi
Social Network Analysis Using Gephi
 
Gephi Tutorial Visualization
Gephi Tutorial VisualizationGephi Tutorial Visualization
Gephi Tutorial Visualization
 
91 Free Twitter Tools and Apps to Fit Any Need
91 Free Twitter Tools and Apps to Fit Any Need91 Free Twitter Tools and Apps to Fit Any Need
91 Free Twitter Tools and Apps to Fit Any Need
 
Gephi Toolkit Tutorial
Gephi Toolkit TutorialGephi Toolkit Tutorial
Gephi Toolkit Tutorial
 

Similaire à Interactive visualization and exploration of network data with gephi

Map history-networks-shorter
Map history-networks-shorterMap history-networks-shorter
Map history-networks-shorterMason Porter
 
How and why study big cultural data
How and why study big cultural dataHow and why study big cultural data
How and why study big cultural dataLev Manovich
 
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Jonathan Stray
 
Making the invisible visible through SNA
Making the invisible visible through SNAMaking the invisible visible through SNA
Making the invisible visible through SNAMYRA School of Business
 
Objectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative ConnotationsObjectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative ConnotationsBeth Johnson
 
DIGH 5000: Data visualization & analysis Presentation
DIGH 5000: Data visualization & analysis PresentationDIGH 5000: Data visualization & analysis Presentation
DIGH 5000: Data visualization & analysis PresentationDanuta Sierhuis, M.A.
 
On relational sociology
On relational sociologyOn relational sociology
On relational sociologyNaoki Maejima
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...The Higher Education Academy
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Toby Burrows
 
Comparison of methods – an unloved duty? Examples from an ongoing bibliometri...
Comparison of methods – an unloved duty? Examples from an ongoing bibliometri...Comparison of methods – an unloved duty? Examples from an ongoing bibliometri...
Comparison of methods – an unloved duty? Examples from an ongoing bibliometri...Andrea Scharnhorst
 
10 More than a Pretty Picture: Visual Thinking in Network Studies (2016)
10 More than a Pretty Picture: Visual Thinking in Network Studies (2016)10 More than a Pretty Picture: Visual Thinking in Network Studies (2016)
10 More than a Pretty Picture: Visual Thinking in Network Studies (2016)Duke Network Analysis Center
 
10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studiesdnac
 
Crafting poems for data analysis?
Crafting poems for data analysis?Crafting poems for data analysis?
Crafting poems for data analysis?Tünde Varga-Atkins
 
Mathematics and Social Networks
Mathematics and Social NetworksMathematics and Social Networks
Mathematics and Social NetworksMason Porter
 
Social Dynamics on Networks
Social Dynamics on NetworksSocial Dynamics on Networks
Social Dynamics on NetworksMason Porter
 
Steps Towards a History of Ethnomethodology in HCI
Steps Towards a History of Ethnomethodology in HCI Steps Towards a History of Ethnomethodology in HCI
Steps Towards a History of Ethnomethodology in HCI butest
 
b6928-cr-primer-pack-sept-2021.pptx
b6928-cr-primer-pack-sept-2021.pptxb6928-cr-primer-pack-sept-2021.pptx
b6928-cr-primer-pack-sept-2021.pptxpgupostgrad
 

Similaire à Interactive visualization and exploration of network data with gephi (20)

Map history-networks-shorter
Map history-networks-shorterMap history-networks-shorter
Map history-networks-shorter
 
How and why study big cultural data
How and why study big cultural dataHow and why study big cultural data
How and why study big cultural data
 
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
 
Making the invisible visible through SNA
Making the invisible visible through SNAMaking the invisible visible through SNA
Making the invisible visible through SNA
 
Objectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative ConnotationsObjectification Is A Word That Has Many Negative Connotations
Objectification Is A Word That Has Many Negative Connotations
 
DIGH 5000: Data visualization & analysis Presentation
DIGH 5000: Data visualization & analysis PresentationDIGH 5000: Data visualization & analysis Presentation
DIGH 5000: Data visualization & analysis Presentation
 
On relational sociology
On relational sociologyOn relational sociology
On relational sociology
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...
 
Comparison of methods – an unloved duty? Examples from an ongoing bibliometri...
Comparison of methods – an unloved duty? Examples from an ongoing bibliometri...Comparison of methods – an unloved duty? Examples from an ongoing bibliometri...
Comparison of methods – an unloved duty? Examples from an ongoing bibliometri...
 
Scits 2014
Scits 2014Scits 2014
Scits 2014
 
10 More than a Pretty Picture: Visual Thinking in Network Studies (2016)
10 More than a Pretty Picture: Visual Thinking in Network Studies (2016)10 More than a Pretty Picture: Visual Thinking in Network Studies (2016)
10 More than a Pretty Picture: Visual Thinking in Network Studies (2016)
 
10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies10 More than a Pretty Picture: Visual Thinking in Network Studies
10 More than a Pretty Picture: Visual Thinking in Network Studies
 
Crafting poems for data analysis?
Crafting poems for data analysis?Crafting poems for data analysis?
Crafting poems for data analysis?
 
Mathematics and Social Networks
Mathematics and Social NetworksMathematics and Social Networks
Mathematics and Social Networks
 
Computer model
Computer modelComputer model
Computer model
 
Social Dynamics on Networks
Social Dynamics on NetworksSocial Dynamics on Networks
Social Dynamics on Networks
 
Steps Towards a History of Ethnomethodology in HCI
Steps Towards a History of Ethnomethodology in HCI Steps Towards a History of Ethnomethodology in HCI
Steps Towards a History of Ethnomethodology in HCI
 
Social network
Social networkSocial network
Social network
 
b6928-cr-primer-pack-sept-2021.pptx
b6928-cr-primer-pack-sept-2021.pptxb6928-cr-primer-pack-sept-2021.pptx
b6928-cr-primer-pack-sept-2021.pptx
 

Dernier

A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 

Dernier (20)

A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 

Interactive visualization and exploration of network data with gephi

  • 1. Interactive visualization and exploration of network data with gephi Bernhard Rieder Universiteit van Amsterdam Mediastudies Department and some conceptual context
  • 2. Context Terms like "big data", "computational social science", "digital humanities", "digital methods", etc. are receiving a lot of attention. They point to a set of practices of knowledge production: data analysis, visualization, modeling, etc. Instead of a totalizing search for a "logic" of data analysis, we could inquire into the vocabulary of concepts and analytical gestures that constitute the practice of data analysis. A twofold approach to methods: ☉ Engagement, development, application => digital methods ☉ Conceptual, historical, and political analysis and critique => software studies
  • 3. This workshop How do we talk about data? How do we analyze them? What is our frame of thought? How do we go further in terms of imagination, expressivity? ☉ Introduction ☉ A bit of math ☉ Two kinds of mathematics ☉ Concepts and techniques from graph theory ☉ Working with gephi Engage the theory of knowledge (epistemology) mobilized in data analysis, but through the actual techniques and not generalizing concepts.
  • 4. Basic ideas Why? Why do network analysis and visualization? Which arguments are put forward? ☉ New media: technical and conceptual structures modeled as networks ☉ Calculative capacities: powerful techniques and tools ☉ Visualization: the network diagram, "visual analytics" ☉ Logistics: data and software are available ☉ Methodology: dissatisfaction with statistics (SNA) ☉ Society: diversification, problems with demographics / statistics / theory
  • 5. Platforms like Twitter boost opportunities for connectivity between various types of actors.
  • 6. At the same time, they produce detailed data traces that are highly centralized and searchable. Much of these data can be analyzed as graphs.
  • 7. What styles of reasoning? Hacking (1991) building the concept of "style of reasoning" on A. C. Crombie’s (1994) "styles of scientific thinking": ☉ postulation and deduction ☉ experiment and empirical research ☉ reasoning by analogy ☉ ordering by comparison and taxonomy ☉ statistical analysis of regularities and probabilities ☉ genetic development What kind of reasoning are we mobilizing in data analysis? Is it one type of reasoning or many? Are we "positivists" when we do data analysis? Reductionists?
  • 8. Quality / quantity "One of my favorite fantasies is a dialogue between Mills and Lazarsfeld in which the former reads to the latter the first sentence of The Sociological Imagination: 'Nowadays men often feel that their private lives are a series of traps.' Lazarsfeld immediately replies: 'How many men, which men, how long have they felt this way, which aspects of their private lives bother them, do their public lives bother them, when do they feel free rather than trapped, what kinds of traps do they experience, etc., etc., etc.' If Mills succumbed, the two of them would have to apply to the National Institute of Mental Health for a million-dollar grant to check out and elaborate that first sentence. They would need a staff of hundreds, and when finished they would have written Americans View Their Mental Health rather than The Sociological Imagination, provided that they finished at all, and provided that either of them cared enough at the end to bother writing anything." (Maurice Stein, cit. in Gitlin 1978) Theory vs. empiricism, macro vs. micro, qualitative vs. quantitative, inductive vs. deductive, associative vs. formalistic, etc. The promise of data analysis tools, applied to exhaustive (and cheap) data, is to bridge the gap, to allow zooming, "quali-quanti" (Latour 2010).
  • 9. Two kinds of mathematics Can there be data analysis without math? No. Does this imply epistemological commitments? Yes. But there are choices, e.g. between: ☉ Confirmatory data analysis => deductive ☉ Exploratory data analysis (Tukey 1962) => inductive There is a fast growing variety of formal analytical gestures relying on mathematical modeling and computation.
  • 10. Two kinds of mathematics Statistics Observed: objects and properties Inferred: social forces Data representation: the table Visual representation: quantity charts Grouping: "class" (similar properties) Graph-theory Observed: objects and relations Inferred: structure Data representation: the matrix Visual representation: network diagrams Grouping: "clique" (dense relations)
  • 11. Graph theory Leonhard Euler, "Seven Bridges of Königsberg", 1735 Introducing the "point and line" model
  • 12. Graph theory Develops over the 20th century, in particular the second half. Integrates branches of mathematics (topology, geometry, statistics, etc.). Graph theory is "the mathematics of structure" (Harary 1965), "a mathematical model for any system involving a binary relation" (Harary 1969); it makes relational structure calculable. "Perhaps even more than to the contact between mankind and nature, graph theory owes to the contact of human beings between each other." (König 1936)
  • 13. Basic ideas Moreno 1934 Graph theory developed in exchange with sociometry, small-group research and (later) social exchange theory. Starting point: "the sociometric test" (experimental definition of "relation")
  • 15. Forsythe and Katz, 1946, "adjacency matrix"
  • 17. Basic ideas The late 1990s The network "singularity": ☉ The network imaginary, a "new science of networks" (Watts 2005) ☉ Computational capacities (memory, speed, interfaces, etc.) ☉ New platforms and datasets ☉ Packaged tools Different traditions conflate to form network analysis: ☉ Social network analysis and sociometrics ☉ Scientometrics / science and technology studies ☉ Mathematics / physics / computer science ☉ Information and data visualization ☉ Digital sociology / new media studies
  • 18. Basic ideas Adamic and Glance, "Divided They Blog", 2005
  • 19. Formalization "As we have seen, the basic terms of digraph theory are point and line. Thus, if an appropriate coordination is made so that each entity of an empirical system is identified with a point and each relationship is identified with a line, then for all true statements about structural properties of the obtained digraph there are corresponding true statements about structural properties of the empirical system." (Harary et al. 1965) There is always an epistemological commitment! => What can "carry" the reductionism and formalization? => What types of analytical gestures?
  • 20. Facebook Page "ElShaheeed", June 2010 – June 2011, (Poell / Rieder, forthcoming) 7K posts, 700K users, 3.6M comments, 10M likes (tool: netvizz), work in progress!
  • 21. Facebook Page "ElShaheeed", June 2010 – June 2011: comment timescatter, log10 y scale, likes on
  • 22. Facebook Page "ElShaheeed", June 2010 – June 2011: scatterplot comments / likes, per post type
  • 23. Facebook Page "ElShaheeed" 700K nodes, 11M connections Color: type
  • 24. Facebook Page "ElShaheeed" 700K nodes, 11M connections Color: outdegree
  • 25. Basic ideas What Kind of Phenomena/Data? Interactive networks (Watts 2004): link encodes tangible interaction ☉ social network ☉ citation networks ☉ hypertext networks Symbolic networks (Watts 2004): link is conceptual ☉ co-presence (Tracker Tracker, IMDB, etc.) ☉ co-word ☉ any kind of "structure" that can be as point and line => do all kinds of analysis (SNA, transportation, text mining, etc.) => analyze structure in various ways
  • 26. Basic ideas What is a graph? An abstract representation of nodes connected by links. Two ways of dealing with graphs: ☉ mathematical analysis (graph statistics, structural measures, etc.) ☉ visualization (network diagram, matrix, arc diagram, etc.)
  • 27. Three different force-based layouts of my FB profile OpenOrd, ForceAtlas, Fruchterman-Reingold
  • 28. Non force-based layouts Circle diagram, parallel bubble lines, arc diagram
  • 29. Network statistics betweenness centrality degree Relational elements of graphs can be represented as tables (nodes have properties) and analyzed through statistics. Network statistics bridge the gap between individual units and the structural forms they are embedded in. This is currently an extremely prolific field of research.
  • 31. Basic ideas What is a graph? Vertices and edges! Nodes and lines! Two main types: Directed (e.g. Twitter) Undirected (e.g. Facebook) Properties of nodes: degree, centrality, etc. Properties of edges: weight, direction, etc. Properties of the graph: averages, diameter, communities, etc.
  • 33. Basic ideas Wikipedia: Glossary of graph theory Tools are easy, concepts are hard
  • 34. Basic ideas Interactive visual analytics Bringing structure to the surface (gephi panel: "layout") ☉ different spatializations (force, geometry, etc.) Projecting variables into the diagram (gephi panel: "ranking") ☉ Size (nodes, edges, labels, etc.) ☉ Color (nodes, edges, labels, etc.) Deriving measures (gephi panel: "statistics") ☉ Properties of nodes, edges, structure => new variables Analysis: e.g. correlation between spatial layout and variables?
  • 37. Basic ideas Twitter #ows dataset, co-hashtag analysis Strong topic clustering
  • 38. Twitter 1% sample, co-hashtag analysis 227,029 unique hashtags, 1627 displayed (freq >= 50) Size: frequency Color: modularity
  • 39. Size: frequency Color: user diversity Twitter 1% sample, co-hashtag analysis 227,029 unique hashtags, 1627 displayed (freq >= 50)
  • 40. Size: frequency Color: degree Twitter 1% sample, co-hashtag analysis 227,029 unique hashtags, 1627 displayed (freq >= 50)
  • 41. Twitter 1% sample Co-hashtag analysis Degree vs. wordFrequency
  • 42. Degree vs. userDiversity Twitter 1% sample Co-hashtag analysis
  • 43. FB group "Islam is dangerous" Friendship network, color: betweenness centrality 2.339 members Average degree of 39.69 81.7% have at least one friend in the group 55.4% five or more 37.2% have 20 or more founder and admin has 609 friends
  • 44. FB group "Islam is dangerous" Friendship network, color: Interface language en_us, de, en_uk, it dominate
  • 45. Mapping European Extremism Friendship relations of 18 extreme-right groups
  • 46. FB page "Educate children about the evils of Islam" Links have more comments, photos more likes.
  • 47. FB page "Stop the Islamization of the World" Number of posts and reactions
  • 48. FB page "Stop the Islamization of the World"
  • 49. Basic ideas Interactive visual analytics Bringing structure to the surface (gephi panel: "layout") ☉ different spatializations (force, geometry, etc.) Projecting variables into the diagram (gephi panel: "ranking") ☉ Size (nodes, edges, labels, etc.) ☉ Color (nodes, edges, labels, etc.) Deriving measures (gephi panel: "statistics") ☉ Properties of nodes, edges, structure => new variables Analysis: e.g. correlation between spatial layout and variables?
  • 51. Nine measures of centrality (Freeman 1979)
  • 52. Label PR α=0.85 PR α=0.7 PR α=0.55 PR α=0.4 In-Degree Out-Degree Degree n34 0.0944 0.0743 0.0584 0.0460 4 1 5 n1 0.0867 0.0617 0.0450 0.0345 1 2 3 n17 0.0668 0.0521 0.0423 0.0355 2 1 3 n39 0.0663 0.0541 0.0453 0.0388 5 1 6 n22 0.0619 0.0506 0.0441 0.0393 5 1 6 n27 0.0591 0.0451 0.0371 0.0318 1 0 1 n38 0.0522 0.0561 0.0542 0.0486 6 0 6 n11 0.0492 0.0372 0.0306 0.0274 3 1 4
  • 54. Thank You rieder@uva.nl https://www.digitalmethods.net http://thepoliticsofsystems.net "Far better an approximate answer to the right question, which is often vague, than an exact answer to the wrong question, which can always be made precise. Data analysis must progress by approximate answers, at best, since its knowledge of what the problem really is will at best be approximate." (Tukey 1962)

Notes de l'éditeur

  1. Anatomy of a tweet. https://twitter.com/ICIJorg/status/321585235491962880https://api.twitter.com/1/statuses/show/321585235491962880.json
  2. Very large scale systems on the one side, but highly concentrated data repositories on the other.The promise of data analysis is, of course, to use that data to make sense of all the complexity.
  3. Reduction to the point and line model requires purging of context, etc. but provides considerable computational capacities. No free lunch, as it were.
  4. Gitlin, The Dominant Paradigm, 1978Many people argue that we no longer need that grant, we already have the data.
  5. Tukey, The Future of Data Analysis, 1962
  6. Allows for all kinds of folding, combinations, etc. – Math is not homogeneous, but sprawling!Different forms of reasoning, different modes of aggregation.These are already analytical frameworks, different ways of formalizing.Statistics: atomism, structure is implicit ("hidden forces", "social forces" cf. Durhkeim) => groups are abstractions, constituted by socioeconomic similaritySocial Network Analysis: atomism, structure is explicit ("dyadic forces") => groups are concrete, constituted by social exchange
  7. Now we can calculate (in particular via matrix algebra).
  8. Handbooks on graph theory are full of exhaustive discussions of basic graph types. Loads of vocabulary and analytical approaches.
  9. http://www.facebook.com/ElShaheeed (Created by WaelGhonim, considered to be a central place for the sparking of the Egyptian Revolution)http://apps.facebook.com/netvizz/ (tool used for extraction)
  10. Classic, simple statistics are often very useful.
  11. It's easy and interesting to produce correlations.
  12. A very different concept of "relation" – no longer derived and probabilistic (correlation) but explicit.Formalization: here: posts and users, but we could do many other things, e.g. users linked by co-reaction, posts linked by co-reaction, we could look at relationships between words, etc.How do we interpret this: understand the platform, understand the context of the phenomenon, understand the algorithm, etc.
  13. How do we interpret something like this?
  14. Visualization is, again, one type of analysis.Which properties of the network are "made salient" by an algorithm?http://thepoliticsofsystems.net/2010/10/one-network-and-four-algorithms/Models behind: spring simulation, simulated annealing (http://wiki.cns.iu.edu/pages/viewpage.action?pageId=1704113)
  15. Non force-based layouts can be extremely useful. Gephi can produce those as well
  16. Extend word lists (what am I missing?), account for refraction. Rieder & Gerlitz 2013: http://journal.media-culture.org.au/index.php/mcjournal/article/viewArticle/620Rieder 2012: http://firstmonday.org/ojs/index.php/fm/article/view/4199/3359
  17. Project variables into the graph User diversity = no of unique users of a hashtag divided by hashtag frequency
  18. Larger roles of hashtags, not all are issue markers!
  19. There is no need to analyze and visualize a graph as a network.Characterize hashtags in relation to a whole. (their role beyond a particular topic sample), better understand our "fishing pole" (the sample technique) and the weight it carries.Tbt: throwback thursday
  20. From DMI workshop on anti-Islamism and right-wing extremism.We can also look at interaction patters: activity structure, held together by leaders?
  21. From DMI workshop on anti-Islamism and right-wing extremism.Netvizz also allows for looking into interaction patterns in groups..
  22. Unique user id allows for large scale analysis.The connectors are often the admins.See: https://wiki.digitalmethods.net/CounterJihadism/ProjectGroup7
  23. We can also look at interaction patters: activity structure, held together by leaders?
  24. Combination of methods is most interesting.The move to posting a larger number of photos is highly successful for this page.What is happening in April 2012?For more details see: https://wiki.digitalmethods.net/CounterJihadism/ProjectGroup3Network diagrams are not so good with time.
  25. Simply looking at "images" in quantitative terms is not enough, here the qualitative part begins and netvizz makes it easier to take that step:Extracting photo URLsExtracting commentsStudying most "successful" tropes in depth, etc.For more details see: https://wiki.digitalmethods.net/CounterJihadism/ProjectGroup3
  26. Network analysis has produced a large number of calculated metrics that take into account the structure of the network."All in all, this process resulted in the specification of nine centrality measures based on three conceptual foundations. Three are based on the degrees of points and are indexes of communication activity. Three are based on the betweenness of points and are indexes of potential for control of communication. And three are based on closeness and are indexes either of independence or efficiency." (Freeman 1979)What concepts are these metricsbased on?
  27. Network metrics are highly dependent on individual variables. Here: the same network with PageRank with four different values for the dampening parameter alpha. (red=highest PR value, yellow=second highest, turquoise=third highest)See Rieder 2012: http://computationalculture.net/article/what_is_in_pagerank