SlideShare a Scribd company logo
1 of 76
MAPPING THE WEB
Marta Severo – Université de Lille 3, Laboratoire Gériico
marta.severo@univ-lille3.fr
13 August 2013, University of Sao Paulo, Escola de
comunicaçoes et artes (ECA/USP)
MAPPING THE WEB
THE PRINCIPLE
The web mapping is based on the idea
that hyperlinks created on the web can
be used as a proxy of social ties
WEB MAPPING
THE PRACTICE
We generate a graph that traces the
network created by hyperlinks on a set of
web pages
MAPPING THE U.S. BLOGOSPHERE (2004)
 Méthodes numériques
Divided they Blog
Adamic & Glance, 2005
Govcom.org, 2008
FRENCH POLITICAL BLOGOSPHERE
http://politicosphere.blog.lemonde.fr/ (Linkfluence)
CAN WE MAP THE WEB?
WHAT IS IT THE WORLD WIDE WEB?
The World Wide Web (abbreviated as
WWW or W3, commonly known as the
web), is a system of interlinked hypertext
documents accessed via the Internet. With a
web browser, one can view web pages that
may contain text, images, videos, and other
multimedia, and navigate between them via
hyperlinks. (Wikipedia)
http://internet-map.net
THE RISK OF WEB MAPPING
HOW TO GET AN EFFECTIVE AND READABLE
MAP OF THE WEB?
TO BECOME A WEB MAPPER YOU HAVE TO...
Understand the
morphology of the Web
Know how to build a
corpus of web sites
Know how to represent a
corpus of web sites
THE MORPHOLOGY OF THE WEB:
THE POWER LAW Barabási, Albert-László (2002)
Linked: The New Science of Networks
NETWORKS OF SCIENTIFIC PAPERS
  D. De Solla Prince, 1965
Science, 149(3683) : 510-515
WHAT DOES THE WEB LOOK LIKE?
WHAT DOES THE WEB LOOK LIKE?
the higher layer
(Fishusa.com)
the top layer
(Wikipedia.org)
the lower layer
(Thediaryofalakenerd.blogspot.com)
the middle layer
(Icefishingtoday.com)
THE LAYERS OF THE WEB
First page of Google
(interested users find them)
First 2/3 results of Google
(everyone see them)
Not showing/indexed
(nowhere to be found)
First 10 pages of Google
(experts find them)
higher layer
top layer
lower layer
middle layer
THE LAYERS OF THE WEB
higher layer
top layer
lower layer
middle layer
THE LAYERS OF THE WEB
higher layer
top layer
lower layer
middle layer
THE LAYERS OF THE WEB
TO BECOME A WEB MAPPER YOU HAVE TO...
Understand the
morphology of the Web
Know how to build a
corpus of web sites
Know how to represent
a corpus of web sites
SOFTWARE: WEB CRAWLER
 Automatic crawler (Issuecrawler)
 Manual crawler (Navicrawler)
Automatic crawler
https://www.issuecrawler.net/
RISKS OF AUTOMATIC CRAWLER
RISKS OF AUTOMATIC CRAWLER
RISKS OF AUTOMATIC CRAWLER
RISKS OF AUTOMATIC CRAWLER
http://webatlas.fr/wp/navicrawler/
MANUAL CRAWLER: NAVICRAWLER
DEFINE THE CORPUS
Tear
Difficult choice
Cut
DEFINE THE CORPUS
THE CORPUS ON THE MAP
entering
the domain
1
excluding
top layer
including
the nebula
including
the core
exploring
the filaments
2
3
4
5
WEB MAPPING FROM THE PRATICAL VIEWPOINT
http://webatlas.fr/wp/navicrawler/
NAVICRAWLER
TURN ON NAVICRAWLER
http://webatlas.fr/wp/navicrawler/
information on the page
information on the site
information on the corpus
lists
sort and find
WINDOW « NAV »
What is a URL? What is a domain name?
WHAT IS A WEBSITE ?
IF WEBSITES ARE SMALLER OF A DOMAIN
NAME..
IF WEBSITES ARE BIGGER THAN A DOMAIN
NAME…
SOCIAL NETWORKS
 Facebook is just on node
https://www.facebook.com/pages/Atopos/
150996404962854
 Twitter accounts are separate nodes but
links in tweets can not be identified
https://twitter.com/atopos_usp
CONSTITUTION OF THE CORPUS
www.webatlas.fr
ATTENTION TO THE DEEPNESS
ATTENTION TO THE DISTANCE
TO BECOME A WEB MAPPER YOU HAVE TO...
Understand the
morphology of the Web
Know how to build a
corpus of web sites
Know how to represent
a corpus of web sites
GEPHI
https://gephi.org/
THE ANALYSIS OF THE GRAPH GIVES US
THREE TYPES OF INFORMATION
1. LAYOUT : Applying an algorithm force-
vector
2. RANKING : Applying a degree
classification
3. PARTITION : Applying partition by color
1. LAYOUT > PROXIMITY
 Two nodes are close if the sites they
represent are directly or indirectly linked.
 Questions :
  1.1 Which are the debates or
communities?
(identification of clusters of nodes)
  1.2. What are the sites that connect
debates / communities?
(identification of bridge between clusters)
ALGORITHM FORCE-VECTOR
1.1. WHICH ARE THE DEBATES OR
COMMUNITIES?
(IDENTIFICATION OF CLUSTERS)
1.1. WHICH ARE THE DEBATES OR
COMMUNITIES?
(IDENTIFICATION OF CLUSTERS)
1.2. WHAT ARE THE SITES THAT CONNECT
DEBATES / COMMUNITIES?
(IDENTIFICATION OF BRIDGE BETWEEN
CLUSTERS)
2. RANKING > AUTORITHIES AND HUBS
 The size of the nodes may be proportional to
the authority of the site (in-degree) or to its
role of information relay (out-degree).
 Questions:
  2.1. What sites are opinion leaders of online
debate?
(identification of graph autorities)
  2.2. What are the sites that bring together the
online debate?
(identification of graph hubs)
2.1. WHAT SITES ARE OPINION LEADERS OF ONLINE
DEBATE?
(IDENTIFICATION OF GRAPH AUTORITIES)
2.2. WHAT ARE THE SITES THAT BRING
TOGETHER THE ONLINE DEBATE?
(IDENTIFICATION OF GRAPH HUBS)
3. PARTITION > CATEGORIZATION
 The color of the nodes can be changed
to show different categories.
 Question :
  How are distributed the different types of
sites?
(evaluation of the topology)
3.1. HOW ARE DISTRIBUTED THE DIFFERENT TYPES
OF SITES?
(EVALUATION OF THE TOPOLOGY)
3.1. HOW ARE DISTRIBUTED THE DIFFERENT TYPES
OF SITES?
(EVALUATION OF THE TOPOLOGY)
EXAMPLES
CONTROVERSY MAPPING
http://controverses.sciences-po.fr/archive/decroissance/
Auteur : étudiants de Sciences Po – Paris (cours de cartographie des
controverses)
CONTROVERSY MAPPING
http://controverses.sciences-po.fr/archive/decroissance/
Auteur : étudiants de Sciences Po – Paris (cours de cartographie des
controverses)
PRACTICES IN BUSINESS
 Monitoring a sector or a product
 Studying a community and identifying
leaders
 Studying e-reputation on the social web
 Studying of spontaneous conversations
around a brand
 Studying the viral spread of content…..
MAPPING OF A COMMUNITY: GITHUB, SOCIAL
NETWORKS, OPEN SOURCE DEVELOPERS
Auteur : linkfluence.net
MAPPING A SECTOR:
ACTORS OF URANIUM MINING IN AFRICA
Auteur : Susana Nuneshttp://www.hellodata.eu/uranium/
Auteur : Susana Nunes
MAPPING A SECTOR:
INVESTORS IN NEW TECHNOLOGIES (2010)
Auteur : linkfluence.net
TEACHING WEB MAPPING
Exercice : « How to promote a web portal for illustrators to
develop this activity? »
Exercice : « Discursive communities related to hipsters in France»
TEACHING WEB MAPPING
EXEMPLES
Severo M. (2012), « Le patrimoine culturel immatériel sur la Toile. Comparaison
entre réseaux nationaux », in Culture et recherche, n. 127, p. 58-57
http://www.culturecommunication.gouv.fr/content/download/53634/415776/file/
Culture%20et%20recherche%20127_automne%202012.pdf
BAGUALA PROJECT
 http://baguala.hypotheses.org
 Project coordinated by Pierre Gautreau,
Université de Paris 1
See article :
 P. Gautreau, H. Hasenack, L. Lerch, G.
Merlinksy, M. Noucher, M. Severo,
« Comparison of the open environmental
data diffusion in Argentina, Bolivia and
Brazil »,
http://hal.archives-ouvertes.fr/hal-00744805
THE GOALS
 Studying the uses of open environmental
data in Latin America and France
 Understanding how Internet changes the
ways the Society represents and manages
its environment, through the supply of
information and data online
 Building an inventory of websites that
provide information or data about the
environment in Argentina, Bolivia and
Brazil
1. SELECTION OF WEBSITES
 275 requests to the Google search engine,
for each of the three countries. For each
country, the list of requests was
established combining the name of an
administrative unit, an environmental
keyword (environment, nature,
environmental education, biodiversity,
pollution, water, climatic change,
environnemental risk, waste, soil, forest)
 For each request, the first 50 answers
were examined, and the pertinent websites
were included to our corpus
2. CATEGORIZATION
  Each website has benne visited and described
through 30 categories, aiming to characterize its
author, the author of its content (frequently not the
same one as the site’s author), the objectives of the
site, its main theme, and the kind of data it supplied
3. MAPPING THE WEB
 we searched for the hyperlinks between
these sites through a “webcrawling”, and
represented the graph they formed
 ATTENTION: For a correct evaluation of
the results that will be discussed, it is
important to remember that this inventory
is only a picture of the web of the period
when it was performed (the mid-2012
year), that will be quickly outdated due to
the rapid changes of the environmental
sites.
ACTIVIST WEBSITES
2 TYPES OF ACTIVIST WEBSITES
 information hubs, their protest action is
aimed at spreading knowledge. In this
category we find primarily alternative
media (indymedia.argentina.org,
rebellion.org, www.adital.com.br) and
social movements
 sites of movements that focus on the
organization of activities in the physical
space. Their protest is geographically
and thematically focused

More Related Content

Similar to Mapping the web

Web20 Intro Naj Shaik
Web20 Intro Naj ShaikWeb20 Intro Naj Shaik
Web20 Intro Naj ShaikKaren Vignare
 
Web 2.0 Instructional Tools
Web 2.0 Instructional ToolsWeb 2.0 Instructional Tools
Web 2.0 Instructional ToolsAntwuan Stinson
 
Introduction to the Social Semantic Web
Introduction to the Social Semantic WebIntroduction to the Social Semantic Web
Introduction to the Social Semantic Webmdabrowski
 
From Academic Library 2.0 to (Literature) Research 2.0
From Academic Library 2.0  to (Literature) Research 2.0From Academic Library 2.0  to (Literature) Research 2.0
From Academic Library 2.0 to (Literature) Research 2.0Michael Habib
 
WEB 3 IS THE FILE UPLOADED IN THIS APPROACH
WEB 3 IS THE FILE UPLOADED IN THIS APPROACHWEB 3 IS THE FILE UPLOADED IN THIS APPROACH
WEB 3 IS THE FILE UPLOADED IN THIS APPROACHBalasundaramSr
 
Community Media 2.0:
Community Media 2.0:  Community Media 2.0:
Community Media 2.0: Felicia
 
Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups Carlo Vaccari
 
The Web, The User and the Library (and why to get in between)
The Web, The User and the Library (and why to get in between)The Web, The User and the Library (and why to get in between)
The Web, The User and the Library (and why to get in between)Guus van den Brekel
 
The web you were used to is gone. Architecture and strategy for your content.
The web you were used to is gone. Architecture and strategy for your content.The web you were used to is gone. Architecture and strategy for your content.
The web you were used to is gone. Architecture and strategy for your content.Alberta Soranzo
 
Learning as a Social Process
Learning as a Social ProcessLearning as a Social Process
Learning as a Social ProcessRobert Cormia
 
NIH Management Series Seminar - June 2008 - Jim Angus
NIH Management Series Seminar - June 2008 - Jim AngusNIH Management Series Seminar - June 2008 - Jim Angus
NIH Management Series Seminar - June 2008 - Jim AngusJim Angus
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchFrancesca Di Donato
 
Hyperlink Formation in Social Bookmarking Systems: Who is Who Online?
Hyperlink Formation in Social Bookmarking Systems: Who is Who Online?Hyperlink Formation in Social Bookmarking Systems: Who is Who Online?
Hyperlink Formation in Social Bookmarking Systems: Who is Who Online?BO TRUE ACTIVITIES SL
 
Social Software in Education
Social Software in EducationSocial Software in Education
Social Software in EducationLaura Blankenship
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Digital Methods Initiative
 
Walking Our Way to the Web
Walking Our Way to the WebWalking Our Way to the Web
Walking Our Way to the WebFabien Gandon
 
Exploiting Semantic Web Techniques For Representing And Utilising
Exploiting Semantic Web Techniques For Representing And UtilisingExploiting Semantic Web Techniques For Representing And Utilising
Exploiting Semantic Web Techniques For Representing And UtilisingOwen Sacco
 

Similar to Mapping the web (20)

Web20 Intro Naj Shaik
Web20 Intro Naj ShaikWeb20 Intro Naj Shaik
Web20 Intro Naj Shaik
 
Web 2.0 Instructional Tools
Web 2.0 Instructional ToolsWeb 2.0 Instructional Tools
Web 2.0 Instructional Tools
 
Introduction to the Social Semantic Web
Introduction to the Social Semantic WebIntroduction to the Social Semantic Web
Introduction to the Social Semantic Web
 
From Academic Library 2.0 to (Literature) Research 2.0
From Academic Library 2.0  to (Literature) Research 2.0From Academic Library 2.0  to (Literature) Research 2.0
From Academic Library 2.0 to (Literature) Research 2.0
 
WEB 3 IS THE FILE UPLOADED IN THIS APPROACH
WEB 3 IS THE FILE UPLOADED IN THIS APPROACHWEB 3 IS THE FILE UPLOADED IN THIS APPROACH
WEB 3 IS THE FILE UPLOADED IN THIS APPROACH
 
Community Media 2.0:
Community Media 2.0:  Community Media 2.0:
Community Media 2.0:
 
Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups
 
The Web, The User and the Library (and why to get in between)
The Web, The User and the Library (and why to get in between)The Web, The User and the Library (and why to get in between)
The Web, The User and the Library (and why to get in between)
 
The web you were used to is gone. Architecture and strategy for your content.
The web you were used to is gone. Architecture and strategy for your content.The web you were used to is gone. Architecture and strategy for your content.
The web you were used to is gone. Architecture and strategy for your content.
 
Learning as a Social Process
Learning as a Social ProcessLearning as a Social Process
Learning as a Social Process
 
NIH Management Series Seminar - June 2008 - Jim Angus
NIH Management Series Seminar - June 2008 - Jim AngusNIH Management Series Seminar - June 2008 - Jim Angus
NIH Management Series Seminar - June 2008 - Jim Angus
 
Web 30 and RSS
Web 30 and RSSWeb 30 and RSS
Web 30 and RSS
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for research
 
Hyperlink Formation in Social Bookmarking Systems: Who is Who Online?
Hyperlink Formation in Social Bookmarking Systems: Who is Who Online?Hyperlink Formation in Social Bookmarking Systems: Who is Who Online?
Hyperlink Formation in Social Bookmarking Systems: Who is Who Online?
 
Webolution
WebolutionWebolution
Webolution
 
Social Software in Education
Social Software in EducationSocial Software in Education
Social Software in Education
 
Internet Mashups
Internet MashupsInternet Mashups
Internet Mashups
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
 
Walking Our Way to the Web
Walking Our Way to the WebWalking Our Way to the Web
Walking Our Way to the Web
 
Exploiting Semantic Web Techniques For Representing And Utilising
Exploiting Semantic Web Techniques For Representing And UtilisingExploiting Semantic Web Techniques For Representing And Utilising
Exploiting Semantic Web Techniques For Representing And Utilising
 

More from Marta Severo

Using Web Archives for Studying Cultural Heritage Collaborative Platforms
Using Web Archives for Studying Cultural Heritage Collaborative PlatformsUsing Web Archives for Studying Cultural Heritage Collaborative Platforms
Using Web Archives for Studying Cultural Heritage Collaborative PlatformsMarta Severo
 
Rappresentazioni digitali della Via Francigena
Rappresentazioni digitali della Via FrancigenaRappresentazioni digitali della Via Francigena
Rappresentazioni digitali della Via FrancigenaMarta Severo
 
Itinéraires culturels et médias sociaux : outils pour la gouvernance de la...
Itinéraires culturels et médias sociaux : outils pour la gouvernance de la...Itinéraires culturels et médias sociaux : outils pour la gouvernance de la...
Itinéraires culturels et médias sociaux : outils pour la gouvernance de la...Marta Severo
 
Archiving news on the Web through RSS flows. A new tool for studying interna...
Archiving news on the Web through RSS flows. A new tool for studying interna...Archiving news on the Web through RSS flows. A new tool for studying interna...
Archiving news on the Web through RSS flows. A new tool for studying interna...Marta Severo
 
I dati del social web: e-reputation e identità digitale
I dati del social web: e-reputation e identità digitaleI dati del social web: e-reputation e identità digitale
I dati del social web: e-reputation e identità digitaleMarta Severo
 
I dati del social web : Social media monitoring
I dati del social web : Social media monitoringI dati del social web : Social media monitoring
I dati del social web : Social media monitoringMarta Severo
 
ANR GEOMEDIA project
ANR GEOMEDIA projectANR GEOMEDIA project
ANR GEOMEDIA projectMarta Severo
 
Net-activisme et territoire : réseaux locaux et réseaux mondiaux pendant le p...
Net-activisme et territoire : réseaux locaux et réseaux mondiaux pendant le p...Net-activisme et territoire : réseaux locaux et réseaux mondiaux pendant le p...
Net-activisme et territoire : réseaux locaux et réseaux mondiaux pendant le p...Marta Severo
 
Digital methods for Social Sciences: origin and definitions
Digital methods for Social Sciences: origin and definitionsDigital methods for Social Sciences: origin and definitions
Digital methods for Social Sciences: origin and definitionsMarta Severo
 

More from Marta Severo (9)

Using Web Archives for Studying Cultural Heritage Collaborative Platforms
Using Web Archives for Studying Cultural Heritage Collaborative PlatformsUsing Web Archives for Studying Cultural Heritage Collaborative Platforms
Using Web Archives for Studying Cultural Heritage Collaborative Platforms
 
Rappresentazioni digitali della Via Francigena
Rappresentazioni digitali della Via FrancigenaRappresentazioni digitali della Via Francigena
Rappresentazioni digitali della Via Francigena
 
Itinéraires culturels et médias sociaux : outils pour la gouvernance de la...
Itinéraires culturels et médias sociaux : outils pour la gouvernance de la...Itinéraires culturels et médias sociaux : outils pour la gouvernance de la...
Itinéraires culturels et médias sociaux : outils pour la gouvernance de la...
 
Archiving news on the Web through RSS flows. A new tool for studying interna...
Archiving news on the Web through RSS flows. A new tool for studying interna...Archiving news on the Web through RSS flows. A new tool for studying interna...
Archiving news on the Web through RSS flows. A new tool for studying interna...
 
I dati del social web: e-reputation e identità digitale
I dati del social web: e-reputation e identità digitaleI dati del social web: e-reputation e identità digitale
I dati del social web: e-reputation e identità digitale
 
I dati del social web : Social media monitoring
I dati del social web : Social media monitoringI dati del social web : Social media monitoring
I dati del social web : Social media monitoring
 
ANR GEOMEDIA project
ANR GEOMEDIA projectANR GEOMEDIA project
ANR GEOMEDIA project
 
Net-activisme et territoire : réseaux locaux et réseaux mondiaux pendant le p...
Net-activisme et territoire : réseaux locaux et réseaux mondiaux pendant le p...Net-activisme et territoire : réseaux locaux et réseaux mondiaux pendant le p...
Net-activisme et territoire : réseaux locaux et réseaux mondiaux pendant le p...
 
Digital methods for Social Sciences: origin and definitions
Digital methods for Social Sciences: origin and definitionsDigital methods for Social Sciences: origin and definitions
Digital methods for Social Sciences: origin and definitions
 

Recently uploaded

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesShubhangi Sonawane
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxNikitaBankoti2
 

Recently uploaded (20)

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 

Mapping the web

  • 1. MAPPING THE WEB Marta Severo – Université de Lille 3, Laboratoire Gériico marta.severo@univ-lille3.fr 13 August 2013, University of Sao Paulo, Escola de comunicaçoes et artes (ECA/USP)
  • 2. MAPPING THE WEB THE PRINCIPLE The web mapping is based on the idea that hyperlinks created on the web can be used as a proxy of social ties
  • 3. WEB MAPPING THE PRACTICE We generate a graph that traces the network created by hyperlinks on a set of web pages
  • 4. MAPPING THE U.S. BLOGOSPHERE (2004)  Méthodes numériques Divided they Blog Adamic & Glance, 2005
  • 7. CAN WE MAP THE WEB?
  • 8. WHAT IS IT THE WORLD WIDE WEB? The World Wide Web (abbreviated as WWW or W3, commonly known as the web), is a system of interlinked hypertext documents accessed via the Internet. With a web browser, one can view web pages that may contain text, images, videos, and other multimedia, and navigate between them via hyperlinks. (Wikipedia)
  • 10. HOW TO GET AN EFFECTIVE AND READABLE MAP OF THE WEB?
  • 11. TO BECOME A WEB MAPPER YOU HAVE TO... Understand the morphology of the Web Know how to build a corpus of web sites Know how to represent a corpus of web sites
  • 12. THE MORPHOLOGY OF THE WEB: THE POWER LAW Barabási, Albert-László (2002) Linked: The New Science of Networks
  • 13. NETWORKS OF SCIENTIFIC PAPERS   D. De Solla Prince, 1965 Science, 149(3683) : 510-515
  • 14. WHAT DOES THE WEB LOOK LIKE?
  • 15. WHAT DOES THE WEB LOOK LIKE?
  • 16. the higher layer (Fishusa.com) the top layer (Wikipedia.org) the lower layer (Thediaryofalakenerd.blogspot.com) the middle layer (Icefishingtoday.com) THE LAYERS OF THE WEB
  • 17. First page of Google (interested users find them) First 2/3 results of Google (everyone see them) Not showing/indexed (nowhere to be found) First 10 pages of Google (experts find them) higher layer top layer lower layer middle layer THE LAYERS OF THE WEB
  • 18. higher layer top layer lower layer middle layer THE LAYERS OF THE WEB
  • 19. higher layer top layer lower layer middle layer THE LAYERS OF THE WEB
  • 20. TO BECOME A WEB MAPPER YOU HAVE TO... Understand the morphology of the Web Know how to build a corpus of web sites Know how to represent a corpus of web sites
  • 21. SOFTWARE: WEB CRAWLER  Automatic crawler (Issuecrawler)  Manual crawler (Navicrawler)
  • 30. THE CORPUS ON THE MAP
  • 31. entering the domain 1 excluding top layer including the nebula including the core exploring the filaments 2 3 4 5 WEB MAPPING FROM THE PRATICAL VIEWPOINT
  • 34. information on the page information on the site information on the corpus lists sort and find WINDOW « NAV »
  • 35. What is a URL? What is a domain name? WHAT IS A WEBSITE ?
  • 36. IF WEBSITES ARE SMALLER OF A DOMAIN NAME..
  • 37. IF WEBSITES ARE BIGGER THAN A DOMAIN NAME…
  • 38. SOCIAL NETWORKS  Facebook is just on node https://www.facebook.com/pages/Atopos/ 150996404962854  Twitter accounts are separate nodes but links in tweets can not be identified https://twitter.com/atopos_usp
  • 39. CONSTITUTION OF THE CORPUS www.webatlas.fr
  • 40. ATTENTION TO THE DEEPNESS
  • 41. ATTENTION TO THE DISTANCE
  • 42. TO BECOME A WEB MAPPER YOU HAVE TO... Understand the morphology of the Web Know how to build a corpus of web sites Know how to represent a corpus of web sites
  • 44. THE ANALYSIS OF THE GRAPH GIVES US THREE TYPES OF INFORMATION 1. LAYOUT : Applying an algorithm force- vector 2. RANKING : Applying a degree classification 3. PARTITION : Applying partition by color
  • 45. 1. LAYOUT > PROXIMITY  Two nodes are close if the sites they represent are directly or indirectly linked.  Questions :   1.1 Which are the debates or communities? (identification of clusters of nodes)   1.2. What are the sites that connect debates / communities? (identification of bridge between clusters)
  • 47. 1.1. WHICH ARE THE DEBATES OR COMMUNITIES? (IDENTIFICATION OF CLUSTERS)
  • 48. 1.1. WHICH ARE THE DEBATES OR COMMUNITIES? (IDENTIFICATION OF CLUSTERS)
  • 49. 1.2. WHAT ARE THE SITES THAT CONNECT DEBATES / COMMUNITIES? (IDENTIFICATION OF BRIDGE BETWEEN CLUSTERS)
  • 50. 2. RANKING > AUTORITHIES AND HUBS  The size of the nodes may be proportional to the authority of the site (in-degree) or to its role of information relay (out-degree).  Questions:   2.1. What sites are opinion leaders of online debate? (identification of graph autorities)   2.2. What are the sites that bring together the online debate? (identification of graph hubs)
  • 51. 2.1. WHAT SITES ARE OPINION LEADERS OF ONLINE DEBATE? (IDENTIFICATION OF GRAPH AUTORITIES)
  • 52. 2.2. WHAT ARE THE SITES THAT BRING TOGETHER THE ONLINE DEBATE? (IDENTIFICATION OF GRAPH HUBS)
  • 53. 3. PARTITION > CATEGORIZATION  The color of the nodes can be changed to show different categories.  Question :   How are distributed the different types of sites? (evaluation of the topology)
  • 54. 3.1. HOW ARE DISTRIBUTED THE DIFFERENT TYPES OF SITES? (EVALUATION OF THE TOPOLOGY)
  • 55. 3.1. HOW ARE DISTRIBUTED THE DIFFERENT TYPES OF SITES? (EVALUATION OF THE TOPOLOGY)
  • 57. CONTROVERSY MAPPING http://controverses.sciences-po.fr/archive/decroissance/ Auteur : étudiants de Sciences Po – Paris (cours de cartographie des controverses)
  • 58. CONTROVERSY MAPPING http://controverses.sciences-po.fr/archive/decroissance/ Auteur : étudiants de Sciences Po – Paris (cours de cartographie des controverses)
  • 59. PRACTICES IN BUSINESS  Monitoring a sector or a product  Studying a community and identifying leaders  Studying e-reputation on the social web  Studying of spontaneous conversations around a brand  Studying the viral spread of content…..
  • 60. MAPPING OF A COMMUNITY: GITHUB, SOCIAL NETWORKS, OPEN SOURCE DEVELOPERS Auteur : linkfluence.net
  • 61. MAPPING A SECTOR: ACTORS OF URANIUM MINING IN AFRICA Auteur : Susana Nuneshttp://www.hellodata.eu/uranium/
  • 63. MAPPING A SECTOR: INVESTORS IN NEW TECHNOLOGIES (2010) Auteur : linkfluence.net
  • 64. TEACHING WEB MAPPING Exercice : « How to promote a web portal for illustrators to develop this activity? »
  • 65. Exercice : « Discursive communities related to hipsters in France» TEACHING WEB MAPPING
  • 66. EXEMPLES Severo M. (2012), « Le patrimoine culturel immatériel sur la Toile. Comparaison entre réseaux nationaux », in Culture et recherche, n. 127, p. 58-57 http://www.culturecommunication.gouv.fr/content/download/53634/415776/file/ Culture%20et%20recherche%20127_automne%202012.pdf
  • 67. BAGUALA PROJECT  http://baguala.hypotheses.org  Project coordinated by Pierre Gautreau, Université de Paris 1 See article :  P. Gautreau, H. Hasenack, L. Lerch, G. Merlinksy, M. Noucher, M. Severo, « Comparison of the open environmental data diffusion in Argentina, Bolivia and Brazil », http://hal.archives-ouvertes.fr/hal-00744805
  • 68. THE GOALS  Studying the uses of open environmental data in Latin America and France  Understanding how Internet changes the ways the Society represents and manages its environment, through the supply of information and data online  Building an inventory of websites that provide information or data about the environment in Argentina, Bolivia and Brazil
  • 69. 1. SELECTION OF WEBSITES  275 requests to the Google search engine, for each of the three countries. For each country, the list of requests was established combining the name of an administrative unit, an environmental keyword (environment, nature, environmental education, biodiversity, pollution, water, climatic change, environnemental risk, waste, soil, forest)  For each request, the first 50 answers were examined, and the pertinent websites were included to our corpus
  • 70. 2. CATEGORIZATION   Each website has benne visited and described through 30 categories, aiming to characterize its author, the author of its content (frequently not the same one as the site’s author), the objectives of the site, its main theme, and the kind of data it supplied
  • 71. 3. MAPPING THE WEB  we searched for the hyperlinks between these sites through a “webcrawling”, and represented the graph they formed  ATTENTION: For a correct evaluation of the results that will be discussed, it is important to remember that this inventory is only a picture of the web of the period when it was performed (the mid-2012 year), that will be quickly outdated due to the rapid changes of the environmental sites.
  • 72.
  • 73.
  • 74.
  • 76. 2 TYPES OF ACTIVIST WEBSITES  information hubs, their protest action is aimed at spreading knowledge. In this category we find primarily alternative media (indymedia.argentina.org, rebellion.org, www.adital.com.br) and social movements  sites of movements that focus on the organization of activities in the physical space. Their protest is geographically and thematically focused