SlideShare une entreprise Scribd logo
1  sur  9
Tagging structure in a protein-protein interaction network, a co-authorship network and the (English) Wikipedia Proteins Math co-authors Wikipedia articles Network Nodes  + Links Node tags Directed Acyclic Graph Gergely Palla ,  Illés  J.  Farka s , P é ter Pollner , Imre Derényi, Tamás Vicsek Category tree FIFA World Cup FIFA World Cup Players Classification of co-authored papers Combinatorics Graph theory Biochemical functions Growth Cell growth E ötvös University and Hungarian Academy of Sciences (Budapest, Hungary) Interactions (“MIPS”) Co-authorships (“MathSciNet”) Hyperlinks in Wikipedia CFinder.org
G. Palla, I. J. Farkas, P. Pollner, I. Derényi, T. Vicsek  Fundamental statistical features and self-similar properties of tagged networks   New Journal of Physics   10 , 123026 (2008)  http://CFinder.org  --> Publications Full version:
There is no clearly separated group of “most popular tags” Transition between popular and less popular tags is  continuous Portion ( x ) of all nodes in the network Probability that a given tag and its descendants label a portion  x  of all nodes CFinder.org
There is no clear group of “most heavily tagged nodes” Transition between strongly and less strongly tagged nodes is  continuous Number of tags ( n ) on a node (protein, author, wiki article) of the network Probability that a node has  n  tags CFinder.org
On the large scale there is no “link saturation” within a topic In fact, link density within a large topic is  almost the same  as outside CFinder.org Number of links among these nodes Number of Wikipedia articles selected from those labeled with “Japan” and descendant terms Maximum possible Wikipedia average
CFinder.org A B C D E F G H J Meaningful removal of loops from the category hierarchy How to achieve tree structure (DAG) with the lowest number of removals Example (Oct.2007): ,[object Object],[object Object],[object Object],[object Object],[object Object],subcategory subcategory Category:Urdu  Category:Hindustani
CFinder.org Meaningful removal of loops from the category hierarchy How to achieve tree structure (DAG) with the lowest number of removals Example (Oct.2007): ,[object Object],[object Object],[object Object],[object Object],[object Object],A B C D E F G H J subcategory subcategory ,[object Object],[object Object],[object Object],[object Object],[object Object],Category:Urdu  Category:Hindustani
Gergely Palla  Illés Farka s  P é ter Pollner  Imre Derényi   Tamás Vicsek
http://CFinder.org -- with network data and free analysis software We thank for support from:

Contenu connexe

En vedette

Searles Lake and the Trona Pinnacles
Searles Lake and the Trona PinnaclesSearles Lake and the Trona Pinnacles
Searles Lake and the Trona Pinnacles
Chris Austin
 
Ethiopia historic highlights july 21, 2013
Ethiopia historic highlights   july 21, 2013Ethiopia historic highlights   july 21, 2013
Ethiopia historic highlights july 21, 2013
Nebiyu Asfaw
 

En vedette (19)

Giving Firefox Users Control of Their Data
Giving Firefox Users Control of Their DataGiving Firefox Users Control of Their Data
Giving Firefox Users Control of Their Data
 
Issues with SignWriting in Unicode 8
Issues with SignWriting in Unicode 8Issues with SignWriting in Unicode 8
Issues with SignWriting in Unicode 8
 
Poetic APIs
Poetic APIsPoetic APIs
Poetic APIs
 
OpenGLAM-2013-Vassia-Atanassova
OpenGLAM-2013-Vassia-AtanassovaOpenGLAM-2013-Vassia-Atanassova
OpenGLAM-2013-Vassia-Atanassova
 
Wikipedia presentation 20130514
Wikipedia presentation 20130514Wikipedia presentation 20130514
Wikipedia presentation 20130514
 
KT ucloud storage, by Jaesuk Ahn
KT ucloud storage, by Jaesuk AhnKT ucloud storage, by Jaesuk Ahn
KT ucloud storage, by Jaesuk Ahn
 
Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia
 
Design Stories Are The New User Stories
Design Stories Are The New User StoriesDesign Stories Are The New User Stories
Design Stories Are The New User Stories
 
TOR... ALL THE THINGS
TOR... ALL THE THINGSTOR... ALL THE THINGS
TOR... ALL THE THINGS
 
Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literatureAutomatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literature
 
Designing speed with progressive enhancement
Designing speed with progressive enhancementDesigning speed with progressive enhancement
Designing speed with progressive enhancement
 
Black Ops of TCP/IP 2011 (Black Hat USA 2011)
Black Ops of TCP/IP 2011 (Black Hat USA 2011)Black Ops of TCP/IP 2011 (Black Hat USA 2011)
Black Ops of TCP/IP 2011 (Black Hat USA 2011)
 
Finding News Citations For Wikipedia
Finding News Citations For WikipediaFinding News Citations For Wikipedia
Finding News Citations For Wikipedia
 
Searles Lake and the Trona Pinnacles
Searles Lake and the Trona PinnaclesSearles Lake and the Trona Pinnacles
Searles Lake and the Trona Pinnacles
 
Entropic web
Entropic webEntropic web
Entropic web
 
Red team upgrades using sccm for malware deployment
Red team upgrades   using sccm for malware deploymentRed team upgrades   using sccm for malware deployment
Red team upgrades using sccm for malware deployment
 
Web performance tools @ WebPerf.camp 2016
Web performance tools @ WebPerf.camp 2016Web performance tools @ WebPerf.camp 2016
Web performance tools @ WebPerf.camp 2016
 
Creative Commons Enforcement
Creative Commons EnforcementCreative Commons Enforcement
Creative Commons Enforcement
 
Ethiopia historic highlights july 21, 2013
Ethiopia historic highlights   july 21, 2013Ethiopia historic highlights   july 21, 2013
Ethiopia historic highlights july 21, 2013
 

Similaire à Structure of tagging on Wikipedia pages and other networks -- Wikimania 2010 -- Gdansk

Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]
Marcia Zeng
 
Capturing emerging relations between schema ontologies on the Web of Data
Capturing emerging relations between schema ontologies on the Web of DataCapturing emerging relations between schema ontologies on the Web of Data
Capturing emerging relations between schema ontologies on the Web of Data
Andriy Nikolov
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
Angelo Salatino
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
Angelo Salatino
 
20120622 web sci12-won-marc smith-semantic and social network analysis of …
20120622 web sci12-won-marc smith-semantic and social network analysis of …20120622 web sci12-won-marc smith-semantic and social network analysis of …
20120622 web sci12-won-marc smith-semantic and social network analysis of …
Marc Smith
 

Similaire à Structure of tagging on Wikipedia pages and other networks -- Wikimania 2010 -- Gdansk (20)

Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information access
 
Wikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing DocumentsWikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing Documents
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
 
Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011
 
New Metrics for New Media Bay Area CIO IT Executives Meetup
New Metrics for New Media Bay Area CIO IT Executives MeetupNew Metrics for New Media Bay Area CIO IT Executives Meetup
New Metrics for New Media Bay Area CIO IT Executives Meetup
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021
 
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
 
#FOJ2013 Follow-up communication in the blogosphere
#FOJ2013 Follow-up communication in the blogosphere#FOJ2013 Follow-up communication in the blogosphere
#FOJ2013 Follow-up communication in the blogosphere
 
Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]
 
Capturing emerging relations between schema ontologies on the Web of Data
Capturing emerging relations between schema ontologies on the Web of DataCapturing emerging relations between schema ontologies on the Web of Data
Capturing emerging relations between schema ontologies on the Web of Data
 
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern MinimalizationABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
 
Do wikipedia science articles reflect on state-of-the-art research?
Do wikipedia science articles reflect on state-of-the-art research?Do wikipedia science articles reflect on state-of-the-art research?
Do wikipedia science articles reflect on state-of-the-art research?
 
A scalable gibbs sampler for probabilistic entity linking
A scalable gibbs sampler for probabilistic entity linkingA scalable gibbs sampler for probabilistic entity linking
A scalable gibbs sampler for probabilistic entity linking
 
mx & dbs
mx & dbsmx & dbs
mx & dbs
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
Exploring Content with Wikipedia
Exploring Content with WikipediaExploring Content with Wikipedia
Exploring Content with Wikipedia
 
Wikis as Social Networks: Evolution and Dynamics
Wikis as Social Networks:Evolution and Dynamics Wikis as Social Networks:Evolution and Dynamics
Wikis as Social Networks: Evolution and Dynamics
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
On the Navigability of Social Tagging Systems
On the Navigability of Social Tagging SystemsOn the Navigability of Social Tagging Systems
On the Navigability of Social Tagging Systems
 
20120622 web sci12-won-marc smith-semantic and social network analysis of …
20120622 web sci12-won-marc smith-semantic and social network analysis of …20120622 web sci12-won-marc smith-semantic and social network analysis of …
20120622 web sci12-won-marc smith-semantic and social network analysis of …
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Dernier (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Structure of tagging on Wikipedia pages and other networks -- Wikimania 2010 -- Gdansk

  • 1. Tagging structure in a protein-protein interaction network, a co-authorship network and the (English) Wikipedia Proteins Math co-authors Wikipedia articles Network Nodes + Links Node tags Directed Acyclic Graph Gergely Palla , Illés J. Farka s , P é ter Pollner , Imre Derényi, Tamás Vicsek Category tree FIFA World Cup FIFA World Cup Players Classification of co-authored papers Combinatorics Graph theory Biochemical functions Growth Cell growth E ötvös University and Hungarian Academy of Sciences (Budapest, Hungary) Interactions (“MIPS”) Co-authorships (“MathSciNet”) Hyperlinks in Wikipedia CFinder.org
  • 2. G. Palla, I. J. Farkas, P. Pollner, I. Derényi, T. Vicsek  Fundamental statistical features and self-similar properties of tagged networks   New Journal of Physics   10 , 123026 (2008)  http://CFinder.org --> Publications Full version:
  • 3. There is no clearly separated group of “most popular tags” Transition between popular and less popular tags is continuous Portion ( x ) of all nodes in the network Probability that a given tag and its descendants label a portion x of all nodes CFinder.org
  • 4. There is no clear group of “most heavily tagged nodes” Transition between strongly and less strongly tagged nodes is continuous Number of tags ( n ) on a node (protein, author, wiki article) of the network Probability that a node has n tags CFinder.org
  • 5. On the large scale there is no “link saturation” within a topic In fact, link density within a large topic is almost the same as outside CFinder.org Number of links among these nodes Number of Wikipedia articles selected from those labeled with “Japan” and descendant terms Maximum possible Wikipedia average
  • 6.
  • 7.
  • 8. Gergely Palla Illés Farka s P é ter Pollner Imre Derényi Tamás Vicsek
  • 9. http://CFinder.org -- with network data and free analysis software We thank for support from: