SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
A Content Analysis:
            How Wikipedia Talk Pages Are Used
                      Jodi Schneider, Alexandre Passant & John G. Breslin
  Motivation                                     Content Analysis                                                                          Semantic Web
  Wikipedia’s coordination costs—the             We used 15 comment types;                                                                 Opportunities
  number of Talk page edits for each             a comment could have multiple types.                                                      We propose structured, meaningful
  article edit—have increased                    We started with Viégas’ 11 types [2]:                                                     annotations: the type of comment.
  dramatically [1]:                              1. Requests for editing coordination                                                      Comment types could enable new
                                                                                                                                           ways to browse Talk pages, using
                                                 2. Requests for information
                                                                                                                                           Semantic Web technologies. We
                                                 3. References to vandalism                                                                could instantaneously gather and
                                                 4. References to guidelines/policies                                                      show all comments of a certain type.

                                                 5. References to internal resources
                                                 6. Off-topic remarks                                                                      We have created a lightweight
                                                                                                                                           ontology, based on SIOC, where
                                                 7. Polls                                                                                  classes in the ontology correspond
                                                 8. Requests for peer review                                                               to common comment types we
                                                                                                                                           identified in the content analysis [4]:
                                                 9. Information boxes
                                                                                                                                           http://rdfs.org/sioc/wikitalk
  We are analyzing Talk pages to                 10. Images
  suggest how Semantic Web                                                                                                                 Users would tick checkboxes to
                                                 11. Other
  technologies (like structured                                                                                                            indicate a comment’s type(s).
  annotations) could improve                     We added 4 new types:
  coordination.                                  1. References to external sources
                                                                                                                                           A JavaScript plugin could then
 A typical discussion in a Wikipedia Talk page   2. Discussing reverts/removed                                                             highlight only certain comment types
                                                 material/controversial edits                                                              —for instance all “References to
                                                 3. Reference to edits made oneself                                                        external sources”. With SPARQL, we
                                                                                                                                           could show all “help requests” from a
                                                 4. Recruiting help for another article/                                                   group of pages.
                                                 portal




                                                                                                                                                                Talk page postings by type.
                                                                                                                                                                ‘Coordination’ is the most
                                                                                                                                                                common type of comment.
                                                                                                                                                                Comment types depend on
                                                                                                                                                                the page type. Discussions
                                                                                                                                                                of ‘reverts/removed
                                                                                                                                                                material/controversial
                                                                                                                                                                edits’ are three times as
                                                                                                                                                                likely on Talk pages of
                                                                                                                                                                controversial articles.
Method                                                                                                                                                          ‘Guidelines’ and ‘sources’
                                                                                                                                                                are commonly discussed.
We are examining 100 Talk pages, 20
                                                                                                                                                                Info boxes are common in
from each of these categories:
                                                                                                                                                                “most views” and
1.  Articles with the most contributors                                                                                                                         “controversial” samples.
2.  Most-viewed articles
3.  Controversial articles
4.  Featured Articles
5.  Random sample
This will help us to identify the types of
conversations and the variance between                References                                                                            Acknowledgements
pages. Existing studies focus on 1 or 2               [1] B. Stvilia, M.B. Twidale, L.C. Smith, and L. Gasser, “Information Quality Work
                                                      Organization in Wikipedia,” JASIST, vol. 59, 2008, pp. 983-1001.                      The work presented in this paper has
article types and use small sample                    [2] F.B. Viegas, M. Wattenberg, J. Kriss, and F.V. Ham, “Talk Before You Type:
                                                                                                                                            been funded by Science Foundation
sizes of 6 to 60 articles.                            Coordination in Wikipedia,” HICSS 2007, pp. 78-87.
                                                      [3] J. Schneider, A. Passant, and Breslin, John G., “A Content Analysis: How
                                                      Wikipedia Talk Pages Are Used,” WebScience 2010, Raleigh, North Carolina.
                                                                                                                                            Ireland under Grant No. SFI/08/CE/
                                                      [4] ibid, “Enhancing MediaWiki Talk pages with Semantics for Better Coordination      I1380 (Líon-2).
                                                      - A Proposal,” The Fifth Workshop on Semantic Wikis: Linking Data and People at
                                                      the 7th Extended Semantic Web Conference (ESWC), Crete, Greece: 2010.

Contenu connexe

Similaire à A Content Analysis: How Wikipedia Talk Pages Are Used (WebSci2010 poster)

Poster session IATEFL Cardiff 2009
Poster session IATEFL Cardiff 2009Poster session IATEFL Cardiff 2009
Poster session IATEFL Cardiff 2009
guest35fca4
 
First nine weeks curriculum
First nine weeks curriculumFirst nine weeks curriculum
First nine weeks curriculum
joannekidwell
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD Editor
 
Preliminary genre analysis_guidelines
Preliminary genre analysis_guidelinesPreliminary genre analysis_guidelines
Preliminary genre analysis_guidelines
Blake Steiner
 
Lecture knowledge representationreasoning
Lecture knowledge representationreasoningLecture knowledge representationreasoning
Lecture knowledge representationreasoning
IKS - Project
 

Similaire à A Content Analysis: How Wikipedia Talk Pages Are Used (WebSci2010 poster) (20)

Chapter6 McHaney
Chapter6 McHaneyChapter6 McHaney
Chapter6 McHaney
 
Mediawiki and Wiki As a Medium
Mediawiki and Wiki As a MediumMediawiki and Wiki As a Medium
Mediawiki and Wiki As a Medium
 
Poster session IATEFL Cardiff 2009
Poster session IATEFL Cardiff 2009Poster session IATEFL Cardiff 2009
Poster session IATEFL Cardiff 2009
 
Understanding and improving Wikipedia article discussion spaces SAC2011
Understanding and improving Wikipedia article discussion spaces SAC2011Understanding and improving Wikipedia article discussion spaces SAC2011
Understanding and improving Wikipedia article discussion spaces SAC2011
 
First nine weeks curriculum
First nine weeks curriculumFirst nine weeks curriculum
First nine weeks curriculum
 
Social Annotation Space, presentation
Social Annotation Space, presentationSocial Annotation Space, presentation
Social Annotation Space, presentation
 
Social Media for Research Communications - Research Communication Workshop
Social Media for Research Communications - Research Communication WorkshopSocial Media for Research Communications - Research Communication Workshop
Social Media for Research Communications - Research Communication Workshop
 
Dg24698702
Dg24698702Dg24698702
Dg24698702
 
Talk before you type: coordination in Wikipedia
Talk before you type: coordination in WikipediaTalk before you type: coordination in Wikipedia
Talk before you type: coordination in Wikipedia
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagement
 
Spotlight
SpotlightSpotlight
Spotlight
 
Annotations are coming to the web
Annotations are coming to the webAnnotations are coming to the web
Annotations are coming to the web
 
Preliminary genre analysis_guidelines
Preliminary genre analysis_guidelinesPreliminary genre analysis_guidelines
Preliminary genre analysis_guidelines
 
Enc 1102 september 11
Enc 1102 september 11Enc 1102 september 11
Enc 1102 september 11
 
Diversity toolkit
Diversity toolkitDiversity toolkit
Diversity toolkit
 
WikiOnt: An Ontology for Describing and Exchanging Wiki Articles
WikiOnt: An Ontology for Describing and Exchanging Wiki ArticlesWikiOnt: An Ontology for Describing and Exchanging Wiki Articles
WikiOnt: An Ontology for Describing and Exchanging Wiki Articles
 
Sentiment Analysis of Document Based on Annotation
Sentiment Analysis of Document Based on Annotation  Sentiment Analysis of Document Based on Annotation
Sentiment Analysis of Document Based on Annotation
 
Lecture knowledge representationreasoning
Lecture knowledge representationreasoningLecture knowledge representationreasoning
Lecture knowledge representationreasoning
 
Niso Annotation Webinar
Niso Annotation WebinarNiso Annotation Webinar
Niso Annotation Webinar
 

Plus de jodischneider

Continued citation of bad science and what we can do about it--2021-04-20
Continued citation of bad science and what we can do about it--2021-04-20Continued citation of bad science and what we can do about it--2021-04-20
Continued citation of bad science and what we can do about it--2021-04-20
jodischneider
 
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
jodischneider
 
Problematic citations--Workshop-on-Open-Citations--2018-09-03
Problematic citations--Workshop-on-Open-Citations--2018-09-03Problematic citations--Workshop-on-Open-Citations--2018-09-03
Problematic citations--Workshop-on-Open-Citations--2018-09-03
jodischneider
 
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
jodischneider
 

Plus de jodischneider (20)

Continued citation of bad science and what we can do about it--2021-04-20
Continued citation of bad science and what we can do about it--2021-04-20Continued citation of bad science and what we can do about it--2021-04-20
Continued citation of bad science and what we can do about it--2021-04-20
 
Continued citation of bad science and what we can do about it--2021-02-19
Continued citation of bad science and what we can do about it--2021-02-19Continued citation of bad science and what we can do about it--2021-02-19
Continued citation of bad science and what we can do about it--2021-02-19
 
The problems of post retraction citation - and mitigation strategies that wor...
The problems of post retraction citation - and mitigation strategies that wor...The problems of post retraction citation - and mitigation strategies that wor...
The problems of post retraction citation - and mitigation strategies that wor...
 
Towards knowledge maintenance in scientific digital libraries with the keysto...
Towards knowledge maintenance in scientific digital libraries with the keysto...Towards knowledge maintenance in scientific digital libraries with the keysto...
Towards knowledge maintenance in scientific digital libraries with the keysto...
 
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
 
Annotation examples--Fribourg--2019-09-03
Annotation examples--Fribourg--2019-09-03Annotation examples--Fribourg--2019-09-03
Annotation examples--Fribourg--2019-09-03
 
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
 
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
 
Problem-citations--CrossrefLive18--2018-11-13
Problem-citations--CrossrefLive18--2018-11-13Problem-citations--CrossrefLive18--2018-11-13
Problem-citations--CrossrefLive18--2018-11-13
 
Problematic citations--Workshop-on-Open-Citations--2018-09-03
Problematic citations--Workshop-on-Open-Citations--2018-09-03Problematic citations--Workshop-on-Open-Citations--2018-09-03
Problematic citations--Workshop-on-Open-Citations--2018-09-03
 
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
 
Innovations in reasoning about health: the case of the Randomized Clinical Tr...
Innovations in reasoning about health: the case of the Randomized Clinical Tr...Innovations in reasoning about health: the case of the Randomized Clinical Tr...
Innovations in reasoning about health: the case of the Randomized Clinical Tr...
 
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
 
Rhetorical moves and audience considerations in the discussion sections of ra...
Rhetorical moves and audience considerations in the discussion sections of ra...Rhetorical moves and audience considerations in the discussion sections of ra...
Rhetorical moves and audience considerations in the discussion sections of ra...
 
Citation practices and the construction of scientific fact--ECA-facts-preconf...
Citation practices and the construction of scientific fact--ECA-facts-preconf...Citation practices and the construction of scientific fact--ECA-facts-preconf...
Citation practices and the construction of scientific fact--ECA-facts-preconf...
 
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
 
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
 
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
 
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
 
Persons, documents, models: organising and structuring information for the We...
Persons, documents, models: organising and structuring information for the We...Persons, documents, models: organising and structuring information for the We...
Persons, documents, models: organising and structuring information for the We...
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

A Content Analysis: How Wikipedia Talk Pages Are Used (WebSci2010 poster)

  • 1. A Content Analysis: How Wikipedia Talk Pages Are Used Jodi Schneider, Alexandre Passant & John G. Breslin Motivation Content Analysis Semantic Web Wikipedia’s coordination costs—the We used 15 comment types; Opportunities number of Talk page edits for each a comment could have multiple types. We propose structured, meaningful article edit—have increased We started with Viégas’ 11 types [2]: annotations: the type of comment. dramatically [1]: 1. Requests for editing coordination Comment types could enable new ways to browse Talk pages, using 2. Requests for information Semantic Web technologies. We 3. References to vandalism could instantaneously gather and 4. References to guidelines/policies show all comments of a certain type. 5. References to internal resources 6. Off-topic remarks We have created a lightweight ontology, based on SIOC, where 7. Polls classes in the ontology correspond 8. Requests for peer review to common comment types we identified in the content analysis [4]: 9. Information boxes http://rdfs.org/sioc/wikitalk We are analyzing Talk pages to 10. Images suggest how Semantic Web Users would tick checkboxes to 11. Other technologies (like structured indicate a comment’s type(s). annotations) could improve We added 4 new types: coordination. 1. References to external sources A JavaScript plugin could then A typical discussion in a Wikipedia Talk page 2. Discussing reverts/removed highlight only certain comment types material/controversial edits —for instance all “References to 3. Reference to edits made oneself external sources”. With SPARQL, we could show all “help requests” from a 4. Recruiting help for another article/ group of pages. portal Talk page postings by type. ‘Coordination’ is the most common type of comment. Comment types depend on the page type. Discussions of ‘reverts/removed material/controversial edits’ are three times as likely on Talk pages of controversial articles. Method ‘Guidelines’ and ‘sources’ are commonly discussed. We are examining 100 Talk pages, 20 Info boxes are common in from each of these categories: “most views” and 1.  Articles with the most contributors “controversial” samples. 2.  Most-viewed articles 3.  Controversial articles 4.  Featured Articles 5.  Random sample This will help us to identify the types of conversations and the variance between References Acknowledgements pages. Existing studies focus on 1 or 2 [1] B. Stvilia, M.B. Twidale, L.C. Smith, and L. Gasser, “Information Quality Work Organization in Wikipedia,” JASIST, vol. 59, 2008, pp. 983-1001. The work presented in this paper has article types and use small sample [2] F.B. Viegas, M. Wattenberg, J. Kriss, and F.V. Ham, “Talk Before You Type: been funded by Science Foundation sizes of 6 to 60 articles. Coordination in Wikipedia,” HICSS 2007, pp. 78-87. [3] J. Schneider, A. Passant, and Breslin, John G., “A Content Analysis: How Wikipedia Talk Pages Are Used,” WebScience 2010, Raleigh, North Carolina. Ireland under Grant No. SFI/08/CE/ [4] ibid, “Enhancing MediaWiki Talk pages with Semantics for Better Coordination I1380 (Líon-2). - A Proposal,” The Fifth Workshop on Semantic Wikis: Linking Data and People at the 7th Extended Semantic Web Conference (ESWC), Crete, Greece: 2010.