Soumettre la recherche
Mettre en ligne
Measuring the Quality of Web Content using Factual Information
•
Télécharger en tant que PPTX, PDF
•
1 j'aime
•
916 vues
Elisabeth Lex
Suivre
Technologie
Affichage du diaporama
Signaler
Partager
Affichage du diaporama
Signaler
Partager
1 sur 13
Télécharger maintenant
Recommandé
Attention Please! A Hybrid Resource Recommender Mimicking Attention-Interpret...
Attention Please! A Hybrid Resource Recommender Mimicking Attention-Interpret...
Elisabeth Lex
Graph Visualization using Hierarchical Aggregation and Edge Bundling
Graph Visualization using Hierarchical Aggregation and Edge Bundling
Elisabeth Lex
What Really Works: Reflections on Applied Methods in a Real World Interdiscip...
What Really Works: Reflections on Applied Methods in a Real World Interdiscip...
Elisabeth Lex
Research Data Explored: Citations versus Altmetrics
Research Data Explored: Citations versus Altmetrics
Elisabeth Lex
Information Quality Assessment in the WIQ-EI EU Project
Information Quality Assessment in the WIQ-EI EU Project
Elisabeth Lex
Thematic teaching
Thematic teaching
Graciela Bilat
32 Ways a Digital Marketing Consultant Can Help Grow Your Business
32 Ways a Digital Marketing Consultant Can Help Grow Your Business
Barry Feldman
EDF2012 - CODE
EDF2012 - CODE
European Data Forum
Recommandé
Attention Please! A Hybrid Resource Recommender Mimicking Attention-Interpret...
Attention Please! A Hybrid Resource Recommender Mimicking Attention-Interpret...
Elisabeth Lex
Graph Visualization using Hierarchical Aggregation and Edge Bundling
Graph Visualization using Hierarchical Aggregation and Edge Bundling
Elisabeth Lex
What Really Works: Reflections on Applied Methods in a Real World Interdiscip...
What Really Works: Reflections on Applied Methods in a Real World Interdiscip...
Elisabeth Lex
Research Data Explored: Citations versus Altmetrics
Research Data Explored: Citations versus Altmetrics
Elisabeth Lex
Information Quality Assessment in the WIQ-EI EU Project
Information Quality Assessment in the WIQ-EI EU Project
Elisabeth Lex
Thematic teaching
Thematic teaching
Graciela Bilat
32 Ways a Digital Marketing Consultant Can Help Grow Your Business
32 Ways a Digital Marketing Consultant Can Help Grow Your Business
Barry Feldman
EDF2012 - CODE
EDF2012 - CODE
European Data Forum
Adjusting the Focus: Usability Study Aligns Organization Vision with Communit...
Adjusting the Focus: Usability Study Aligns Organization Vision with Communit...
Laurie Bennett
Mastery of Common Core Assessments
Mastery of Common Core Assessments
School Improvement Network
SMX Landing Page Optimization
SMX Landing Page Optimization
Datalicious
2011 11 11 (uc3m) emadrid slindstaedt kmi tug computational support for work-...
2011 11 11 (uc3m) emadrid slindstaedt kmi tug computational support for work-...
eMadrid network
M12S07 - Retention & ESI - Paths to Success - Part Two
M12S07 - Retention & ESI - Paths to Success - Part Two
MER Conference
LavaCon 2012: How to Deliver the Wrong Content to the Wrong Person at the Wro...
LavaCon 2012: How to Deliver the Wrong Content to the Wrong Person at the Wro...
Don Day
Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3
Yamika Mehra
NISO Webinar: Return on Investment (ROI) in Linking the Semantic Web
NISO Webinar: Return on Investment (ROI) in Linking the Semantic Web
National Information Standards Organization (NISO)
Greenwich Digital Learning Share
Greenwich Digital Learning Share
EdAdvance
Chapter 20 Presentation
Chapter 20 Presentation
LizbethKate
ai-one presentation
ai-one presentation
diggelmann
Community research
Community research
Steven Taylor
PERSPECTIVES GENERATION VIA MULTI-HEAD ATTENTION MECHANISM AND COMMON-SENSE K...
PERSPECTIVES GENERATION VIA MULTI-HEAD ATTENTION MECHANISM AND COMMON-SENSE K...
IJCI JOURNAL
Enabled, Engaged, Empowered: The Student Vision for Digital Learning
Enabled, Engaged, Empowered: The Student Vision for Digital Learning
Julie Evans
Diversity and novelty for recommendation system
Diversity and novelty for recommendation system
Zhenv5
Information Quality Assessment in the WIQ-EI EU Project
Information Quality Assessment in the WIQ-EI EU Project
wiqei
A Framework for Applying Quantified Self Approaches to Support Reflective Lea...
A Framework for Applying Quantified Self Approaches to Support Reflective Lea...
veronicarp
Ideal-Analytics Product Training
Ideal-Analytics Product Training
Yamika Mehra
Capstone Project
Capstone Project
Digital Disciple Network
Module 6 - Communication and effective presentations
Module 6 - Communication and effective presentations
Paul Brown
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Drew Madelung
Contenu connexe
Similaire à Measuring the Quality of Web Content using Factual Information
Adjusting the Focus: Usability Study Aligns Organization Vision with Communit...
Adjusting the Focus: Usability Study Aligns Organization Vision with Communit...
Laurie Bennett
Mastery of Common Core Assessments
Mastery of Common Core Assessments
School Improvement Network
SMX Landing Page Optimization
SMX Landing Page Optimization
Datalicious
2011 11 11 (uc3m) emadrid slindstaedt kmi tug computational support for work-...
2011 11 11 (uc3m) emadrid slindstaedt kmi tug computational support for work-...
eMadrid network
M12S07 - Retention & ESI - Paths to Success - Part Two
M12S07 - Retention & ESI - Paths to Success - Part Two
MER Conference
LavaCon 2012: How to Deliver the Wrong Content to the Wrong Person at the Wro...
LavaCon 2012: How to Deliver the Wrong Content to the Wrong Person at the Wro...
Don Day
Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3
Yamika Mehra
NISO Webinar: Return on Investment (ROI) in Linking the Semantic Web
NISO Webinar: Return on Investment (ROI) in Linking the Semantic Web
National Information Standards Organization (NISO)
Greenwich Digital Learning Share
Greenwich Digital Learning Share
EdAdvance
Chapter 20 Presentation
Chapter 20 Presentation
LizbethKate
ai-one presentation
ai-one presentation
diggelmann
Community research
Community research
Steven Taylor
PERSPECTIVES GENERATION VIA MULTI-HEAD ATTENTION MECHANISM AND COMMON-SENSE K...
PERSPECTIVES GENERATION VIA MULTI-HEAD ATTENTION MECHANISM AND COMMON-SENSE K...
IJCI JOURNAL
Enabled, Engaged, Empowered: The Student Vision for Digital Learning
Enabled, Engaged, Empowered: The Student Vision for Digital Learning
Julie Evans
Diversity and novelty for recommendation system
Diversity and novelty for recommendation system
Zhenv5
Information Quality Assessment in the WIQ-EI EU Project
Information Quality Assessment in the WIQ-EI EU Project
wiqei
A Framework for Applying Quantified Self Approaches to Support Reflective Lea...
A Framework for Applying Quantified Self Approaches to Support Reflective Lea...
veronicarp
Ideal-Analytics Product Training
Ideal-Analytics Product Training
Yamika Mehra
Capstone Project
Capstone Project
Digital Disciple Network
Module 6 - Communication and effective presentations
Module 6 - Communication and effective presentations
Paul Brown
Similaire à Measuring the Quality of Web Content using Factual Information
(20)
Adjusting the Focus: Usability Study Aligns Organization Vision with Communit...
Adjusting the Focus: Usability Study Aligns Organization Vision with Communit...
Mastery of Common Core Assessments
Mastery of Common Core Assessments
SMX Landing Page Optimization
SMX Landing Page Optimization
2011 11 11 (uc3m) emadrid slindstaedt kmi tug computational support for work-...
2011 11 11 (uc3m) emadrid slindstaedt kmi tug computational support for work-...
M12S07 - Retention & ESI - Paths to Success - Part Two
M12S07 - Retention & ESI - Paths to Success - Part Two
LavaCon 2012: How to Deliver the Wrong Content to the Wrong Person at the Wro...
LavaCon 2012: How to Deliver the Wrong Content to the Wrong Person at the Wro...
Ideal-Analytics - Introduction to Version 3.3
Ideal-Analytics - Introduction to Version 3.3
NISO Webinar: Return on Investment (ROI) in Linking the Semantic Web
NISO Webinar: Return on Investment (ROI) in Linking the Semantic Web
Greenwich Digital Learning Share
Greenwich Digital Learning Share
Chapter 20 Presentation
Chapter 20 Presentation
ai-one presentation
ai-one presentation
Community research
Community research
PERSPECTIVES GENERATION VIA MULTI-HEAD ATTENTION MECHANISM AND COMMON-SENSE K...
PERSPECTIVES GENERATION VIA MULTI-HEAD ATTENTION MECHANISM AND COMMON-SENSE K...
Enabled, Engaged, Empowered: The Student Vision for Digital Learning
Enabled, Engaged, Empowered: The Student Vision for Digital Learning
Diversity and novelty for recommendation system
Diversity and novelty for recommendation system
Information Quality Assessment in the WIQ-EI EU Project
Information Quality Assessment in the WIQ-EI EU Project
A Framework for Applying Quantified Self Approaches to Support Reflective Lea...
A Framework for Applying Quantified Self Approaches to Support Reflective Lea...
Ideal-Analytics Product Training
Ideal-Analytics Product Training
Capstone Project
Capstone Project
Module 6 - Communication and effective presentations
Module 6 - Communication and effective presentations
Dernier
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Drew Madelung
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
wesley chun
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
Principled Technologies
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Safe Software
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Malak Abu Hammad
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
The Digital Insurer
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Delhi Call girls
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
V3cube
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Results
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Anna Loughnan Colquhoun
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Maria Levchenko
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
HampshireHUG
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
Rafal Los
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
Delhi Call girls
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Igalia
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
naman860154
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Enterprise Knowledge
Dernier
(20)
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Measuring the Quality of Web Content using Factual Information
1.
16. April 2012
www.know-center.at Measuring the Quality of Web Content using Factual Information WebQuality 2012 workshop at WWW 2012 Elisabeth Lex, Michael Voelske , Marcelo Errecalde , Edgardo Ferretti, Leticia Cagnina, Christopher Horn, Benno Stein and Michael Granitzer © Know-Center 2012 gefördert durch das Kompetenzzentrenprogramm
2.
Agenda Motivation Approach Results Summary and Outlook
2 © Know-Center 2012
3.
Motivation People‘s decisions often
based on Web content lacking quality control, no verification Inaccurate, incorrect infomation No fact checking Measures needed to capture credibility and quality aspects In respect to facts! 3 © Know-Center 2012
4.
Approach Measure information quality
based on factual information 3 Approaches: Use simple statistics about the facts obtained from text Exploit relational information contained in facts Use semantic relationships like meronymy and hypernymy First approach: Use simple statistical features about facts in a document Indicates how informative a document is Derive facts from Web content using Open Information Extraction 4 © Know-Center 2012
5.
Definition of Factual
Density Fact Count Factual Density 5 © Know-Center 2012
6.
Experiments Wikipedia: 1000 Featured
and Good articles versus 1000 Non- Featured (randomly selected) Featured: a comprehensive coverage of the major facts in the context of the article’s subject Baseline: Word Count [Blumenstock 2008] Featured articles longer than non-featured Bias: longer docs contain more facts Evaluation: 2 Datasets Unbalanced: articles differ in length Balanced: articles similar in length 6 © Know-Center 2012
7.
Distributions of docs
in both datasets in respect to word count 7 © Know-Center 2012
8.
Precision/Recall curves of
Factual Density 8 © Know-Center 2012
9.
Results Factual Density on
balanced corpus 9 © Know-Center 2012
10.
Experiments – Relational
Features Approach 2: exploiting relational information contained in facts Extract relational features from articles Use relations from ReVerb: binary relations (e1, relation, e2) Use them to train a classifier to discriminate between featured/good and non-featured 10 © Know-Center 2012
11.
Experiments – Relational
Features Approach 2: exploiting relational information contained in facts Extract relational features from articles Use relations from ReVerb: binary relations (e1, relation, e2) Use them to train a classifier to discriminate between featured/good and non-featured 11 © Know-Center 2012
12.
Summary Simple fact related
measure: Factual Density Based on Factual Density, featured/good articles can be separated from non-featured if article length similar If articles differ in length, word count! For future work, combination of both Plan to incorporate edit history: more editors, higher factual density Preliminary experiments with relational features Promising results, more work in this direction Goal here is to bring semantics in to the field of Information Quality We expect this to unlock several IQ dimensions, e.g. generality vs specificity 12 © Know-Center 2012
13.
Thank you for
your attention! Elisabeth Lex elex@know-center.at 13 © Know-Center 2012
Télécharger maintenant