SlideShare une entreprise Scribd logo
1  sur  22
Evaluating Entity Summarization
      Using a Game-Based Ground Truth
           Andreas Thalhammer¹, Magnus Knuth²,
                     and Harald Sack²




                             ¹ University of Innsbruck, Austria
13 Nov. 2012
ISWC 2012 Boston             ² Hasso Plattner Institute Potsdam, Germany
Google: “Get the best summary” [1]
  • Inglourious Basterds (Movie)
  • Freebase: 1279 triples
  • DBpedia: 217 triples


   • Google Knowledge Graph
     summary: 14 triples

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   2
Entity Summarization
• First attempt towards a definition:
“... not just represent the main themes of the
original data, but rather, can best identify the
underlying entity” [2]

                       Is this a good definition?



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   3
Entity Summarization (cont.)
   “A summary can be loosely defined as a text that is
   produced from one or more texts, that conveys
   important information in the original text(s), and
   that is no longer than half of the original text(s) and
   usually significantly less than that.” [3]

               A summary is
                 • short
                 • and conveys important information.

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   4
Entity Summarization (cont.)
• Our (loose) definition:
    “Entity summarization is the task of producing a
    summary that conveys important facts about the
    entity while reducing the number of shown facts
    significantly.”




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   5
The Problem: Evaluation
  • How do we make different summarization
    systems comparable?

           Sub-question:
         • How do we grasp the idea of “important
            facts”?



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   6
Related Work
• RELIN: Relatedness and Informativeness-based
  Centrality for Entity Summarization [2]

      – Intrinsic: 24 users compiled summaries of 149
        entities (forming a gold standard)
        (Intersection-based similarity)

      – Extrinsic: 47 pairs of FB and DBpedia entities were
        selected (24 correct ones, 23 incorrect ones).
                      Users judge whether pairs are correct or not.

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   7
Related Work (cont.)
• Towards exploratory video search using linked
  data [4]

      – Quantitative evaluation of heuristics
         Ground truth, containing 115 entities
        summarized by 72 users.
      – Precision/Recall similarity measure



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   8
Related Work (cont.)
• It is hard to find participants.
• Generating summaries is a cumbersome
  process.
• Only a subset of property-value pairs are
  ranked by the users.
• Up to this point, none of the two evaluation
  datasets is publically available.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   9
Our Idea
• Important facts are commonly known
• Unimportant facts are rarely known

• How to find out?




                      Linked Data quiz game!
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   10
Hypothesis
“A game-based ground truth is suitable for
evaluating the performance of summarization
approaches in the movie domain”

Assumption: implemented approaches correlate
with the game-based ground truth while random
summaries do not.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   11
Dataset
• 60 arbitrary selected movies from IMDb Top250
• RDF descriptions from Freebase
• Usage of a property white list
• Triple store: Ontotext’s OWLIM with OWL2-RL
  reasoning enabled.
• Property chains:
               <http://some-name.space/hasActor>
               <http://www.w3.org/2002/07/owl#propertyChainAxiom> (
               <http://rdf.freebase.com/ns/film.film.starring>
               <http://rdf.freebase.com/ns/film.performance.actor> ).
                 All data is available at: http://yovisto.com/labs/iswc2012
13 Nov. 2012    Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   12
WhoKnows?Movies!
            S                        P                        O
:The_Princess_Bride         prop:actor            :Billy_Crystal, ...
:Braveheart                 prop:actor            :Mel_Gibson, ...
:Pulp_Fiction               prop:actor            :John_Travolta .


  •      Question types:
                 - One-to-One
                 - One-to-N
  •      Questions are composed
         upside down:
                  ‘Object is the property of subject1,
                  subject2, subject3’
                     Play the game at: http://bit.ly/WhoKnowsMovies
  13 Nov. 2012      Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   13
Frequency == Importance ???
                                                                      word       upper          lower

• Information retrieval:                                         frequency       cut-off        cut-off



      – Luhn (1958):
        “resolving power of words” [5]
                                                                                                          ranking by
                                                                                                          word frequency

• Game supports half-knowledge in general
      – e.g. which movie was released 1994?
        Monsters, Inc. – Pulp Fiction – Casablanca
      – ... but the human brain performs better with
            pictures (actors), sounds (film music), ...
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston             14
Evaluated Systems
• UBES (Usage-based Entity Summarization) [5]
      – Combine Freebase with HetRec2011 MovieLens2k [6]
      – Use item-based collaborative filtering to form
        neighborhoods for each movie
      – Find out which property-value pairs a movie shares
        with its neighbors
      – Use a TF-IDF related weighting scheme
                                    Bob          Alice         Marc          Elena         John     Mary
                Pulp Fiction            1            0             1              0            1      1
                Heat                    0            0             1              1            0      0
                Kill Bill               1            0             1              0            1      0
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston          15
Evaluated Systems (cont.)
  • GKG (Google Knowledge Graph) [1]
        – Enables semi-automatic transformation to Freebase
/search?hl=en&q=quentin+tarantino&
stick=H4sIAAAAAAAAAONgVuLQz9U3MLM0zgEA_
sQyxwwAAAA&
sa=X&ei=FnjTT7rXN8jftAaAhPWIDw&
ved=0CKwBEJsTKAA

        – base64 + gzip
         /m/0693l
                        http://www.freebase.com/view/m/0693l
                        redirects to:
                        http://www.freebase.com/view/en/quentin_tarantino
  13 Nov. 2012    Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   16
Results
• 690 sessions, 8308 questions
• 217 players (135 players played only once)
• 2314 of 2829 triples were played more than 3
  times




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   17
Result: Kendall’s τ
• Property ranking:



• Feature (property-value) ranking:




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   18
Conclusion
• The results indicate that a game-based ground
  truth is suitable for evaluating entity
  summarization.

• The current dataset is too sparse to make valid
  assumptions about the importance of single
  facts.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   19
Future Work
•    Increase the number of players
•    Score the exclusion principle
•    Increase the number of movies
•    Application to additional domains
•    Publish new versions of the evaluation dataset
     on a regular basis



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   20
Questions?

                              Help collecting data:
                  http://bit.ly/WhoKnowsMovies


                                              Andreas Thalhammer (andreas.thalhammer@sti2.at)
                                              Magnus Knuth (magnus.knuth@hpi.uni-potsdam.de)
                                                   Harald Sack (harald.sack@hpi.uni-potsdam.de)

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   21
References
     [1] Singhal, A.: Introducing the knowledge graph: things, not strings (2012),
          http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html
     [2] Cheng, G., Tran, T., Qu, Y.: RELIN: Relatedness and Informativeness-Based Centrality for Entity
          Summarization. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N.,
          Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 114–129. Springer, Heidelberg
          (2011)
     [3] Dragomir R. Radev, Eduard Hovy, and Kathleen McKeown. 2002. Introduction to the special
          issue on summarization. Comput. Linguist. 28, 4 (December 2002), 399-408.
          DOI=10.1162/089120102762671927 http://dx.doi.org/10.1162/089120102762671927
     [4] Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools
          and Applications 59, 645–672 (2012), 10.1007/s11042-011-0733-1
     [5] Thalhammer, A., Toma, I., Roa-Valverde, A.J., Fensel, D.: Leveraging usage data for linked data
          movie entity summarization. In: Proc. of the 2nd Int. Ws. on Usage Analysis and the Web of
          Data (USEWOD 2012) co-located with WWW 2012, Lyon, France, vol. abs/1204.2718 (2012)
     [6] Cantador, I., Brusilovsky, P., Kuflik, T.: 2nd ws. on information heterogeneity and fusion in
          recommender systems (hetrec 2011). In: Proc. of 5th ACM Conf. on Recommender systems,
          RecSys 2011. ACM, New York (2011)



13 Nov. 2012      Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   22

Contenu connexe

Dernier

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 

Dernier (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 

En vedette

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

En vedette (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Evaluating Entity Summarization Using a Game-Based Ground Truth

  • 1. Evaluating Entity Summarization Using a Game-Based Ground Truth Andreas Thalhammer¹, Magnus Knuth², and Harald Sack² ¹ University of Innsbruck, Austria 13 Nov. 2012 ISWC 2012 Boston ² Hasso Plattner Institute Potsdam, Germany
  • 2. Google: “Get the best summary” [1] • Inglourious Basterds (Movie) • Freebase: 1279 triples • DBpedia: 217 triples • Google Knowledge Graph summary: 14 triples 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 2
  • 3. Entity Summarization • First attempt towards a definition: “... not just represent the main themes of the original data, but rather, can best identify the underlying entity” [2] Is this a good definition? 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 3
  • 4. Entity Summarization (cont.) “A summary can be loosely defined as a text that is produced from one or more texts, that conveys important information in the original text(s), and that is no longer than half of the original text(s) and usually significantly less than that.” [3] A summary is • short • and conveys important information. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 4
  • 5. Entity Summarization (cont.) • Our (loose) definition: “Entity summarization is the task of producing a summary that conveys important facts about the entity while reducing the number of shown facts significantly.” 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 5
  • 6. The Problem: Evaluation • How do we make different summarization systems comparable? Sub-question: • How do we grasp the idea of “important facts”? 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 6
  • 7. Related Work • RELIN: Relatedness and Informativeness-based Centrality for Entity Summarization [2] – Intrinsic: 24 users compiled summaries of 149 entities (forming a gold standard) (Intersection-based similarity) – Extrinsic: 47 pairs of FB and DBpedia entities were selected (24 correct ones, 23 incorrect ones).  Users judge whether pairs are correct or not. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 7
  • 8. Related Work (cont.) • Towards exploratory video search using linked data [4] – Quantitative evaluation of heuristics  Ground truth, containing 115 entities summarized by 72 users. – Precision/Recall similarity measure 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 8
  • 9. Related Work (cont.) • It is hard to find participants. • Generating summaries is a cumbersome process. • Only a subset of property-value pairs are ranked by the users. • Up to this point, none of the two evaluation datasets is publically available. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 9
  • 10. Our Idea • Important facts are commonly known • Unimportant facts are rarely known • How to find out?  Linked Data quiz game! 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 10
  • 11. Hypothesis “A game-based ground truth is suitable for evaluating the performance of summarization approaches in the movie domain” Assumption: implemented approaches correlate with the game-based ground truth while random summaries do not. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 11
  • 12. Dataset • 60 arbitrary selected movies from IMDb Top250 • RDF descriptions from Freebase • Usage of a property white list • Triple store: Ontotext’s OWLIM with OWL2-RL reasoning enabled. • Property chains: <http://some-name.space/hasActor> <http://www.w3.org/2002/07/owl#propertyChainAxiom> ( <http://rdf.freebase.com/ns/film.film.starring> <http://rdf.freebase.com/ns/film.performance.actor> ). All data is available at: http://yovisto.com/labs/iswc2012 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 12
  • 13. WhoKnows?Movies! S P O :The_Princess_Bride prop:actor :Billy_Crystal, ... :Braveheart prop:actor :Mel_Gibson, ... :Pulp_Fiction prop:actor :John_Travolta . • Question types: - One-to-One - One-to-N • Questions are composed upside down: ‘Object is the property of subject1, subject2, subject3’ Play the game at: http://bit.ly/WhoKnowsMovies 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 13
  • 14. Frequency == Importance ??? word upper lower • Information retrieval: frequency cut-off cut-off – Luhn (1958): “resolving power of words” [5] ranking by word frequency • Game supports half-knowledge in general – e.g. which movie was released 1994? Monsters, Inc. – Pulp Fiction – Casablanca – ... but the human brain performs better with pictures (actors), sounds (film music), ... 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 14
  • 15. Evaluated Systems • UBES (Usage-based Entity Summarization) [5] – Combine Freebase with HetRec2011 MovieLens2k [6] – Use item-based collaborative filtering to form neighborhoods for each movie – Find out which property-value pairs a movie shares with its neighbors – Use a TF-IDF related weighting scheme Bob Alice Marc Elena John Mary Pulp Fiction 1 0 1 0 1 1 Heat 0 0 1 1 0 0 Kill Bill 1 0 1 0 1 0 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 15
  • 16. Evaluated Systems (cont.) • GKG (Google Knowledge Graph) [1] – Enables semi-automatic transformation to Freebase /search?hl=en&q=quentin+tarantino& stick=H4sIAAAAAAAAAONgVuLQz9U3MLM0zgEA_ sQyxwwAAAA& sa=X&ei=FnjTT7rXN8jftAaAhPWIDw& ved=0CKwBEJsTKAA – base64 + gzip  /m/0693l http://www.freebase.com/view/m/0693l redirects to: http://www.freebase.com/view/en/quentin_tarantino 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 16
  • 17. Results • 690 sessions, 8308 questions • 217 players (135 players played only once) • 2314 of 2829 triples were played more than 3 times 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 17
  • 18. Result: Kendall’s τ • Property ranking: • Feature (property-value) ranking: 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 18
  • 19. Conclusion • The results indicate that a game-based ground truth is suitable for evaluating entity summarization. • The current dataset is too sparse to make valid assumptions about the importance of single facts. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 19
  • 20. Future Work • Increase the number of players • Score the exclusion principle • Increase the number of movies • Application to additional domains • Publish new versions of the evaluation dataset on a regular basis 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 20
  • 21. Questions? Help collecting data: http://bit.ly/WhoKnowsMovies Andreas Thalhammer (andreas.thalhammer@sti2.at) Magnus Knuth (magnus.knuth@hpi.uni-potsdam.de) Harald Sack (harald.sack@hpi.uni-potsdam.de) 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 21
  • 22. References [1] Singhal, A.: Introducing the knowledge graph: things, not strings (2012), http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html [2] Cheng, G., Tran, T., Qu, Y.: RELIN: Relatedness and Informativeness-Based Centrality for Entity Summarization. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 114–129. Springer, Heidelberg (2011) [3] Dragomir R. Radev, Eduard Hovy, and Kathleen McKeown. 2002. Introduction to the special issue on summarization. Comput. Linguist. 28, 4 (December 2002), 399-408. DOI=10.1162/089120102762671927 http://dx.doi.org/10.1162/089120102762671927 [4] Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools and Applications 59, 645–672 (2012), 10.1007/s11042-011-0733-1 [5] Thalhammer, A., Toma, I., Roa-Valverde, A.J., Fensel, D.: Leveraging usage data for linked data movie entity summarization. In: Proc. of the 2nd Int. Ws. on Usage Analysis and the Web of Data (USEWOD 2012) co-located with WWW 2012, Lyon, France, vol. abs/1204.2718 (2012) [6] Cantador, I., Brusilovsky, P., Kuflik, T.: 2nd ws. on information heterogeneity and fusion in recommender systems (hetrec 2011). In: Proc. of 5th ACM Conf. on Recommender systems, RecSys 2011. ACM, New York (2011) 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 22