SlideShare une entreprise Scribd logo
1  sur  23
IWEST 2012 workshop located at ESWC 2012




                                     Evaluating Semantic Search
                                      Systems to Identify Future
                                          Directions of Research
                                           Khadija Elbedweihy1, Stuart N. Wrigley1,
                                            Fabio Ciravegna1, Dorothee Reinhard2,
                                                              Abraham Bernstein2
                                                      1University of Sheffield, UK
       18.06.2012
                                                2University of Zurich, Switzerland
       1
Outline
•   Introduction
•   Evaluation Design
•   Evaluation Execution
•   Usability Feedback and Analysis
•   Future Directions for Research
•   Conclusions




18.06.2012

2
INTRODUCTION

18.06.2012

3
Semantic Search
• Semantic Search tools have different
    • querying approaches (e.g., forms, graphs, keywords).
    • search strategies during processing and query execution.
    • format and content of the results presented to the user.


• These factors influence the user's perceived performance
  usability of the tool.

• Searching is a user-centric process; usability evaluation is as
  important as – if not more than – assessing the performance.

18.06.2012

4
Previous evaluation efforts
• Kaufmann (2007): compared 4 SW query interfaces (NL and Graph-
  based)
• SemSearch Challenge: ad-hoc object retrieval using keywords
• Question Answering Over Linked Data (QALD): two NL interfaces
• TREC Entity List Completion (ELC) Task: similar to SemSearch

• All previous evaluations based upon the Cranfield methodology
      – test collection; set of tasks; set of relevance judgments.


• Little or no focus on usability

18.06.2012

5
EVALUATION DESIGN

18.06.2012

6
Evaluation Design
Aspect          Details
Tools           • Any query input style
                • Answers extracted from data (e.g., list of URIs or literals but not documents)
Data            Mooney Natural Language Learning Data
                • known within the search community
                • simple and well-known domain for subjects (geography)
                • questions already available
                     • Give me all the state capitals of the USA?
                     • Which rivers in Arkansas are longer than Alleghany river?
Subjects        38 subjects (26 males, 12 females); aged between 20 and 35 years old
Criteria        • Usability:
                     • query input (expressiveness, etc.)
                     • usefulness and suitability of returned answers (data) and presentation
                • Performance: speed of execution (also affects user satisfaction)

   18.06.2012

   7
Data Captured
• Results for each question:
      –   time required to formulate query
      –   number of attempts required to answer question
      –   success rate (user found satisfying answer or not)
      –   query execution time


• Questionnaires capturing user experience
     – System Usability Scale (SUS) questionnaire
     – Extended questionnaire
     – Demographics questionnaire



04.08.2010

8
EVALUATION EXECUTION

18.06.2012

9
Participating tools

Tool         Description
K-Search     Form-based
Ginseng      Natural language with constrained vocabulary and grammar
NLP-Reduce   Natural language for full English questions, sentence fragments,
             and keywords.
PowerAqua    Natural language interface




18.06.2012

10
Running the experiment




18.06.2012

11
ANALYSIS AND FEEDBACK

18.06.2012

12
Results
Criterion                         K-Search    Ginseng       Nlp-     PowerAqua
                          ‘Bad’
                           Bad    Form-      Controlled    Reduce     NL-based
                                  based      NL-based     NL-based
Mean experiment time (s)          4313.84     3612.12     4798.58     2003.9     ‘Awful’
Mean SUS (0 – 100)                44.38         40         25.94       72.25
                                                                                 ‘Good’
Mean Ext.Questionnaire (0-100)    47.29         45         44.63       80.67
Mean number of attempts           2.37         2.03         5.54       2.01    Twice # of
                                                                               attempts
Mean answer found rate            0.41         0.19         0.21       0.55
Mean execution time (s)           0.44         0.51         0.51        11       slowest
Mean input time (s)               69.11        81.63         29        16.03

                                                          slowest

   18.06.2012

   13
Feedback: input style
Input          Positive                                    Negative
Free NL        • fast (16 and 29 sec on average)           mismatch (habitability) problem: “I need to
               • most natural (query in plain natural      know and use the terms expected by the
                 language)                                 system and not my own terms to get results”
Contr. NL      • guidance: suggestions and auto-           very restricted language model:
                 completion                                • frustration (low SUS)
               • avoids habitability problem (only valid   • limit flexibility and expressiveness
                 queries)                                  • slow query formulation (highest input
                                                             time: 81.63 sec)
Form           • allow users to build more complex         • more difficult to use than NL
                 queries than NL                           • time consuming (input time: 69.11 sec on
               • helpful to know the search space            average)
                 (concepts & relations)




  18.06.2012

  14
Feedback: results
Aspect         Comments
Presentation   Results not user-friendly
               • provided full URIs of the concepts
                 (e.g. `http://www.mooney.net/geo#tennesse2’)
               • used ontology labels for providing a NL representation of the answer
                 (e.g. `montgomeryAI’)
Management Users have high expectations; requested advanced means of managing
           the results such as:
           • storing and reusing results of previous queries
           • filtering results according to some suitable criteria
           • checking the provenance of the results
           • basic manipulations such as sorting results




18.06.2012

15
FUTURE DIRECTIONS FOR
     RESEARCH
18.06.2012

16
Input Style
• Visualising the search space shows:
      • what type of information is available (exploration)
      • what queries are supported (query formulation guidance).
• Typing queries in natural language is fast and easy

• Provide ‘dual query formulation’ approach
      • users unfamiliar with domain can correctly formulate their
        intended queries using view-based
      • users familiar with domain can use faster NL queries

18.06.2012

17
Input Style
• Comparatives and Superlatives still a challenge
      e.g., FREyA uses an ‘intervention approach’
             • if a numerical datatype property is found in user query:
               1. generates maximum, minimum and sum functions
               2. user chooses the required function




18.06.2012

18
Query Execution
Delays in response time negatively affect user experience and
satisfaction.

• Provide feedback
      • reduces the effect of delays (more willing to wait if they know the
        status of their search process).

• Provide intermediate (partial) results
      • gradually incremented to provide the complete result set.
      • similar to (arguably better than) basic feedback




18.06.2012

19
Results
 • Presentation
      • Attractive, accessible, understandable and user-friendly.
      • Augment with associated information: `richer’ user experience.


 • Management
      •      Filter, sort
      •      Some complex questions require multiple sub-queries
      •      Ability to store and reuse the result set could be helpful.
      •      Queries can then be constructed by combining saved queries
             with logical operators such as `AND' and `OR’.


18.06.2012

20
CONCLUSIONS

18.06.2012

21
Conclusions & Recommendations
• Query input approaches serve different purposes:
      – View-based: explore and understand
      – NL-based: efficiency and simplicity

• Dual query approach to input
      – natural language and view-based input styles
      – improve search effectiveness and user satisfaction

• More sophisticated results presentation and management
      – customise: sort, filter, provenance and (temporary save)
      – enrich: supplementary information


18.06.2012

22
THANK YOU

18.06.2012

23

Contenu connexe

Tendances

Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLPGVS Chaitanya
 
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
 Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw... Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...Christian Posse
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and ChallengesJens Lehmann
 
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Shalin Hai-Jew
 
Answer Extraction for how and why Questions in Question Answering Systems
Answer Extraction for how and why Questions in Question Answering SystemsAnswer Extraction for how and why Questions in Question Answering Systems
Answer Extraction for how and why Questions in Question Answering Systemsijceronline
 
Action research for_librarians_carl2012
Action research for_librarians_carl2012Action research for_librarians_carl2012
Action research for_librarians_carl2012srosenblatt
 
Practical machine learning - Part 1
Practical machine learning - Part 1Practical machine learning - Part 1
Practical machine learning - Part 1Traian Rebedea
 
Workshop 2 using nvivo 12 for qualitative data analysis
Workshop 2 using nvivo 12 for qualitative data analysisWorkshop 2 using nvivo 12 for qualitative data analysis
Workshop 2 using nvivo 12 for qualitative data analysisDr. Yaar Muhammad
 
Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...
Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...
Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...pamselle
 
Örüntü tanıma - Pattern Recognition
Örüntü tanıma - Pattern RecognitionÖrüntü tanıma - Pattern Recognition
Örüntü tanıma - Pattern RecognitionHassan-k Abdi
 
Using and learning phrases
Using and learning phrasesUsing and learning phrases
Using and learning phrasesCassandra Jacobs
 
Computer Software in Qualitative Research: An Introduction to NVivo
Computer Software in Qualitative Research: An Introduction to NVivoComputer Software in Qualitative Research: An Introduction to NVivo
Computer Software in Qualitative Research: An Introduction to NVivoAdam Perzynski, PhD
 
Data analysis – using computers
Data analysis – using computersData analysis – using computers
Data analysis – using computersNoonapau
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...Joaquin Delgado PhD.
 
Tutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment AnalysisTutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment AnalysisYun Hao
 
Framester and WFD
Framester and WFD Framester and WFD
Framester and WFD Aldo Gangemi
 

Tendances (17)

Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLP
 
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
 Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw... Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
Key Lessons Learned Building Recommender Systems for Large-Scale Social Netw...
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
 
Answer Extraction for how and why Questions in Question Answering Systems
Answer Extraction for how and why Questions in Question Answering SystemsAnswer Extraction for how and why Questions in Question Answering Systems
Answer Extraction for how and why Questions in Question Answering Systems
 
Action research for_librarians_carl2012
Action research for_librarians_carl2012Action research for_librarians_carl2012
Action research for_librarians_carl2012
 
Practical machine learning - Part 1
Practical machine learning - Part 1Practical machine learning - Part 1
Practical machine learning - Part 1
 
Workshop 2 using nvivo 12 for qualitative data analysis
Workshop 2 using nvivo 12 for qualitative data analysisWorkshop 2 using nvivo 12 for qualitative data analysis
Workshop 2 using nvivo 12 for qualitative data analysis
 
Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...
Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...
Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...
 
Örüntü tanıma - Pattern Recognition
Örüntü tanıma - Pattern RecognitionÖrüntü tanıma - Pattern Recognition
Örüntü tanıma - Pattern Recognition
 
Question answering
Question answeringQuestion answering
Question answering
 
Using and learning phrases
Using and learning phrasesUsing and learning phrases
Using and learning phrases
 
Computer Software in Qualitative Research: An Introduction to NVivo
Computer Software in Qualitative Research: An Introduction to NVivoComputer Software in Qualitative Research: An Introduction to NVivo
Computer Software in Qualitative Research: An Introduction to NVivo
 
Data analysis – using computers
Data analysis – using computersData analysis – using computers
Data analysis – using computers
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
 
Tutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment AnalysisTutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment Analysis
 
Framester and WFD
Framester and WFD Framester and WFD
Framester and WFD
 

Similaire à Evaluating Semantic Search Systems to Identify Future Directions of Research

Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsAsking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsMohammad Aliannejadi
 
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...Jekaterina Novikova, PhD
 
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Lucidworks
 
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...Jin Young Kim
 
Establishing best practices to improve usefulness and usability of web interf...
Establishing best practices to improve usefulness and usability of web interf...Establishing best practices to improve usefulness and usability of web interf...
Establishing best practices to improve usefulness and usability of web interf...DRIscience
 
Cognitive Ability Effects on Effort in Web Search & Navigation by Gwizdka
Cognitive Ability Effects on Effort in Web Search & Navigation by GwizdkaCognitive Ability Effects on Effort in Web Search & Navigation by Gwizdka
Cognitive Ability Effects on Effort in Web Search & Navigation by Gwizdkajacekg
 
questionnaires types and types of scale.pdf
questionnaires types and types of scale.pdfquestionnaires types and types of scale.pdf
questionnaires types and types of scale.pdfAbhishekSingh158448
 
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning... RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...S. Diana Hu
 
Crowdsourcing Best Practices
Crowdsourcing Best Practices Crowdsourcing Best Practices
Crowdsourcing Best Practices Marta Sabou
 
Usability Testing for Qualitative Researchers - QRCA NYC Chapter event
Usability Testing for Qualitative Researchers - QRCA NYC Chapter eventUsability Testing for Qualitative Researchers - QRCA NYC Chapter event
Usability Testing for Qualitative Researchers - QRCA NYC Chapter eventKay Aubrey
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesAlan Said
 
Game design 2 (2013): Lecture 11 - User Feedback in Game Design
Game design 2 (2013): Lecture 11 - User Feedback in Game DesignGame design 2 (2013): Lecture 11 - User Feedback in Game Design
Game design 2 (2013): Lecture 11 - User Feedback in Game DesignDavid Farrell
 
Introduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey ResearchIntroduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey ResearchCaroline Jarrett
 
Improving DBpedia (one microtask at a time)
Improving DBpedia (one microtask at a time)Improving DBpedia (one microtask at a time)
Improving DBpedia (one microtask at a time)Elena Simperl
 
Action research for_librarians_carl2012
Action research for_librarians_carl2012Action research for_librarians_carl2012
Action research for_librarians_carl2012srosenblatt
 
Advanced topics research
Advanced topics researchAdvanced topics research
Advanced topics researchkieran122
 
Artificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software TestingArtificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software TestingLionel Briand
 

Similaire à Evaluating Semantic Search Systems to Identify Future Directions of Research (20)

Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsAsking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
 
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
Natural Language Processing: From Human-Robot Interaction to Alzheimer’s Dete...
 
The art of project estimation
The art of project estimationThe art of project estimation
The art of project estimation
 
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...
 
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
SIGIR Tutorial on IR Evaluation: Designing an End-to-End Offline Evaluation P...
 
Establishing best practices to improve usefulness and usability of web interf...
Establishing best practices to improve usefulness and usability of web interf...Establishing best practices to improve usefulness and usability of web interf...
Establishing best practices to improve usefulness and usability of web interf...
 
Cognitive Ability Effects on Effort in Web Search & Navigation by Gwizdka
Cognitive Ability Effects on Effort in Web Search & Navigation by GwizdkaCognitive Ability Effects on Effort in Web Search & Navigation by Gwizdka
Cognitive Ability Effects on Effort in Web Search & Navigation by Gwizdka
 
questionnaires types and types of scale.pdf
questionnaires types and types of scale.pdfquestionnaires types and types of scale.pdf
questionnaires types and types of scale.pdf
 
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning... RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 
Crowdsourcing Best Practices
Crowdsourcing Best Practices Crowdsourcing Best Practices
Crowdsourcing Best Practices
 
Usability Testing for Qualitative Researchers - QRCA NYC Chapter event
Usability Testing for Qualitative Researchers - QRCA NYC Chapter eventUsability Testing for Qualitative Researchers - QRCA NYC Chapter event
Usability Testing for Qualitative Researchers - QRCA NYC Chapter event
 
Chapter 8 Evaluation Techniques
Chapter 8 Evaluation  TechniquesChapter 8 Evaluation  Techniques
Chapter 8 Evaluation Techniques
 
The Art of Project Estimation
The Art of Project EstimationThe Art of Project Estimation
The Art of Project Estimation
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
 
Game design 2 (2013): Lecture 11 - User Feedback in Game Design
Game design 2 (2013): Lecture 11 - User Feedback in Game DesignGame design 2 (2013): Lecture 11 - User Feedback in Game Design
Game design 2 (2013): Lecture 11 - User Feedback in Game Design
 
Introduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey ResearchIntroduction to Usability Testing for Survey Research
Introduction to Usability Testing for Survey Research
 
Improving DBpedia (one microtask at a time)
Improving DBpedia (one microtask at a time)Improving DBpedia (one microtask at a time)
Improving DBpedia (one microtask at a time)
 
Action research for_librarians_carl2012
Action research for_librarians_carl2012Action research for_librarians_carl2012
Action research for_librarians_carl2012
 
Advanced topics research
Advanced topics researchAdvanced topics research
Advanced topics research
 
Artificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software TestingArtificial Intelligence for Automated Software Testing
Artificial Intelligence for Automated Software Testing
 

Dernier

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Dernier (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Evaluating Semantic Search Systems to Identify Future Directions of Research

  • 1. IWEST 2012 workshop located at ESWC 2012 Evaluating Semantic Search Systems to Identify Future Directions of Research Khadija Elbedweihy1, Stuart N. Wrigley1, Fabio Ciravegna1, Dorothee Reinhard2, Abraham Bernstein2 1University of Sheffield, UK 18.06.2012 2University of Zurich, Switzerland 1
  • 2. Outline • Introduction • Evaluation Design • Evaluation Execution • Usability Feedback and Analysis • Future Directions for Research • Conclusions 18.06.2012 2
  • 4. Semantic Search • Semantic Search tools have different • querying approaches (e.g., forms, graphs, keywords). • search strategies during processing and query execution. • format and content of the results presented to the user. • These factors influence the user's perceived performance usability of the tool. • Searching is a user-centric process; usability evaluation is as important as – if not more than – assessing the performance. 18.06.2012 4
  • 5. Previous evaluation efforts • Kaufmann (2007): compared 4 SW query interfaces (NL and Graph- based) • SemSearch Challenge: ad-hoc object retrieval using keywords • Question Answering Over Linked Data (QALD): two NL interfaces • TREC Entity List Completion (ELC) Task: similar to SemSearch • All previous evaluations based upon the Cranfield methodology – test collection; set of tasks; set of relevance judgments. • Little or no focus on usability 18.06.2012 5
  • 7. Evaluation Design Aspect Details Tools • Any query input style • Answers extracted from data (e.g., list of URIs or literals but not documents) Data Mooney Natural Language Learning Data • known within the search community • simple and well-known domain for subjects (geography) • questions already available • Give me all the state capitals of the USA? • Which rivers in Arkansas are longer than Alleghany river? Subjects 38 subjects (26 males, 12 females); aged between 20 and 35 years old Criteria • Usability: • query input (expressiveness, etc.) • usefulness and suitability of returned answers (data) and presentation • Performance: speed of execution (also affects user satisfaction) 18.06.2012 7
  • 8. Data Captured • Results for each question: – time required to formulate query – number of attempts required to answer question – success rate (user found satisfying answer or not) – query execution time • Questionnaires capturing user experience – System Usability Scale (SUS) questionnaire – Extended questionnaire – Demographics questionnaire 04.08.2010 8
  • 10. Participating tools Tool Description K-Search Form-based Ginseng Natural language with constrained vocabulary and grammar NLP-Reduce Natural language for full English questions, sentence fragments, and keywords. PowerAqua Natural language interface 18.06.2012 10
  • 13. Results Criterion K-Search Ginseng Nlp- PowerAqua ‘Bad’ Bad Form- Controlled Reduce NL-based based NL-based NL-based Mean experiment time (s) 4313.84 3612.12 4798.58 2003.9 ‘Awful’ Mean SUS (0 – 100) 44.38 40 25.94 72.25 ‘Good’ Mean Ext.Questionnaire (0-100) 47.29 45 44.63 80.67 Mean number of attempts 2.37 2.03 5.54 2.01 Twice # of attempts Mean answer found rate 0.41 0.19 0.21 0.55 Mean execution time (s) 0.44 0.51 0.51 11 slowest Mean input time (s) 69.11 81.63 29 16.03 slowest 18.06.2012 13
  • 14. Feedback: input style Input Positive Negative Free NL • fast (16 and 29 sec on average) mismatch (habitability) problem: “I need to • most natural (query in plain natural know and use the terms expected by the language) system and not my own terms to get results” Contr. NL • guidance: suggestions and auto- very restricted language model: completion • frustration (low SUS) • avoids habitability problem (only valid • limit flexibility and expressiveness queries) • slow query formulation (highest input time: 81.63 sec) Form • allow users to build more complex • more difficult to use than NL queries than NL • time consuming (input time: 69.11 sec on • helpful to know the search space average) (concepts & relations) 18.06.2012 14
  • 15. Feedback: results Aspect Comments Presentation Results not user-friendly • provided full URIs of the concepts (e.g. `http://www.mooney.net/geo#tennesse2’) • used ontology labels for providing a NL representation of the answer (e.g. `montgomeryAI’) Management Users have high expectations; requested advanced means of managing the results such as: • storing and reusing results of previous queries • filtering results according to some suitable criteria • checking the provenance of the results • basic manipulations such as sorting results 18.06.2012 15
  • 16. FUTURE DIRECTIONS FOR RESEARCH 18.06.2012 16
  • 17. Input Style • Visualising the search space shows: • what type of information is available (exploration) • what queries are supported (query formulation guidance). • Typing queries in natural language is fast and easy • Provide ‘dual query formulation’ approach • users unfamiliar with domain can correctly formulate their intended queries using view-based • users familiar with domain can use faster NL queries 18.06.2012 17
  • 18. Input Style • Comparatives and Superlatives still a challenge e.g., FREyA uses an ‘intervention approach’ • if a numerical datatype property is found in user query: 1. generates maximum, minimum and sum functions 2. user chooses the required function 18.06.2012 18
  • 19. Query Execution Delays in response time negatively affect user experience and satisfaction. • Provide feedback • reduces the effect of delays (more willing to wait if they know the status of their search process). • Provide intermediate (partial) results • gradually incremented to provide the complete result set. • similar to (arguably better than) basic feedback 18.06.2012 19
  • 20. Results • Presentation • Attractive, accessible, understandable and user-friendly. • Augment with associated information: `richer’ user experience. • Management • Filter, sort • Some complex questions require multiple sub-queries • Ability to store and reuse the result set could be helpful. • Queries can then be constructed by combining saved queries with logical operators such as `AND' and `OR’. 18.06.2012 20
  • 22. Conclusions & Recommendations • Query input approaches serve different purposes: – View-based: explore and understand – NL-based: efficiency and simplicity • Dual query approach to input – natural language and view-based input styles – improve search effectiveness and user satisfaction • More sophisticated results presentation and management – customise: sort, filter, provenance and (temporary save) – enrich: supplementary information 18.06.2012 22