SlideShare a Scribd company logo
1 of 28
Download to read offline
Semantic Faceted Search 
with SemFacet 
Evgeny Kharlamov 
Information Systems Group 
Department of Computer Science 
University of Oxford
Finding Data w/ Keywords is Hard 
§ Keyword search is the paradigm 
to access data on the Web, 
company websites, etc 
§ Limitations of keyword search 
§ Too many docs contain keywords 
§ Meaning is not built in keywords 
§ Becomes the art of 
“finding the best combination” 
§ Limited control on search
How to Improve Search Experience? 
§ Improve the search paradigm 
§ End-user oriented query formulation interfaces 
§ Faceted search 
§ Improve the data model 
§ Semantic Web models 
§ Our proposal: 
§ do both and combine 
§ Faceted search 
§ Semantic Web model
Enhancing Keyword Search with Facets 
§ A facet = control mechanism 
§ Name 
§ Set of values
Enhancing Keyword Search with Facets 
§ A facet = control mechanism 
§ Name 
§ Set of values 
§ Facets in action 
§ Choose a value
Enhancing Keyword Search with Facets 
§ A facet = control mechanism 
§ Name 
§ Set of values 
§ Facets in action 
§ Choose a value 
§ Restrict search result 
§ Advantages of facets 
§ Allow to say what you 
really mean 
§ Give control over 
search
Faceted Search in the Nutshell 
stars 
3-stars 
restaurant 
§ Search over 
one set of items 
§ Items annotated with 
§ Strings 
§ Search result: 
subset of items 
Asian 
Italian 
4-stars 5-stars 
French 
Find 4-star hotels with French restaurants
Faceted Search in the Nutshell 
stars 
3-stars 
restaurant 
§ Search over 
one set of items 
§ Items annotated with 
§ Strings 
§ Search result: 
subset of items 
Asian 
Italian 
4-stars 5-stars 
French 
Find 4-star hotels with French restaurants
Faceted Search in the Nutshell 
stars 
3-stars 
restaurant 
§ Search over 
one set of items 
§ Items annotated with 
§ Strings 
§ Search result: 
subset of items 
Asian 
Italian 
4-stars 5-stars 
French 
Find 4-star hotels with French restaurants
Faceted Search in the Nutshell 
stars 
3-stars 
restaurant 
§ Search over 
one set of items 
§ Items annotated with 
§ Strings 
§ Search result: 
subset of items 
Asian 
Italian 
4-stars 5-stars 
French 
output 
Find 4-star hotels with French restaurants
F-Search is the De Facto Standard
Semantic Web Models 
§ RDF data model 
§ objects annotated with strings and objects 
§ OWL 2 ontologies 
§ structure vocabularies of annotations 
4-stars French 
stars 
restaurant 
type 
walking 
distance to 
French restaurant is a Restaurant that offers French cuisine. 
FrenchRestaurant ⊑ Restaurant ⊓ ∃ offers.FrenchCuisine
Enhancing Search with SW in Practice
Enhancing Search with SW in Practice
Enhancing Search with SW in Practice 
Hello, my name is John Doe. 
I study at the University if Dreams. 
My daughter is Alice.... 
embedding 
semantic 
annotations 
<section itemscope itemtype = "http://dava-vocabulary.org/Person" 
itemid = "http://myitems/john-doe-1234" > 
Hello, my name is 
<span itemprop="name">John Doe</span>. 
I study at the 
<span itemprop="affiliation">University of Dreams</span> 
My daughter is 
<span itemtype = "http://dava-vocabulary.org/children" 
itemid = "http://myitems/alice-doe-5678" > 
Alice </span> 
....
Semantic Web Models 
§ RDF data model 
§ objects annotated with strings and objects 
§ OWL 2 ontologies 
§ structure vocabularies of annotations 
from 2011 to 2012 the fraction of structured data went from 
3.5% to 13%
Semantic Web Models 
§ RDF data model 
§ objects annotated with strings and objects 
§ OWL 2 ontologies 
§ structure vocabularies of annotations 
from 2011 to 2012 the fraction of structured data went from 
3.5% to 13%
How to Improve Search Experience? 
§ Improve the search paradigm 
§ End-user oriented query formulation interfaces 
§ Faceted Search 
§ Improve the data model 
§ Semantic Web models 
§ RDF Data 
§ OWL 2 ontologies 
§ Our proposal: 
§ Semantic Faceted Search that combines 
§ Faceted search 
§ Semantic Web model
Semantic Faceted Search in the Nutshell 
4-stars 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to 
output
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to 
output
Semantic Faceted Search in the Nutshell 
stars 
3-stars 
§ Search over 
several sets of items 
§ Items annotated with 
§ Strings 
§ Items 
§ Search result: 
§ user-chosen 
subset of items 
4-stars 5-stars Asian Italian French 
restaurant 
Find 4-star hotels with French restaurants 
that are walking distance to Eiffel tower 
type 
walking 
distance to 
output
Research Contributions 
§ Solid foundation for Semantic F-Search 
§ Projection of ontologies on 
graph data structures 
§ Allows to incorporate ontologies 
into faceted search 
§ Gives better faceted interfaces 
politicians Search 
More Focus 
type 
USpres 
Country 
More Focus 
More Focus 
Remove 
More Focus 
Remove 
§ Generate more facets / Prune irrelevant facets 
§ Scalable algorithms to 
§ generate and update facets from 
§ Data and Ontologies 
§ Algorithms to evaluate faceted queries over semantic data 
§ Exploits bottom up query evaluation 
http://en.wikipedia.org/wiki/Bill_Clinton 
William Jefferson "Bill" Clinton (born William 
Jefferson Blythe III; August 19, 1946) is an 
American politician who served as the 42nd 
President of the United States from 1993 to 
2001. Inaugurated at age 46, he was the third-youngest 
president. He took office at the end 
of the Cold War, and was the first president of 
the baby boomer generation... 
has child 
ANY 
Remove 
Remove 
is graduated from 
Stanford Uni. 
is graduated from 
Stanford Uni. 
Harvard Uni. 
Georgetown Uni.
SemFacet System 
§ Integration of 
§ Keyword search and 
§ Semantic faceted search 
§ Main features 
§ Automatic generation of f-search interfaces 
over RDF data and OWL 2 ontologies 
§ In memory 
§ Online and offline reasoning 
§ Efficient on millions of triples 
§ Flexible configuration 
§ Interchangeable triple stores 
§ RDFOX, PAGOdA, Hermit, Sesame 
§ Configurable answers (snippets) 
§ Support of Or and And facets 
Faceted Query 
Interface 
Answers as 
Snippets 
Presentation 
Layer 
Application 
Layer 
Data 
Layer 
Facet 
Generator 
Query 
Converter 
Snippet 
Generator 
Triple Store: 
Ontology 
Data 
Keyword 
Based Search 
KBS 
Engine 
Inverted Index 
e.g. DBpedia 
Abstracts 
RDFOX, PAGOdA, Hermit, Sesame
SemFacet Team 
§ Marcelo Arenas 
§ Bernardo Cuenca Grau 
§ Evgeny Kharlamov 
§ Sarunas Marciuska 
§ Dmitriy Zheleznyakov

More Related Content

Viewers also liked

Overview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationOverview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationDBOnto
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn PresentationDBOnto
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDBOnto
 
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterDBOnto
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paperDBOnto
 
PDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentationPDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentationDBOnto
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperDBOnto
 
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...DBOnto
 
PAGOdA poster
PAGOdA posterPAGOdA poster
PAGOdA posterDBOnto
 
PDQ Poster
PDQ PosterPDQ Poster
PDQ PosterDBOnto
 
RDFox Poster
RDFox PosterRDFox Poster
RDFox PosterDBOnto
 
PAGOdA Presentation
PAGOdA PresentationPAGOdA Presentation
PAGOdA PresentationDBOnto
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paperDBOnto
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DBOnto
 
Parallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationParallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationDBOnto
 
Query Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperQuery Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperDBOnto
 

Viewers also liked (16)

Overview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentationOverview of Dan Olteanu's Research presentation
Overview of Dan Olteanu's Research presentation
 
ROSeAnn Presentation
ROSeAnn PresentationROSeAnn Presentation
ROSeAnn Presentation
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meeting
 
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - PosterArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
ArtForm - Dynamic analysis of JavaScript validation in web forms - Poster
 
PAGOdA paper
PAGOdA paperPAGOdA paper
PAGOdA paper
 
PDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentationPDQ: Proof-driven Querying presentation
PDQ: Proof-driven Querying presentation
 
Aggregating Semantic Annotators Paper
Aggregating Semantic Annotators PaperAggregating Semantic Annotators Paper
Aggregating Semantic Annotators Paper
 
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF ...
 
PAGOdA poster
PAGOdA posterPAGOdA poster
PAGOdA poster
 
PDQ Poster
PDQ PosterPDQ Poster
PDQ Poster
 
RDFox Poster
RDFox PosterRDFox Poster
RDFox Poster
 
PAGOdA Presentation
PAGOdA PresentationPAGOdA Presentation
PAGOdA Presentation
 
Sem facet paper
Sem facet paperSem facet paper
Sem facet paper
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
 
Parallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox PresentationParallel Datalog Reasoning in RDFox Presentation
Parallel Datalog Reasoning in RDFox Presentation
 
Query Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning PaperQuery Distributed RDF Graphs: The Effects of Partitioning Paper
Query Distributed RDF Graphs: The Effects of Partitioning Paper
 

Similar to Semantic Faceted Search with SemFacet presentation

Web Search: Tips and techniques from a Librarian
Web Search: Tips and techniques from a LibrarianWeb Search: Tips and techniques from a Librarian
Web Search: Tips and techniques from a Librarianlerichard
 
Best Practices for Enterprise Search
Best Practices for Enterprise SearchBest Practices for Enterprise Search
Best Practices for Enterprise SearchChris Risner
 
Online Research_How to get the best out of internet searches
Online Research_How to get the best out of internet searchesOnline Research_How to get the best out of internet searches
Online Research_How to get the best out of internet searches211 Check
 
Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011cyberspaced educator
 
SEO for Ecommerce - an overview
SEO for Ecommerce - an overviewSEO for Ecommerce - an overview
SEO for Ecommerce - an overviewErudite
 
Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentationadeason
 
Martina Welander - Google is a two pagesite
Martina Welander - Google is a two pagesiteMartina Welander - Google is a two pagesite
Martina Welander - Google is a two pagesiteNordicSitecoreConference
 
Getting Ahead of the Curve: Leveraging Local Search in the U.K.
Getting Ahead of the Curve: Leveraging Local Search in the U.K.Getting Ahead of the Curve: Leveraging Local Search in the U.K.
Getting Ahead of the Curve: Leveraging Local Search in the U.K.George Freitag
 
Spiders, farms, and bubbles: how to become an expert internet searcher
Spiders, farms, and bubbles: how to become an expert internet searcherSpiders, farms, and bubbles: how to become an expert internet searcher
Spiders, farms, and bubbles: how to become an expert internet searcherMegan Heuer
 
Northwest Florida Association of Computer User Groups TECH 17 Better Search ...
Northwest Florida Association of  Computer User Groups TECH 17 Better Search ...Northwest Florida Association of  Computer User Groups TECH 17 Better Search ...
Northwest Florida Association of Computer User Groups TECH 17 Better Search ...hewie
 
JAB2012 Smart Search Presentation
JAB2012 Smart Search PresentationJAB2012 Smart Search Presentation
JAB2012 Smart Search PresentationChris Davenport
 
Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014notess
 
Ipe pp slides google talk 2013
Ipe pp slides google talk 2013Ipe pp slides google talk 2013
Ipe pp slides google talk 2013Elizabeth Holmes
 
RESEARCHING YOUR TOPIC_edit.pptx
RESEARCHING YOUR TOPIC_edit.pptxRESEARCHING YOUR TOPIC_edit.pptx
RESEARCHING YOUR TOPIC_edit.pptxShukurat Bello
 

Similar to Semantic Faceted Search with SemFacet presentation (20)

Web Search: Tips and techniques from a Librarian
Web Search: Tips and techniques from a LibrarianWeb Search: Tips and techniques from a Librarian
Web Search: Tips and techniques from a Librarian
 
Best Practices for Enterprise Search
Best Practices for Enterprise SearchBest Practices for Enterprise Search
Best Practices for Enterprise Search
 
Online Research_How to get the best out of internet searches
Online Research_How to get the best out of internet searchesOnline Research_How to get the best out of internet searches
Online Research_How to get the best out of internet searches
 
Carl 2014 slides_gotime
Carl 2014 slides_gotimeCarl 2014 slides_gotime
Carl 2014 slides_gotime
 
Search Analytics - Comperio
Search Analytics - ComperioSearch Analytics - Comperio
Search Analytics - Comperio
 
Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011
 
SEO for Ecommerce - an overview
SEO for Ecommerce - an overviewSEO for Ecommerce - an overview
SEO for Ecommerce - an overview
 
Audit
AuditAudit
Audit
 
Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentation
 
Google Is a Two Page Site
Google Is a Two Page SiteGoogle Is a Two Page Site
Google Is a Two Page Site
 
Martina Welander - Google is a two pagesite
Martina Welander - Google is a two pagesiteMartina Welander - Google is a two pagesite
Martina Welander - Google is a two pagesite
 
Getting Ahead of the Curve: Leveraging Local Search in the U.K.
Getting Ahead of the Curve: Leveraging Local Search in the U.K.Getting Ahead of the Curve: Leveraging Local Search in the U.K.
Getting Ahead of the Curve: Leveraging Local Search in the U.K.
 
Spiders, farms, and bubbles: how to become an expert internet searcher
Spiders, farms, and bubbles: how to become an expert internet searcherSpiders, farms, and bubbles: how to become an expert internet searcher
Spiders, farms, and bubbles: how to become an expert internet searcher
 
Northwest Florida Association of Computer User Groups TECH 17 Better Search ...
Northwest Florida Association of  Computer User Groups TECH 17 Better Search ...Northwest Florida Association of  Computer User Groups TECH 17 Better Search ...
Northwest Florida Association of Computer User Groups TECH 17 Better Search ...
 
JAB2012 Smart Search Presentation
JAB2012 Smart Search PresentationJAB2012 Smart Search Presentation
JAB2012 Smart Search Presentation
 
Hotbot ppt
Hotbot pptHotbot ppt
Hotbot ppt
 
Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014
 
Ipe pp slides google talk 2013
Ipe pp slides google talk 2013Ipe pp slides google talk 2013
Ipe pp slides google talk 2013
 
RESEARCHING YOUR TOPIC_edit.pptx
RESEARCHING YOUR TOPIC_edit.pptxRESEARCHING YOUR TOPIC_edit.pptx
RESEARCHING YOUR TOPIC_edit.pptx
 
Identifying Keywords and Searching Techniques
Identifying Keywords and Searching TechniquesIdentifying Keywords and Searching Techniques
Identifying Keywords and Searching Techniques
 

Recently uploaded

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Recently uploaded (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Semantic Faceted Search with SemFacet presentation

  • 1. Semantic Faceted Search with SemFacet Evgeny Kharlamov Information Systems Group Department of Computer Science University of Oxford
  • 2. Finding Data w/ Keywords is Hard § Keyword search is the paradigm to access data on the Web, company websites, etc § Limitations of keyword search § Too many docs contain keywords § Meaning is not built in keywords § Becomes the art of “finding the best combination” § Limited control on search
  • 3. How to Improve Search Experience? § Improve the search paradigm § End-user oriented query formulation interfaces § Faceted search § Improve the data model § Semantic Web models § Our proposal: § do both and combine § Faceted search § Semantic Web model
  • 4. Enhancing Keyword Search with Facets § A facet = control mechanism § Name § Set of values
  • 5. Enhancing Keyword Search with Facets § A facet = control mechanism § Name § Set of values § Facets in action § Choose a value
  • 6. Enhancing Keyword Search with Facets § A facet = control mechanism § Name § Set of values § Facets in action § Choose a value § Restrict search result § Advantages of facets § Allow to say what you really mean § Give control over search
  • 7. Faceted Search in the Nutshell stars 3-stars restaurant § Search over one set of items § Items annotated with § Strings § Search result: subset of items Asian Italian 4-stars 5-stars French Find 4-star hotels with French restaurants
  • 8. Faceted Search in the Nutshell stars 3-stars restaurant § Search over one set of items § Items annotated with § Strings § Search result: subset of items Asian Italian 4-stars 5-stars French Find 4-star hotels with French restaurants
  • 9. Faceted Search in the Nutshell stars 3-stars restaurant § Search over one set of items § Items annotated with § Strings § Search result: subset of items Asian Italian 4-stars 5-stars French Find 4-star hotels with French restaurants
  • 10. Faceted Search in the Nutshell stars 3-stars restaurant § Search over one set of items § Items annotated with § Strings § Search result: subset of items Asian Italian 4-stars 5-stars French output Find 4-star hotels with French restaurants
  • 11. F-Search is the De Facto Standard
  • 12. Semantic Web Models § RDF data model § objects annotated with strings and objects § OWL 2 ontologies § structure vocabularies of annotations 4-stars French stars restaurant type walking distance to French restaurant is a Restaurant that offers French cuisine. FrenchRestaurant ⊑ Restaurant ⊓ ∃ offers.FrenchCuisine
  • 13. Enhancing Search with SW in Practice
  • 14. Enhancing Search with SW in Practice
  • 15. Enhancing Search with SW in Practice Hello, my name is John Doe. I study at the University if Dreams. My daughter is Alice.... embedding semantic annotations <section itemscope itemtype = "http://dava-vocabulary.org/Person" itemid = "http://myitems/john-doe-1234" > Hello, my name is <span itemprop="name">John Doe</span>. I study at the <span itemprop="affiliation">University of Dreams</span> My daughter is <span itemtype = "http://dava-vocabulary.org/children" itemid = "http://myitems/alice-doe-5678" > Alice </span> ....
  • 16. Semantic Web Models § RDF data model § objects annotated with strings and objects § OWL 2 ontologies § structure vocabularies of annotations from 2011 to 2012 the fraction of structured data went from 3.5% to 13%
  • 17. Semantic Web Models § RDF data model § objects annotated with strings and objects § OWL 2 ontologies § structure vocabularies of annotations from 2011 to 2012 the fraction of structured data went from 3.5% to 13%
  • 18. How to Improve Search Experience? § Improve the search paradigm § End-user oriented query formulation interfaces § Faceted Search § Improve the data model § Semantic Web models § RDF Data § OWL 2 ontologies § Our proposal: § Semantic Faceted Search that combines § Faceted search § Semantic Web model
  • 19. Semantic Faceted Search in the Nutshell 4-stars stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to
  • 20. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to
  • 21. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to
  • 22. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to
  • 23. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to output
  • 24. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to output
  • 25. Semantic Faceted Search in the Nutshell stars 3-stars § Search over several sets of items § Items annotated with § Strings § Items § Search result: § user-chosen subset of items 4-stars 5-stars Asian Italian French restaurant Find 4-star hotels with French restaurants that are walking distance to Eiffel tower type walking distance to output
  • 26. Research Contributions § Solid foundation for Semantic F-Search § Projection of ontologies on graph data structures § Allows to incorporate ontologies into faceted search § Gives better faceted interfaces politicians Search More Focus type USpres Country More Focus More Focus Remove More Focus Remove § Generate more facets / Prune irrelevant facets § Scalable algorithms to § generate and update facets from § Data and Ontologies § Algorithms to evaluate faceted queries over semantic data § Exploits bottom up query evaluation http://en.wikipedia.org/wiki/Bill_Clinton William Jefferson "Bill" Clinton (born William Jefferson Blythe III; August 19, 1946) is an American politician who served as the 42nd President of the United States from 1993 to 2001. Inaugurated at age 46, he was the third-youngest president. He took office at the end of the Cold War, and was the first president of the baby boomer generation... has child ANY Remove Remove is graduated from Stanford Uni. is graduated from Stanford Uni. Harvard Uni. Georgetown Uni.
  • 27. SemFacet System § Integration of § Keyword search and § Semantic faceted search § Main features § Automatic generation of f-search interfaces over RDF data and OWL 2 ontologies § In memory § Online and offline reasoning § Efficient on millions of triples § Flexible configuration § Interchangeable triple stores § RDFOX, PAGOdA, Hermit, Sesame § Configurable answers (snippets) § Support of Or and And facets Faceted Query Interface Answers as Snippets Presentation Layer Application Layer Data Layer Facet Generator Query Converter Snippet Generator Triple Store: Ontology Data Keyword Based Search KBS Engine Inverted Index e.g. DBpedia Abstracts RDFOX, PAGOdA, Hermit, Sesame
  • 28. SemFacet Team § Marcelo Arenas § Bernardo Cuenca Grau § Evgeny Kharlamov § Sarunas Marciuska § Dmitriy Zheleznyakov