SlideShare une entreprise Scribd logo
1  sur  57
Télécharger pour lire hors ligne
Search
is the new UI
OpenSource Connections
@DanielBeach
@DanielBeach

OpenSource Connections
‣ Search strategist @ 

OpenSource Connections
‣ Elasticsearch + AngularJS

video course
‣ Spyglass search UI
‣ New dad
“Search results”
google.com
Powered by Search
‣ Search results
‣ Data visualization dashboards
‣ Recommendation engines
‣ Anomaly detection
Not your typical search interface.
parse.ly
Search Misconceptions
‣ Search only powers 

“search pages”
‣ Search is best left to Google
‣ Search is only useful for 

full-text documents
Movie Data
tmdb.com
Typical Search Inputs
‣ Text entered by a user
‣ Selected filters
Passive Search Inputs
‣ Current time & events
‣ The user’s location
‣ How recently was the content
published?
‣ The popularity of an item
Time Data
google.com
Talk Overview
‣ Principles of good search
‣ How search engines work
‣ Front-end search patterns
GENERAL SEARCH PRINCIPLES
Search is an opportunity
to have a conversation.
‣ Users are telling you what
they expect to find on your
site
‣ Search is an opportunity to
learn about your users /
customers
Natural
searches
sound hound
Principles of 

Search Design
‣ Values search
‣ Listens to your users
‣ Shows relevant content
‣ Gets out of their way
Search is about Clarity
‣ The relevancy of the results 

is more important than the
design of the results.
This is what search should be.
amazon.com
Only Show what is
Important
‣ If you have twenty average
results, but one result is
statistically more important,
don’t show the others.
Grouping by
category
canopy.co
Accommodate Multiple
Types of Searching
‣ Informational users are
interested in the breadth of
your data
‣ Navigational users want to 

get somewhere fast
Highlighting the important
lonelyplanet.com
Bad Search
‣ Invalid results
‣ No results
‣ Confusing result hierarchy
‣ Visually messy results
‣ No clear input or submit
Good Search Design
‣ Showing relevant data to
users
‣ Only shows useful /
actionable results
‣ Visually clean results
‣ Differentiated results
‣ Recognizable input and submit
HOW SEARCH ENGINES WORK
How humans see text
–Arthur Clark
How a search engine sees text
This is called text analysis, and it happens at indexing time.
Indexed terms
(simplification)
Query Analysis
A compatible analysis chain must be used for the query string as the indexed content in
order for terms to match.
TERM MEANING
Tokenization
Splitting text into indexable pieces, called tokens.A word is often an
example of a token.
Stemming
Collapsing words to their root (interpretation, interpreting -->
interpret)
Inverted index An index of tokens. Maps tokens to document position
Term frequency The number of times a token occurs in a document
Inverse document
frequency
Tokens that appear in fewer documents are calculated to be more
important (simplified)
Document Scoring
‣ Score based on number of
matches in a document, as
compared to the popularity of
that term across all
documents.
Types of Boosts
‣ Field boosts
‣ Text matching (title^5
description)
‣ Function scores
‣ Boost newer content
‣ Multiply by % of popularity
‣ …
Minimum Match
‣ How many of the query terms
have to match in order for a
document to be returned?
‣ Precision vs recall
FRONT-END SEARCH PATTERNS
Search patterns
Search request
{
"query": {
"multi_match": {
“fields": ["title^5", "description”],
“query": “descender”,
"minimum_should_match": "2<-1 5<70%"
}
}
}
Response
{
"took": 47,
"timed_out": false,
"hits": {
"total": 2,
"max_score": 2.17284,
"hits": [
{
"_index": "catalog",
"_type": "comics",
"_id": "84",
"_score": 2.17284,
"_source": {
"title": "Descender",
"description": "One young robot’s struggle to stay alive
in a universe where all androids have been outlawed and bounty
hunters lurk on every planet."
...
Aggregations
{
query: query,
aggs: {
"comic formats": {
terms: {field: "format"}
}
}
}
Aggregations Request
Aggregations
Response
{
"hits":{
"total":2,
"hits":[ ... ],
"aggregations":{
“comic types":{
"buckets":[{
"key":"Trades",
"doc_count":63},
{
"key":"Graphic Novels",
"doc_count":35
},
{
"key":"Compilations",
"doc_count":9
}]}}}
Filtering
{
filtered: {
query: query,
filter: {
bool: {
must: [
{term: {"illustrator": "Dustin
Nguyen"}};
]
}
}}}
Search categories
etsy.com
Showing relative document counts
assignment.uspto.gov
Autocomplete
Autocomplete
Suggestions
‣ Spelling corrections
‣ More popular phrases
Autocomplete Request
{
"query": {
"simple_query_string": {
"fields": ['title'],
"query": baseTerms + '(' + lastTerm
+ '|' + lastTerm + '*)',
"default_operator": "and"
}
},
"size": 3,
"_source": ["title"]
}
Autocomplete – disambiguation
google.com
Suggestion Request
{
"query": query,
"suggest": {
"text": searchTerms,
"phraseSuggestion": {
"phrase": {
"field": "title",
"direct_generator": [{
"field": "title",
"suggest_mode": "popular",
"min_word_length": 3
}]
}}}}
Highlighting
Highlighting
‣ Highlight search terms
‣ Snippets for large blocks of
text
Highlighting Request
{
query: query,
highlight: {
fields: {
"title": {number_of_fragments:
0},
"detailed description":
{number_of_fragments: 0}
}
}
}
Highlighting Response
{
"took":28,
"hits":{
"total":2,
"max_score":1.44856,
"hits":[{
"_index":"catalog",
"_type":"comics",
"_id":"84",
"_score":1.44856,
"_source": {
"title": "Descender"
},
"highlight":{
"title":["<em>Descender</em>"]}
}
]}}}
Result highlighting
library.oreilly.com
Load more
Loading More Results
{
"query": {"match_all": {}},
"from": resultsPage * 10,
"sort": "fieldName desc",
}
Think outside the
search box
‣ Search gives you extreme
flexibility to return relevant
content quickly, given a wide
range of inputs
‣ Retrieval and ranking engine
@DanielBeach
OpenSource Connections
Search
is the new UI
OpenSource Connections
@DanielBeach

Contenu connexe

Tendances

Enhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchEnhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic search
lucenerevolution
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Lucidworks
 

Tendances (17)

Boosting Documents in Solr by Recency, Popularity and Personal Preferences - ...
Boosting Documents in Solr by Recency, Popularity and Personal Preferences - ...Boosting Documents in Solr by Recency, Popularity and Personal Preferences - ...
Boosting Documents in Solr by Recency, Popularity and Personal Preferences - ...
 
Click-through relevance ranking in solr &  lucid works enterprise - By Andrz...
 Click-through relevance ranking in solr &  lucid works enterprise - By Andrz... Click-through relevance ranking in solr &  lucid works enterprise - By Andrz...
Click-through relevance ranking in solr &  lucid works enterprise - By Andrz...
 
Webinar: Modern Techniques for Better Search Relevance with Fusion
Webinar: Modern Techniques for Better Search Relevance with FusionWebinar: Modern Techniques for Better Search Relevance with Fusion
Webinar: Modern Techniques for Better Search Relevance with Fusion
 
Whitepaper- Real World Search
Whitepaper-  Real World SearchWhitepaper-  Real World Search
Whitepaper- Real World Search
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
 
State-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache SolrState-of-the-Art Drupal Search with Apache Solr
State-of-the-Art Drupal Search with Apache Solr
 
Boosting Documents in Solr by Recency, Popularity, and User Preferences
Boosting Documents in Solr by Recency, Popularity, and User PreferencesBoosting Documents in Solr by Recency, Popularity, and User Preferences
Boosting Documents in Solr by Recency, Popularity, and User Preferences
 
Webinar: Event Processing & Data Analytics with Lucidworks Fusion
Webinar: Event Processing & Data Analytics with Lucidworks FusionWebinar: Event Processing & Data Analytics with Lucidworks Fusion
Webinar: Event Processing & Data Analytics with Lucidworks Fusion
 
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesIntroduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and Usecases
 
Introduction to Apache Solr
Introduction to Apache SolrIntroduction to Apache Solr
Introduction to Apache Solr
 
A Multifaceted Look At Faceting - Ted Sullivan, Lucidworks
A Multifaceted Look At Faceting - Ted Sullivan, LucidworksA Multifaceted Look At Faceting - Ted Sullivan, Lucidworks
A Multifaceted Look At Faceting - Ted Sullivan, Lucidworks
 
Enhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic searchEnhancing relevancy through personalization & semantic search
Enhancing relevancy through personalization & semantic search
 
Solr: 4 big features
Solr: 4 big featuresSolr: 4 big features
Solr: 4 big features
 
Solr 6.0 Graph Query Overview
Solr 6.0 Graph Query OverviewSolr 6.0 Graph Query Overview
Solr 6.0 Graph Query Overview
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
 
E commerce Search using Apache Solr
E commerce Search using Apache SolrE commerce Search using Apache Solr
E commerce Search using Apache Solr
 
Do you need an external search platform for Adobe Experience Manager?
Do you need an external search platform for Adobe Experience Manager?Do you need an external search platform for Adobe Experience Manager?
Do you need an external search platform for Adobe Experience Manager?
 

Similaire à Search is the UI

Charting Searchland, ACM SIG Data Mining
Charting Searchland, ACM SIG Data MiningCharting Searchland, ACM SIG Data Mining
Charting Searchland, ACM SIG Data Mining
Valeria de Paiva
 
Information Retrieval (for beginners)
Information Retrieval (for beginners)Information Retrieval (for beginners)
Information Retrieval (for beginners)
James Melzer
 
How search engines work Anand Saini
How search engines work Anand SainiHow search engines work Anand Saini
How search engines work Anand Saini
Dr,Saini Anand
 

Similaire à Search is the UI (20)

Search is the new UI
Search is the new UISearch is the new UI
Search is the new UI
 
Searchland: Search quality for Beginners
Searchland: Search quality for BeginnersSearchland: Search quality for Beginners
Searchland: Search quality for Beginners
 
Structured Document Search and Retrieval
Structured Document Search and RetrievalStructured Document Search and Retrieval
Structured Document Search and Retrieval
 
Charting Searchland, ACM SIG Data Mining
Charting Searchland, ACM SIG Data MiningCharting Searchland, ACM SIG Data Mining
Charting Searchland, ACM SIG Data Mining
 
Searchland2
Searchland2Searchland2
Searchland2
 
Information Retrieval (for beginners)
Information Retrieval (for beginners)Information Retrieval (for beginners)
Information Retrieval (for beginners)
 
Search Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your CustomersSearch Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your Customers
 
Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentation
 
How search engines work Anand Saini
How search engines work Anand SainiHow search engines work Anand Saini
How search engines work Anand Saini
 
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information ArchitectureUsing Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
 
You Don't Know SEO
You Don't Know SEOYou Don't Know SEO
You Don't Know SEO
 
Search Analytics for Content Strategists
Search Analytics for Content StrategistsSearch Analytics for Content Strategists
Search Analytics for Content Strategists
 
SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018 SDSC18 and DSATL Meetup March 2018
SDSC18 and DSATL Meetup March 2018
 
Search Analytics for Fun and Profit
Search Analytics for Fun and ProfitSearch Analytics for Fun and Profit
Search Analytics for Fun and Profit
 
Search Analytics - Comperio
Search Analytics - ComperioSearch Analytics - Comperio
Search Analytics - Comperio
 
AI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementAI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge Management
 
Getting the Most out of Type-Ahead/Autocomplete - LavaCon 2015 propsoal by Br...
Getting the Most out of Type-Ahead/Autocomplete - LavaCon 2015 propsoal by Br...Getting the Most out of Type-Ahead/Autocomplete - LavaCon 2015 propsoal by Br...
Getting the Most out of Type-Ahead/Autocomplete - LavaCon 2015 propsoal by Br...
 
Introducing Featured Search - Talk on the TYPO3 CAMP MALLORCA 2015
Introducing Featured Search - Talk on the TYPO3 CAMP MALLORCA 2015Introducing Featured Search - Talk on the TYPO3 CAMP MALLORCA 2015
Introducing Featured Search - Talk on the TYPO3 CAMP MALLORCA 2015
 
UX: internal search for e-commerce
UX: internal search for e-commerceUX: internal search for e-commerce
UX: internal search for e-commerce
 
Search Enginesv2
Search Enginesv2Search Enginesv2
Search Enginesv2
 

Dernier

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Search is the UI