SlideShare une entreprise Scribd logo
1  sur  7
Developing and Testing Search 
Engine Algorithms – 
Counterintuitive observations due to end 
user behavior. Suggestions. 
(This presentation is not representing views of my current employer) 
CHRISTIAN VON REVENTLOW (VONREVENTLOW) 
VONREVENTLOW@YAHOO.COM, +1 201 259 5973
Use a data driven feedback loop to 
evolve and test search engine algorithms 
Search Engine 
Algorithm 
Queries 
Query 
results Tester 
or 
End user 
Search 
End-User 
Interface 
Queries 
Results 
Rate results 
Data Scientists, 
Algorithm- and 
Software 
Developers 
Software 
Compute 
Search 
Result 
Quality 
Statics 
Search Quality Metrics
Measuring search performance 
• Understanding behavior and needs of satisfied and unsatisfied search users is key for improving the 
users search experience [0] 
• Satisfaction/dissatisfaction data is used to evolve and optimize the search algorithms [1]. 
• Traces from end users or a subset thereof 
• Testers creating sample queries. 
• Metrics like MAP (mean average precision) or NDCG (normalized discounted cumulative gain) had 
been used to measure search quality. [2]. A Click-through was used to judge relevance of results. 
• Nowadays metrics use the entire sequence of events in a search. An example is modelling search 
logs in Markov Models to get estimators of user satisfaction or dissatisfaction [1,3]. 
[0] A Dasdan, K Tsioutsiouliklis, E. Velipasaoglu. “Web Search Engine Metrics”, WWW2010 
[1] A Hassan, Y Song, Li-Wei He. “A Task Level Metric for Improving Web Search Satisfaction and its Application on Improving Relevance Estimation”, ACM CIKM’11 Oct 2011 
[2] K. Jaervelin, J. Kekalainen. “Cumulated gain based evaluation of IR techniques”, ACM TOIS 2002 
[3] A. Hassan, R. White. “Personalized Models of Search Satisfaction”, CIKM Nov 2013
Specialization improves end user 
satisfaction in search 
• Economic theory: specialized search engines deliver an 
advantage – specifically when its not only about attracting 
as many searchers but satisfying as many of them [1]. 
• Real data: shows user satisfaction increases when grouping 
users in cohorts with similar topical interests and optimize 
for each relevant cohort [2]. 
• Best by combining search results that are valid for 
everybody (Global) and the specific cohort only (personal). 
• The Counterintuitive: User satisfaction is better when 
optimizing search for the cohort – vs. optimizing for each 
individual. 
Users profit from the larger feedback dataset to the search 
algorithms provided by a cohort of similar people. 
[1] D. Kempe, B. Lucier. “User Satisfaction in Competitive Sponsored Search”. Cornell University. 
arXiv:1310.4098v1 [cs.GT] 
[2] A. Hassan, R. White. “Personalized Models of Search Satisfaction”. CIKM Nov 2013 
6.00% 
5.00% 
4.00% 
3.00% 
2.00% 
1.00% 
0.00% 
Example percentage gain in accuracy 
vs optimizing for single target audience 
Optimize by Topic Optimize by larger 
Cohort 
Optimize by smaller 
Cohort 
Global & Personal combined Personal only
Its not sufficient that “the right result” is 
part of the results list 
• Studies have shown that users are only interested in 
the first few results – thus high accuracy is desirable 
[1] 
• Temporal and geographic relevance, coverage, 
comprehensiveness, rapid discovery of new content, 
content freshness and diversity are vectors relevant 
Result 1- relevant 
for users [2] 
Result 2 - right results 
Result 3 – wrong/irrelevant 
• Users search environment has a major impact – like 
search on a mobile device vs. search from a tablet – 
Result 4 – relevant 
requiring specialization. 
Result 5 - relevant 
• The counterintuitive: Even if the “right result” is part 
of the first few results - having irrelevant/perceived 
wrong results makes the user disbelief in the 
correctness of ALL results. 
Query results 
[1] R.W. White. D. Morris. “Investigating the querying and browsing behavior of advanced search engine users” Proc. SIGIR 
[2] A Dasdan, K Tsioutsiouliklis, E. Velipasaoglu. “Web Search Engine Metrics”, WWW2010
Don’t get fooled if your user satisfaction 
goes up – it might not be all good.. 
• Behavioral differences have been shown between novice 
and expert searchers [1] 
• Optimizing differently for experts and casual users increases 
user satisfaction. [2] 
• The Counterintuitive: User satisfaction goes up over time 
even if you do not modify the algorithms. 
• Why: 
• Users learn how to query best (i.e. become mature users) 
• Learned what not to ask – i.e. intuitively restrict the usage space 
• Or worse: defect to other search engines. 60% of switches to a 
different engine are caused by dissatisfaction [3] 
• So don’t get fooled – understand why your satisfaction went 
up… 
[1] R.W. White. D. Morris. “Investigating the querying and browsing behavior of advanced search engine 
users” Proc. SIGIR 
[2] A. Hassan, R. White. “Personalized Models of Search Satisfaction”. CIKM Nov 2013 
[3] Q. Guo, R.W. White, Y. Zhang, B. Anderson, S Dumais. “Why searchers switch: understanding and 
predicting engine switching rationales”. Proc. SIGIR 2011 
6.00% 
5.00% 
4.00% 
3.00% 
2.00% 
1.00% 
0.00% 
Example percentage gain in accuracy 
vs optimizing for single target audience 
Optimize by Topic Optimize for expert 
vs. casual user 
Optimize by larger 
Cohort 
Global & Personal combined Personal only
Resulting Strategies 
• Specialization/Focus: Get clarify of what your search engine is targeted for – vs. a general 
purpose web search. Examples are places on a map, images, research papers. 
• Cohorts: Segment your user base in cohorts and optimize for each of them. 
• Start with expert and casual user. 
• Interview users, analyze search traces, … to identify other larger cohorts. 
• Usage: Optimize for usage environment & target device. 
• Smartphone, Tablet, PC, Professional multiscreen office setup. 
• Correctness: Carefully evaluate the dissatisfying query results. And minimize them. 
• Fresh end user participants for testing: Regularly recruit new groups of users to optimize your 
algorithms – specifically people who have never used your search before.

Contenu connexe

Similaire à Developing and testing search engine algorithms –

What Is UX Research & How Is It Done.pptx
What Is UX Research & How Is It Done.pptxWhat Is UX Research & How Is It Done.pptx
What Is UX Research & How Is It Done.pptxTurboAnchor
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Personalization of the Web Search
Personalization of the Web SearchPersonalization of the Web Search
Personalization of the Web SearchIJMER
 
A Systematic Literature Survey On Recommendation System
A Systematic Literature Survey On Recommendation SystemA Systematic Literature Survey On Recommendation System
A Systematic Literature Survey On Recommendation SystemGina Rizzo
 
Personalization of the Web Search
Personalization of the Web SearchPersonalization of the Web Search
Personalization of the Web SearchIJMER
 
IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...eSAT Publishing House
 
Auditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAuditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAmit Sharma
 
User search goal inference and feedback session using fast generalized – fuzz...
User search goal inference and feedback session using fast generalized – fuzz...User search goal inference and feedback session using fast generalized – fuzz...
User search goal inference and feedback session using fast generalized – fuzz...eSAT Publishing House
 
Personalized mobile search engine
Personalized mobile search enginePersonalized mobile search engine
Personalized mobile search engineSaurav Kumar
 
Customer Satisfaction about the Product with Special Reference to Rusee’s Food’s
Customer Satisfaction about the Product with Special Reference to Rusee’s Food’sCustomer Satisfaction about the Product with Special Reference to Rusee’s Food’s
Customer Satisfaction about the Product with Special Reference to Rusee’s Food’sIRJET Journal
 
business analytics unit 1 and 3 notes.pdf
business analytics unit 1 and 3 notes.pdfbusiness analytics unit 1 and 3 notes.pdf
business analytics unit 1 and 3 notes.pdftarunprajapati0t
 
Efficient way of user search location in query processing
Efficient way of user search location in query processingEfficient way of user search location in query processing
Efficient way of user search location in query processingeSAT Publishing House
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET Journal
 
Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...IAESIJAI
 
A New Algorithm for Inferring User Search Goals with Feedback Sessions
A New Algorithm for Inferring User Search Goals with Feedback SessionsA New Algorithm for Inferring User Search Goals with Feedback Sessions
A New Algorithm for Inferring User Search Goals with Feedback SessionsIJERA Editor
 
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback SessionsIRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback SessionsIRJET Journal
 
USER PROFILE BASED PERSONALIZED WEB SEARCH
USER PROFILE BASED PERSONALIZED WEB SEARCHUSER PROFILE BASED PERSONALIZED WEB SEARCH
USER PROFILE BASED PERSONALIZED WEB SEARCHijmpict
 

Similaire à Developing and testing search engine algorithms – (20)

What Is UX Research & How Is It Done.pptx
What Is UX Research & How Is It Done.pptxWhat Is UX Research & How Is It Done.pptx
What Is UX Research & How Is It Done.pptx
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Personalization of the Web Search
Personalization of the Web SearchPersonalization of the Web Search
Personalization of the Web Search
 
A Systematic Literature Survey On Recommendation System
A Systematic Literature Survey On Recommendation SystemA Systematic Literature Survey On Recommendation System
A Systematic Literature Survey On Recommendation System
 
Personalization of the Web Search
Personalization of the Web SearchPersonalization of the Web Search
Personalization of the Web Search
 
IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...
 
Auditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAuditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographics
 
Ac02411221125
Ac02411221125Ac02411221125
Ac02411221125
 
User search goal inference and feedback session using fast generalized – fuzz...
User search goal inference and feedback session using fast generalized – fuzz...User search goal inference and feedback session using fast generalized – fuzz...
User search goal inference and feedback session using fast generalized – fuzz...
 
Web personalization
Web personalizationWeb personalization
Web personalization
 
Personalized mobile search engine
Personalized mobile search enginePersonalized mobile search engine
Personalized mobile search engine
 
Customer Satisfaction about the Product with Special Reference to Rusee’s Food’s
Customer Satisfaction about the Product with Special Reference to Rusee’s Food’sCustomer Satisfaction about the Product with Special Reference to Rusee’s Food’s
Customer Satisfaction about the Product with Special Reference to Rusee’s Food’s
 
business analytics unit 1 and 3 notes.pdf
business analytics unit 1 and 3 notes.pdfbusiness analytics unit 1 and 3 notes.pdf
business analytics unit 1 and 3 notes.pdf
 
Efficient way of user search location in query processing
Efficient way of user search location in query processingEfficient way of user search location in query processing
Efficient way of user search location in query processing
 
50120140506005 2
50120140506005 250120140506005 2
50120140506005 2
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
 
Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...
 
A New Algorithm for Inferring User Search Goals with Feedback Sessions
A New Algorithm for Inferring User Search Goals with Feedback SessionsA New Algorithm for Inferring User Search Goals with Feedback Sessions
A New Algorithm for Inferring User Search Goals with Feedback Sessions
 
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback SessionsIRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
 
USER PROFILE BASED PERSONALIZED WEB SEARCH
USER PROFILE BASED PERSONALIZED WEB SEARCHUSER PROFILE BASED PERSONALIZED WEB SEARCH
USER PROFILE BASED PERSONALIZED WEB SEARCH
 

Dernier

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceanilsa9823
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 

Dernier (20)

Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 

Developing and testing search engine algorithms –

  • 1. Developing and Testing Search Engine Algorithms – Counterintuitive observations due to end user behavior. Suggestions. (This presentation is not representing views of my current employer) CHRISTIAN VON REVENTLOW (VONREVENTLOW) VONREVENTLOW@YAHOO.COM, +1 201 259 5973
  • 2. Use a data driven feedback loop to evolve and test search engine algorithms Search Engine Algorithm Queries Query results Tester or End user Search End-User Interface Queries Results Rate results Data Scientists, Algorithm- and Software Developers Software Compute Search Result Quality Statics Search Quality Metrics
  • 3. Measuring search performance • Understanding behavior and needs of satisfied and unsatisfied search users is key for improving the users search experience [0] • Satisfaction/dissatisfaction data is used to evolve and optimize the search algorithms [1]. • Traces from end users or a subset thereof • Testers creating sample queries. • Metrics like MAP (mean average precision) or NDCG (normalized discounted cumulative gain) had been used to measure search quality. [2]. A Click-through was used to judge relevance of results. • Nowadays metrics use the entire sequence of events in a search. An example is modelling search logs in Markov Models to get estimators of user satisfaction or dissatisfaction [1,3]. [0] A Dasdan, K Tsioutsiouliklis, E. Velipasaoglu. “Web Search Engine Metrics”, WWW2010 [1] A Hassan, Y Song, Li-Wei He. “A Task Level Metric for Improving Web Search Satisfaction and its Application on Improving Relevance Estimation”, ACM CIKM’11 Oct 2011 [2] K. Jaervelin, J. Kekalainen. “Cumulated gain based evaluation of IR techniques”, ACM TOIS 2002 [3] A. Hassan, R. White. “Personalized Models of Search Satisfaction”, CIKM Nov 2013
  • 4. Specialization improves end user satisfaction in search • Economic theory: specialized search engines deliver an advantage – specifically when its not only about attracting as many searchers but satisfying as many of them [1]. • Real data: shows user satisfaction increases when grouping users in cohorts with similar topical interests and optimize for each relevant cohort [2]. • Best by combining search results that are valid for everybody (Global) and the specific cohort only (personal). • The Counterintuitive: User satisfaction is better when optimizing search for the cohort – vs. optimizing for each individual. Users profit from the larger feedback dataset to the search algorithms provided by a cohort of similar people. [1] D. Kempe, B. Lucier. “User Satisfaction in Competitive Sponsored Search”. Cornell University. arXiv:1310.4098v1 [cs.GT] [2] A. Hassan, R. White. “Personalized Models of Search Satisfaction”. CIKM Nov 2013 6.00% 5.00% 4.00% 3.00% 2.00% 1.00% 0.00% Example percentage gain in accuracy vs optimizing for single target audience Optimize by Topic Optimize by larger Cohort Optimize by smaller Cohort Global & Personal combined Personal only
  • 5. Its not sufficient that “the right result” is part of the results list • Studies have shown that users are only interested in the first few results – thus high accuracy is desirable [1] • Temporal and geographic relevance, coverage, comprehensiveness, rapid discovery of new content, content freshness and diversity are vectors relevant Result 1- relevant for users [2] Result 2 - right results Result 3 – wrong/irrelevant • Users search environment has a major impact – like search on a mobile device vs. search from a tablet – Result 4 – relevant requiring specialization. Result 5 - relevant • The counterintuitive: Even if the “right result” is part of the first few results - having irrelevant/perceived wrong results makes the user disbelief in the correctness of ALL results. Query results [1] R.W. White. D. Morris. “Investigating the querying and browsing behavior of advanced search engine users” Proc. SIGIR [2] A Dasdan, K Tsioutsiouliklis, E. Velipasaoglu. “Web Search Engine Metrics”, WWW2010
  • 6. Don’t get fooled if your user satisfaction goes up – it might not be all good.. • Behavioral differences have been shown between novice and expert searchers [1] • Optimizing differently for experts and casual users increases user satisfaction. [2] • The Counterintuitive: User satisfaction goes up over time even if you do not modify the algorithms. • Why: • Users learn how to query best (i.e. become mature users) • Learned what not to ask – i.e. intuitively restrict the usage space • Or worse: defect to other search engines. 60% of switches to a different engine are caused by dissatisfaction [3] • So don’t get fooled – understand why your satisfaction went up… [1] R.W. White. D. Morris. “Investigating the querying and browsing behavior of advanced search engine users” Proc. SIGIR [2] A. Hassan, R. White. “Personalized Models of Search Satisfaction”. CIKM Nov 2013 [3] Q. Guo, R.W. White, Y. Zhang, B. Anderson, S Dumais. “Why searchers switch: understanding and predicting engine switching rationales”. Proc. SIGIR 2011 6.00% 5.00% 4.00% 3.00% 2.00% 1.00% 0.00% Example percentage gain in accuracy vs optimizing for single target audience Optimize by Topic Optimize for expert vs. casual user Optimize by larger Cohort Global & Personal combined Personal only
  • 7. Resulting Strategies • Specialization/Focus: Get clarify of what your search engine is targeted for – vs. a general purpose web search. Examples are places on a map, images, research papers. • Cohorts: Segment your user base in cohorts and optimize for each of them. • Start with expert and casual user. • Interview users, analyze search traces, … to identify other larger cohorts. • Usage: Optimize for usage environment & target device. • Smartphone, Tablet, PC, Professional multiscreen office setup. • Correctness: Carefully evaluate the dissatisfying query results. And minimize them. • Fresh end user participants for testing: Regularly recruit new groups of users to optimize your algorithms – specifically people who have never used your search before.