SlideShare a Scribd company logo
1 of 24
Personalized Diversification of
                        Search Results

                                  David Vallet, Pablo Castells
                              Universidad Autónoma de Madrid
                             {david.vallet,pablo.castells}@uam.es


                                            Personalized Diversification of Search Results
IRG
IR Group @ UAM
                   34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                 Portland, Oregon, 15th August 2012
Classic Web Search Model

                 Query: Queen




                   ?




                                                Personalized Diversification of Search Results
IRG
IR Group @ UAM
                       34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                     Portland, Oregon, 15th August 2012                                2
Classic Web Search Model

                 Query: Queen




                   ?




                                                Personalized Diversification of Search Results
IRG
IR Group @ UAM
                       34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                     Portland, Oregon, 15th August 2012                                3
Personalized Web Search Model

                   Query: Queen




                     Personalized
                       ordering



         User
         Profile




                                                Personalized Diversification of Search Results
IRG
IR Group @ UAM
                       34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                     Portland, Oregon, 15th August 2012                                4
Diversified Web Search Model

                 Query: Queen




                     Diversification
                         model


                 ?




                                                Personalized Diversification of Search Results
IRG
IR Group @ UAM
                       34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                     Portland, Oregon, 15th August 2012                                5
Personalized Diversification of Web Search?
                 Query: Queen
                                                                     Personalization
                       User                                          Tailor the results to the specific interests of the user
                       Profile                                       Relies on accurate user profile
                                                                     Risk of being perceived as intrusive by the user




                                                                      Diversification
                                                                      Cover all interpretations of the query in the first
                                                                      results
                                                                      Maximize probability of showing an interpretation
                                                                      relevant to the user
                                    Diversification                   Outliers
                                        model                         Users not finding a relevant result in the top
                      ?                                               positions
                                                   Personalized Diversification of Search Results
IRG
IR Group @ UAM
                          34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                        Portland, Oregon, 15th August 2012                                6
Diversified Web Search Model: SoA diversification approaches




                              document relevance                                    novelty




                                          Personalized Diversification of Search Results
IRG
IR Group @ UAM
                 34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                               Portland, Oregon, 15th August 2012                                7
Diversified Web Search Model: SoA diversification approaches




                                 document query                    document topic
                                    relevance                         relevance                            novelty




                                          Personalized Diversification of Search Results
IRG
IR Group @ UAM
                 34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                               Portland, Oregon, 15th August 2012                                    8
Diversified Web Search Model: Diversification VS Personalization




 baseline                          Diversification                                                               Personalization
                                    (IA-Select)                                                                     (BM-25)
                                          Personalized Diversification of Search Results
IRG
IR Group @ UAM
                 34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                               Portland, Oregon, 15th August 2012                                      9
Diversified Web Search Model: Diversification VS Personalization




 baseline                          Diversification                           Diversified                         Personalization
                                    (IA-Select)                            personalization                          (BM-25)
                                          Personalized Diversification of Search Results
IRG
IR Group @ UAM
                 34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                               Portland, Oregon, 15th August 2012                                      10
Diversified Web Search Model: Diversification VS Personalization




                            Small study: what do users prefer?
                            • If profile is perfectly (explicitly) defined
                                personalization over any diversification
                            • If profile has errors
                                personalized diversity over full
                                   personalization or diversification



 baseline                          Diversification                           diversified                         Personalization
                                    (IA-Select)                            personalization                          (BM-25)
                                          Personalized Diversification of Search Results
IRG
IR Group @ UAM
                 34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                               Portland, Oregon, 15th August 2012                                      11
Personalized Diversification of Web Search: Personalized IA-Select



          IA-SELECT                   document relevance                                    novelty




          Personalized IA-SELECT
                 – Adding a user component




                                                   Personalized Diversification of Search Results
IRG
IR Group @ UAM
                          34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                        Portland, Oregon, 15th August 2012                                12
Personalized Diversification of Web Search: Personalized xQuAD



                                          document query                    document topic
          xQuAD                             relevance                         relevance                            novelty




          Personalized xQuAD
                 – Adding a user component




                                                   Personalized Diversification of Search Results
IRG
IR Group @ UAM
                          34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                        Portland, Oregon, 15th August 2012                                    13
Personalized Diversification of Web Search: Model
          Personalized IA-SELECT




          Personalized xQuAD



          Estimations




                                              Personalized Diversification of Search Results
IRG
IR Group @ UAM
                     34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                   Portland, Oregon, 15th August 2012                                14
Personalized Diversification of Web Search: Model estimations




                  delicious

                                          Personalized Diversification of Search Results
IRG
IR Group @ UAM
                 34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                               Portland, Oregon, 15th August 2012                                15
Personalized Diversification of Web Search: Model estimations




           1     http://www.textwise.com
                                                        Personalized Diversification of Search Results
IRG
IR Group @ UAM
                               34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                             Portland, Oregon, 15th August 2012                                16
How to evaluate?


                 delicious                         Evaluation topic



                                                                       ?


           Accuracy assessments                                                      Diversity assessments
                 – Document relevance assessments                                            – Document/aspect assessments




  Crowdsourced evaluation
                                                  Personalized Diversification of Search Results
IRG
IR Group @ UAM
                         34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                       Portland, Oregon, 15th August 2012                                17
Personalized Diversification of Web Search: Online Evaluation

         Social User
          Profile u                                  Social tagging                                 t1,d      t2,d              tl,d
 …




                                                                                                                      • User annotations
                                                                                                                      • Search result annotations         4
   t1                       1                                                     d1
                                                                                             2                                   3                      d'1
                                                                                                                                       Depth-5
  t2               Top k                                       Search                                                                  pooling
                 popular tags                                  results            d2                                                                    d'2
                                      Web Search                                                        Result reordering




                                                                               …




                                                                                                                                                   …
 …




                                                                                  dn                                                                d‘P’
  t1                                                                                                                   • User classification
                                                                                                                       • Search result classification
 …




                                                Document classifier                                c1,d      c2,d               cc,d




                    Users get to evaluate known topics
                    Must be done live, cannot make workers wait


                                                         Personalized Diversification of Search Results
IRG
IR Group @ UAM
                                34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                              Portland, Oregon, 15th August 2012                                       18
Crowdsourced Evaluation: UI Desing

          Assessments
                 – Q1: user relevance
                                          Accuracy (F-measure)
                 – Q2: topic relevance
                 – Q3: document-category classification: subjective to users, foster the reuse of
                   categories (avg. 5 categories per topic)




          Include a training task for workers (they appreciate it)

                                                    Personalized Diversification of Search Results
IRG
IR Group @ UAM
                           34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                         Portland, Oregon, 15th August 2012                                19
Personalized Diversification of Web Search: Crowdsourced Evaluation

          Collected data
                 –   Period: 4 weeks
                 –   35 Delicious users: bookmarks, top tags, tag assignments, etc.
                 –   180 search topics
                 –   Over 3800 relevance judgments
          Provided as evaluation and development dataset
                 – http://ir.ii.uam.es/dvallet/persdivers




                                                     Personalized Diversification of Search Results
IRG
IR Group @ UAM
                            34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                          Portland, Oregon, 15th August 2012                                20
Personalized Diversification of Web Search: Results
     Accuracy metrics

                                             Topic relevance                   User relevance                      F(Topic,User)




                                                nDCG@5




                                                                                 nDCG@5




                                                                                                                   nDCG@5
                                                                 P@5




                                                                                                  P@5




                                                                                                                                 P@5
                  Bing                       0.393            0.918           0.330            0.702              0.359      0.796
                  IA-Select                  0.363            0.911           0.294            0.664              0.325      0.768
      Diversity
                  xQuAD                      0.381            0.911           0.320            0.670              0.348      0.772
                  Personalization            0.388            0.933           0.357            0.746              0.372      0.829
                  PIA-Select                 0.331            0.861           0.306            0.652              0.318      0.742
   Personalized   PIA-SelectBM25             0.363            0.892           0.345            0.670              0.354      0.766
     Diversity    PxQuAD                     0.395            0.930           0.337            0.714              0.364      0.807
                  PxQuADBM25                 0.405            0.939           0.361            0.730              0.382      0.821


                    Statistically significant                                                        Statistically significant
                    with respect to Baseline                                                         with respect to xQuAD

                                           Personalized Diversification of Search Results
IRG
IR Group @ UAM
                  34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                Portland, Oregon, 15th August 2012                                          21
Personalized Diversification of Web Search: Results
     Diversity metrics
                                                           Topic relevance                                   User relevance




                                                                 α-nDCG@5




                                                                                                                   α-nDCG@5
                                                ERR-IA@5




                                                                                                  ERR-IA@5
                                                                                 S-recall@5




                                                                                                                                   S-recall@5
                  Bing                       0.274             0.787          0.500            0.254              0.626        0.475
      Diversity   IA-Select                  0.262             0.758          0.463            0.237              0.582        0.426
                  xQuAD                      0.275             0.793          0.510            0.257              0.624        0.478
                  Personalization            0.267             0.778          0.486            0.262              0.646        0.483
                  PIA-Select                 0.251             0.742          0.489            0.239              0.593        0.481
   Personalized   PIA-SelectBM25             0.279             0.803          0.543            0.273              0.656        0.521
     Diversity    PxQuAD                     0.275             0.796          0.503            0.268              0.649        0.489
                  PxQuADBM25                 0.281             0.810          0.519            0.267              0.658        0.496


                    Statistically significant                                                           Statistically significant
                    with respect to Baseline                                                            with respect to xQuAD

                                           Personalized Diversification of Search Results
IRG
IR Group @ UAM
                  34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                Portland, Oregon, 15th August 2012                                            22
Conclusions

          Adapted SoA diversification techniques to include a personalized factor
          Presented an example of personalized diversification of Web search
                 – Personalization: social tagging
                 – Diversification: ODP Web categories
                  Effectively combined both benefits of personalization and diversification
          Crowdsourced evaluation approach for diversity and personalized
           techniques
                 –   Great if you have a lot of restrictions
                 –   Repeatable
                 –   Cheap
                 –   …Although can be difficult to set up




                                                      Personalized Diversification of Search Results
IRG
IR Group @ UAM
                             34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                           Portland, Oregon, 15th August 2012                                23
Thank you!!!



                                                                Thank you!




      References
                 – [Vallet-10]: David Vallet, Iván Cantador, Joemon M. Jose: Personalizing
                   Web Search with Folksonomy-Based User and Document Profiles. ECIR
                   2010: 420-431

                                                    Personalized Diversification of Search Results
IRG
IR Group @ UAM
                           34th International ACM SIGIR Conference on Research and Development in Information Retrieval,
                                                         Portland, Oregon, 15th August 2012

More Related Content

Similar to Personalized Diversification of Search Results

Information Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT CoimbatoreInformation Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT Coimbatoreveningstonk
 
The Conclusion for sigir 2011
The Conclusion for sigir 2011The Conclusion for sigir 2011
The Conclusion for sigir 2011Yueshen Xu
 
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...Pablo Castells
 
Customisable cross-database Bio2RDF queries
Customisable cross-database Bio2RDF queriesCustomisable cross-database Bio2RDF queries
Customisable cross-database Bio2RDF queriesPeter Ansell
 
CIP Genebank and DOIs
CIP Genebank and DOIsCIP Genebank and DOIs
CIP Genebank and DOIsEdwin Rojas
 
Towards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software MetadataTowards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software Metadatadgarijo
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Softwaredgarijo
 
Information Retrieval Using an Ontological Web-Trading Model
Information Retrieval Using an Ontological Web-Trading ModelInformation Retrieval Using an Ontological Web-Trading Model
Information Retrieval Using an Ontological Web-Trading ModelApplied Computing Group
 
Hello Open World - Semtech 2009
Hello Open World - Semtech 2009Hello Open World - Semtech 2009
Hello Open World - Semtech 2009Alexandre Passant
 
Gra implementations perbix_search
Gra implementations perbix_searchGra implementations perbix_search
Gra implementations perbix_searchICJIA Webmaster
 
Research, the Cloud, and the IRB
Research, the Cloud, and the IRBResearch, the Cloud, and the IRB
Research, the Cloud, and the IRBMichael Zimmer
 
Malicious Link Detection System
Malicious Link Detection SystemMalicious Link Detection System
Malicious Link Detection SystemIRJET Journal
 
Lessons for an enterprise approach to modeling at scale
Lessons for an enterprise approach to modeling at scaleLessons for an enterprise approach to modeling at scale
Lessons for an enterprise approach to modeling at scaleSigOpt
 
랭킹 최적화를 넘어 인간적인 검색으로 - 서울대 융합기술원 발표
랭킹 최적화를 넘어 인간적인 검색으로  - 서울대 융합기술원 발표랭킹 최적화를 넘어 인간적인 검색으로  - 서울대 융합기술원 발표
랭킹 최적화를 넘어 인간적인 검색으로 - 서울대 융합기술원 발표Jin Young Kim
 
A Data Sharing Protocol to Minimize Security and Privacy Risks in Cloud Storage
A Data Sharing Protocol to Minimize Security and Privacy Risks in Cloud StorageA Data Sharing Protocol to Minimize Security and Privacy Risks in Cloud Storage
A Data Sharing Protocol to Minimize Security and Privacy Risks in Cloud Storageijtsrd
 
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...Umair ul Hassan
 
Diversity
DiversityDiversity
DiversityTelnet
 
Introduction on Science Gateway
Introduction on Science GatewayIntroduction on Science Gateway
Introduction on Science Gatewayriround
 
A Model for Encryption of a Text Phrase using Genetic Algorithm
A Model for Encryption of a Text Phrase using Genetic AlgorithmA Model for Encryption of a Text Phrase using Genetic Algorithm
A Model for Encryption of a Text Phrase using Genetic Algorithmijtsrd
 
Rethinking Microblogging: Open Distributed Semantic
Rethinking Microblogging: Open Distributed SemanticRethinking Microblogging: Open Distributed Semantic
Rethinking Microblogging: Open Distributed SemanticAlexandre Passant
 

Similar to Personalized Diversification of Search Results (20)

Information Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT CoimbatoreInformation Retrieval AICTE FDP at GCT Coimbatore
Information Retrieval AICTE FDP at GCT Coimbatore
 
The Conclusion for sigir 2011
The Conclusion for sigir 2011The Conclusion for sigir 2011
The Conclusion for sigir 2011
 
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...SIGIR 2012 - Explicit Relevance Models in Intent-Oriented  Information Retrie...
SIGIR 2012 - Explicit Relevance Models in Intent-Oriented Information Retrie...
 
Customisable cross-database Bio2RDF queries
Customisable cross-database Bio2RDF queriesCustomisable cross-database Bio2RDF queries
Customisable cross-database Bio2RDF queries
 
CIP Genebank and DOIs
CIP Genebank and DOIsCIP Genebank and DOIs
CIP Genebank and DOIs
 
Towards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software MetadataTowards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software Metadata
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Software
 
Information Retrieval Using an Ontological Web-Trading Model
Information Retrieval Using an Ontological Web-Trading ModelInformation Retrieval Using an Ontological Web-Trading Model
Information Retrieval Using an Ontological Web-Trading Model
 
Hello Open World - Semtech 2009
Hello Open World - Semtech 2009Hello Open World - Semtech 2009
Hello Open World - Semtech 2009
 
Gra implementations perbix_search
Gra implementations perbix_searchGra implementations perbix_search
Gra implementations perbix_search
 
Research, the Cloud, and the IRB
Research, the Cloud, and the IRBResearch, the Cloud, and the IRB
Research, the Cloud, and the IRB
 
Malicious Link Detection System
Malicious Link Detection SystemMalicious Link Detection System
Malicious Link Detection System
 
Lessons for an enterprise approach to modeling at scale
Lessons for an enterprise approach to modeling at scaleLessons for an enterprise approach to modeling at scale
Lessons for an enterprise approach to modeling at scale
 
랭킹 최적화를 넘어 인간적인 검색으로 - 서울대 융합기술원 발표
랭킹 최적화를 넘어 인간적인 검색으로  - 서울대 융합기술원 발표랭킹 최적화를 넘어 인간적인 검색으로  - 서울대 융합기술원 발표
랭킹 최적화를 넘어 인간적인 검색으로 - 서울대 융합기술원 발표
 
A Data Sharing Protocol to Minimize Security and Privacy Risks in Cloud Storage
A Data Sharing Protocol to Minimize Security and Privacy Risks in Cloud StorageA Data Sharing Protocol to Minimize Security and Privacy Risks in Cloud Storage
A Data Sharing Protocol to Minimize Security and Privacy Risks in Cloud Storage
 
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
 
Diversity
DiversityDiversity
Diversity
 
Introduction on Science Gateway
Introduction on Science GatewayIntroduction on Science Gateway
Introduction on Science Gateway
 
A Model for Encryption of a Text Phrase using Genetic Algorithm
A Model for Encryption of a Text Phrase using Genetic AlgorithmA Model for Encryption of a Text Phrase using Genetic Algorithm
A Model for Encryption of a Text Phrase using Genetic Algorithm
 
Rethinking Microblogging: Open Distributed Semantic
Rethinking Microblogging: Open Distributed SemanticRethinking Microblogging: Open Distributed Semantic
Rethinking Microblogging: Open Distributed Semantic
 

Recently uploaded

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

Recently uploaded (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

Personalized Diversification of Search Results

  • 1. Personalized Diversification of Search Results David Vallet, Pablo Castells Universidad Autónoma de Madrid {david.vallet,pablo.castells}@uam.es Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012
  • 2. Classic Web Search Model Query: Queen ? Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 2
  • 3. Classic Web Search Model Query: Queen ? Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 3
  • 4. Personalized Web Search Model Query: Queen Personalized ordering User Profile Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 4
  • 5. Diversified Web Search Model Query: Queen Diversification model ? Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 5
  • 6. Personalized Diversification of Web Search? Query: Queen  Personalization User Tailor the results to the specific interests of the user Profile Relies on accurate user profile Risk of being perceived as intrusive by the user  Diversification Cover all interpretations of the query in the first results Maximize probability of showing an interpretation relevant to the user Diversification Outliers model Users not finding a relevant result in the top ? positions Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 6
  • 7. Diversified Web Search Model: SoA diversification approaches document relevance novelty Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 7
  • 8. Diversified Web Search Model: SoA diversification approaches document query document topic relevance relevance novelty Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 8
  • 9. Diversified Web Search Model: Diversification VS Personalization baseline Diversification Personalization (IA-Select) (BM-25) Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 9
  • 10. Diversified Web Search Model: Diversification VS Personalization baseline Diversification Diversified Personalization (IA-Select) personalization (BM-25) Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 10
  • 11. Diversified Web Search Model: Diversification VS Personalization Small study: what do users prefer? • If profile is perfectly (explicitly) defined  personalization over any diversification • If profile has errors  personalized diversity over full personalization or diversification baseline Diversification diversified Personalization (IA-Select) personalization (BM-25) Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 11
  • 12. Personalized Diversification of Web Search: Personalized IA-Select  IA-SELECT document relevance novelty  Personalized IA-SELECT – Adding a user component Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 12
  • 13. Personalized Diversification of Web Search: Personalized xQuAD document query document topic  xQuAD relevance relevance novelty  Personalized xQuAD – Adding a user component Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 13
  • 14. Personalized Diversification of Web Search: Model  Personalized IA-SELECT  Personalized xQuAD  Estimations Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 14
  • 15. Personalized Diversification of Web Search: Model estimations delicious Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 15
  • 16. Personalized Diversification of Web Search: Model estimations 1 http://www.textwise.com Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 16
  • 17. How to evaluate? delicious Evaluation topic ?  Accuracy assessments  Diversity assessments – Document relevance assessments – Document/aspect assessments  Crowdsourced evaluation Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 17
  • 18. Personalized Diversification of Web Search: Online Evaluation Social User Profile u Social tagging t1,d t2,d tl,d … • User annotations • Search result annotations 4 t1 1 d1 2 3 d'1 Depth-5 t2 Top k Search pooling popular tags results d2 d'2 Web Search Result reordering … … … dn d‘P’ t1 • User classification • Search result classification … Document classifier c1,d c2,d cc,d Users get to evaluate known topics Must be done live, cannot make workers wait Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 18
  • 19. Crowdsourced Evaluation: UI Desing  Assessments – Q1: user relevance Accuracy (F-measure) – Q2: topic relevance – Q3: document-category classification: subjective to users, foster the reuse of categories (avg. 5 categories per topic)  Include a training task for workers (they appreciate it) Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 19
  • 20. Personalized Diversification of Web Search: Crowdsourced Evaluation  Collected data – Period: 4 weeks – 35 Delicious users: bookmarks, top tags, tag assignments, etc. – 180 search topics – Over 3800 relevance judgments  Provided as evaluation and development dataset – http://ir.ii.uam.es/dvallet/persdivers Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 20
  • 21. Personalized Diversification of Web Search: Results Accuracy metrics Topic relevance User relevance F(Topic,User) nDCG@5 nDCG@5 nDCG@5 P@5 P@5 P@5 Bing 0.393 0.918 0.330 0.702 0.359 0.796 IA-Select 0.363 0.911 0.294 0.664 0.325 0.768 Diversity xQuAD 0.381 0.911 0.320 0.670 0.348 0.772 Personalization 0.388 0.933 0.357 0.746 0.372 0.829 PIA-Select 0.331 0.861 0.306 0.652 0.318 0.742 Personalized PIA-SelectBM25 0.363 0.892 0.345 0.670 0.354 0.766 Diversity PxQuAD 0.395 0.930 0.337 0.714 0.364 0.807 PxQuADBM25 0.405 0.939 0.361 0.730 0.382 0.821 Statistically significant Statistically significant with respect to Baseline with respect to xQuAD Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 21
  • 22. Personalized Diversification of Web Search: Results Diversity metrics Topic relevance User relevance α-nDCG@5 α-nDCG@5 ERR-IA@5 ERR-IA@5 S-recall@5 S-recall@5 Bing 0.274 0.787 0.500 0.254 0.626 0.475 Diversity IA-Select 0.262 0.758 0.463 0.237 0.582 0.426 xQuAD 0.275 0.793 0.510 0.257 0.624 0.478 Personalization 0.267 0.778 0.486 0.262 0.646 0.483 PIA-Select 0.251 0.742 0.489 0.239 0.593 0.481 Personalized PIA-SelectBM25 0.279 0.803 0.543 0.273 0.656 0.521 Diversity PxQuAD 0.275 0.796 0.503 0.268 0.649 0.489 PxQuADBM25 0.281 0.810 0.519 0.267 0.658 0.496 Statistically significant Statistically significant with respect to Baseline with respect to xQuAD Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 22
  • 23. Conclusions  Adapted SoA diversification techniques to include a personalized factor  Presented an example of personalized diversification of Web search – Personalization: social tagging – Diversification: ODP Web categories  Effectively combined both benefits of personalization and diversification  Crowdsourced evaluation approach for diversity and personalized techniques – Great if you have a lot of restrictions – Repeatable – Cheap – …Although can be difficult to set up Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012 23
  • 24. Thank you!!! Thank you!  References – [Vallet-10]: David Vallet, Iván Cantador, Joemon M. Jose: Personalizing Web Search with Folksonomy-Based User and Document Profiles. ECIR 2010: 420-431 Personalized Diversification of Search Results IRG IR Group @ UAM 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Portland, Oregon, 15th August 2012

Editor's Notes

  1. Personalizedia-selecttopicweightissustitutedbyuserpreferencetotopic
  2. Personalizedia-selecttopicweightissustitutedbyuserpreferencetotopic
  3. Personalizedia-selecttopicweightissustitutedbyuserpreferencetotopic
  4. Onephrasemaliciousworkers
  5. Randlinski and Dumais