SlideShare une entreprise Scribd logo
1  sur  21
2013

A Collaborative Document Ranking
Model for a Multi-Faceted Search

Laure Soulier
Lynda Tamine and Wahiba Bahsoun

09/12/2013 – AIRS 2013
Collaborative Information Retrieval

Overview
1.
2.
3.
4.
5.
6.

From individual to collaborative IR
Related Work
Research Questions
CIR Model for Multi-Faceted Search
Experimental Evaluation
Conclusion and Future Work

2
From Individual to Collaborative
Information Retrieval (CIR)
Information need

Information
retrieval system

Complex or exploratory tasks

Collaboration paradigms

[Shah, 2012]

[Foley et al., 2009; Morris and
Teevans, 2010]

Shared
information need

Collaborative
information
retrieval
system

.
.
.

- Division of labor
- Sharing of knowledge
- Awareness
Synergic Effect [Shah, 2012]

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

3
cost

From Individual to Collaborative
Information Retrieval (CIR)

.
.
.

Hubble
telescope
achievement

gravitational lenses

new cosmological
theories

Shared
information need

Collaborative
information
retrieval
system

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

4
Related Work: multi-faceted search
individual vs. collaborative search
•

Individual-based search
– Identifying query facets for facet-based retrieval
•
•
•

Terminological resources [Dakka et al., 2006]
Navigation-based classification [Dou et al., 2011]
Probabilistic model [Blei et al., 2003; Deerwester et al., 1990]

– Result diversification [Carbonell et al., 1998; Wang et al., 2012]

•

Collaborative-based search
– Relevance feedback-based CIR models [Foley et al., 2009;
Morris et al., 2008]

– Role-based CIR models [Pickens et al., 2008; Shah et al., 2010]

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

5
Facet 8

Research Questions

Facet 6

- How to leverage experts’ domain
knowledge for collaboratively ranking
documents?

Facet 4

Information
need

- How to mine query facets?

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

6
CIR Model for Multi-Faceted Search
Query Facet
Mining

Expert-based
Document
Scoring

Expert-Based
Document
Allocation

7

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

7
CIR Model for Multi-Faceted Search
Query Facet
Mining

Expert-based
Document
Scoring

Expert-Based
Document
Allocation

• Document diversification and facet mining
Shared
information need

t1 t2 t3 t4 t5 t6 t7 t8 t9 ...

t1 t2 t3 t4 t5 t6 t7 t8 t9 ...

[Carbonell and Goldstein, 1998]

t1 t2 t3 t4 t5 t6 t7 t8 t9 ...

Diversification

Latent Dirichlet
Allocation
[Blei et al., 2003]

t1 t2 t3 t4 t5 t6 t7 t8 t9 ...
...

t1 t2 t3 t4 t5 t6 t7 t8 t9 ...

Likelihood optimization

t1 t2 t3 t4 t5 t6 t7 t8 t9 ...
...

8

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

8
CIR Model for Multi-Faceted Search
Query Facets
Mining

Expert-based
Document
Scoring

Expert-Based
Document
Allocation

• Estimating the document relevance towards each expert

9

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

9
CIR Model for Multi-Faceted Search
Query Facet
Mining

Expert-based
Document
Scoring

Expert-Based
Document
Allocation

• Assigning a document to the most suitable expert considering experts' domain
expertise towards the query facets
 Document allocation

 Division of labor:
Intersection of document lists
simultaneously displayed to
experts is empty
10

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

10
Experimental Evaluation
Dataset and Collaboration simulation
•

Dataset
–
–
–

•

TREC Interactive 6-7-8
210 158 articles
277 collaborative search sessions from 20 TREC topics

Collaboration-simulation based framework [Foley et al., 2009]

For each TREC query Q:

All possible combinations of
size m>1 of domain experts
Domain experts

2-means
classification

…
…

Domain novices

…
CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

11
Experimental Evaluation
Collaboration simulation
For each TREC query Q and each expert group: building the timeline of relevance feedback

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

12
Experimental Evaluation
Baselines
Query Facet
Mining

Expert-based
Document
Scoring

Expert-Based
Document
Allocation

Division of
Labor

• W/oEMDoL: the individual version of our model which
integrates only the expert-based document scoring.

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

13
Experimental Evaluation
Baselines
Mining
Query Facets

Expert-based
Document
Scoring

Expert-Based
Document
Allocation

Division of
Labor

• W/oEMDoL: the individual version of our model which
integrates only the expert-based document scoring.
• W/oDoL: our collaborative-based model by excluding the
division of labor principle.

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

14
Experimental Evaluation
Baselines
Mining
Query Facets

Expert-based
Document
Scoring

Expert-Based
Document
Allocation

Division of
Labor

• W/oEMDoL: the individual version of our model which
integrates only the expert-based document scoring.
• W/oDoL: our collaborative-based model by excluding the
division of labor principle.
• W/oEM: our collaborative-based model by excluding the
expert-based document allocation.
CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

15
Experimental Evaluation
Metrics
•

Metrics
–

Relevance judgments: relevance feedback provided by TREC participants
including an agreement level

–

Measures [Shah and Gonzalez-Ibanez, 2011]
Precision: the more document lists include
relevant documents, the higher the precision is.
Coverage: the more document lists are diversified,
the higher the coverage is.
Relevant coverage: the more document lists include
diversified and relevant documents, the higher the
relevant coverage is.

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

16
Experimental Evaluation
Results
• 2-cross validation
– 2 random subsets of collaborative search sessions split on the basis of
TREC queries

• Parameter Tuning
–
–
–
–

Diversification: γ=1
LDA modeling: likelihood maximal at 200 topics
Number of considered facets for topical-based modeling: 5
Expert-based document scoring: λ=0.6

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

17
Experimental Evaluation
Results

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

18
Experimental Evaluation
Results
• Complementary Analysis

• Retrieval effectiveness is generally
stable even with the increasing size
of the group

• Higher the agreement level is,
fewer documents are assessed as
relevant
 favors the search failure

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

19
Conclusion and Future Work
• A 2-step collaborative ranking model for satisfying a multifaceted information need considering a group of experts.
– Expert-based document scoring
– Expert-based document allocation by means of the ExpectationMaximization learning method.

• Evaluation through a collaboration simulation-based
framework showing effective results.
• Future work:
– Design of other formal methods to emphasize division of labor
– Modeling of user profile through his behavior in addition to his
relevance feedback.
20

CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion

20
Collaborative Information
Retrieval

Many thanks for your attention

http://www.irit.fr/~Laure.Soulier/

http://www.linkedin.com/pub/laure-soulier/48/299/188

Contenu connexe

Tendances

Transformation Functions for Text Classification: A case study with StackOver...
Transformation Functions for Text Classification: A case study with StackOver...Transformation Functions for Text Classification: A case study with StackOver...
Transformation Functions for Text Classification: A case study with StackOver...Sebastian Ruder
 
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...IJERA Editor
 
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftStrong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftSebastian Ruder
 
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...Dr. Haxel Consult
 
Probabilistic retrieval model
Probabilistic retrieval modelProbabilistic retrieval model
Probabilistic retrieval modelbaradhimarch81
 
Analyzing observational data during qualitative research
Analyzing observational data during qualitative researchAnalyzing observational data during qualitative research
Analyzing observational data during qualitative researchWafa Iqbal
 
Using Qualitative Data Analysis Software By Michelle C. Bligh, Ph.D., Claremo...
Using Qualitative Data Analysis Software By Michelle C. Bligh, Ph.D., Claremo...Using Qualitative Data Analysis Software By Michelle C. Bligh, Ph.D., Claremo...
Using Qualitative Data Analysis Software By Michelle C. Bligh, Ph.D., Claremo...James Mullooly PhD
 
Neural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain ShiftNeural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain ShiftSebastian Ruder
 
Technological Route between Pioneerism and Improvement
Technological Route between Pioneerism and ImprovementTechnological Route between Pioneerism and Improvement
Technological Route between Pioneerism and ImprovementRoberto Nani
 

Tendances (10)

Chapter8.coding
Chapter8.codingChapter8.coding
Chapter8.coding
 
Transformation Functions for Text Classification: A case study with StackOver...
Transformation Functions for Text Classification: A case study with StackOver...Transformation Functions for Text Classification: A case study with StackOver...
Transformation Functions for Text Classification: A case study with StackOver...
 
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
A Comparative Study of Centroid-Based and Naïve Bayes Classifiers for Documen...
 
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftStrong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
 
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
 
Probabilistic retrieval model
Probabilistic retrieval modelProbabilistic retrieval model
Probabilistic retrieval model
 
Analyzing observational data during qualitative research
Analyzing observational data during qualitative researchAnalyzing observational data during qualitative research
Analyzing observational data during qualitative research
 
Using Qualitative Data Analysis Software By Michelle C. Bligh, Ph.D., Claremo...
Using Qualitative Data Analysis Software By Michelle C. Bligh, Ph.D., Claremo...Using Qualitative Data Analysis Software By Michelle C. Bligh, Ph.D., Claremo...
Using Qualitative Data Analysis Software By Michelle C. Bligh, Ph.D., Claremo...
 
Neural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain ShiftNeural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain Shift
 
Technological Route between Pioneerism and Improvement
Technological Route between Pioneerism and ImprovementTechnological Route between Pioneerism and Improvement
Technological Route between Pioneerism and Improvement
 

En vedette

Collaborative Information Retrieval: Frameworks, Theoretical Models and Emerg...
Collaborative Information Retrieval: Frameworks, Theoretical Models and Emerg...Collaborative Information Retrieval: Frameworks, Theoretical Models and Emerg...
Collaborative Information Retrieval: Frameworks, Theoretical Models and Emerg...UPMC - Sorbonne Universities
 
Understanding the Impact of the Role Factor in Collaborative Information Retr...
Understanding the Impact of the Role Factor in Collaborative Information Retr...Understanding the Impact of the Role Factor in Collaborative Information Retr...
Understanding the Impact of the Role Factor in Collaborative Information Retr...UPMC - Sorbonne Universities
 
Thesis slides - Definition and evluation of collaborative information retriev...
Thesis slides - Definition and evluation of collaborative information retriev...Thesis slides - Definition and evluation of collaborative information retriev...
Thesis slides - Definition and evluation of collaborative information retriev...UPMC - Sorbonne Universities
 
Collaborative Information Retrieval: Concepts, Models and Evaluation
Collaborative Information Retrieval: Concepts, Models and EvaluationCollaborative Information Retrieval: Concepts, Models and Evaluation
Collaborative Information Retrieval: Concepts, Models and EvaluationUPMC - Sorbonne Universities
 
Un modèle de recherche d’information collaborative basé sur l’expertise des u...
Un modèle de recherche d’information collaborative basé sur l’expertise des u...Un modèle de recherche d’information collaborative basé sur l’expertise des u...
Un modèle de recherche d’information collaborative basé sur l’expertise des u...UPMC - Sorbonne Universities
 
Answering Twitter Questions: a Model for Recommending Answerers through Socia...
Answering Twitter Questions: a Model for Recommending Answerers through Socia...Answering Twitter Questions: a Model for Recommending Answerers through Socia...
Answering Twitter Questions: a Model for Recommending Answerers through Socia...UPMC - Sorbonne Universities
 
Quels facteurs de pertinence pour la recherche de produits e-commerce ?
Quels facteurs de pertinence pour la recherche de produits e-commerce ?Quels facteurs de pertinence pour la recherche de produits e-commerce ?
Quels facteurs de pertinence pour la recherche de produits e-commerce ?Lamjed Ben Jabeur
 
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...Lamjed Ben Jabeur
 

En vedette (8)

Collaborative Information Retrieval: Frameworks, Theoretical Models and Emerg...
Collaborative Information Retrieval: Frameworks, Theoretical Models and Emerg...Collaborative Information Retrieval: Frameworks, Theoretical Models and Emerg...
Collaborative Information Retrieval: Frameworks, Theoretical Models and Emerg...
 
Understanding the Impact of the Role Factor in Collaborative Information Retr...
Understanding the Impact of the Role Factor in Collaborative Information Retr...Understanding the Impact of the Role Factor in Collaborative Information Retr...
Understanding the Impact of the Role Factor in Collaborative Information Retr...
 
Thesis slides - Definition and evluation of collaborative information retriev...
Thesis slides - Definition and evluation of collaborative information retriev...Thesis slides - Definition and evluation of collaborative information retriev...
Thesis slides - Definition and evluation of collaborative information retriev...
 
Collaborative Information Retrieval: Concepts, Models and Evaluation
Collaborative Information Retrieval: Concepts, Models and EvaluationCollaborative Information Retrieval: Concepts, Models and Evaluation
Collaborative Information Retrieval: Concepts, Models and Evaluation
 
Un modèle de recherche d’information collaborative basé sur l’expertise des u...
Un modèle de recherche d’information collaborative basé sur l’expertise des u...Un modèle de recherche d’information collaborative basé sur l’expertise des u...
Un modèle de recherche d’information collaborative basé sur l’expertise des u...
 
Answering Twitter Questions: a Model for Recommending Answerers through Socia...
Answering Twitter Questions: a Model for Recommending Answerers through Socia...Answering Twitter Questions: a Model for Recommending Answerers through Socia...
Answering Twitter Questions: a Model for Recommending Answerers through Socia...
 
Quels facteurs de pertinence pour la recherche de produits e-commerce ?
Quels facteurs de pertinence pour la recherche de produits e-commerce ?Quels facteurs de pertinence pour la recherche de produits e-commerce ?
Quels facteurs de pertinence pour la recherche de produits e-commerce ?
 
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
Accès à l’information dans les réseaux sociaux : quelles formes de collaborat...
 

Similaire à A Collaborative Document Ranking Model for a Multi-Faceted Search

Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013Neo4j
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementNick Lynch
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise SearchFindwise
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesToine Bogers
 
Case Study Research in Software Engineering
Case Study Research in Software EngineeringCase Study Research in Software Engineering
Case Study Research in Software Engineeringalessio_ferrari
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
SC13 BoF: RDA and HPC
SC13 BoF: RDA and HPCSC13 BoF: RDA and HPC
SC13 BoF: RDA and HPCJohn Cobb
 
Evaluating Search Performance
Evaluating Search PerformanceEvaluating Search Performance
Evaluating Search PerformanceIngo Frommholz
 
Uk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan BicarreguiUk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan BicarreguiInnovate UK
 
RDM requirements gathering with DAF
RDM requirements gathering with DAFRDM requirements gathering with DAF
RDM requirements gathering with DAFSarah Jones
 
Action research for_librarians_carl2012
Action research for_librarians_carl2012Action research for_librarians_carl2012
Action research for_librarians_carl2012srosenblatt
 
Business case and cost modelling for an end-to-end RDM service
Business case and cost modelling for an end-to-end RDM serviceBusiness case and cost modelling for an end-to-end RDM service
Business case and cost modelling for an end-to-end RDM serviceJisc RDM
 
Evaluating Search Performance
Evaluating Search PerformanceEvaluating Search Performance
Evaluating Search PerformanceIngo Frommholz
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processLouise Corti
 
Workflow Reuse in Practice: A Study of Neuroimaging Pipeline Users
Workflow Reuse in Practice: A Study of Neuroimaging Pipeline UsersWorkflow Reuse in Practice: A Study of Neuroimaging Pipeline Users
Workflow Reuse in Practice: A Study of Neuroimaging Pipeline Usersdgarijo
 
Action research for_librarians_carl2012
Action research for_librarians_carl2012Action research for_librarians_carl2012
Action research for_librarians_carl2012srosenblatt
 

Similaire à A Collaborative Document Ranking Model for a Multi-Faceted Search (20)

Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
 
Introduction to SciVal
Introduction to SciValIntroduction to SciVal
Introduction to SciVal
 
Chap_006.ppt
Chap_006.pptChap_006.ppt
Chap_006.ppt
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
Analysis as KM
Analysis as KMAnalysis as KM
Analysis as KM
 
Benchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program CommitteesBenchmarking Domain-specific Expert Search using Workshop Program Committees
Benchmarking Domain-specific Expert Search using Workshop Program Committees
 
Case Study Research in Software Engineering
Case Study Research in Software EngineeringCase Study Research in Software Engineering
Case Study Research in Software Engineering
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
SC13 BoF: RDA and HPC
SC13 BoF: RDA and HPCSC13 BoF: RDA and HPC
SC13 BoF: RDA and HPC
 
Evaluating Search Performance
Evaluating Search PerformanceEvaluating Search Performance
Evaluating Search Performance
 
Uk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan BicarreguiUk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
 
RDM requirements gathering with DAF
RDM requirements gathering with DAFRDM requirements gathering with DAF
RDM requirements gathering with DAF
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
Action research for_librarians_carl2012
Action research for_librarians_carl2012Action research for_librarians_carl2012
Action research for_librarians_carl2012
 
Business case and cost modelling for an end-to-end RDM service
Business case and cost modelling for an end-to-end RDM serviceBusiness case and cost modelling for an end-to-end RDM service
Business case and cost modelling for an end-to-end RDM service
 
Evaluating Search Performance
Evaluating Search PerformanceEvaluating Search Performance
Evaluating Search Performance
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production process
 
Workflow Reuse in Practice: A Study of Neuroimaging Pipeline Users
Workflow Reuse in Practice: A Study of Neuroimaging Pipeline UsersWorkflow Reuse in Practice: A Study of Neuroimaging Pipeline Users
Workflow Reuse in Practice: A Study of Neuroimaging Pipeline Users
 
Action research for_librarians_carl2012
Action research for_librarians_carl2012Action research for_librarians_carl2012
Action research for_librarians_carl2012
 

Dernier

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Dernier (20)

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

A Collaborative Document Ranking Model for a Multi-Faceted Search

  • 1. 2013 A Collaborative Document Ranking Model for a Multi-Faceted Search Laure Soulier Lynda Tamine and Wahiba Bahsoun 09/12/2013 – AIRS 2013
  • 2. Collaborative Information Retrieval Overview 1. 2. 3. 4. 5. 6. From individual to collaborative IR Related Work Research Questions CIR Model for Multi-Faceted Search Experimental Evaluation Conclusion and Future Work 2
  • 3. From Individual to Collaborative Information Retrieval (CIR) Information need Information retrieval system Complex or exploratory tasks Collaboration paradigms [Shah, 2012] [Foley et al., 2009; Morris and Teevans, 2010] Shared information need Collaborative information retrieval system . . . - Division of labor - Sharing of knowledge - Awareness Synergic Effect [Shah, 2012] CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 3
  • 4. cost From Individual to Collaborative Information Retrieval (CIR) . . . Hubble telescope achievement gravitational lenses new cosmological theories Shared information need Collaborative information retrieval system CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 4
  • 5. Related Work: multi-faceted search individual vs. collaborative search • Individual-based search – Identifying query facets for facet-based retrieval • • • Terminological resources [Dakka et al., 2006] Navigation-based classification [Dou et al., 2011] Probabilistic model [Blei et al., 2003; Deerwester et al., 1990] – Result diversification [Carbonell et al., 1998; Wang et al., 2012] • Collaborative-based search – Relevance feedback-based CIR models [Foley et al., 2009; Morris et al., 2008] – Role-based CIR models [Pickens et al., 2008; Shah et al., 2010] CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 5
  • 6. Facet 8 Research Questions Facet 6 - How to leverage experts’ domain knowledge for collaboratively ranking documents? Facet 4 Information need - How to mine query facets? CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 6
  • 7. CIR Model for Multi-Faceted Search Query Facet Mining Expert-based Document Scoring Expert-Based Document Allocation 7 CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 7
  • 8. CIR Model for Multi-Faceted Search Query Facet Mining Expert-based Document Scoring Expert-Based Document Allocation • Document diversification and facet mining Shared information need t1 t2 t3 t4 t5 t6 t7 t8 t9 ... t1 t2 t3 t4 t5 t6 t7 t8 t9 ... [Carbonell and Goldstein, 1998] t1 t2 t3 t4 t5 t6 t7 t8 t9 ... Diversification Latent Dirichlet Allocation [Blei et al., 2003] t1 t2 t3 t4 t5 t6 t7 t8 t9 ... ... t1 t2 t3 t4 t5 t6 t7 t8 t9 ... Likelihood optimization t1 t2 t3 t4 t5 t6 t7 t8 t9 ... ... 8 CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 8
  • 9. CIR Model for Multi-Faceted Search Query Facets Mining Expert-based Document Scoring Expert-Based Document Allocation • Estimating the document relevance towards each expert 9 CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 9
  • 10. CIR Model for Multi-Faceted Search Query Facet Mining Expert-based Document Scoring Expert-Based Document Allocation • Assigning a document to the most suitable expert considering experts' domain expertise towards the query facets  Document allocation  Division of labor: Intersection of document lists simultaneously displayed to experts is empty 10 CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 10
  • 11. Experimental Evaluation Dataset and Collaboration simulation • Dataset – – – • TREC Interactive 6-7-8 210 158 articles 277 collaborative search sessions from 20 TREC topics Collaboration-simulation based framework [Foley et al., 2009] For each TREC query Q: All possible combinations of size m>1 of domain experts Domain experts 2-means classification … … Domain novices … CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 11
  • 12. Experimental Evaluation Collaboration simulation For each TREC query Q and each expert group: building the timeline of relevance feedback CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 12
  • 13. Experimental Evaluation Baselines Query Facet Mining Expert-based Document Scoring Expert-Based Document Allocation Division of Labor • W/oEMDoL: the individual version of our model which integrates only the expert-based document scoring. CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 13
  • 14. Experimental Evaluation Baselines Mining Query Facets Expert-based Document Scoring Expert-Based Document Allocation Division of Labor • W/oEMDoL: the individual version of our model which integrates only the expert-based document scoring. • W/oDoL: our collaborative-based model by excluding the division of labor principle. CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 14
  • 15. Experimental Evaluation Baselines Mining Query Facets Expert-based Document Scoring Expert-Based Document Allocation Division of Labor • W/oEMDoL: the individual version of our model which integrates only the expert-based document scoring. • W/oDoL: our collaborative-based model by excluding the division of labor principle. • W/oEM: our collaborative-based model by excluding the expert-based document allocation. CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 15
  • 16. Experimental Evaluation Metrics • Metrics – Relevance judgments: relevance feedback provided by TREC participants including an agreement level – Measures [Shah and Gonzalez-Ibanez, 2011] Precision: the more document lists include relevant documents, the higher the precision is. Coverage: the more document lists are diversified, the higher the coverage is. Relevant coverage: the more document lists include diversified and relevant documents, the higher the relevant coverage is. CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 16
  • 17. Experimental Evaluation Results • 2-cross validation – 2 random subsets of collaborative search sessions split on the basis of TREC queries • Parameter Tuning – – – – Diversification: γ=1 LDA modeling: likelihood maximal at 200 topics Number of considered facets for topical-based modeling: 5 Expert-based document scoring: λ=0.6 CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 17
  • 18. Experimental Evaluation Results CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 18
  • 19. Experimental Evaluation Results • Complementary Analysis • Retrieval effectiveness is generally stable even with the increasing size of the group • Higher the agreement level is, fewer documents are assessed as relevant  favors the search failure CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 19
  • 20. Conclusion and Future Work • A 2-step collaborative ranking model for satisfying a multifaceted information need considering a group of experts. – Expert-based document scoring – Expert-based document allocation by means of the ExpectationMaximization learning method. • Evaluation through a collaboration simulation-based framework showing effective results. • Future work: – Design of other formal methods to emphasize division of labor – Modeling of user profile through his behavior in addition to his relevance feedback. 20 CIR | Related Work | Research questions | CIR for multi-faceted search | Experiments | Conclusion 20
  • 21. Collaborative Information Retrieval Many thanks for your attention http://www.irit.fr/~Laure.Soulier/ http://www.linkedin.com/pub/laure-soulier/48/299/188