SlideShare une entreprise Scribd logo
1  sur  37
Télécharger pour lire hors ligne
Exploiting Distributional Semantic
 Models in Question Answering
          Piero Molino1,2, Pierpaolo Basile1,2,
 Annalina Caputo1, Pasquale Lops1, Giovanni Semeraro1
                          {basilepp@di.uniba.it}
        1Dept. of   Computer Science, University of Bari Aldo Moro
                           2QuestionCube s.r.l.




                        6th IEEE International Conference on Semantic Computing
                        September 19-21, 2012 Palermo, Italy
Background
    A bottle of Tesgüino is on the table.
         Everyone likes Tesgüino.
        Tesgüino makes you drunk.
      We make Tesgüino out of corn.
Background
    A bottle of Tesgüino is on the table.
         Everyone likes Tesgüino.
        Tesgüino makes you drunk.
      We make Tesgüino out of corn.



It is a corn beer
Background
You shall know a word by
the company it keeps!

Meaning of a word is
determined by its usage




                           4
Background
Distributional Semantic Models (DSMs)
   – represent words as points in a geometric space
   – do not require specific text operations
   – corpus/language independent
Widely used in IR and Computational Linguistic
   – semantic text similarity, synonyms detection, query
     expansion, …
Not directly used in Question Answering (QA)



                                                           5
Our idea
Using DSMs into a QA system
  – QuestionCube: a framework for QA
Exploit DSMs
  – compare semantic similarity between user’s question
    and candidate answer
Our idea
Using DSMs into a QA system
  – QuestionCube: a framework for QA
Exploit DSMs
  – compare semantic similarity between user’s question
    and candidate answer

     Q: Which beverages contain alcohol?
        A: Tesgüino makes you drunk.
Outline
• Introduction to the QA problem
• Distributional Semantic Models
  – semantic composition
• DSMs into QuestionCube
• Evaluation and results
• Final remark
QA vs. IR
                          Question Answering          Information Retrieval


Information need          Natural language question   Keywords


                          Factoid answer or short
Result                                                Whole documents
                          passages


                                                      Searching manually inside
Information acquisition   Reading answers
                                                      documents
DISTRIBUTIONAL SEMANTIC
MODELS
Distributional Semantic Models…
Defined as: <T, C, R, W, M, d, S>
  – T: target elements (words)
  – C: context
  – R: relation between T and C
  – W: weighting schema
  – M: geometric space TxC
  – d: space reduction M -> M’
  – S: similarity function defined in M’
…Distributional Semantic Models
Term-term co-occurrence matrix: each cell
contains the co-occurrences between two terms
within a prefixed distance
               dog   cat   computer   animal   mouse
      dog      0     4        0         2        1
      cat      4     0        0         3        5
    computer   0     0        0         0        3
     animal    2     3        0         0        2
     mouse     1     5        3         2        0
…Distributional Semantic Models
TTM: term-term co-occurrence matrix
Latent Semantic Analysis (LSA): relies on the
Singular Value Decomposition (SVD) of the co-
occurrence matrix
Random Indexing (RI): based on the Random
Projection
Latent Semantic Analysis over Random Indexing
(LSARI)
Latent Semantic Analysis
Given M the TTM matrix
  – apply the SVD decomposition M=UΣVT
     • Σ is the matrix of singular values
  – choose k singular values from Σ and approximate
    the matrix M
     • M*=U*Σ*V*T
  – SVD
     • discover high-order relations between terms
     • reduce the sparseness of the original matrix
Random Indexing
1. Generate and assign a context vector to each
   context element (e.g. document, passage,
   term, …)
2. Term vector is the sum of the context vectors
   in which the term occurs
  – sometimes the context vector could be boosted by
    a score (e.g. term frequency, PMI, …)



                                                   15
Context Vector

    0 0 0 0 0 0 0 -1 0 0 0 0 1 0 0 -1 0 1 0 0 0 0 1 0 0 0 0 -1



•   sparse
•   high dimensional
•   ternary {-1, 0, +1}
•   small number of randomly distributed non-
    zero elements
                                                                 16
Random Indexing (formal)


                                    B =A R
                                      n,k       n,m   m,k
                                                             k << m
                                    B nearly preserves the
                                    distance between points
                                     (Johnson-Lindenstrauss lemma)
                                    dr = c× d
RI is a locality-sensitive hashing method which approximate the cosine
distance between vectors
                                                                     17
Random Indexing (example)

                                CVjohn -> (0, 0, 0, 0, 0, 0, 1, 0, -1, 0)
John eats a red apple           CVeat -> (1, 0, 0, 0, -1, 0, 0 ,0 ,0 ,0)
                                CVred -> (0, 0, 0, 1, 0, 0, 0, -1, 0, 0)



TVapple= CVjohn+ CVeat+ CVred=(1, 0, 0, 1, -1, 0, 1, -1, -1, 0)




                                                                     18
Latent Semantic Analysis over Random
Indexing
1. Reduce the dimension of the co-occurrences
   matrix using RI
2. Perform LSA over RI (LSARI)
  – reduction of LSA computation time: RI matrix
    contains less dimensions than co-occurrences
    matrix
QUESTIONCUBE FRAMEWORK
QuestionCube framework…
Natural Language pipeline for Italian and English
  – PoS-tagging, lemmatization, Name Entity Recognition,
    Phrase Chunking, Dependency Parsing, Word Sense
    Disambiguation (WSD)
IR model based on classical VSM or BM25
  – several query expansion strategies
Passage retrieval
…QuestionCube framework
  Question
                             Passage Filter

  Question                    Term filter
  analysis
                                              Ranked list of
                             N-gram filter
                                                passages
  Question
expansion and                Density filter
categorization
                 Candidate     Sequence
                 passages        filter

 Document
  retrieval
                              Score norm

    Index
DSMS INTO QUESTIONCUBE
Integration
   Question
                              Passage Filter

   Question                    Term filter
   analysis
                                               Ranked list of
                              N-gram filter
                                                 passages
   Question
 expansion and                Density filter
 categorization
                  Candidate     Sequence
                  passages        filter
                              Distributional
  Document                         filter
   retrieval
                                                   New filter
                               Score norm        based on DSMs

     Index
Distributional filter…
Compute the similarity between the user’s
question and each candidate answer
Both user’s question and candidate answer are
represented by more than one term
  – a method to compose words is necessary
…Distributional filter…
 candidate     Question                    candidate
  answer1                                   answer3


 candidate                                 candidate
  answer2                                   answer5

             Distributional
 candidate                    re-ranking   candidate
                  filter
  answer3                                   answer1
                 DSM

             compositional
               approach

 candidate                                 candidate
  answern                                   answern
…Distributional filter (compositional
approach)
Addition (+): pointwise sum of components
  – represent complex structures by summing words
    which compose them
Given q=q1 q2 … qn and a=a1 a2 … am
                                          
  – question:    q     q1        q2         qn
                                                   
  – candidate answer:            a     a1   a2       am
                                       
                                      q a
  – similarity   sim ( q , a )         
                                      q a
EVALUATION AND RESULTS
Evaluation…
Dataset: 2010 CLEF QA Competition
  – 10,700 documents
       • from European Union legislation and European Parliament
         transcriptions
  – 200 questions
  – Italian and English
DSMs
  – 1,000 vector dimension (TTM/LSA/RI/RILSA)
  – 50,000 most frequent words
  – co-occurrence distance: 4

                                                                   29
...Evaluation
1. Effectiveness of DSMs in QuestionCube
2. Comparison between the several DSMs
   adopted
Metrics
  – a@n: accuracy taking into account only the first n
    answers
                                    N
                                     1
  – MRR based on the rank
                                  rank
    of the correct answer       i       1       i

                                            N


                                                         30
Evaluation (alone scenario)
Only distributional filter   Passage Filter

is adopted: no other          Term filter
filters in the pipeline
                             N-gram filter


                             Density filter

                               Sequence
                                 filter
                             Distributional
                                  filter

                              Score norm

                                              31
Evaluation (combined)
Distributional filter is       Passage Filter

combined with the other         Term filter
filters
  – using CombSum function     N-gram filter

  – baseline: distributional   Density filter
    filter is removed
                                 Sequence
                                   filter
                               Distributional
                                    filter

                                Score norm

                                                32
Results (English)…
          Run        a@1    a@5    a@10      a@30         MRR
alone     TTM        .060   .145   .215      .345         .107
          RI         .180   .370   .425      .535         .267
          LSA        .205   .415   .490      .600         .300
          LSARI      .190   .405   .490      .620         .295
combined baseline*   .445   .635   .690      .780         .549
          TTM        .535   .715   .775      .810         .6141
          RI         .550   .730   .785      .870         .6371,2
          LSA        .560   .725   .790      .855         .6371
          LSARI      .555   .730   .790      .870         .6341

                                          1significat wrt. baseline
                                          2significat wrt. TTM

                                          *without distributional filter
…Results (Italian)
          Run        a@1    a@5    a@10      a@30         MRR
alone     TTM        .060   .140   .175      .280         .097
          RI         .175   .305   .385      .465         .241
          LSA        .155   .315   .390      .480         .229
          LSARI      .180   .335   .400      .500         .254
combined baseline*   .365   .530   .630      .715         .441
          TTM        .405   .565   .645      .740         .5391
          RI         .465   .645   .720      .785         .5551
          LSA        .470   .645   .690      .785         .5511
          LSARI      .480   .635   .690      .785         .5571,2

                                          1significat wrt. baseline
                                          2significat wrt. TTM

                                          *without distributional filter
Final remarks…
Alone: all the proposed DSMs are better than
  the TTM
  – in particular LSA and LSARI
Combined: all the combinations are able to
overcome the baseline
  – English +16% (RI/LSA), Italian +26% (LSARI)
Slight difference in performance between LSA
and LSARI
…Final remarks
Distributional filter in the alone scenario shows
an higher improvement than in the combination
scenario
  – find more effective way to combine filters
    (learning to rank approach)
Exploit more complex semantic compositional
strategies
Thank you for your attention!

         Questions?
        <basilepp@di.uniba.it>

Contenu connexe

Tendances

Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksLeonardo Di Donato
 
NLP Project: Paragraph Topic Classification
NLP Project: Paragraph Topic ClassificationNLP Project: Paragraph Topic Classification
NLP Project: Paragraph Topic ClassificationEugene Nho
 
Deep Learning for Search
Deep Learning for SearchDeep Learning for Search
Deep Learning for SearchBhaskar Mitra
 
graduate_thesis (1)
graduate_thesis (1)graduate_thesis (1)
graduate_thesis (1)Sihan Chen
 
A survey on parallel corpora alignment
A survey on parallel corpora alignment A survey on parallel corpora alignment
A survey on parallel corpora alignment andrefsantos
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsRoelof Pieters
 
The Geometry of Learning
The Geometry of LearningThe Geometry of Learning
The Geometry of Learningfridolin.wild
 
Finding Co-solvers on Twitter, with the Little Help from Linked Data
Finding Co-solvers on Twitter, with the Little Help from Linked DataFinding Co-solvers on Twitter, with the Little Help from Linked Data
Finding Co-solvers on Twitter, with the Little Help from Linked DataMilan Stankovic
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueCSCJournals
 
Semantic video classification based on subtitles and domain terminologies
Semantic video classification based on subtitles and domain terminologiesSemantic video classification based on subtitles and domain terminologies
Semantic video classification based on subtitles and domain terminologiesTing Wen Su
 
Search Engines
Search EnginesSearch Engines
Search Enginesbutest
 
Deep Learning for NLP
Deep Learning for NLPDeep Learning for NLP
Deep Learning for NLPAmit Kapoor
 
IRJET- Automatic Language Identification using Hybrid Approach and Classifica...
IRJET- Automatic Language Identification using Hybrid Approach and Classifica...IRJET- Automatic Language Identification using Hybrid Approach and Classifica...
IRJET- Automatic Language Identification using Hybrid Approach and Classifica...IRJET Journal
 
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...Andre Freitas
 
Adapto\ing Rankers Online, Maarten de Rijke
Adapto\ing Rankers Online, Maarten de RijkeAdapto\ing Rankers Online, Maarten de Rijke
Adapto\ing Rankers Online, Maarten de Rijkeyaevents
 
Framester and WFD
Framester and WFD Framester and WFD
Framester and WFD Aldo Gangemi
 

Tendances (20)

Topics Modeling
Topics ModelingTopics Modeling
Topics Modeling
 
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
 
Topic Modeling
Topic ModelingTopic Modeling
Topic Modeling
 
NLP Project: Paragraph Topic Classification
NLP Project: Paragraph Topic ClassificationNLP Project: Paragraph Topic Classification
NLP Project: Paragraph Topic Classification
 
Deep Learning for Search
Deep Learning for SearchDeep Learning for Search
Deep Learning for Search
 
graduate_thesis (1)
graduate_thesis (1)graduate_thesis (1)
graduate_thesis (1)
 
A survey on parallel corpora alignment
A survey on parallel corpora alignment A survey on parallel corpora alignment
A survey on parallel corpora alignment
 
Parekh dfa
Parekh dfaParekh dfa
Parekh dfa
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
 
The Geometry of Learning
The Geometry of LearningThe Geometry of Learning
The Geometry of Learning
 
Finding Co-solvers on Twitter, with the Little Help from Linked Data
Finding Co-solvers on Twitter, with the Little Help from Linked DataFinding Co-solvers on Twitter, with the Little Help from Linked Data
Finding Co-solvers on Twitter, with the Little Help from Linked Data
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 
speech enhancement
speech enhancementspeech enhancement
speech enhancement
 
Semantic video classification based on subtitles and domain terminologies
Semantic video classification based on subtitles and domain terminologiesSemantic video classification based on subtitles and domain terminologies
Semantic video classification based on subtitles and domain terminologies
 
Search Engines
Search EnginesSearch Engines
Search Engines
 
Deep Learning for NLP
Deep Learning for NLPDeep Learning for NLP
Deep Learning for NLP
 
IRJET- Automatic Language Identification using Hybrid Approach and Classifica...
IRJET- Automatic Language Identification using Hybrid Approach and Classifica...IRJET- Automatic Language Identification using Hybrid Approach and Classifica...
IRJET- Automatic Language Identification using Hybrid Approach and Classifica...
 
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
A Distributional Semantics Approach for Selective Reasoning on Commonsense Gr...
 
Adapto\ing Rankers Online, Maarten de Rijke
Adapto\ing Rankers Online, Maarten de RijkeAdapto\ing Rankers Online, Maarten de Rijke
Adapto\ing Rankers Online, Maarten de Rijke
 
Framester and WFD
Framester and WFD Framester and WFD
Framester and WFD
 

Similaire à Exploiting Distributional Semantic Models in Question Answering

"SSC" - Geometria e Semantica del Linguaggio
"SSC" - Geometria e Semantica del Linguaggio"SSC" - Geometria e Semantica del Linguaggio
"SSC" - Geometria e Semantica del LinguaggioAlumni Mathematica
 
Computational Semantics
Computational SemanticsComputational Semantics
Computational SemanticsRossi Setchi
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxdongchangim30
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information RetrievalNik Spirin
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysisAcad
 
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494Sean Golliher
 
Introduction to k-Nearest Neighbors and Amazon SageMaker
Introduction to k-Nearest Neighbors and Amazon SageMaker Introduction to k-Nearest Neighbors and Amazon SageMaker
Introduction to k-Nearest Neighbors and Amazon SageMaker Suman Debnath
 
[WISE 2015] Similarity-Based Context-aware Recommendation
[WISE 2015] Similarity-Based Context-aware Recommendation[WISE 2015] Similarity-Based Context-aware Recommendation
[WISE 2015] Similarity-Based Context-aware RecommendationYONG ZHENG
 
Current Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationCurrent Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationMario Sangiorgio
 
Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Charles Martin
 
Download
DownloadDownload
Downloadbutest
 
Download
DownloadDownload
Downloadbutest
 
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Jonathon Hare
 
ECO_TEXT_CLUSTERING
ECO_TEXT_CLUSTERINGECO_TEXT_CLUSTERING
ECO_TEXT_CLUSTERINGGeorge Simov
 
A model-based relevance estimation approach for feature selection in microarr...
A model-based relevance estimation approach for feature selection in microarr...A model-based relevance estimation approach for feature selection in microarr...
A model-based relevance estimation approach for feature selection in microarr...Gianluca Bontempi
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learningtelss09
 
Supervised Quantization for Similarity Search (camera-ready)
Supervised Quantization for Similarity Search (camera-ready)Supervised Quantization for Similarity Search (camera-ready)
Supervised Quantization for Similarity Search (camera-ready)Xiaojuan (Kathleen) WANG
 
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015rusbase
 

Similaire à Exploiting Distributional Semantic Models in Question Answering (20)

"SSC" - Geometria e Semantica del Linguaggio
"SSC" - Geometria e Semantica del Linguaggio"SSC" - Geometria e Semantica del Linguaggio
"SSC" - Geometria e Semantica del Linguaggio
 
Computational Semantics
Computational SemanticsComputational Semantics
Computational Semantics
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptx
 
0 introduction
0  introduction0  introduction
0 introduction
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
 
Introduction to k-Nearest Neighbors and Amazon SageMaker
Introduction to k-Nearest Neighbors and Amazon SageMaker Introduction to k-Nearest Neighbors and Amazon SageMaker
Introduction to k-Nearest Neighbors and Amazon SageMaker
 
Clique and sting
Clique and stingClique and sting
Clique and sting
 
[WISE 2015] Similarity-Based Context-aware Recommendation
[WISE 2015] Similarity-Based Context-aware Recommendation[WISE 2015] Similarity-Based Context-aware Recommendation
[WISE 2015] Similarity-Based Context-aware Recommendation
 
Current Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationCurrent Approaches in Search Result Diversification
Current Approaches in Search Result Diversification
 
Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3Applied machine learning for search engine relevance 3
Applied machine learning for search engine relevance 3
 
Download
DownloadDownload
Download
 
Download
DownloadDownload
Download
 
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
 
ECO_TEXT_CLUSTERING
ECO_TEXT_CLUSTERINGECO_TEXT_CLUSTERING
ECO_TEXT_CLUSTERING
 
A model-based relevance estimation approach for feature selection in microarr...
A model-based relevance estimation approach for feature selection in microarr...A model-based relevance estimation approach for feature selection in microarr...
A model-based relevance estimation approach for feature selection in microarr...
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learning
 
Supervised Quantization for Similarity Search (camera-ready)
Supervised Quantization for Similarity Search (camera-ready)Supervised Quantization for Similarity Search (camera-ready)
Supervised Quantization for Similarity Search (camera-ready)
 
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
 

Plus de Pierpaolo Basile

Diachronic analysis of entities by exploiting wikipedia page revisions
Diachronic analysis of entities by exploiting wikipedia page revisionsDiachronic analysis of entities by exploiting wikipedia page revisions
Diachronic analysis of entities by exploiting wikipedia page revisionsPierpaolo Basile
 
Come l'industria tecnologica ha cancellato le donne dalla storia
Come l'industria tecnologica ha cancellato le donne dalla storiaCome l'industria tecnologica ha cancellato le donne dalla storia
Come l'industria tecnologica ha cancellato le donne dalla storiaPierpaolo Basile
 
EVALITA 2018 NLP4FUN - Solving language games
EVALITA 2018 NLP4FUN - Solving language gamesEVALITA 2018 NLP4FUN - Solving language games
EVALITA 2018 NLP4FUN - Solving language gamesPierpaolo Basile
 
Buon appetito! Analyzing Happiness in Italian Tweets
Buon appetito! Analyzing Happiness in Italian TweetsBuon appetito! Analyzing Happiness in Italian Tweets
Buon appetito! Analyzing Happiness in Italian TweetsPierpaolo Basile
 
Detecting semantic shift in large corpora by exploiting temporal random indexing
Detecting semantic shift in large corpora by exploiting temporal random indexingDetecting semantic shift in large corpora by exploiting temporal random indexing
Detecting semantic shift in large corpora by exploiting temporal random indexingPierpaolo Basile
 
Bi-directional LSTM-CNNs-CRF for Italian Sequence Labeling
Bi-directional LSTM-CNNs-CRF for Italian Sequence LabelingBi-directional LSTM-CNNs-CRF for Italian Sequence Labeling
Bi-directional LSTM-CNNs-CRF for Italian Sequence LabelingPierpaolo Basile
 
INSERT COIN - Storia dei videogame: da Spacewar a Street Fighter
INSERT COIN - Storia dei videogame: da Spacewar a Street FighterINSERT COIN - Storia dei videogame: da Spacewar a Street Fighter
INSERT COIN - Storia dei videogame: da Spacewar a Street FighterPierpaolo Basile
 
QuestionCube DigithON 2017
QuestionCube DigithON 2017QuestionCube DigithON 2017
QuestionCube DigithON 2017Pierpaolo Basile
 
Diachronic Analysis of the Italian Language exploiting Google Ngram
Diachronic Analysis of the Italian Language exploiting Google NgramDiachronic Analysis of the Italian Language exploiting Google Ngram
Diachronic Analysis of the Italian Language exploiting Google NgramPierpaolo Basile
 
La macchina più geek dell’universo The Turing Machine
La macchina più geek dell’universo The Turing MachineLa macchina più geek dell’universo The Turing Machine
La macchina più geek dell’universo The Turing MachinePierpaolo Basile
 
UNIBA: Exploiting a Distributional Semantic Model for Disambiguating and Link...
UNIBA: Exploiting a Distributional Semantic Model for Disambiguating and Link...UNIBA: Exploiting a Distributional Semantic Model for Disambiguating and Link...
UNIBA: Exploiting a Distributional Semantic Model for Disambiguating and Link...Pierpaolo Basile
 
Building WordSpaces via Random Indexing from simple to complex spaces
Building WordSpaces via Random Indexing from simple to complex spacesBuilding WordSpaces via Random Indexing from simple to complex spaces
Building WordSpaces via Random Indexing from simple to complex spacesPierpaolo Basile
 
Analysing Word Meaning over Time by Exploiting Temporal Random Indexing
Analysing Word Meaning over Time by Exploiting Temporal Random IndexingAnalysing Word Meaning over Time by Exploiting Temporal Random Indexing
Analysing Word Meaning over Time by Exploiting Temporal Random IndexingPierpaolo Basile
 
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...Pierpaolo Basile
 
A Study on Compositional Semantics of Words in Distributional Spaces
A Study on Compositional Semantics of Words in Distributional SpacesA Study on Compositional Semantics of Words in Distributional Spaces
A Study on Compositional Semantics of Words in Distributional SpacesPierpaolo Basile
 
Sst evalita2011 basile_pierpaolo
Sst evalita2011 basile_pierpaoloSst evalita2011 basile_pierpaolo
Sst evalita2011 basile_pierpaoloPierpaolo Basile
 
AI*IA 2012 PAI Workshop OTTHO
AI*IA 2012 PAI Workshop OTTHOAI*IA 2012 PAI Workshop OTTHO
AI*IA 2012 PAI Workshop OTTHOPierpaolo Basile
 
Word Sense Disambiguation and Intelligent Information Access
Word Sense Disambiguation and Intelligent Information AccessWord Sense Disambiguation and Intelligent Information Access
Word Sense Disambiguation and Intelligent Information AccessPierpaolo Basile
 

Plus de Pierpaolo Basile (20)

Diachronic analysis of entities by exploiting wikipedia page revisions
Diachronic analysis of entities by exploiting wikipedia page revisionsDiachronic analysis of entities by exploiting wikipedia page revisions
Diachronic analysis of entities by exploiting wikipedia page revisions
 
Come l'industria tecnologica ha cancellato le donne dalla storia
Come l'industria tecnologica ha cancellato le donne dalla storiaCome l'industria tecnologica ha cancellato le donne dalla storia
Come l'industria tecnologica ha cancellato le donne dalla storia
 
EVALITA 2018 NLP4FUN - Solving language games
EVALITA 2018 NLP4FUN - Solving language gamesEVALITA 2018 NLP4FUN - Solving language games
EVALITA 2018 NLP4FUN - Solving language games
 
Buon appetito! Analyzing Happiness in Italian Tweets
Buon appetito! Analyzing Happiness in Italian TweetsBuon appetito! Analyzing Happiness in Italian Tweets
Buon appetito! Analyzing Happiness in Italian Tweets
 
Detecting semantic shift in large corpora by exploiting temporal random indexing
Detecting semantic shift in large corpora by exploiting temporal random indexingDetecting semantic shift in large corpora by exploiting temporal random indexing
Detecting semantic shift in large corpora by exploiting temporal random indexing
 
Bi-directional LSTM-CNNs-CRF for Italian Sequence Labeling
Bi-directional LSTM-CNNs-CRF for Italian Sequence LabelingBi-directional LSTM-CNNs-CRF for Italian Sequence Labeling
Bi-directional LSTM-CNNs-CRF for Italian Sequence Labeling
 
INSERT COIN - Storia dei videogame: da Spacewar a Street Fighter
INSERT COIN - Storia dei videogame: da Spacewar a Street FighterINSERT COIN - Storia dei videogame: da Spacewar a Street Fighter
INSERT COIN - Storia dei videogame: da Spacewar a Street Fighter
 
QuestionCube DigithON 2017
QuestionCube DigithON 2017QuestionCube DigithON 2017
QuestionCube DigithON 2017
 
Diachronic Analysis of the Italian Language exploiting Google Ngram
Diachronic Analysis of the Italian Language exploiting Google NgramDiachronic Analysis of the Italian Language exploiting Google Ngram
Diachronic Analysis of the Italian Language exploiting Google Ngram
 
Diachronic Analysis
Diachronic AnalysisDiachronic Analysis
Diachronic Analysis
 
(Open) data hacking
(Open) data hacking(Open) data hacking
(Open) data hacking
 
La macchina più geek dell’universo The Turing Machine
La macchina più geek dell’universo The Turing MachineLa macchina più geek dell’universo The Turing Machine
La macchina più geek dell’universo The Turing Machine
 
UNIBA: Exploiting a Distributional Semantic Model for Disambiguating and Link...
UNIBA: Exploiting a Distributional Semantic Model for Disambiguating and Link...UNIBA: Exploiting a Distributional Semantic Model for Disambiguating and Link...
UNIBA: Exploiting a Distributional Semantic Model for Disambiguating and Link...
 
Building WordSpaces via Random Indexing from simple to complex spaces
Building WordSpaces via Random Indexing from simple to complex spacesBuilding WordSpaces via Random Indexing from simple to complex spaces
Building WordSpaces via Random Indexing from simple to complex spaces
 
Analysing Word Meaning over Time by Exploiting Temporal Random Indexing
Analysing Word Meaning over Time by Exploiting Temporal Random IndexingAnalysing Word Meaning over Time by Exploiting Temporal Random Indexing
Analysing Word Meaning over Time by Exploiting Temporal Random Indexing
 
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...
COLING 2014 - An Enhanced Lesk Word Sense Disambiguation Algorithm through a ...
 
A Study on Compositional Semantics of Words in Distributional Spaces
A Study on Compositional Semantics of Words in Distributional SpacesA Study on Compositional Semantics of Words in Distributional Spaces
A Study on Compositional Semantics of Words in Distributional Spaces
 
Sst evalita2011 basile_pierpaolo
Sst evalita2011 basile_pierpaoloSst evalita2011 basile_pierpaolo
Sst evalita2011 basile_pierpaolo
 
AI*IA 2012 PAI Workshop OTTHO
AI*IA 2012 PAI Workshop OTTHOAI*IA 2012 PAI Workshop OTTHO
AI*IA 2012 PAI Workshop OTTHO
 
Word Sense Disambiguation and Intelligent Information Access
Word Sense Disambiguation and Intelligent Information AccessWord Sense Disambiguation and Intelligent Information Access
Word Sense Disambiguation and Intelligent Information Access
 

Dernier

How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 

Dernier (20)

How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 

Exploiting Distributional Semantic Models in Question Answering

  • 1. Exploiting Distributional Semantic Models in Question Answering Piero Molino1,2, Pierpaolo Basile1,2, Annalina Caputo1, Pasquale Lops1, Giovanni Semeraro1 {basilepp@di.uniba.it} 1Dept. of Computer Science, University of Bari Aldo Moro 2QuestionCube s.r.l. 6th IEEE International Conference on Semantic Computing September 19-21, 2012 Palermo, Italy
  • 2. Background A bottle of Tesgüino is on the table. Everyone likes Tesgüino. Tesgüino makes you drunk. We make Tesgüino out of corn.
  • 3. Background A bottle of Tesgüino is on the table. Everyone likes Tesgüino. Tesgüino makes you drunk. We make Tesgüino out of corn. It is a corn beer
  • 4. Background You shall know a word by the company it keeps! Meaning of a word is determined by its usage 4
  • 5. Background Distributional Semantic Models (DSMs) – represent words as points in a geometric space – do not require specific text operations – corpus/language independent Widely used in IR and Computational Linguistic – semantic text similarity, synonyms detection, query expansion, … Not directly used in Question Answering (QA) 5
  • 6. Our idea Using DSMs into a QA system – QuestionCube: a framework for QA Exploit DSMs – compare semantic similarity between user’s question and candidate answer
  • 7. Our idea Using DSMs into a QA system – QuestionCube: a framework for QA Exploit DSMs – compare semantic similarity between user’s question and candidate answer Q: Which beverages contain alcohol? A: Tesgüino makes you drunk.
  • 8. Outline • Introduction to the QA problem • Distributional Semantic Models – semantic composition • DSMs into QuestionCube • Evaluation and results • Final remark
  • 9. QA vs. IR Question Answering Information Retrieval Information need Natural language question Keywords Factoid answer or short Result Whole documents passages Searching manually inside Information acquisition Reading answers documents
  • 11. Distributional Semantic Models… Defined as: <T, C, R, W, M, d, S> – T: target elements (words) – C: context – R: relation between T and C – W: weighting schema – M: geometric space TxC – d: space reduction M -> M’ – S: similarity function defined in M’
  • 12. …Distributional Semantic Models Term-term co-occurrence matrix: each cell contains the co-occurrences between two terms within a prefixed distance dog cat computer animal mouse dog 0 4 0 2 1 cat 4 0 0 3 5 computer 0 0 0 0 3 animal 2 3 0 0 2 mouse 1 5 3 2 0
  • 13. …Distributional Semantic Models TTM: term-term co-occurrence matrix Latent Semantic Analysis (LSA): relies on the Singular Value Decomposition (SVD) of the co- occurrence matrix Random Indexing (RI): based on the Random Projection Latent Semantic Analysis over Random Indexing (LSARI)
  • 14. Latent Semantic Analysis Given M the TTM matrix – apply the SVD decomposition M=UΣVT • Σ is the matrix of singular values – choose k singular values from Σ and approximate the matrix M • M*=U*Σ*V*T – SVD • discover high-order relations between terms • reduce the sparseness of the original matrix
  • 15. Random Indexing 1. Generate and assign a context vector to each context element (e.g. document, passage, term, …) 2. Term vector is the sum of the context vectors in which the term occurs – sometimes the context vector could be boosted by a score (e.g. term frequency, PMI, …) 15
  • 16. Context Vector 0 0 0 0 0 0 0 -1 0 0 0 0 1 0 0 -1 0 1 0 0 0 0 1 0 0 0 0 -1 • sparse • high dimensional • ternary {-1, 0, +1} • small number of randomly distributed non- zero elements 16
  • 17. Random Indexing (formal) B =A R n,k n,m m,k k << m B nearly preserves the distance between points (Johnson-Lindenstrauss lemma) dr = c× d RI is a locality-sensitive hashing method which approximate the cosine distance between vectors 17
  • 18. Random Indexing (example) CVjohn -> (0, 0, 0, 0, 0, 0, 1, 0, -1, 0) John eats a red apple CVeat -> (1, 0, 0, 0, -1, 0, 0 ,0 ,0 ,0) CVred -> (0, 0, 0, 1, 0, 0, 0, -1, 0, 0) TVapple= CVjohn+ CVeat+ CVred=(1, 0, 0, 1, -1, 0, 1, -1, -1, 0) 18
  • 19. Latent Semantic Analysis over Random Indexing 1. Reduce the dimension of the co-occurrences matrix using RI 2. Perform LSA over RI (LSARI) – reduction of LSA computation time: RI matrix contains less dimensions than co-occurrences matrix
  • 21. QuestionCube framework… Natural Language pipeline for Italian and English – PoS-tagging, lemmatization, Name Entity Recognition, Phrase Chunking, Dependency Parsing, Word Sense Disambiguation (WSD) IR model based on classical VSM or BM25 – several query expansion strategies Passage retrieval
  • 22. …QuestionCube framework Question Passage Filter Question Term filter analysis Ranked list of N-gram filter passages Question expansion and Density filter categorization Candidate Sequence passages filter Document retrieval Score norm Index
  • 24. Integration Question Passage Filter Question Term filter analysis Ranked list of N-gram filter passages Question expansion and Density filter categorization Candidate Sequence passages filter Distributional Document filter retrieval New filter Score norm based on DSMs Index
  • 25. Distributional filter… Compute the similarity between the user’s question and each candidate answer Both user’s question and candidate answer are represented by more than one term – a method to compose words is necessary
  • 26. …Distributional filter… candidate Question candidate answer1 answer3 candidate candidate answer2 answer5 Distributional candidate re-ranking candidate filter answer3 answer1 DSM compositional approach candidate candidate answern answern
  • 27. …Distributional filter (compositional approach) Addition (+): pointwise sum of components – represent complex structures by summing words which compose them Given q=q1 q2 … qn and a=a1 a2 … am     – question: q q1 q2  qn     – candidate answer: a a1 a2  am   q a – similarity sim ( q , a )   q a
  • 29. Evaluation… Dataset: 2010 CLEF QA Competition – 10,700 documents • from European Union legislation and European Parliament transcriptions – 200 questions – Italian and English DSMs – 1,000 vector dimension (TTM/LSA/RI/RILSA) – 50,000 most frequent words – co-occurrence distance: 4 29
  • 30. ...Evaluation 1. Effectiveness of DSMs in QuestionCube 2. Comparison between the several DSMs adopted Metrics – a@n: accuracy taking into account only the first n answers N 1 – MRR based on the rank rank of the correct answer i 1 i N 30
  • 31. Evaluation (alone scenario) Only distributional filter Passage Filter is adopted: no other Term filter filters in the pipeline N-gram filter Density filter Sequence filter Distributional filter Score norm 31
  • 32. Evaluation (combined) Distributional filter is Passage Filter combined with the other Term filter filters – using CombSum function N-gram filter – baseline: distributional Density filter filter is removed Sequence filter Distributional filter Score norm 32
  • 33. Results (English)… Run a@1 a@5 a@10 a@30 MRR alone TTM .060 .145 .215 .345 .107 RI .180 .370 .425 .535 .267 LSA .205 .415 .490 .600 .300 LSARI .190 .405 .490 .620 .295 combined baseline* .445 .635 .690 .780 .549 TTM .535 .715 .775 .810 .6141 RI .550 .730 .785 .870 .6371,2 LSA .560 .725 .790 .855 .6371 LSARI .555 .730 .790 .870 .6341 1significat wrt. baseline 2significat wrt. TTM *without distributional filter
  • 34. …Results (Italian) Run a@1 a@5 a@10 a@30 MRR alone TTM .060 .140 .175 .280 .097 RI .175 .305 .385 .465 .241 LSA .155 .315 .390 .480 .229 LSARI .180 .335 .400 .500 .254 combined baseline* .365 .530 .630 .715 .441 TTM .405 .565 .645 .740 .5391 RI .465 .645 .720 .785 .5551 LSA .470 .645 .690 .785 .5511 LSARI .480 .635 .690 .785 .5571,2 1significat wrt. baseline 2significat wrt. TTM *without distributional filter
  • 35. Final remarks… Alone: all the proposed DSMs are better than the TTM – in particular LSA and LSARI Combined: all the combinations are able to overcome the baseline – English +16% (RI/LSA), Italian +26% (LSARI) Slight difference in performance between LSA and LSARI
  • 36. …Final remarks Distributional filter in the alone scenario shows an higher improvement than in the combination scenario – find more effective way to combine filters (learning to rank approach) Exploit more complex semantic compositional strategies
  • 37. Thank you for your attention! Questions? <basilepp@di.uniba.it>