SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Experiments with Segmentation Strategies for
Passage Retrieval in Audio-Visual Documents
Petra Galuščáková and Pavel Pecina
galuscakova@ufal.mff.cuni.cz
Institute of Formal and Applied Linguistics
Faculty of Mathematics and Physics
Charles University in Prague
4. 4. 2014
2
Information Retrieval
●
Information Retrieval (IR) is a task which involves searching
for documents relevant to a given query.
3
Speech Retrieval
●
Speech Retrieval focuses on retrieval from audio-visual
documents (recordings).
4
Speech Retrieval
●
Speech Retrieval is often converted on traditional
Information Retrieval
●
Automatic Speech Recognition (ASR) system applied to
the audio track
5
Speech Retrieval
Problems
●
Documents are long (e.g. whole TV programmes)
● Often unstructured
●
Navigation in audio-visual recordings is time consuming
● We need to retrieve relevant segments of full documents
●
Possibility to browse the recordings using hyperlinks (links
between passages)
→ Passage Retrieval
6
Passage Retrieval
●
Splits texts into smaller units which then function as
documents in the retrieval process
● Makes the retrieval process more precise
●
May improve retrieval of full documents
●
The segmentation is crucial for the quality of the retrieval
→ We focus on segmentation strategies
7
Segmentation Strategies
● Regular (Window-based)
● Segments of equal length with regular shift
● Claimed to be a very effective approach
● Similarity-based
● Measures similarity between neighbouring segments
● Lexical-chain-based
● Finds sequences of lexicographically related word occurrences
● Feature-based
● Employs machine learning methods to detect segment boundaries
based on various features
8
Feature-based Segmentation
in Passage Retrieval
9
Experiments
Tasks Description
10
● MediaEval is a benchmarking initiative dedicated to
development, comparison, and improvement of strategies for
processing and retrieving multimedia content.
● E.g., speech recognition, multimedia content analysis, music and
audio analysis, social networks, geo-coordinates, …
●
2013 Similar Segments in Social Speech Task
●
2013 Search and Hyperlinking Task
11
Similar Segments in Social
Speech (SSSS) Task
● Scenario:
●
A new member (e.g., a new student) joins a community or
organization (e.g., a university), which owns an archive of recorded
conversations among its members
●
A member wants to find information according to his or her interest
in the archive
– The student wants to find more segments similar to the ones he
or she is interested in and browses the archive using hyperlinks
in videos
● The main goal:
● To find segments similar to the given ones
12
Similar Segments in Social
Speech Task Data
●
On purpose recorded interviews (5 hours)
of two speakers (university students’
community)
● Divided into training/test data
●
Manual and ASR transcripts
●
Manually indicated segments (1886 segments), manually
grouped into similarity sets
●
Query segment - specified by the timestamp of its beginning
and end
● Queries - constructed by including all words lying within the
boundaries of the query segments
13
Search and Hyperlinking (SH)
Task
● Scenario:
● A user wants to find a piece of information relevant to a given
query in a collection of TV programmes (Search subtask)
●
And then navigate through a large archive using hyperlinks to
the retrieved segments (Hyperlinking subtask)
●
The main goal of the Search Subtask
●
Find passages relevant to a user’s interest given by a textual
query in a large set of audio-visual recordings
14
Search and Hyperlinking Task
Data
●
TV programme recordings provided
by BBC (1697 hours)
●
Subtitles and two ASR transcripts
(LIMSI and LIUM)
●
4 training and 50 test queries
● Query text: e. g. Boris Johnson
● Visual cue: e. g. 2 men sitting opposite each other
● Metadata, synopsis, cast, detected shots, detected faces,
visual concepts
15
Passage Retrieval Quality
Evaluation
●
Full document retrieval → Mean Reciprocal Rank (MRR)
– RR = 1 / rank of the first correctly retrieved document
●
Retrieval of the exact passages → MRRw and MGAP
●
MRR-window (MRRw)
– Retrieved starting points are limited to appear less than 60 seconds
from the relevant starting points
●
Mean Generalized Average Precision (MGAP)
– The quality of the retrieved starting point is assessed according to
its distance from the relevant starting point using a penalty
function
16
Experiments
System Description
17
Baseline System
●
We employ the Terrier IR toolkit
●
Hiemstra language model
● Parameter set to 0.35 (importance of a query term in a
document)
● Stopwords removal, stemming
●
Post-filtering of the answers
● The segments partially overlapping with either the query
segment or a higher ranked segment are removed from the
list of results
18
Window-based Segmentation
● Equally-long segments with a regular shift
19
Feature-based Segmentation
● We identify possible segment boundaries (beginnings and
ends)
●
Model: J48 decision trees
●
Training data available for the SSSS task
●
Manually marked segments
●
Binary classification problem
● For each word in the transcripts, we predict whether a
segment boundary occurs after this word or not
● Classes: segment boundary and segment continuation
20
Features
● Cue words and tags (n-grams which frequently occur at the
boundary,most informative n-grams) for segment beginning
and end
● Segment beginnings: “I’m”, “the”, “are you”, “you have”, ...
● Segment ends: “good”, “interesting”, “lot”, ...
● Letter cases
● Length of the silence before the word
● Division given in transcripts (e.g., speech segments defined in
the LIMSI transcripts)
● The output of the TextTiling algorithm
21
Feature-based Segmentation
Approaches
22
Experiments
Results
23
Similar Segments in Social
Speech Task - Evaluation
●
Best results are obtained by the feature-based segmentation into
overlapping segments
●
Manual gold-standard segmentation is outperformed by feature-
based segmentation (MRRw score on the manual transcripts)
●
Manual transcripts are significantly better in all scores
24
Segmentation Model
in the SH Task
● Training set used in the SH Search Subtask is very small
●
We apply the SSSS-trained models in the SH task
● Allows us to examine the possibility of creating a universal
model for feature-based segmentation
●
Potential problems:
● Different vocabulary (student's dialogues vs. TV programmes)
● Different ASR systems may prefer different vocabulary
● Different distribution of silence, document structure
25
SH Task Evaluation
●
Not as consistent as for the SSSS task
●
Depending on the type of the transcript
●
Feature-based approaches creating overlapping segments -
effective when applied on the subtitles
26
Conclusion
27
Conclusion
●
Information Retrieval, focus on speech data (Speech
Retrieval)
● Focus on retrieval of exact relevant passages
●
Importance of segmentation
●
Experiments in MediaEval benchamark
●
Similar Segments in Social Speech Task (university
student dialogues) and Search and Hyperlinking Task
(BBC programmes)
● We applied window-based segmentation and three types
of feature-based segmentations
28
Conclusion cont.
●
Feature-based segmentation applied in the two tasks
outperformed regular segmentation
● Claimed to be a very effective approach
●
The improvement in the SSSS Task was statistically
significant on the manual (MRRw and mGAP measures)
and ASR (mGAP measure) transcripts
● The results in the SH task were not so conclusive
● Some of the results (on the subtitles) are encouraging
29
Thank you
This research has been supported by the project AMALACH (grant
n. DF12P01OVV022 of the program NAKI of the Ministry of Culture of the Czech
Republic), the Czech Science Foundation (grant n. P103/12/G084), and the Charles
University Grant Agency (grant n. 920913).

Contenu connexe

Similaire à Experiments with Feature-Based Segmentation for Passage Retrieval

Multimodal Features for Search and Hyperlinking of Video Content
Multimodal Features for Search and Hyperlinking of Video ContentMultimodal Features for Search and Hyperlinking of Video Content
Multimodal Features for Search and Hyperlinking of Video ContentPetra Galuscakova
 
CUNI at MediaEval 2013 Similar Segments in Social Speech Task
CUNI at MediaEval 2013 Similar Segments in Social Speech TaskCUNI at MediaEval 2013 Similar Segments in Social Speech Task
CUNI at MediaEval 2013 Similar Segments in Social Speech TaskPetra Galuscakova
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categoriesWarNik Chow
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationChamani Shiranthika
 
Penalty Functions for Evaluation Measures of Unsegmented Speech Retrieval
Penalty Functions for Evaluation Measures of Unsegmented Speech RetrievalPenalty Functions for Evaluation Measures of Unsegmented Speech Retrieval
Penalty Functions for Evaluation Measures of Unsegmented Speech RetrievalPetra Galuscakova
 
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFEnd-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFJayavardhan Reddy Peddamail
 
Audio Information for Hyperlinking of TV Content
Audio Information for Hyperlinking of TV ContentAudio Information for Hyperlinking of TV Content
Audio Information for Hyperlinking of TV ContentPetra Galuscakova
 
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 ReviewNatural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 Reviewchangedaeoh
 
Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...
Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...
Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...krisztianbalog
 
A template based algorithm for automatic summarization and dialogue managemen...
A template based algorithm for automatic summarization and dialogue managemen...A template based algorithm for automatic summarization and dialogue managemen...
A template based algorithm for automatic summarization and dialogue managemen...eSAT Journals
 
Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemIJTET Journal
 
Architecture of an ontology based domain-specific natural language question a...
Architecture of an ontology based domain-specific natural language question a...Architecture of an ontology based domain-specific natural language question a...
Architecture of an ontology based domain-specific natural language question a...IJwest
 
Answer extraction and passage retrieval for
Answer extraction and passage retrieval forAnswer extraction and passage retrieval for
Answer extraction and passage retrieval forWaheeb Ahmed
 
2. Constantin Orasan (UoW) EXPERT Introduction
2. Constantin Orasan (UoW) EXPERT Introduction2. Constantin Orasan (UoW) EXPERT Introduction
2. Constantin Orasan (UoW) EXPERT IntroductionRIILP
 
Keyphrase Extraction And Source Code Similarity Detection- A Survey
Keyphrase Extraction And Source Code Similarity Detection- A Survey Keyphrase Extraction And Source Code Similarity Detection- A Survey
Keyphrase Extraction And Source Code Similarity Detection- A Survey Nakul Sharma
 
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSINGAUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSINGIRJET Journal
 
Writing a scientific manuscript
Writing a scientific manuscriptWriting a scientific manuscript
Writing a scientific manuscriptMartin McMorrow
 

Similaire à Experiments with Feature-Based Segmentation for Passage Retrieval (20)

Multimodal Features for Search and Hyperlinking of Video Content
Multimodal Features for Search and Hyperlinking of Video ContentMultimodal Features for Search and Hyperlinking of Video Content
Multimodal Features for Search and Hyperlinking of Video Content
 
CUNI at MediaEval 2013 Similar Segments in Social Speech Task
CUNI at MediaEval 2013 Similar Segments in Social Speech TaskCUNI at MediaEval 2013 Similar Segments in Social Speech Task
CUNI at MediaEval 2013 Similar Segments in Social Speech Task
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translation
 
Penalty Functions for Evaluation Measures of Unsegmented Speech Retrieval
Penalty Functions for Evaluation Measures of Unsegmented Speech RetrievalPenalty Functions for Evaluation Measures of Unsegmented Speech Retrieval
Penalty Functions for Evaluation Measures of Unsegmented Speech Retrieval
 
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFEnd-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
 
Audio Information for Hyperlinking of TV Content
Audio Information for Hyperlinking of TV ContentAudio Information for Hyperlinking of TV Content
Audio Information for Hyperlinking of TV Content
 
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 ReviewNatural Language Generation / Stanford cs224n 2019w lecture 15 Review
Natural Language Generation / Stanford cs224n 2019w lecture 15 Review
 
team10.ppt.pptx
team10.ppt.pptxteam10.ppt.pptx
team10.ppt.pptx
 
Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...
Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...
Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...
 
Linguistic Evaluation of Support Verb Construction Translations by OpenLogos ...
Linguistic Evaluation of Support Verb Construction Translations by OpenLogos ...Linguistic Evaluation of Support Verb Construction Translations by OpenLogos ...
Linguistic Evaluation of Support Verb Construction Translations by OpenLogos ...
 
A template based algorithm for automatic summarization and dialogue managemen...
A template based algorithm for automatic summarization and dialogue managemen...A template based algorithm for automatic summarization and dialogue managemen...
A template based algorithm for automatic summarization and dialogue managemen...
 
Ontology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval SystemOntology Based Approach for Semantic Information Retrieval System
Ontology Based Approach for Semantic Information Retrieval System
 
Architecture of an ontology based domain-specific natural language question a...
Architecture of an ontology based domain-specific natural language question a...Architecture of an ontology based domain-specific natural language question a...
Architecture of an ontology based domain-specific natural language question a...
 
Filling the gaps
Filling the gapsFilling the gaps
Filling the gaps
 
Answer extraction and passage retrieval for
Answer extraction and passage retrieval forAnswer extraction and passage retrieval for
Answer extraction and passage retrieval for
 
2. Constantin Orasan (UoW) EXPERT Introduction
2. Constantin Orasan (UoW) EXPERT Introduction2. Constantin Orasan (UoW) EXPERT Introduction
2. Constantin Orasan (UoW) EXPERT Introduction
 
Keyphrase Extraction And Source Code Similarity Detection- A Survey
Keyphrase Extraction And Source Code Similarity Detection- A Survey Keyphrase Extraction And Source Code Similarity Detection- A Survey
Keyphrase Extraction And Source Code Similarity Detection- A Survey
 
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSINGAUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
AUTOMATIC QUESTION GENERATION USING NATURAL LANGUAGE PROCESSING
 
Writing a scientific manuscript
Writing a scientific manuscriptWriting a scientific manuscript
Writing a scientific manuscript
 

Plus de Petra Galuscakova

Combining Evidence for Cross-language Information Retrieval
Combining Evidence for Cross-language Information RetrievalCombining Evidence for Cross-language Information Retrieval
Combining Evidence for Cross-language Information RetrievalPetra Galuscakova
 
Multimodal Features for Linking Television Content
Multimodal Features for Linking Television ContentMultimodal Features for Linking Television Content
Multimodal Features for Linking Television ContentPetra Galuscakova
 
Czech Malach Cross-lingual Speech Retrieval Test Collection
Czech Malach Cross-lingual Speech Retrieval Test CollectionCzech Malach Cross-lingual Speech Retrieval Test Collection
Czech Malach Cross-lingual Speech Retrieval Test CollectionPetra Galuscakova
 
Evaluácia tematického vyhľadávania v audiovizuálnych nahrávkach
Evaluácia tematického vyhľadávania v audiovizuálnych nahrávkachEvaluácia tematického vyhľadávania v audiovizuálnych nahrávkach
Evaluácia tematického vyhľadávania v audiovizuálnych nahrávkachPetra Galuscakova
 
Česko-slovenský paralelný korpus určený pre preklad medzi blízkymi jazykmi
Česko-slovenský paralelný korpus určený pre preklad medzi blízkymi jazykmiČesko-slovenský paralelný korpus určený pre preklad medzi blízkymi jazykmi
Česko-slovenský paralelný korpus určený pre preklad medzi blízkymi jazykmiPetra Galuscakova
 
Application of Topic Segmentation in Audiovisual Information Retrieval
Application of Topic Segmentation in Audiovisual Information RetrievalApplication of Topic Segmentation in Audiovisual Information Retrieval
Application of Topic Segmentation in Audiovisual Information RetrievalPetra Galuscakova
 

Plus de Petra Galuscakova (6)

Combining Evidence for Cross-language Information Retrieval
Combining Evidence for Cross-language Information RetrievalCombining Evidence for Cross-language Information Retrieval
Combining Evidence for Cross-language Information Retrieval
 
Multimodal Features for Linking Television Content
Multimodal Features for Linking Television ContentMultimodal Features for Linking Television Content
Multimodal Features for Linking Television Content
 
Czech Malach Cross-lingual Speech Retrieval Test Collection
Czech Malach Cross-lingual Speech Retrieval Test CollectionCzech Malach Cross-lingual Speech Retrieval Test Collection
Czech Malach Cross-lingual Speech Retrieval Test Collection
 
Evaluácia tematického vyhľadávania v audiovizuálnych nahrávkach
Evaluácia tematického vyhľadávania v audiovizuálnych nahrávkachEvaluácia tematického vyhľadávania v audiovizuálnych nahrávkach
Evaluácia tematického vyhľadávania v audiovizuálnych nahrávkach
 
Česko-slovenský paralelný korpus určený pre preklad medzi blízkymi jazykmi
Česko-slovenský paralelný korpus určený pre preklad medzi blízkymi jazykmiČesko-slovenský paralelný korpus určený pre preklad medzi blízkymi jazykmi
Česko-slovenský paralelný korpus určený pre preklad medzi blízkymi jazykmi
 
Application of Topic Segmentation in Audiovisual Information Retrieval
Application of Topic Segmentation in Audiovisual Information RetrievalApplication of Topic Segmentation in Audiovisual Information Retrieval
Application of Topic Segmentation in Audiovisual Information Retrieval
 

Dernier

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 

Dernier (20)

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

Experiments with Feature-Based Segmentation for Passage Retrieval

  • 1. Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents Petra Galuščáková and Pavel Pecina galuscakova@ufal.mff.cuni.cz Institute of Formal and Applied Linguistics Faculty of Mathematics and Physics Charles University in Prague 4. 4. 2014
  • 2. 2 Information Retrieval ● Information Retrieval (IR) is a task which involves searching for documents relevant to a given query.
  • 3. 3 Speech Retrieval ● Speech Retrieval focuses on retrieval from audio-visual documents (recordings).
  • 4. 4 Speech Retrieval ● Speech Retrieval is often converted on traditional Information Retrieval ● Automatic Speech Recognition (ASR) system applied to the audio track
  • 5. 5 Speech Retrieval Problems ● Documents are long (e.g. whole TV programmes) ● Often unstructured ● Navigation in audio-visual recordings is time consuming ● We need to retrieve relevant segments of full documents ● Possibility to browse the recordings using hyperlinks (links between passages) → Passage Retrieval
  • 6. 6 Passage Retrieval ● Splits texts into smaller units which then function as documents in the retrieval process ● Makes the retrieval process more precise ● May improve retrieval of full documents ● The segmentation is crucial for the quality of the retrieval → We focus on segmentation strategies
  • 7. 7 Segmentation Strategies ● Regular (Window-based) ● Segments of equal length with regular shift ● Claimed to be a very effective approach ● Similarity-based ● Measures similarity between neighbouring segments ● Lexical-chain-based ● Finds sequences of lexicographically related word occurrences ● Feature-based ● Employs machine learning methods to detect segment boundaries based on various features
  • 10. 10 ● MediaEval is a benchmarking initiative dedicated to development, comparison, and improvement of strategies for processing and retrieving multimedia content. ● E.g., speech recognition, multimedia content analysis, music and audio analysis, social networks, geo-coordinates, … ● 2013 Similar Segments in Social Speech Task ● 2013 Search and Hyperlinking Task
  • 11. 11 Similar Segments in Social Speech (SSSS) Task ● Scenario: ● A new member (e.g., a new student) joins a community or organization (e.g., a university), which owns an archive of recorded conversations among its members ● A member wants to find information according to his or her interest in the archive – The student wants to find more segments similar to the ones he or she is interested in and browses the archive using hyperlinks in videos ● The main goal: ● To find segments similar to the given ones
  • 12. 12 Similar Segments in Social Speech Task Data ● On purpose recorded interviews (5 hours) of two speakers (university students’ community) ● Divided into training/test data ● Manual and ASR transcripts ● Manually indicated segments (1886 segments), manually grouped into similarity sets ● Query segment - specified by the timestamp of its beginning and end ● Queries - constructed by including all words lying within the boundaries of the query segments
  • 13. 13 Search and Hyperlinking (SH) Task ● Scenario: ● A user wants to find a piece of information relevant to a given query in a collection of TV programmes (Search subtask) ● And then navigate through a large archive using hyperlinks to the retrieved segments (Hyperlinking subtask) ● The main goal of the Search Subtask ● Find passages relevant to a user’s interest given by a textual query in a large set of audio-visual recordings
  • 14. 14 Search and Hyperlinking Task Data ● TV programme recordings provided by BBC (1697 hours) ● Subtitles and two ASR transcripts (LIMSI and LIUM) ● 4 training and 50 test queries ● Query text: e. g. Boris Johnson ● Visual cue: e. g. 2 men sitting opposite each other ● Metadata, synopsis, cast, detected shots, detected faces, visual concepts
  • 15. 15 Passage Retrieval Quality Evaluation ● Full document retrieval → Mean Reciprocal Rank (MRR) – RR = 1 / rank of the first correctly retrieved document ● Retrieval of the exact passages → MRRw and MGAP ● MRR-window (MRRw) – Retrieved starting points are limited to appear less than 60 seconds from the relevant starting points ● Mean Generalized Average Precision (MGAP) – The quality of the retrieved starting point is assessed according to its distance from the relevant starting point using a penalty function
  • 17. 17 Baseline System ● We employ the Terrier IR toolkit ● Hiemstra language model ● Parameter set to 0.35 (importance of a query term in a document) ● Stopwords removal, stemming ● Post-filtering of the answers ● The segments partially overlapping with either the query segment or a higher ranked segment are removed from the list of results
  • 18. 18 Window-based Segmentation ● Equally-long segments with a regular shift
  • 19. 19 Feature-based Segmentation ● We identify possible segment boundaries (beginnings and ends) ● Model: J48 decision trees ● Training data available for the SSSS task ● Manually marked segments ● Binary classification problem ● For each word in the transcripts, we predict whether a segment boundary occurs after this word or not ● Classes: segment boundary and segment continuation
  • 20. 20 Features ● Cue words and tags (n-grams which frequently occur at the boundary,most informative n-grams) for segment beginning and end ● Segment beginnings: “I’m”, “the”, “are you”, “you have”, ... ● Segment ends: “good”, “interesting”, “lot”, ... ● Letter cases ● Length of the silence before the word ● Division given in transcripts (e.g., speech segments defined in the LIMSI transcripts) ● The output of the TextTiling algorithm
  • 23. 23 Similar Segments in Social Speech Task - Evaluation ● Best results are obtained by the feature-based segmentation into overlapping segments ● Manual gold-standard segmentation is outperformed by feature- based segmentation (MRRw score on the manual transcripts) ● Manual transcripts are significantly better in all scores
  • 24. 24 Segmentation Model in the SH Task ● Training set used in the SH Search Subtask is very small ● We apply the SSSS-trained models in the SH task ● Allows us to examine the possibility of creating a universal model for feature-based segmentation ● Potential problems: ● Different vocabulary (student's dialogues vs. TV programmes) ● Different ASR systems may prefer different vocabulary ● Different distribution of silence, document structure
  • 25. 25 SH Task Evaluation ● Not as consistent as for the SSSS task ● Depending on the type of the transcript ● Feature-based approaches creating overlapping segments - effective when applied on the subtitles
  • 27. 27 Conclusion ● Information Retrieval, focus on speech data (Speech Retrieval) ● Focus on retrieval of exact relevant passages ● Importance of segmentation ● Experiments in MediaEval benchamark ● Similar Segments in Social Speech Task (university student dialogues) and Search and Hyperlinking Task (BBC programmes) ● We applied window-based segmentation and three types of feature-based segmentations
  • 28. 28 Conclusion cont. ● Feature-based segmentation applied in the two tasks outperformed regular segmentation ● Claimed to be a very effective approach ● The improvement in the SSSS Task was statistically significant on the manual (MRRw and mGAP measures) and ASR (mGAP measure) transcripts ● The results in the SH task were not so conclusive ● Some of the results (on the subtitles) are encouraging
  • 29. 29 Thank you This research has been supported by the project AMALACH (grant n. DF12P01OVV022 of the program NAKI of the Ministry of Culture of the Czech Republic), the Czech Science Foundation (grant n. P103/12/G084), and the Charles University Grant Agency (grant n. 920913).