SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
A Composite Kernel Approach for Dialog Topic Tracking with
Structured Domain Knowledge from Wikipedia
Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Human Language Technology Department, Institute for Infocomm Research, Singapore
Introduction
Most previous dialog systems have focused only on a single
target task on a single topic and domain.
Some researchers have tried to identify dialog topics with text
categorization or external knowledge-based approaches.
In this work, we propose a composite kernel approach for
dialog topic tracking with structured domain knowledge from
Wikipedia.
Dialog Topic Tracking
A classification problem f(xt) = (yt−1, yt)
xt contains the input features obtained at a turn t
yt ∈ C, where C is a closed set of topic categories.
If a topic transition occurs at t, yt should be different from yt−1.
Otherwise, both yt and yt−1 have the same value.
Examples of dialog topic tracking
t Speaker Utterance Topic Transition
0 Guide How can I help you? NONE→NONE
1
Tourist Can you recommend some good places to visit in Singapore?
NONE→ATTR
Guide Well if you like to visit an icon of Singapore, Merlion park will be a nice place
to visit.
2
Tourist Merlion is a symbol for Singapore, right?
ATTR→ATTR
Guide Yes, we use that to symbolise Singapore.
3
Tourist Okay.
ATTR→ATTR
Guide The lion head symbolised the founding of the island and the fish body just
symbolised the humble fishing village.
4
Tourist How can I get there from Orchard Road?
ATTR→TRSP
Guide You can take the north-south line train from Orchard Road and stop at Raffles
Place station.
5
Tourist Is this walking distance from the station to the destination?
TRSP→TRSP
Guide Yes, it’ll take only ten minutes on foot.
6
Tourist Alright.
TRSP→FOOD
Guide Well, you can also enjoy some seafoods at the riverside near the place.
7
Tourist What food do you have any recommendations to try there?
FOOD→FOOD
Guide If you like spicy foods, you must try chilli crab which is one of our favourite
dishes here in Singapore.
8 Tourist Great! I’ll try that. FOOD→FOOD
f detects three topic transitions at t1, t4, t6
Topic sequence: Attraction→Transportation→Food
Wikipedia-based Dialog Topic Tracking
Supervised machine learning
With the training examples annotated with topic labels
Based on the information obtained solely from an ongoing dialog
Not sufficient to identify not only user-initiative, but also system-initiative
topic transitions
Wikipedia-based dialog topic tracking
To leverage on Wikipedia as an external knowledge source
Obtained without significant effort toward building resources
Wikipedia-based Composite Kernel
Composite kernel for dialog topic tracking
With relevant Wikipedia paragraphs selected using cosine similarity
sim (x, pi) =
φ(x) · φ(pi)
|φ(x)||φ(pi)|
The term vector φ(x) is computed by accumulating the weights in the
previous turns
φ(x) = α1, α2, · · · , α|W| ∈ R|W|
Two kernel structures are constructed using the information from the
highly-ranked paragraphs
History Sequence Kernel
A sequence of the most similar paragraphs
S = (s0, · · · , st), where sj = argmaxi sim xj, pi
Sub-sequence kernel
Ks(S1, S2) =
u∈An i:u=S1[i] j:u=S2[j]
λl(i)+l(j)
Domain Context Tree Kernel
A tree structure incorporates various types of domain
knowledge from Wikipedia
Subset tree kernel
Kt(T1, T2) =
n1∈NT1
n2∈NT2
(n1, n2) ,
Kernel Composition
Linear combination of three kernels
K(x1, x2) = α · Kl(V1, V2) + β · Ks(S1, S2) + γ · Kt(T1, T2)
Experimental Setup
Singapore tour guide dialogs
Human-human mixed initiative dialogs
35 sessions, 21 hours, 19,651 utterances
Manually annotated with nine topic categories
Wikipedia collection
3,155 articles related to Singapore
Collected from Wikipedia database dump as of February 2013
Models
Baselines: Kl only, Kl with P as features
Proposed approaches: Kl + Ks, Kl + Kt, Kl + Ks + Kt.
Measures
5-fold cross validation to the manual annotations
Accuracy of the predicted topic label for every turn
Precision/Recall/F-measure for each event of topic transition occurred
Experimental Results
Comparison of the performances among the models
Turn-level Transition-level
Accuracy P R F
Kl 62.45 42.77 24.77 31.37
Kl + P 62.44 42.76 24.77 31.37
Kl + Ks 67.19 39.94 40.59 40.26
Kl + Kt 68.54 45.55 35.69 40.02
Kl + Ks + Kt 69.98 44.82 39.83 42.18
Error distibutions of topic transitions
0
500
1000
1500
2000
2500
3000
Kl Kl + P Kl + Ks Kl + Kt ALL
NumberofTransitionErrors
FP(SYS)
FN(SYS)
FP(USR)
FN(USR)
1 Fusionopolis Way, #21-01 Connexis (South Tower), Singapore 138632 Email: kims@i2r.a-star.edu.sg WWW: http://hlt.i2r.a-star.edu.sg/

Contenu connexe

Tendances

Tendances (6)

Link Discovery Tutorial Part I: Efficiency
Link Discovery Tutorial Part I: EfficiencyLink Discovery Tutorial Part I: Efficiency
Link Discovery Tutorial Part I: Efficiency
 
Announcement of the Course "Scientific Computing with Python: An Introduction...
Announcement of the Course "Scientific Computing with Python: An Introduction...Announcement of the Course "Scientific Computing with Python: An Introduction...
Announcement of the Course "Scientific Computing with Python: An Introduction...
 
A new technique for proving non regularity based on the measure of a language
A new technique for proving non regularity based on the measure of a languageA new technique for proving non regularity based on the measure of a language
A new technique for proving non regularity based on the measure of a language
 
形式言語理論への 測度論的アプローチ
形式言語理論への 測度論的アプローチ形式言語理論への 測度論的アプローチ
形式言語理論への 測度論的アプローチ
 
Theory of Computation
Theory of ComputationTheory of Computation
Theory of Computation
 
Subtyping
SubtypingSubtyping
Subtyping
 

En vedette (7)

Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
 

Similaire à A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia

Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Seokhwan Kim
 

Similaire à A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia (20)

Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
 
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataDedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
 
Introduction to Prolog
Introduction to PrologIntroduction to Prolog
Introduction to Prolog
 
Translated learning
Translated learningTranslated learning
Translated learning
 
Reading papers - survey on Non-Convex Optimization
Reading papers - survey on Non-Convex OptimizationReading papers - survey on Non-Convex Optimization
Reading papers - survey on Non-Convex Optimization
 
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and HyperlinkingDCU Search Runs at MediaEval 2014 Search and Hyperlinking
DCU Search Runs at MediaEval 2014 Search and Hyperlinking
 
Fosdem 2013 petra selmer flexible querying of graph data
Fosdem 2013 petra selmer   flexible querying of graph dataFosdem 2013 petra selmer   flexible querying of graph data
Fosdem 2013 petra selmer flexible querying of graph data
 
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
 
All good things
All good thingsAll good things
All good things
 
Jarrar: ORM in Description Logic
Jarrar: ORM in Description Logic  Jarrar: ORM in Description Logic
Jarrar: ORM in Description Logic
 
MOOCs, Two Years Later - Pierre Dillenbourg, Swiss Federal Institute of Techn...
MOOCs, Two Years Later - Pierre Dillenbourg, Swiss Federal Institute of Techn...MOOCs, Two Years Later - Pierre Dillenbourg, Swiss Federal Institute of Techn...
MOOCs, Two Years Later - Pierre Dillenbourg, Swiss Federal Institute of Techn...
 
transfer.pptx
transfer.pptxtransfer.pptx
transfer.pptx
 
Cuckoo Search Algorithm: An Introduction
Cuckoo Search Algorithm: An IntroductionCuckoo Search Algorithm: An Introduction
Cuckoo Search Algorithm: An Introduction
 
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
 
Artificial intelligence for Social Good
Artificial intelligence for Social GoodArtificial intelligence for Social Good
Artificial intelligence for Social Good
 
Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Tex...
Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Tex...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Tex...
Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Tex...
 
Sigir 2011 proceedings
Sigir 2011 proceedingsSigir 2011 proceedings
Sigir 2011 proceedings
 
Some Information Retrieval Models and Our Experiments for TREC KBA
Some Information Retrieval Models and Our Experiments for TREC KBASome Information Retrieval Models and Our Experiments for TREC KBA
Some Information Retrieval Models and Our Experiments for TREC KBA
 
Nlp history-thai-virach20191025
Nlp history-thai-virach20191025Nlp history-thai-virach20191025
Nlp history-thai-virach20191025
 

Plus de Seokhwan Kim

Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
Seokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
Seokhwan Kim
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
Seokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
Seokhwan Kim
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
Seokhwan Kim
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
Seokhwan Kim
 

Plus de Seokhwan Kim (12)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
 

Dernier

Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
Bhagirath Gogikar
 

Dernier (20)

Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 

A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia

  • 1. A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia Seokhwan Kim, Rafael E. Banchs, Haizhou Li Human Language Technology Department, Institute for Infocomm Research, Singapore Introduction Most previous dialog systems have focused only on a single target task on a single topic and domain. Some researchers have tried to identify dialog topics with text categorization or external knowledge-based approaches. In this work, we propose a composite kernel approach for dialog topic tracking with structured domain knowledge from Wikipedia. Dialog Topic Tracking A classification problem f(xt) = (yt−1, yt) xt contains the input features obtained at a turn t yt ∈ C, where C is a closed set of topic categories. If a topic transition occurs at t, yt should be different from yt−1. Otherwise, both yt and yt−1 have the same value. Examples of dialog topic tracking t Speaker Utterance Topic Transition 0 Guide How can I help you? NONE→NONE 1 Tourist Can you recommend some good places to visit in Singapore? NONE→ATTR Guide Well if you like to visit an icon of Singapore, Merlion park will be a nice place to visit. 2 Tourist Merlion is a symbol for Singapore, right? ATTR→ATTR Guide Yes, we use that to symbolise Singapore. 3 Tourist Okay. ATTR→ATTR Guide The lion head symbolised the founding of the island and the fish body just symbolised the humble fishing village. 4 Tourist How can I get there from Orchard Road? ATTR→TRSP Guide You can take the north-south line train from Orchard Road and stop at Raffles Place station. 5 Tourist Is this walking distance from the station to the destination? TRSP→TRSP Guide Yes, it’ll take only ten minutes on foot. 6 Tourist Alright. TRSP→FOOD Guide Well, you can also enjoy some seafoods at the riverside near the place. 7 Tourist What food do you have any recommendations to try there? FOOD→FOOD Guide If you like spicy foods, you must try chilli crab which is one of our favourite dishes here in Singapore. 8 Tourist Great! I’ll try that. FOOD→FOOD f detects three topic transitions at t1, t4, t6 Topic sequence: Attraction→Transportation→Food Wikipedia-based Dialog Topic Tracking Supervised machine learning With the training examples annotated with topic labels Based on the information obtained solely from an ongoing dialog Not sufficient to identify not only user-initiative, but also system-initiative topic transitions Wikipedia-based dialog topic tracking To leverage on Wikipedia as an external knowledge source Obtained without significant effort toward building resources Wikipedia-based Composite Kernel Composite kernel for dialog topic tracking With relevant Wikipedia paragraphs selected using cosine similarity sim (x, pi) = φ(x) · φ(pi) |φ(x)||φ(pi)| The term vector φ(x) is computed by accumulating the weights in the previous turns φ(x) = α1, α2, · · · , α|W| ∈ R|W| Two kernel structures are constructed using the information from the highly-ranked paragraphs History Sequence Kernel A sequence of the most similar paragraphs S = (s0, · · · , st), where sj = argmaxi sim xj, pi Sub-sequence kernel Ks(S1, S2) = u∈An i:u=S1[i] j:u=S2[j] λl(i)+l(j) Domain Context Tree Kernel A tree structure incorporates various types of domain knowledge from Wikipedia Subset tree kernel Kt(T1, T2) = n1∈NT1 n2∈NT2 (n1, n2) , Kernel Composition Linear combination of three kernels K(x1, x2) = α · Kl(V1, V2) + β · Ks(S1, S2) + γ · Kt(T1, T2) Experimental Setup Singapore tour guide dialogs Human-human mixed initiative dialogs 35 sessions, 21 hours, 19,651 utterances Manually annotated with nine topic categories Wikipedia collection 3,155 articles related to Singapore Collected from Wikipedia database dump as of February 2013 Models Baselines: Kl only, Kl with P as features Proposed approaches: Kl + Ks, Kl + Kt, Kl + Ks + Kt. Measures 5-fold cross validation to the manual annotations Accuracy of the predicted topic label for every turn Precision/Recall/F-measure for each event of topic transition occurred Experimental Results Comparison of the performances among the models Turn-level Transition-level Accuracy P R F Kl 62.45 42.77 24.77 31.37 Kl + P 62.44 42.76 24.77 31.37 Kl + Ks 67.19 39.94 40.59 40.26 Kl + Kt 68.54 45.55 35.69 40.02 Kl + Ks + Kt 69.98 44.82 39.83 42.18 Error distibutions of topic transitions 0 500 1000 1500 2000 2500 3000 Kl Kl + P Kl + Ks Kl + Kt ALL NumberofTransitionErrors FP(SYS) FN(SYS) FP(USR) FN(USR) 1 Fusionopolis Way, #21-01 Connexis (South Tower), Singapore 138632 Email: kims@i2r.a-star.edu.sg WWW: http://hlt.i2r.a-star.edu.sg/