SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
WIKIPEDIA-BASED KERNELS
FOR DIALOGUE TOPIC TRACKING
Seokhwan Kim, Rafael E. Banchs, Haizhou Li
Human Language Technology Department
Institute for Infocomm Research (I2R)
6th May 2014
ICASSP, Florence, Italy
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 2
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 3
Motivation
• Spoken Dialogue Systems
– Next-generation User Interface
– The most natural way for human-human communication
• Single-task dialogues
– Most previous work focuses on single target task
• Eg. Flight Reservation, Bus Information Guide, Restaurant Booking
– Cause limitations in practical uses
• Multi-task dialogues
– [Lin et al. 1999, Ikeda et al. 2008, Celikyilmaz et al. 2011]
– Selecting the most probable system at each turn
– Each system is independently built and operated from others
Pg 4
Related Work
• Text categorization-based Dialogue Topic Identification
– [Nakata et al., 2002; Lagus&Kuusisto, 2002; Adams&Martell, 2008]
– Differences from written texts
• Determinations of topics
– User’s intentions
– System’s decisions
• Available features
– Unable to see the future turns
• Knowledge-based Dialogue Topic Suggestion
– External Knowledge Sources
• Eg. Domain Models, Heuristics, Agendas
• [Roy&Subramaniam, 2006; Young et al., 2007; Bohus&Rudnicky, 2003; Lee et al. 2008
– Limited flexibility
• To handle user-initiative cases
– High cost
• To build a sufficient amount of resources
Pg 5
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 6
Dialogue Topic Tracking
• Subtasks
– Dialogue Segmentation
• Segmenting a session into topically coherent sub-dialogues
– Topic Transition Identification
• Identifying the next topic category at each time of topic transition
Pg 7
Dialogue Topic Tracking
• Example
Pg 8
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 9
Wikipedia-based Kernel Method
• Vector Space Model
– The simplest approach to represent features for supervised machine
learning methods
– An instance for each turn  A weighted term vector
– Lack of semantic or domain-specific aspects
• Each word is considered as an independent and identical unit
Pg 10
Wikipedia-based Kernel Method
• Wikipedia for Dialogue Topic Tracking
– As an external knowledge source
– Without significant effort for building resources
– Previous work
• [Breuing et al., 2011; Wilcock, 2012]
• Focusing only on a single type of information from Wikipedia
• Wikipedia-based Kernel Method
– Aiming at incorporating various knowledge from Wikipedia
– To map the data into a higher dimensional feature space
• Vector Space Extension
• Vector Transformation
Pg 11
Wikipedia-based Kernel Method
• Vector Extension
Pg 12
Term Vector Concept Vector
…
U: ---------------------------
S: ---------------------------
U: ---------------------------
S: ---------------------------
U: ---------------------------
S: ---------------------------
…
x
β1
β2
Β|D|
d1
d2
d|D|
⁞
Wikipedia-based Kernel Method
• Vector Transformation
– Each extended vector is transformed into a new space
– Transformation Matrix S
– s(di, dj) is the relatedness between di and dj
– Update of Concept Vector Values
Pg 13
Wikipedia-based Kernel Method
Pg 14
Measures of Contextual Relatedness
• How to compute s(di, dj)?
• Category Relatedness
– Based on hierarchical structures of Wikipedia categories
• depth(d): the length of the path from the root node to d
• lcs(di, dj): the least common subsume of the two articles in the hierarchy
Pg 15
Measures of Contextual Relatedness
• Category Overlap Score
– Based on the ratio of common categories of two concepts
– By Jaccard’s coefficient
• Contents Similarity
– Based on the cosine similarity between term vectors from the body texts
Pg 16
Measures of Contextual Relatedness
• Co-occurrence Frequency
– To represent the discourse relatedness obtained from Wikipedia
– Assumption
• The more frequently the mentions about two concepts co-occurred
• The more similar aspects both concepts take in dialogue flows
– By normalized point-wise mutual information
Pg 17
Measures of Contextual Relatedness
• Geographical Closeness
– Domain-specific Measure
– Based on the geographic coordinate information of spatial concepts
• Final Score
Pg 18
Measures of Contextual Relatedness
Pg 19
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 20
Evaluation
• Dataset
– Dialogue Corpus on Singapore Tour Guide
• Real human-human mixed initiative conversations
• Between guides and tourists
• Stats
– 35 dialogue sessions
– 21 hours
– 19,651 utterances
• Topics
– 1,642 topic segments
– 9 topic categories
» Opening, Closing, Itinerary, Accommodation, Attraction, Food,
Transportation, Shopping, Other
– Wikipedia Collection
• 3,115 articles related to Singapore
• Collected from Wikipedia database dump as of Feb 2013
Pg 21
Evaluation
• Models
– Training Instances
• 8,318 instances for user-turn-level segmentation
• 1,607 instances for dialogue-segment-level topic prediction
– Support Vector Machine (SVM) Models
• BOW: Baseline only with term vector space
• WK0: Extended vector without transformation
• WK1: s(di, dj) = s1(di, dj)
• WK2: s(di, dj) = s1(di, dj) + s2(di, dj)
• WK3: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj)
• WK4: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) + s4(di, dj)
• WK5: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) + s4(di, dj) + s5(di, dj)
• Metrics
– Five-fold Cross-validation
– Segmentation: P/R/F
– Topic Prediction: Accuracy
Pg 22
Evaluation
• Comparison of dialogue topic tracking performances
Pg 23
Evaluation
• Distributions of errors on the cascaded results with WK5
– 71.4% of errors result from segmentation
– 60.0% of errors occurred for system-initiative cases
Pg 24
Contents
• Introduction
• Problem Definition
• Method
• Evaluation
• Conclusions
Pg 25
Conclusions
• Summary
– Wikipedia-based Kernel Method for Dialogue Topic Tracking
– To incorporate various types of information from Wikipedia
– Experimental results show the merits of our proposed approach in
mixed-initiative dialogues
• Ongoing Work
– Using more various types of knowledge from Wikipedia
– To be presented at ACL 2014
• A Composite Kernel Approach for Dialog Topic Tracking with Structured
Domain Knowledge from Wikipedia
Pg 26
Thank You
Pg 27
Contact: kims@i2r.a-star.edu.sg
References
• B. Lin, H. Wang, and L. Lee, “A distributed architecture for cooperative spoken dialogue agents with coherent dialogue state and history,” in Proceedings
of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1999.
• S. Ikeda, K. Komatani, T. Ogata, H. G. Okuno, and H. G. Okuno, “Extensibility verification of robust domain selection against out-of-grammar utterances
in multidomain spoken dialogue system.,” in Proceedings of the 9th Annual Conference of the International Speech Communicatiuon Association
(INTERSPEECH), 2008, pp. 487–490.
• A. Celikyilmaz, D. Hakkani-T¨ur, and G. T¨ur, “Approximate inference for domain detection in spoken language understanding.,” in Proceedings of the
12th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2011, pp. 713–716.
• T. Nakata, S. Ando, and A. Okumura, “Topic detection based on dialogue history,” in Proceedings of the 19th international conference on Computational
linguistics (COLING), 2002, pp. 1–7.
• K. Lagus and J. Kuusisto, “Topic identification in natural language dialogues using neural networks,” in Proceedings of the 3rd SIGdial workshop on
Discourse and dialogue, 2002, pp. 95–102.
• P. H. Adams and C. H. Martell, “Topic detection and extraction in chat,” in Proceedings of the 2008 IEEE International Conference on Semantic
Computing, 2008, pp. 581–588.
• S. Roy and L. V. Subramaniam, “Automatic generation of domain models for call centers from noisy transcriptions,” in Proceedings of the 21st
International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, 2006, pp. 737–744.
• S. Young, J. Schatzmann, K. Weilhammer, and H. Ye, “The hidden information state approach to dialog management,” in Proceedings of the
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2007.
• D. Bohus and A. Rudnicky, “Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda,” in Proceedings of the
European Conference on Speech, Communication and Technology, 2003, pp. 597–600.
• C. Lee, S. Jung, and G. G. Lee, “Robust dialog management with n-best hypotheses using dialog examples and agenda.,” in Proceedings of the 46th
Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2008, pp. 630–637.
• G. Salton, A. Wong, and C.S. Yang, “A vector space model for automatic indexing,” Communications of the ACM, vol. 18, no. 11, pp. 613–620, 1975.
• G.Wilcock, “Wikitalk: a spoken wikipedia-based opendomain knowledge access system,” in Proceedings of the Workshop on Question Answering for
Complex Domains, 2012, p. 5770.
• A. Breuing, U. Waltinger, and I. Wachsmuth, “Harvesting wikipedia knowledge to identify topics in ongoing natural language dialogs,” in Proceedings of
the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011, pp. 445–450.
• P. Wang and C. Domeniconi, “Building semantic kernels for text classification using wikipedia,” in Proceedings of the 14th ACM SIGKDD international
conference on Knowledge discovery and data mining, 2008, pp. 713–721.
• Z. Wu and M. Palmer, “Verbs semantics and lexical selection,” in Proceedings of the 32nd annual meeting on Association for Computational Linguistics,
1994, pp.133–138.
• C. C. Chang and C. J. Lin, “Libsvm: a library for support vector machines,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 2, no.
3, pp. 27, 2011.

Contenu connexe

Similaire à Wikipedia-based Kernels for Dialogue Topic Tracking

Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesAshis Kumar Chanda
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesAshis Chanda
 
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyCapturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyOCLC
 
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyCapturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyLynn Connaway
 
Towards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial FindingsTowards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial Findingsalc28
 
Rizal Conference
Rizal ConferenceRizal Conference
Rizal Conferencerajenmunoo
 
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...tobold
 
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...Sebastian Dennerlein
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationElena Simperl
 
Experiences in the Development of Geographical Ontologies and Linked Data
Experiences in the Development of Geographical Ontologies and Linked DataExperiences in the Development of Geographical Ontologies and Linked Data
Experiences in the Development of Geographical Ontologies and Linked DataOscar Corcho
 
Social media as a tool for terminological research
Social media as a tool for terminological researchSocial media as a tool for terminological research
Social media as a tool for terminological researchTERMCAT
 
Methodological Implications of Using Google Applications (Google Sites and Go...
Methodological Implications of Using Google Applications (Google Sites and Go...Methodological Implications of Using Google Applications (Google Sites and Go...
Methodological Implications of Using Google Applications (Google Sites and Go...cafuchs
 
Web 2.0 and Social Media in the Language and Literature Classroom
Web 2.0 and Social Media in the Language and Literature ClassroomWeb 2.0 and Social Media in the Language and Literature Classroom
Web 2.0 and Social Media in the Language and Literature ClassroomClaudia Warth
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎Libcorpio
 
Web 2.0 & Social Media in Language Teaching
Web 2.0 & Social Media in Language TeachingWeb 2.0 & Social Media in Language Teaching
Web 2.0 & Social Media in Language TeachingClaudia Warth
 
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolExposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolElectronic Resources & Libraries
 
Standards for Smart Learning Environments
Standards for Smart Learning EnvironmentsStandards for Smart Learning Environments
Standards for Smart Learning EnvironmentsTore Hoel
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...TimelessFuture
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 

Similaire à Wikipedia-based Kernels for Dialogue Topic Tracking (20)

Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
 
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyCapturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
 
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library EthnographyCapturing the Behaviors of the Elusive User: Strategies for Library Ethnography
Capturing the Behaviors of the Elusive User: Strategies for Library Ethnography
 
Towards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial FindingsTowards OpenURL Quality Metrics: Initial Findings
Towards OpenURL Quality Metrics: Initial Findings
 
Rizal Conference
Rizal ConferenceRizal Conference
Rizal Conference
 
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
 
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generation
 
Experiences in the Development of Geographical Ontologies and Linked Data
Experiences in the Development of Geographical Ontologies and Linked DataExperiences in the Development of Geographical Ontologies and Linked Data
Experiences in the Development of Geographical Ontologies and Linked Data
 
Social media as a tool for terminological research
Social media as a tool for terminological researchSocial media as a tool for terminological research
Social media as a tool for terminological research
 
Methodological Implications of Using Google Applications (Google Sites and Go...
Methodological Implications of Using Google Applications (Google Sites and Go...Methodological Implications of Using Google Applications (Google Sites and Go...
Methodological Implications of Using Google Applications (Google Sites and Go...
 
Web 2.0 and Social Media in the Language and Literature Classroom
Web 2.0 and Social Media in the Language and Literature ClassroomWeb 2.0 and Social Media in the Language and Literature Classroom
Web 2.0 and Social Media in the Language and Literature Classroom
 
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
Wusteman Ticer09
Wusteman Ticer09Wusteman Ticer09
Wusteman Ticer09
 
Web 2.0 & Social Media in Language Teaching
Web 2.0 & Social Media in Language TeachingWeb 2.0 & Social Media in Language Teaching
Web 2.0 & Social Media in Language Teaching
 
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolExposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
 
Standards for Smart Learning Environments
Standards for Smart Learning EnvironmentsStandards for Smart Learning Environments
Standards for Smart Learning Environments
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 

Plus de Seokhwan Kim

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)Seokhwan Kim
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Seokhwan Kim
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingSeokhwan Kim
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionSeokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionSeokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessSeokhwan Kim
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionSeokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템Seokhwan Kim
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionSeokhwan Kim
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...Seokhwan Kim
 

Plus de Seokhwan Kim (15)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
 

Dernier

A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 

Dernier (20)

Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 

Wikipedia-based Kernels for Dialogue Topic Tracking

  • 1. WIKIPEDIA-BASED KERNELS FOR DIALOGUE TOPIC TRACKING Seokhwan Kim, Rafael E. Banchs, Haizhou Li Human Language Technology Department Institute for Infocomm Research (I2R) 6th May 2014 ICASSP, Florence, Italy
  • 2. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 2
  • 3. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 3
  • 4. Motivation • Spoken Dialogue Systems – Next-generation User Interface – The most natural way for human-human communication • Single-task dialogues – Most previous work focuses on single target task • Eg. Flight Reservation, Bus Information Guide, Restaurant Booking – Cause limitations in practical uses • Multi-task dialogues – [Lin et al. 1999, Ikeda et al. 2008, Celikyilmaz et al. 2011] – Selecting the most probable system at each turn – Each system is independently built and operated from others Pg 4
  • 5. Related Work • Text categorization-based Dialogue Topic Identification – [Nakata et al., 2002; Lagus&Kuusisto, 2002; Adams&Martell, 2008] – Differences from written texts • Determinations of topics – User’s intentions – System’s decisions • Available features – Unable to see the future turns • Knowledge-based Dialogue Topic Suggestion – External Knowledge Sources • Eg. Domain Models, Heuristics, Agendas • [Roy&Subramaniam, 2006; Young et al., 2007; Bohus&Rudnicky, 2003; Lee et al. 2008 – Limited flexibility • To handle user-initiative cases – High cost • To build a sufficient amount of resources Pg 5
  • 6. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 6
  • 7. Dialogue Topic Tracking • Subtasks – Dialogue Segmentation • Segmenting a session into topically coherent sub-dialogues – Topic Transition Identification • Identifying the next topic category at each time of topic transition Pg 7
  • 9. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 9
  • 10. Wikipedia-based Kernel Method • Vector Space Model – The simplest approach to represent features for supervised machine learning methods – An instance for each turn  A weighted term vector – Lack of semantic or domain-specific aspects • Each word is considered as an independent and identical unit Pg 10
  • 11. Wikipedia-based Kernel Method • Wikipedia for Dialogue Topic Tracking – As an external knowledge source – Without significant effort for building resources – Previous work • [Breuing et al., 2011; Wilcock, 2012] • Focusing only on a single type of information from Wikipedia • Wikipedia-based Kernel Method – Aiming at incorporating various knowledge from Wikipedia – To map the data into a higher dimensional feature space • Vector Space Extension • Vector Transformation Pg 11
  • 12. Wikipedia-based Kernel Method • Vector Extension Pg 12 Term Vector Concept Vector … U: --------------------------- S: --------------------------- U: --------------------------- S: --------------------------- U: --------------------------- S: --------------------------- … x β1 β2 Β|D| d1 d2 d|D| ⁞
  • 13. Wikipedia-based Kernel Method • Vector Transformation – Each extended vector is transformed into a new space – Transformation Matrix S – s(di, dj) is the relatedness between di and dj – Update of Concept Vector Values Pg 13
  • 15. Measures of Contextual Relatedness • How to compute s(di, dj)? • Category Relatedness – Based on hierarchical structures of Wikipedia categories • depth(d): the length of the path from the root node to d • lcs(di, dj): the least common subsume of the two articles in the hierarchy Pg 15
  • 16. Measures of Contextual Relatedness • Category Overlap Score – Based on the ratio of common categories of two concepts – By Jaccard’s coefficient • Contents Similarity – Based on the cosine similarity between term vectors from the body texts Pg 16
  • 17. Measures of Contextual Relatedness • Co-occurrence Frequency – To represent the discourse relatedness obtained from Wikipedia – Assumption • The more frequently the mentions about two concepts co-occurred • The more similar aspects both concepts take in dialogue flows – By normalized point-wise mutual information Pg 17
  • 18. Measures of Contextual Relatedness • Geographical Closeness – Domain-specific Measure – Based on the geographic coordinate information of spatial concepts • Final Score Pg 18
  • 19. Measures of Contextual Relatedness Pg 19
  • 20. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 20
  • 21. Evaluation • Dataset – Dialogue Corpus on Singapore Tour Guide • Real human-human mixed initiative conversations • Between guides and tourists • Stats – 35 dialogue sessions – 21 hours – 19,651 utterances • Topics – 1,642 topic segments – 9 topic categories » Opening, Closing, Itinerary, Accommodation, Attraction, Food, Transportation, Shopping, Other – Wikipedia Collection • 3,115 articles related to Singapore • Collected from Wikipedia database dump as of Feb 2013 Pg 21
  • 22. Evaluation • Models – Training Instances • 8,318 instances for user-turn-level segmentation • 1,607 instances for dialogue-segment-level topic prediction – Support Vector Machine (SVM) Models • BOW: Baseline only with term vector space • WK0: Extended vector without transformation • WK1: s(di, dj) = s1(di, dj) • WK2: s(di, dj) = s1(di, dj) + s2(di, dj) • WK3: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) • WK4: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) + s4(di, dj) • WK5: s(di, dj) = s1(di, dj) + s2(di, dj) + s3(di, dj) + s4(di, dj) + s5(di, dj) • Metrics – Five-fold Cross-validation – Segmentation: P/R/F – Topic Prediction: Accuracy Pg 22
  • 23. Evaluation • Comparison of dialogue topic tracking performances Pg 23
  • 24. Evaluation • Distributions of errors on the cascaded results with WK5 – 71.4% of errors result from segmentation – 60.0% of errors occurred for system-initiative cases Pg 24
  • 25. Contents • Introduction • Problem Definition • Method • Evaluation • Conclusions Pg 25
  • 26. Conclusions • Summary – Wikipedia-based Kernel Method for Dialogue Topic Tracking – To incorporate various types of information from Wikipedia – Experimental results show the merits of our proposed approach in mixed-initiative dialogues • Ongoing Work – Using more various types of knowledge from Wikipedia – To be presented at ACL 2014 • A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain Knowledge from Wikipedia Pg 26
  • 27. Thank You Pg 27 Contact: kims@i2r.a-star.edu.sg
  • 28. References • B. Lin, H. Wang, and L. Lee, “A distributed architecture for cooperative spoken dialogue agents with coherent dialogue state and history,” in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1999. • S. Ikeda, K. Komatani, T. Ogata, H. G. Okuno, and H. G. Okuno, “Extensibility verification of robust domain selection against out-of-grammar utterances in multidomain spoken dialogue system.,” in Proceedings of the 9th Annual Conference of the International Speech Communicatiuon Association (INTERSPEECH), 2008, pp. 487–490. • A. Celikyilmaz, D. Hakkani-T¨ur, and G. T¨ur, “Approximate inference for domain detection in spoken language understanding.,” in Proceedings of the 12th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2011, pp. 713–716. • T. Nakata, S. Ando, and A. Okumura, “Topic detection based on dialogue history,” in Proceedings of the 19th international conference on Computational linguistics (COLING), 2002, pp. 1–7. • K. Lagus and J. Kuusisto, “Topic identification in natural language dialogues using neural networks,” in Proceedings of the 3rd SIGdial workshop on Discourse and dialogue, 2002, pp. 95–102. • P. H. Adams and C. H. Martell, “Topic detection and extraction in chat,” in Proceedings of the 2008 IEEE International Conference on Semantic Computing, 2008, pp. 581–588. • S. Roy and L. V. Subramaniam, “Automatic generation of domain models for call centers from noisy transcriptions,” in Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, 2006, pp. 737–744. • S. Young, J. Schatzmann, K. Weilhammer, and H. Ye, “The hidden information state approach to dialog management,” in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2007. • D. Bohus and A. Rudnicky, “Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda,” in Proceedings of the European Conference on Speech, Communication and Technology, 2003, pp. 597–600. • C. Lee, S. Jung, and G. G. Lee, “Robust dialog management with n-best hypotheses using dialog examples and agenda.,” in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2008, pp. 630–637. • G. Salton, A. Wong, and C.S. Yang, “A vector space model for automatic indexing,” Communications of the ACM, vol. 18, no. 11, pp. 613–620, 1975. • G.Wilcock, “Wikitalk: a spoken wikipedia-based opendomain knowledge access system,” in Proceedings of the Workshop on Question Answering for Complex Domains, 2012, p. 5770. • A. Breuing, U. Waltinger, and I. Wachsmuth, “Harvesting wikipedia knowledge to identify topics in ongoing natural language dialogs,” in Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011, pp. 445–450. • P. Wang and C. Domeniconi, “Building semantic kernels for text classification using wikipedia,” in Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, 2008, pp. 713–721. • Z. Wu and M. Palmer, “Verbs semantics and lexical selection,” in Proceedings of the 32nd annual meeting on Association for Computational Linguistics, 1994, pp.133–138. • C. C. Chang and C. J. Lin, “Libsvm: a library for support vector machines,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 2, no. 3, pp. 27, 2011.