SlideShare une entreprise Scribd logo
1  sur  44
Detecting Actionable Items in Meetings by
Convolutional Deep Structured Semantic Models
YUN-NUNG (VIVIAN) CHEN
DILEK HAKKANI-TÜR
XIAODONG HE
AUG 7TH, 2015
REDMOND, WA
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 1
My Background
Yun-Nung (Vivian) Chen http://vivianchen.idv.tw
PhD candidate working with Prof. Alexander I Rudnicky
Language Technologies Institute, School of Computer Science, Carnegie Mellon University
Research focus
◦ spoken dialogue system
◦ unsupervised/weakly-supervised spoken language understanding
Dissertation
◦ Title: Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogue Systems
◦ Committee: Prof. Alexander I. Rudnicky, Prof. Anatole Gershman, Prof. Alan W Black, Dr. Dilek Hakkani-Tür
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 2
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 3
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 4
Motivation
Meetings as a rich resource contain important information for participants.
◦ Meeting summarization (Xie et al., 2009; Riedhammer et al., 2010; Chen and Metze, 2013)
◦ Decision detection (Fernandez et al., 2008)
◦ Noteworthy utterance detection (Banerjee and Rudnicky, 2009)
◦ Action item identification (Yang et al., 2008)
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 5
Motivation
Computing devices have been easily accessible and information search has been a common part
of regular conversations.
Meetings include discussions for identifying participants’ next actions.
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 6
Motivation - Example
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 7
Will Vivian come here for the meeting? Find Calendar Entry
Motivation - Example
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 8
The ACL Best Student Paper is great! That is about .. Web Search
Motivation - Example
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 9
Let’s discuss the rebuttal, maybe Monday afternoon. Create Calendar Entry
Motivation
Some actionable items
◦ should be processed in real-time during the meeting, e.g. search, find_calendar_entry
◦ can be done after the meeting, e.g. create_reminder, create_calendar_entry
The advantages include
◦ seamless
◦ users do not need to directly address actions
◦ make meeting continue as planned, not interrupted by system errors
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 10
Motivation
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 11
3944.620 3945.990 me018 Have they ever responded to you?
3945.703 3946.325 me011 Nope.
find_email: check emails of me011, search for any emails from them
2049.280 2057.010 mn015 Yeah it's - or - or just - Yeah. It's also all on my - my home page at E_M_L. It's called "An
Anatomy of afind Spatial Description". But I'll send that link.
send_email: email all participants about "link to An Anatomy of Spatial Description"
2867.655 2888.613 mn015 I suggest w- to - for - to proceed with this in - in the sense that maybe, throughout this week,
the three of us will - will talk some more about maybe segmenting off different regions, and
we make up some - some toy a- observable "nodes" - is that what th-
create_calendar_entry: open calendars of participants, marking times free for the three participants and schedule an event
 Action Detection/Identification
stime etime spk_id trans
Motivation
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 12
3944.620 3945.990 me018 Have <from_contact_name>they</from_contact_name> ever responded to
<contact_name>you</contact_name>?
3945.703 3946.325 me011 Nope.
find_email: check emails of me011, search for any emails from them
2049.280 2057.010 mn015 Yeah it's - or - or just - Yeah. It's also all on my - my home page at E_M_L. It's called "An
Anatomy of afind Spatial Description". But I'll send <email_content>that link</email_content>.
send_email: email all participants about "link to An Anatomy of Spatial Description"
2867.655 2888.613 mn015 I suggest w- to - for - to proceed with this in - in the sense that maybe, <date>throughout this
week</date>, the <contact_name>three of us</contact_name> will - will talk some more
about maybe segmenting off different regions, and we make up some - some toy a- observable
"nodes" - is that what th-
create_calendar_entry: open calendars of participants, marking times free for the three participants and schedule an event
 Coreference Resolution
stime etime spk_id trans
 Action Detection/Identification  Slot Filling
Pipeline
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 13
Automatic Speech
Recognition (ASR)
Speaker
Diarization
Coreference
Resolution
Sentence
Segmentation
Slot Filling
Action Detection
3944.620 3945.990Have they ever responded to you? me018 find_email
from_contact_name contact_name
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 14
Task Definition
Actionable Item Detection
◦ Goal: provide the easy access to information and perform actions a personal assistant can handle
without interrupting the meetings
◦ Assumption: some actions and associated arguments can be shared across genres
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 15
Human-Machine Genre create_calendar_entry
schedule a meeting with <contact_name>John</contact_name> <start_time>this afternoon</start_time>
Human-Human Genre create_calendar_entry
how about the <contact_name>three of us</contact_name> discuss this later <start_time>this afternoon</start_time>?
 more casual, include conversational terms
Task: multi-class utterance classification
• train on the available human-machine genre
• test on the human-human genre
Adaptation
• model adapation
• embedding vector adaptation
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 16
Proposed Approach
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 17
Idea: Cortana data (human-machine) may help detect actionable items in meetings (human-human)
12 30 on Monday the 2nd of July schedule meeting to discuss taxes with Bill,
Judy and Rick.
12 PM lunch with Jessie
12:00 meeting with cleaning staff to discuss progress
2 hour business presentation with Rick Sandy on March 8 at 7am
create_calendar_entry
A read receipt should be sent in Karen's email.
Activate meeting delay by 15 minutes. Inform the participants.
Add more message saying "Thanks".
send_email
Query Doc
Convolutional deep structured semantic models for IR may be useful for this task.
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 18
Convolutional Deep Structured Semantic Model (CDSSM)
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 19
20K 20K 20K
1000
w1 w2 w3
1000 1000
20K
wd
1000
300
Word Sequence: x
Word Hashing Matrix: Wh
Word Hashing Layer: lh
Convolution Matrix: Wc
Convolutional Layer: lc
Max Pooling Operation
Max Pooling Layer: lm
Semantic Projection Matrix: Ws
Semantic Layer: y
max max max
300 300 300 300
U A1 A2 An
CosSim(U, Ai)
P(A1 | U) P(A2 | U) P(An | U)
…
Semantic Similarity
Posterior Probability
Utterance
Action
𝑃 𝐴 𝑈 =
exp(𝐶𝑜𝑠𝑆𝑖𝑚(𝑈, 𝐴))
𝐴′ exp(𝐶𝑜𝑠𝑆𝑖𝑚(𝑈, 𝐴′))how about the we discuss this later
Shen et al., “A latent semantic model with convolutional-pooling structure for information retrieval,” in CIKM, 2014.
Huang et al., “Learning deep structured semantic models for web search using click through data,” in CIKM, 2013.
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 20
The model objective is to maximize the likelihood of associated actions given training utterances:
Similarly, reversing actions and utterances induces
Convolutional Deep Structured Semantic Model (CDSSM)
300 300 300 300
A U1 U2 Un
P(U1 | A) P(U2 | A) P(Un | A)
…
Action
Utterance
 Predictive Model
 Generative Model
300 300 300 300
U A1 A2 An
P(A1 | U) P(A2 | U) P(An | U)
…
Utterance
Action
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 21
Adaptation
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 22
Issue: mismatch between a source genre and a target genre
Solution:
◦ Adapting CDSSM
◦ Continually train the CDSSM using the data from the target genre (usually stop early before fully
converged)
◦ More robust because of data from different genres and specific to the target genre
◦ Adapting Action Embeddings
◦ Moving learned action embeddings close to the observed corresponding utterance embeddings
Adapting Action Embeddings
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 23
The mismatch may result in inaccurate action embeddings
Action1
Action2
Action3
utt utt
utt
utt
utt
utt
utt
Action embedding: trained on the source genre
Utterance embedding: generated on the target genre
Idea: moving action embeddings close to the observed
utterance embeddings from the target genre
Adapting Action Embeddings
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 24
Learning adapted action embeddings by minimizing an objective:
Action1
Action2
Action3
utt utt
utt
utt
utt
utt
utt
The distance between original and new action embeddings
The distance between new action embeddings and corresponding utterance embeddings
Action3
Action1
Action2
Adapting Action Embeddings
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 25
adapted action embeddings
The objective considering action and utterance embeddings together:
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 26
Actionable Item Detection
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 27
Goal: predict possible actions given each utterance
Before adaptation, the estimated semantic score of the k-th action
After adaptation, the estimated semantic score of the k-th action
Two ways of usage:
1. As final prediction scores
2. As features of a classifier: the trained classifier outputs the final prediction scores
Actionable Item Detection
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 28
Unidirectional: prediction score from one of
◦ Predictive model: SP(U, A)
◦ Generative model: SG(U, A)
Bidirectional: prediction score merging both scores
300 300 300 300
A U1 U2 Un
SG(U, A)
…
Action
Utterance
300 300 300 300
U A1 A2 An
SP(U, A)
…
Utterance
Action
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 29
Experiments
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 30
Dataset: 22 meetings from the ICSI meeting corpus
Meeting type: Bed, Bmr, Bro
Identified action: find_calendar_entry, create_calendar_entry, open_agenda, add_agenda_item,
create_single_reminder, send_email, find_email, make_call, search, open_setting
Annotating agreement: Cohen’s Kappa = 0.64~0.67
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Bed
Bmr
Bro
create_single_reminder find_calendar_entry search add_agenda_item create_calendar_entry
open_agenda send_email find_email make_call open_setting
Experiments
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 31
Evaluation metrics: the average area under the precision-recall curve (AUC) for 10 actions+others
CDSSM training:
◦ Mismatch-CDSSM: trained on Cortana data (300 iters)
◦ Adapt-CDSSM: trained on Cortana data and then continually trained on meeting data (each 150 iters)
◦ Match-CDSSM: trained on meeting data (300 iters)
Approach #dim
Mismatch-CDSSM Adapt-CDSSM Match-CDSSM
P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir
w/o
SVM
Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57
AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34
Experiments
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 32
Evaluation metrics: the average area under the precision-recall curve (AUC) for 10 actions+others
CDSSM training:
◦ Mismatch-CDSSM: trained on Cortana data (300 iters)
◦ Adapt-CDSSM: trained on Cortana data and then continually trained on meeting data (each 150 iters)
◦ Match-CDSSM: trained on meeting data (300 iters)
Approach #dim
Mismatch-CDSSM Adapt-CDSSM Match-CDSSM
P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir
w/o
SVM
Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57
AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34
w/
SVM
Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27
Experiments
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 33
Evaluation metrics: the average area under the precision-recall curve (AUC) for 10 actions+others
CDSSM training:
◦ Mismatch-CDSSM: trained on Cortana data (300 iters)
◦ Adapt-CDSSM: trained on Cortana data and then continually trained on meeting data (each 150 iters)
◦ Match-CDSSM: trained on meeting data (300 iters)
Approach #dim
Mismatch-CDSSM Adapt-CDSSM Match-CDSSM
P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir
w/o
SVM
Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57
AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34
w/
SVM
Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27
+ Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 64.52 64.81 68.86
Experiments
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 34
Evaluation metrics: the average area under the precision-recall curve (AUC) for 10 actions+others
CDSSM training:
◦ Mismatch-CDSSM: trained on Cortana data (300 iters)
◦ Adapt-CDSSM: trained on Cortana data and then continually trained on meeting data (each 150 iters)
◦ Match-CDSSM: trained on meeting data (300 iters)
Approach #dim
Mismatch-CDSSM Adapt-CDSSM Match-CDSSM
P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir
w/o
SVM
Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57
AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34
w/
SVM
Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27
+ Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 64.52 64.81 68.86
+ AdaptSim 311 52.75 55.22 59.23 61.60 61.13 65.71 64.72 65.39 69.08
Comparing different CDSSM training data
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 35
The embeddings pretrained on the mismatched data are successfully adjusted to the target genre
Matched data produces the best performance
◦ Not robust enough, even worse than the mismatch model
Better robustness from the bidirectional model, the embedding adaptation, and an additional classifier
Approach #dim
Mismatch-CDSSM Adapt-CDSSM Match-CDSSM
P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir
w/o
SVM
Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57
AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34
w/
SVM
Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27
+ Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 64.52 64.81 68.86
+ AdaptSim 311 52.75 55.22 59.23 61.60 61.13 65.71 64.72 65.39 69.08
Effectiveness of Bidirectional Estimation
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 36
Almost all results from the bidirectional estimation significantly outperform unidirectional ones.
Between predictive and generative models, no certain direction is better.
Predictive and generative models can compensate each other and provide more robust estimations.
Approach #dim
Mismatch-CDSSM Adapt-CDSSM Match-CDSSM
P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir
w/o
SVM
Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57
AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34
w/
SVM
Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27
+ Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 64.52 64.81 68.86
+ AdaptSim 311 52.75 55.22 59.23 61.60 61.13 65.71 64.72 65.39 69.08
Effectiveness of Adaptation
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 37
When the trained model mismatches to the task,
◦ the model adaptation
◦ the action embedding adaptation
are useful to overcome the genre mismatch.
Approach #dim
Mismatch-CDSSM Adapt-CDSSM
P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir
w/o
SVM
Sim 47.45 48.17 49.10 48.67 50.09 50.36
AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08
w/
SVM
Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95
+ Sim 311 52.80 54.95 59.09 60.78 60.29 65.08
+ AdaptSim 311 52.75 55.22 59.23 61.60 61.13 65.71
model adaptation
action embedding adaptation
Effectiveness of Adaptation
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 38
When the model matches with the target genre, action embedding adaptation still works.
We force action embeddings to be more accurate by decreasing the accuracy of others embeddings
◦ Less training samples for important action embeddings
◦ More training samples for others
Approach
Match-CDSSM
P(A|U) P(U|A) Bidir
w/o
SVM
Sim 56.33 43.39 50.57
AdaptSim 64.19 60.36 62.34 0.0
0.2
0.4
0.6
0.8
1.0
Sim
AdpSim
AUC
Effectiveness of CDSSM
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 39
Baselines
◦ Lexical: ngram
◦ Semantic: doc2vec (paragraph vector)
Classifier: SVM with RBF kernal
Approach AUC
Baseline
ngram 52.84
doc2vec 59.79
Proposed
CDSSM: P(A|U) 64.33
CDSSM: P(U|A) 65.58
CDSSM: Bidirectional 69.27
Semantic embeddings provide useful features for detecting actions.
Bidirectional estimation performs better than unidirectional.
All proposed approaches outperform baselines.
Le and Mikolov, “Distributed representations of sentences and documents,” in ICML, 2014.
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 40
Conclusions
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 41
The latent semantic features generated by CDSSM show the effectiveness of detecting actions in
meetings, outperform lexical features and semantic paragraph vectors.
The adaptation techniques adjust embeddings to fit the target genre when the target genre does not
match well with the source genre.
The experiments highlight a future research direction for language understanding.
◦ Release annotated dataset
◦ Release trained embeddings
Outline
Introduction
◦ Motivation
◦ Task Definition
Proposed Approach
◦ Convolutional Deep Structured Semantic Model (CDSSM)
◦ Adaptation
◦ Actionable Item Detection
Experiments
Conclusions
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 42
Ongoing & Future Work
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 43
Active learning: boosting the performance via few annotated actions
◦ Rank by score without a classifier
◦ Acquire annotations for top-ranked utterances
Robustness test: evaluate unseen action embeddings generated by CDSSM
◦ Unseen: send_email
◦ Observed: send_message, find_email
Q & A 
DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 44
THANKS FOR YOUR ATTENTIONS!

Contenu connexe

Tendances

Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...Yun-Nung (Vivian) Chen
 
End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
End-to-End Joint Learning of Natural Language Understanding and Dialogue ManagerEnd-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
End-to-End Joint Learning of Natural Language Understanding and Dialogue ManagerYun-Nung (Vivian) Chen
 
One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人Yun-Nung (Vivian) Chen
 
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...Yun-Nung (Vivian) Chen
 
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...Yun-Nung (Vivian) Chen
 
How the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in DialoguesHow the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in DialoguesYun-Nung (Vivian) Chen
 
2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue SystemsMLReview
 
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...AI Frontiers
 
End-to-End Task-Completion Neural Dialogue Systems
End-to-End Task-Completion Neural Dialogue SystemsEnd-to-End Task-Completion Neural Dialogue Systems
End-to-End Task-Completion Neural Dialogue SystemsYun-Nung (Vivian) Chen
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System ReviewNguyen Quang
 
What Does Conversational Information Access Exactly Mean and How to Evaluate It?
What Does Conversational Information Access Exactly Mean and How to Evaluate It?What Does Conversational Information Access Exactly Mean and How to Evaluate It?
What Does Conversational Information Access Exactly Mean and How to Evaluate It?krisztianbalog
 
Conversational AI from an Information Retrieval Perspective: Remaining Challe...
Conversational AI from an Information Retrieval Perspective: Remaining Challe...Conversational AI from an Information Retrieval Perspective: Remaining Challe...
Conversational AI from an Information Retrieval Perspective: Remaining Challe...krisztianbalog
 
Dialogue systems and personal assistants
Dialogue systems and personal assistantsDialogue systems and personal assistants
Dialogue systems and personal assistantsNatalia Konstantinova
 
A model of a social chatbot
A model of a social chatbotA model of a social chatbot
A model of a social chatbotAgnese Augello
 
Intro to Vita Beans
Intro to Vita BeansIntro to Vita Beans
Intro to Vita Beansamruth
 
Managing Dialog Strategy In Multiskill AI Assistant.pdf
Managing Dialog Strategy In Multiskill AI Assistant.pdfManaging Dialog Strategy In Multiskill AI Assistant.pdf
Managing Dialog Strategy In Multiskill AI Assistant.pdfDaniel Kornev
 
IRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET Journal
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processingpunedevscom
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingPranav Gupta
 

Tendances (20)

Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
 
End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
End-to-End Joint Learning of Natural Language Understanding and Dialogue ManagerEnd-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
 
One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人One Day for Bot 一天搞懂聊天機器人
One Day for Bot 一天搞懂聊天機器人
 
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information ...
 
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Lan...
 
How the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in DialoguesHow the Context Matters Language and Interaction in Dialogues
How the Context Matters Language and Interaction in Dialogues
 
2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems2017 Tutorial - Deep Learning for Dialogue Systems
2017 Tutorial - Deep Learning for Dialogue Systems
 
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
Dilek Hakkani-Tur at AI Frontiers: Conversational machines: Deep Learning for...
 
End-to-End Task-Completion Neural Dialogue Systems
End-to-End Task-Completion Neural Dialogue SystemsEnd-to-End Task-Completion Neural Dialogue Systems
End-to-End Task-Completion Neural Dialogue Systems
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
 
What Does Conversational Information Access Exactly Mean and How to Evaluate It?
What Does Conversational Information Access Exactly Mean and How to Evaluate It?What Does Conversational Information Access Exactly Mean and How to Evaluate It?
What Does Conversational Information Access Exactly Mean and How to Evaluate It?
 
Conversational AI from an Information Retrieval Perspective: Remaining Challe...
Conversational AI from an Information Retrieval Perspective: Remaining Challe...Conversational AI from an Information Retrieval Perspective: Remaining Challe...
Conversational AI from an Information Retrieval Perspective: Remaining Challe...
 
Dialogue systems and personal assistants
Dialogue systems and personal assistantsDialogue systems and personal assistants
Dialogue systems and personal assistants
 
A model of a social chatbot
A model of a social chatbotA model of a social chatbot
A model of a social chatbot
 
Intro to Vita Beans
Intro to Vita BeansIntro to Vita Beans
Intro to Vita Beans
 
Managing Dialog Strategy In Multiskill AI Assistant.pdf
Managing Dialog Strategy In Multiskill AI Assistant.pdfManaging Dialog Strategy In Multiskill AI Assistant.pdf
Managing Dialog Strategy In Multiskill AI Assistant.pdf
 
Blenderbot
BlenderbotBlenderbot
Blenderbot
 
IRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression Identification
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 

En vedette

Automatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesAutomatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesYun-Nung (Vivian) Chen
 
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course LecturesAutomatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course LecturesYun-Nung (Vivian) Chen
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...James Hendler
 
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Yun-Nung (Vivian) Chen
 
An Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingAn Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingYun-Nung (Vivian) Chen
 
Thinking Outside the Table
Thinking Outside the TableThinking Outside the Table
Thinking Outside the TableOntotext
 
Cognitive Computing and the future of Artificial Intelligence
Cognitive Computing and the future of Artificial IntelligenceCognitive Computing and the future of Artificial Intelligence
Cognitive Computing and the future of Artificial IntelligenceVarun Singh
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...James Hendler
 

En vedette (8)

Automatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course LecturesAutomatic Key Term Extraction from Spoken Course Lectures
Automatic Key Term Extraction from Spoken Course Lectures
 
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course LecturesAutomatic Key Term Extraction and Summarization from Spoken Course Lectures
Automatic Key Term Extraction and Summarization from Spoken Course Lectures
 
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
 
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
Leveraging Behavioral Patterns of Mobile Applications for Personalized Spoken...
 
An Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task UnderstandingAn Intelligent Assistant for High-Level Task Understanding
An Intelligent Assistant for High-Level Task Understanding
 
Thinking Outside the Table
Thinking Outside the TableThinking Outside the Table
Thinking Outside the Table
 
Cognitive Computing and the future of Artificial Intelligence
Cognitive Computing and the future of Artificial IntelligenceCognitive Computing and the future of Artificial Intelligence
Cognitive Computing and the future of Artificial Intelligence
 
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
 

Similaire à Detecting Actionable Items in Meetings with Convolutional Deep Structured Semantic Models

Abstract
AbstractAbstract
Abstractemaye
 
시나리오 베이스 디자인 방법론 (Scenario Based Design)
시나리오 베이스 디자인 방법론 (Scenario Based Design)시나리오 베이스 디자인 방법론 (Scenario Based Design)
시나리오 베이스 디자인 방법론 (Scenario Based Design)Hajin Lim
 
Concept mapping for complexity management
Concept mapping for complexity managementConcept mapping for complexity management
Concept mapping for complexity managementLawrie Hunter
 
Agile leadership practices for PIONEERS
 Agile leadership practices for PIONEERS Agile leadership practices for PIONEERS
Agile leadership practices for PIONEERSStefan Haas
 
Multilayered paper prototyping for user concept modeling
Multilayered paper prototyping for user concept modelingMultilayered paper prototyping for user concept modeling
Multilayered paper prototyping for user concept modelingUKOLN (dev), University of Bath
 
Bridging the gap between AI and UI - DSI Vienna - full version
Bridging the gap between AI and UI - DSI Vienna - full versionBridging the gap between AI and UI - DSI Vienna - full version
Bridging the gap between AI and UI - DSI Vienna - full versionLiad Magen
 
LearningAG.ppt
LearningAG.pptLearningAG.ppt
LearningAG.pptbutest
 
accessTufts_Leveraging_DesignThinking_EnterpriseArchitecture_NERCOMP_3-23-201...
accessTufts_Leveraging_DesignThinking_EnterpriseArchitecture_NERCOMP_3-23-201...accessTufts_Leveraging_DesignThinking_EnterpriseArchitecture_NERCOMP_3-23-201...
accessTufts_Leveraging_DesignThinking_EnterpriseArchitecture_NERCOMP_3-23-201...EshaSharma71
 
On Machine Learning and Data Mining
On Machine Learning and Data MiningOn Machine Learning and Data Mining
On Machine Learning and Data Miningbutest
 
Access tufts leveraging_designthinking_enterprisearchitecture_nercomp_3-23-2016
Access tufts leveraging_designthinking_enterprisearchitecture_nercomp_3-23-2016Access tufts leveraging_designthinking_enterprisearchitecture_nercomp_3-23-2016
Access tufts leveraging_designthinking_enterprisearchitecture_nercomp_3-23-2016Ricardo Amador
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papersAshish Kulkarni
 
Artificial Intelligence
Artificial Intelligence Artificial Intelligence
Artificial Intelligence Prasad Kulkarni
 
Meetup 22/2/2018 - Artificiële Intelligentie & Human Resources
Meetup 22/2/2018 - Artificiële Intelligentie & Human ResourcesMeetup 22/2/2018 - Artificiële Intelligentie & Human Resources
Meetup 22/2/2018 - Artificiële Intelligentie & Human ResourcesDigipolis Antwerpen
 
DBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineDBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineYi Zeng
 
Using construction grammar in conversational systems
Using construction grammar in conversational systemsUsing construction grammar in conversational systems
Using construction grammar in conversational systemsCJ Jenkins
 
Tutorial Cognition - Irene
Tutorial Cognition - IreneTutorial Cognition - Irene
Tutorial Cognition - IreneSSSW
 
Social machines: theory design and incentives
Social machines: theory design and incentivesSocial machines: theory design and incentives
Social machines: theory design and incentivesElena Simperl
 
Conversational AI with Rasa - PyData Workshop
Conversational AI with Rasa - PyData WorkshopConversational AI with Rasa - PyData Workshop
Conversational AI with Rasa - PyData WorkshopTom Bocklisch
 

Similaire à Detecting Actionable Items in Meetings with Convolutional Deep Structured Semantic Models (20)

Abstract
AbstractAbstract
Abstract
 
시나리오 베이스 디자인 방법론 (Scenario Based Design)
시나리오 베이스 디자인 방법론 (Scenario Based Design)시나리오 베이스 디자인 방법론 (Scenario Based Design)
시나리오 베이스 디자인 방법론 (Scenario Based Design)
 
Concept mapping for complexity management
Concept mapping for complexity managementConcept mapping for complexity management
Concept mapping for complexity management
 
Agile leadership practices for PIONEERS
 Agile leadership practices for PIONEERS Agile leadership practices for PIONEERS
Agile leadership practices for PIONEERS
 
Multilayered paper prototyping for user concept modeling
Multilayered paper prototyping for user concept modelingMultilayered paper prototyping for user concept modeling
Multilayered paper prototyping for user concept modeling
 
Bridging the gap between AI and UI - DSI Vienna - full version
Bridging the gap between AI and UI - DSI Vienna - full versionBridging the gap between AI and UI - DSI Vienna - full version
Bridging the gap between AI and UI - DSI Vienna - full version
 
LearningAG.ppt
LearningAG.pptLearningAG.ppt
LearningAG.ppt
 
accessTufts_Leveraging_DesignThinking_EnterpriseArchitecture_NERCOMP_3-23-201...
accessTufts_Leveraging_DesignThinking_EnterpriseArchitecture_NERCOMP_3-23-201...accessTufts_Leveraging_DesignThinking_EnterpriseArchitecture_NERCOMP_3-23-201...
accessTufts_Leveraging_DesignThinking_EnterpriseArchitecture_NERCOMP_3-23-201...
 
On Machine Learning and Data Mining
On Machine Learning and Data MiningOn Machine Learning and Data Mining
On Machine Learning and Data Mining
 
Access tufts leveraging_designthinking_enterprisearchitecture_nercomp_3-23-2016
Access tufts leveraging_designthinking_enterprisearchitecture_nercomp_3-23-2016Access tufts leveraging_designthinking_enterprisearchitecture_nercomp_3-23-2016
Access tufts leveraging_designthinking_enterprisearchitecture_nercomp_3-23-2016
 
Laird ibm-small
Laird ibm-smallLaird ibm-small
Laird ibm-small
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papers
 
Artificial Intelligence
Artificial Intelligence Artificial Intelligence
Artificial Intelligence
 
human_factors_03.ppt
human_factors_03.ppthuman_factors_03.ppt
human_factors_03.ppt
 
Meetup 22/2/2018 - Artificiële Intelligentie & Human Resources
Meetup 22/2/2018 - Artificiële Intelligentie & Human ResourcesMeetup 22/2/2018 - Artificiële Intelligentie & Human Resources
Meetup 22/2/2018 - Artificiële Intelligentie & Human Resources
 
DBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineDBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support Engine
 
Using construction grammar in conversational systems
Using construction grammar in conversational systemsUsing construction grammar in conversational systems
Using construction grammar in conversational systems
 
Tutorial Cognition - Irene
Tutorial Cognition - IreneTutorial Cognition - Irene
Tutorial Cognition - Irene
 
Social machines: theory design and incentives
Social machines: theory design and incentivesSocial machines: theory design and incentives
Social machines: theory design and incentives
 
Conversational AI with Rasa - PyData Workshop
Conversational AI with Rasa - PyData WorkshopConversational AI with Rasa - PyData Workshop
Conversational AI with Rasa - PyData Workshop
 

Dernier

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Dernier (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Detecting Actionable Items in Meetings with Convolutional Deep Structured Semantic Models

  • 1. Detecting Actionable Items in Meetings by Convolutional Deep Structured Semantic Models YUN-NUNG (VIVIAN) CHEN DILEK HAKKANI-TÜR XIAODONG HE AUG 7TH, 2015 REDMOND, WA DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 1
  • 2. My Background Yun-Nung (Vivian) Chen http://vivianchen.idv.tw PhD candidate working with Prof. Alexander I Rudnicky Language Technologies Institute, School of Computer Science, Carnegie Mellon University Research focus ◦ spoken dialogue system ◦ unsupervised/weakly-supervised spoken language understanding Dissertation ◦ Title: Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogue Systems ◦ Committee: Prof. Alexander I. Rudnicky, Prof. Anatole Gershman, Prof. Alan W Black, Dr. Dilek Hakkani-Tür DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 2
  • 3. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 3
  • 4. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 4
  • 5. Motivation Meetings as a rich resource contain important information for participants. ◦ Meeting summarization (Xie et al., 2009; Riedhammer et al., 2010; Chen and Metze, 2013) ◦ Decision detection (Fernandez et al., 2008) ◦ Noteworthy utterance detection (Banerjee and Rudnicky, 2009) ◦ Action item identification (Yang et al., 2008) DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 5
  • 6. Motivation Computing devices have been easily accessible and information search has been a common part of regular conversations. Meetings include discussions for identifying participants’ next actions. DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 6
  • 7. Motivation - Example DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 7 Will Vivian come here for the meeting? Find Calendar Entry
  • 8. Motivation - Example DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 8 The ACL Best Student Paper is great! That is about .. Web Search
  • 9. Motivation - Example DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 9 Let’s discuss the rebuttal, maybe Monday afternoon. Create Calendar Entry
  • 10. Motivation Some actionable items ◦ should be processed in real-time during the meeting, e.g. search, find_calendar_entry ◦ can be done after the meeting, e.g. create_reminder, create_calendar_entry The advantages include ◦ seamless ◦ users do not need to directly address actions ◦ make meeting continue as planned, not interrupted by system errors DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 10
  • 11. Motivation DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 11 3944.620 3945.990 me018 Have they ever responded to you? 3945.703 3946.325 me011 Nope. find_email: check emails of me011, search for any emails from them 2049.280 2057.010 mn015 Yeah it's - or - or just - Yeah. It's also all on my - my home page at E_M_L. It's called "An Anatomy of afind Spatial Description". But I'll send that link. send_email: email all participants about "link to An Anatomy of Spatial Description" 2867.655 2888.613 mn015 I suggest w- to - for - to proceed with this in - in the sense that maybe, throughout this week, the three of us will - will talk some more about maybe segmenting off different regions, and we make up some - some toy a- observable "nodes" - is that what th- create_calendar_entry: open calendars of participants, marking times free for the three participants and schedule an event  Action Detection/Identification stime etime spk_id trans
  • 12. Motivation DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 12 3944.620 3945.990 me018 Have <from_contact_name>they</from_contact_name> ever responded to <contact_name>you</contact_name>? 3945.703 3946.325 me011 Nope. find_email: check emails of me011, search for any emails from them 2049.280 2057.010 mn015 Yeah it's - or - or just - Yeah. It's also all on my - my home page at E_M_L. It's called "An Anatomy of afind Spatial Description". But I'll send <email_content>that link</email_content>. send_email: email all participants about "link to An Anatomy of Spatial Description" 2867.655 2888.613 mn015 I suggest w- to - for - to proceed with this in - in the sense that maybe, <date>throughout this week</date>, the <contact_name>three of us</contact_name> will - will talk some more about maybe segmenting off different regions, and we make up some - some toy a- observable "nodes" - is that what th- create_calendar_entry: open calendars of participants, marking times free for the three participants and schedule an event  Coreference Resolution stime etime spk_id trans  Action Detection/Identification  Slot Filling
  • 13. Pipeline DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 13 Automatic Speech Recognition (ASR) Speaker Diarization Coreference Resolution Sentence Segmentation Slot Filling Action Detection 3944.620 3945.990Have they ever responded to you? me018 find_email from_contact_name contact_name
  • 14. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 14
  • 15. Task Definition Actionable Item Detection ◦ Goal: provide the easy access to information and perform actions a personal assistant can handle without interrupting the meetings ◦ Assumption: some actions and associated arguments can be shared across genres DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 15 Human-Machine Genre create_calendar_entry schedule a meeting with <contact_name>John</contact_name> <start_time>this afternoon</start_time> Human-Human Genre create_calendar_entry how about the <contact_name>three of us</contact_name> discuss this later <start_time>this afternoon</start_time>?  more casual, include conversational terms Task: multi-class utterance classification • train on the available human-machine genre • test on the human-human genre Adaptation • model adapation • embedding vector adaptation
  • 16. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 16
  • 17. Proposed Approach DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 17 Idea: Cortana data (human-machine) may help detect actionable items in meetings (human-human) 12 30 on Monday the 2nd of July schedule meeting to discuss taxes with Bill, Judy and Rick. 12 PM lunch with Jessie 12:00 meeting with cleaning staff to discuss progress 2 hour business presentation with Rick Sandy on March 8 at 7am create_calendar_entry A read receipt should be sent in Karen's email. Activate meeting delay by 15 minutes. Inform the participants. Add more message saying "Thanks". send_email Query Doc Convolutional deep structured semantic models for IR may be useful for this task.
  • 18. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 18
  • 19. Convolutional Deep Structured Semantic Model (CDSSM) DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 19 20K 20K 20K 1000 w1 w2 w3 1000 1000 20K wd 1000 300 Word Sequence: x Word Hashing Matrix: Wh Word Hashing Layer: lh Convolution Matrix: Wc Convolutional Layer: lc Max Pooling Operation Max Pooling Layer: lm Semantic Projection Matrix: Ws Semantic Layer: y max max max 300 300 300 300 U A1 A2 An CosSim(U, Ai) P(A1 | U) P(A2 | U) P(An | U) … Semantic Similarity Posterior Probability Utterance Action 𝑃 𝐴 𝑈 = exp(𝐶𝑜𝑠𝑆𝑖𝑚(𝑈, 𝐴)) 𝐴′ exp(𝐶𝑜𝑠𝑆𝑖𝑚(𝑈, 𝐴′))how about the we discuss this later Shen et al., “A latent semantic model with convolutional-pooling structure for information retrieval,” in CIKM, 2014. Huang et al., “Learning deep structured semantic models for web search using click through data,” in CIKM, 2013.
  • 20. DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 20 The model objective is to maximize the likelihood of associated actions given training utterances: Similarly, reversing actions and utterances induces Convolutional Deep Structured Semantic Model (CDSSM) 300 300 300 300 A U1 U2 Un P(U1 | A) P(U2 | A) P(Un | A) … Action Utterance  Predictive Model  Generative Model 300 300 300 300 U A1 A2 An P(A1 | U) P(A2 | U) P(An | U) … Utterance Action
  • 21. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 21
  • 22. Adaptation DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 22 Issue: mismatch between a source genre and a target genre Solution: ◦ Adapting CDSSM ◦ Continually train the CDSSM using the data from the target genre (usually stop early before fully converged) ◦ More robust because of data from different genres and specific to the target genre ◦ Adapting Action Embeddings ◦ Moving learned action embeddings close to the observed corresponding utterance embeddings
  • 23. Adapting Action Embeddings DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 23 The mismatch may result in inaccurate action embeddings Action1 Action2 Action3 utt utt utt utt utt utt utt Action embedding: trained on the source genre Utterance embedding: generated on the target genre Idea: moving action embeddings close to the observed utterance embeddings from the target genre
  • 24. Adapting Action Embeddings DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 24 Learning adapted action embeddings by minimizing an objective: Action1 Action2 Action3 utt utt utt utt utt utt utt The distance between original and new action embeddings The distance between new action embeddings and corresponding utterance embeddings Action3 Action1 Action2
  • 25. Adapting Action Embeddings DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 25 adapted action embeddings The objective considering action and utterance embeddings together:
  • 26. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 26
  • 27. Actionable Item Detection DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 27 Goal: predict possible actions given each utterance Before adaptation, the estimated semantic score of the k-th action After adaptation, the estimated semantic score of the k-th action Two ways of usage: 1. As final prediction scores 2. As features of a classifier: the trained classifier outputs the final prediction scores
  • 28. Actionable Item Detection DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 28 Unidirectional: prediction score from one of ◦ Predictive model: SP(U, A) ◦ Generative model: SG(U, A) Bidirectional: prediction score merging both scores 300 300 300 300 A U1 U2 Un SG(U, A) … Action Utterance 300 300 300 300 U A1 A2 An SP(U, A) … Utterance Action
  • 29. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 29
  • 30. Experiments DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 30 Dataset: 22 meetings from the ICSI meeting corpus Meeting type: Bed, Bmr, Bro Identified action: find_calendar_entry, create_calendar_entry, open_agenda, add_agenda_item, create_single_reminder, send_email, find_email, make_call, search, open_setting Annotating agreement: Cohen’s Kappa = 0.64~0.67 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Bed Bmr Bro create_single_reminder find_calendar_entry search add_agenda_item create_calendar_entry open_agenda send_email find_email make_call open_setting
  • 31. Experiments DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 31 Evaluation metrics: the average area under the precision-recall curve (AUC) for 10 actions+others CDSSM training: ◦ Mismatch-CDSSM: trained on Cortana data (300 iters) ◦ Adapt-CDSSM: trained on Cortana data and then continually trained on meeting data (each 150 iters) ◦ Match-CDSSM: trained on meeting data (300 iters) Approach #dim Mismatch-CDSSM Adapt-CDSSM Match-CDSSM P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir w/o SVM Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57 AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34
  • 32. Experiments DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 32 Evaluation metrics: the average area under the precision-recall curve (AUC) for 10 actions+others CDSSM training: ◦ Mismatch-CDSSM: trained on Cortana data (300 iters) ◦ Adapt-CDSSM: trained on Cortana data and then continually trained on meeting data (each 150 iters) ◦ Match-CDSSM: trained on meeting data (300 iters) Approach #dim Mismatch-CDSSM Adapt-CDSSM Match-CDSSM P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir w/o SVM Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57 AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34 w/ SVM Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27
  • 33. Experiments DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 33 Evaluation metrics: the average area under the precision-recall curve (AUC) for 10 actions+others CDSSM training: ◦ Mismatch-CDSSM: trained on Cortana data (300 iters) ◦ Adapt-CDSSM: trained on Cortana data and then continually trained on meeting data (each 150 iters) ◦ Match-CDSSM: trained on meeting data (300 iters) Approach #dim Mismatch-CDSSM Adapt-CDSSM Match-CDSSM P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir w/o SVM Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57 AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34 w/ SVM Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27 + Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 64.52 64.81 68.86
  • 34. Experiments DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 34 Evaluation metrics: the average area under the precision-recall curve (AUC) for 10 actions+others CDSSM training: ◦ Mismatch-CDSSM: trained on Cortana data (300 iters) ◦ Adapt-CDSSM: trained on Cortana data and then continually trained on meeting data (each 150 iters) ◦ Match-CDSSM: trained on meeting data (300 iters) Approach #dim Mismatch-CDSSM Adapt-CDSSM Match-CDSSM P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir w/o SVM Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57 AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34 w/ SVM Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27 + Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 64.52 64.81 68.86 + AdaptSim 311 52.75 55.22 59.23 61.60 61.13 65.71 64.72 65.39 69.08
  • 35. Comparing different CDSSM training data DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 35 The embeddings pretrained on the mismatched data are successfully adjusted to the target genre Matched data produces the best performance ◦ Not robust enough, even worse than the mismatch model Better robustness from the bidirectional model, the embedding adaptation, and an additional classifier Approach #dim Mismatch-CDSSM Adapt-CDSSM Match-CDSSM P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir w/o SVM Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57 AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34 w/ SVM Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27 + Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 64.52 64.81 68.86 + AdaptSim 311 52.75 55.22 59.23 61.60 61.13 65.71 64.72 65.39 69.08
  • 36. Effectiveness of Bidirectional Estimation DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 36 Almost all results from the bidirectional estimation significantly outperform unidirectional ones. Between predictive and generative models, no certain direction is better. Predictive and generative models can compensate each other and provide more robust estimations. Approach #dim Mismatch-CDSSM Adapt-CDSSM Match-CDSSM P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir w/o SVM Sim 47.45 48.17 49.10 48.67 50.09 50.36 56.33 43.39 50.57 AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 64.19 60.36 62.34 w/ SVM Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 64.33 65.58 69.27 + Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 64.52 64.81 68.86 + AdaptSim 311 52.75 55.22 59.23 61.60 61.13 65.71 64.72 65.39 69.08
  • 37. Effectiveness of Adaptation DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 37 When the trained model mismatches to the task, ◦ the model adaptation ◦ the action embedding adaptation are useful to overcome the genre mismatch. Approach #dim Mismatch-CDSSM Adapt-CDSSM P(A|U) P(U|A) Bidir P(A|U) P(U|A) Bidir w/o SVM Sim 47.45 48.17 49.10 48.67 50.09 50.36 AdaptSim 54.00 53.89 55.82 59.46 56.96 60.08 w/ SVM Embeddings 300 53.07 48.07 55.71 60.06 59.03 63.95 + Sim 311 52.80 54.95 59.09 60.78 60.29 65.08 + AdaptSim 311 52.75 55.22 59.23 61.60 61.13 65.71 model adaptation action embedding adaptation
  • 38. Effectiveness of Adaptation DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 38 When the model matches with the target genre, action embedding adaptation still works. We force action embeddings to be more accurate by decreasing the accuracy of others embeddings ◦ Less training samples for important action embeddings ◦ More training samples for others Approach Match-CDSSM P(A|U) P(U|A) Bidir w/o SVM Sim 56.33 43.39 50.57 AdaptSim 64.19 60.36 62.34 0.0 0.2 0.4 0.6 0.8 1.0 Sim AdpSim AUC
  • 39. Effectiveness of CDSSM DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 39 Baselines ◦ Lexical: ngram ◦ Semantic: doc2vec (paragraph vector) Classifier: SVM with RBF kernal Approach AUC Baseline ngram 52.84 doc2vec 59.79 Proposed CDSSM: P(A|U) 64.33 CDSSM: P(U|A) 65.58 CDSSM: Bidirectional 69.27 Semantic embeddings provide useful features for detecting actions. Bidirectional estimation performs better than unidirectional. All proposed approaches outperform baselines. Le and Mikolov, “Distributed representations of sentences and documents,” in ICML, 2014.
  • 40. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 40
  • 41. Conclusions DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 41 The latent semantic features generated by CDSSM show the effectiveness of detecting actions in meetings, outperform lexical features and semantic paragraph vectors. The adaptation techniques adjust embeddings to fit the target genre when the target genre does not match well with the source genre. The experiments highlight a future research direction for language understanding. ◦ Release annotated dataset ◦ Release trained embeddings
  • 42. Outline Introduction ◦ Motivation ◦ Task Definition Proposed Approach ◦ Convolutional Deep Structured Semantic Model (CDSSM) ◦ Adaptation ◦ Actionable Item Detection Experiments Conclusions Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 42
  • 43. Ongoing & Future Work DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 43 Active learning: boosting the performance via few annotated actions ◦ Rank by score without a classifier ◦ Acquire annotations for top-ranked utterances Robustness test: evaluate unseen action embeddings generated by CDSSM ◦ Unseen: send_email ◦ Observed: send_message, find_email
  • 44. Q & A  DETECTING ACTIONABLE ITEMS IN MEETINGS BY CONVOLUTIONAL DEEP STRUCTURED SEMANTIC MODELS 44 THANKS FOR YOUR ATTENTIONS!