Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems

•

0 j'aime•47 vues

Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems presented by Hongkyu Lim

Technologie

2022. 06. 10
Mem2Seq: Effectively Incorporating Knowledge Bases into
End-to-End Task-Oriented Dialog Systems
Andrea Madotto, Chien-Sheng Wu, Pascale Fung
ACL 2018
Hongkyu Lim

Contents
• Overview
• Introduction
• Model Description
• Memory Encoder
• Memory Decoder
• Sentinel
• Memory Content
• Experimental Results
• Analysis and Discussion
• Conclusion

3
Overview
• In Task Oriented Dialog system, it is hard to combine Knowledge
base(KB).
• Struggling to combine KB to RNN hidden states
• Time consumption : using attention mechanism
• Mem2Seq is a solution to solve the issues.
• Mem2Seq is a model that combines pointer network and attention.

4
Introduction
• Task oriented dialog system is used to conduct particular objectives.
• It is essential to generate query with KB.
• Currently(2018), RNN based on hidden states has yielded good
performances.
•  But, there are still problems
• It is hard to comprehend KB and RNN hidden states
• Takes too long to process long sequences with attention

5
Introduction
• MemNN
• A Recurrent attention model to utilize large external memory
• Reports embedding to the external memory
• Reads the memory repeatedly with query vectors
• This approach enables…
• Remembers KB for longer than before
• Encodes long sequential dialog fast
• However…
• MemNN only chooses from the pool.
• It does not generate answers.

6
Model Description
• Mem2Seq
• Solves the limitations of MemNN
• Mem2Seeq relates concepts of pointer network to multi-hop attention mechanism.
• Mem2Seq copies words directly from KB
• Mem2Seq learns generating dynamic query to access to memory.

7
Model Description
• Mem2Seq(architecture)
• Composed of MemNN Encoder and memory decoder
• MemNN Encoder makes vectors for dialog reports
• Memory Decoder generates responses by reading and copying memory

8
Model Description
• Terms & Equations
• Sequence Tokens for dialog records
• $ is a special sign of token to generate words from memory content
• Tuple for Knowledge Base
• Concat of X and B

9
Model Description
• Memory Encoder
• 𝑈 is a word wise concatenation of dialog and sentinel token.
• The memory of MemNN is represented as
• 𝐶 is a vector mapped with token used in reading query vectors.
• Repeated for K hops.
• For each memory sequence, the model calculates attention weights at hop k.

10
Model Description
• Memory Encoder
• pk is responsible for memory selector to assign relations between memory
queries.
• The model reads memory ok through the sum of weights
• The result of the encoder is ok and it is the input of the decoder of Mem2Seq.

11
Model Description
• Memory Decoder
• Uses both dialog records and KB
• GRU modules receives previously generated words and query to generate new
queries every time step t.
• Query h0 is the result of the Encoder
• In every step, the decoder computes vocabulary distributions and memory
contents distributions
• The decoder generates tokens at the memory by pointing the input words.

12
Model Description
• Sentinel
• If memory has no required words, memory content distribution yields sentinel
words.
• Memory Content
• Dialog record is saved in the memory with respect to words.
• Speakers and time are added to each token.
• When saving KB, the token is based on subject, relations, and objects.
• KB is only used to consult on particular conversations.

14
Analysis and Discussion
• Memory Attention
• As shown in the picture, the
distribution of weights is very clear.

15
Conclusion
• Mem2Seq is a memory to sequence model for task—oriented dialog
system in end-to-end framework.
• Mem2Seq is combining multi-hop attention mechanism of end-to-end
memory network with pointer network.
• They validated the performance of Mem2Seq with experiments.

Contenu connexe

Similaire à Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems

Trends in DNN compressionKaushalya Madhawa

Reduced instruction set computersSyed Zaid Irshad

Survey of Attention mechanismSwatiNarkhede1

Morph : a novel acceleratorBaharJV

Jms deep dive [con4864]Ryan Cuprak

Master defence 2020 - Borys Olshanetskyi -Context Independent Speaker Classif...Lviv Data Science Summer School

Lecture 11: ML Deployment & Monitoring (Full Stack Deep Learning - Spring 2021)Sergey Karayev

Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...Intel® Software

CE412 -advanced computer Architecture lecture 1.pdfAdelAbougdera

Basic Structure of a Computer SystemAmirthavalli Senthil

embedded system-Memory_Organization_final.pdfSarveshPandey64

Survey of Attention mechanism & Use in Computer VisionSwatiNarkhede1

Embedded CKrunal Siddhapathak

Project Presentation FinalDhritiman Halder

Deep_Learning_Frameworks_CNTK_PyTorchSubhashis Hazarika

Contribution of recurrent connectionist language models in improving lstm bas...anna8885

Memcached PresentationAsif Ali

“Efficient Many-function Video ML at the Edge,” a Presentation from Cisco Sys...Edge AI and Vision Alliance

EE5440 – Computer Architecture - Lecture 2Dilawar Khan

No sql presentationSaifuddin Kaijar

Similaire à Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems (20)

Trends in DNN compression

Reduced instruction set computers

Survey of Attention mechanism

Morph : a novel accelerator

Jms deep dive [con4864]

Master defence 2020 - Borys Olshanetskyi -Context Independent Speaker Classif...

Lecture 11: ML Deployment & Monitoring (Full Stack Deep Learning - Spring 2021)

Performance Optimization of Deep Learning Frameworks Caffe* and Tensorflow* f...

CE412 -advanced computer Architecture lecture 1.pdf

Basic Structure of a Computer System

embedded system-Memory_Organization_final.pdf

Survey of Attention mechanism & Use in Computer Vision

Embedded C

Project Presentation Final

Deep_Learning_Frameworks_CNTK_PyTorch

Contribution of recurrent connectionist language models in improving lstm bas...

Memcached Presentation

“Efficient Many-function Video ML at the Edge,” a Presentation from Cisco Sys...

EE5440 – Computer Architecture - Lecture 2

No sql presentation

Plus de ivaderivader

Argument Miningivaderivader

Papers at CHI23ivaderivader

DDGK: Learning Graph Representations for Deep Divergence Graph Kernelsivaderivader

So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality ivaderivader

Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...ivaderivader

Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...ivaderivader

A Style-Based Generator Architecture for Generative Adversarial Networksivaderivader

CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...ivaderivader

Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualizationivaderivader

Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...ivaderivader

Neural Approximate Dynamic Programming for On-Demand Ride-Poolingivaderivader

StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...ivaderivader

Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTubeivaderivader

Invertible Denoising Network: A Light Solution for Real Noise Removalivaderivader

Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Networkivaderivader

MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training ivaderivader

Screen2Vec: Semantic Embedding of GUI Screens and GUI Componentsivaderivader

Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...ivaderivader

Natural Language to Visualization by Neural Machine Translationivaderivader

Recommending What Video to Watch Next: A Multitask Ranking Systemivaderivader

Plus de ivaderivader (20)

Argument Mining

Papers at CHI23

DDGK: Learning Graph Representations for Deep Divergence Graph Kernels

So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality

Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...

Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...

A Style-Based Generator Architecture for Generative Adversarial Networks

CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...

Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization

Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...

Neural Approximate Dynamic Programming for On-Demand Ride-Pooling

StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...

Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube

Invertible Denoising Network: A Light Solution for Real Noise Removal

Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network

MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training

Screen2Vec: Semantic Embedding of GUI Screens and GUI Components

Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...

Natural Language to Visualization by Neural Machine Translation

Recommending What Video to Watch Next: A Multitask Ranking System

Dernier

Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney

Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica

Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos

So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda

Decarbonising Buildings: Making a net-zero built environment a realityIES VE

Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica

All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough

Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers

A Journey Into the Emotions of Software DevelopersNicole Novielli

The State of Passkeys with FIDO Alliance.pptxLoriGlavin3

Top 10 Hubspot Development Companies in 2024TopCSSGallery

A Glance At The Java Performance ToolboxAna-Maria Mihalceanu

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes

Data governance with Unity Catalog PresentationKnoldus Inc.

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3

Connecting the Dots for Information Discovery.pdfNeo4j

Dernier (20)

Long journey of Ruby standard library at RubyConf AU 2024

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...

Glenn Lazarus- Why Your Observability Strategy Needs Security Observability

Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)

So einfach geht modernes Roaming fuer Notes und Nomad.pdf

Decarbonising Buildings: Making a net-zero built environment a reality

Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure

All These Sophisticated Attacks, Can We Really Detect Them - PDF

Design pattern talk by Kaya Weers - 2024 (v2)

A Journey Into the Emotions of Software Developers

The State of Passkeys with FIDO Alliance.pptx

Top 10 Hubspot Development Companies in 2024

A Glance At The Java Performance Toolbox

How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes

Data governance with Unity Catalog Presentation

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

Generative Artificial Intelligence: How generative AI works.pdf

The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx

Connecting the Dots for Information Discovery.pdf

Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems

1. 2022. 06. 10 Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems Andrea Madotto, Chien-Sheng Wu, Pascale Fung ACL 2018 Hongkyu Lim

2. Contents • Overview • Introduction • Model Description • Memory Encoder • Memory Decoder • Sentinel • Memory Content • Experimental Results • Analysis and Discussion • Conclusion

3. 3 Overview • In Task Oriented Dialog system, it is hard to combine Knowledge base(KB). • Struggling to combine KB to RNN hidden states • Time consumption : using attention mechanism • Mem2Seq is a solution to solve the issues. • Mem2Seq is a model that combines pointer network and attention.

4. 4 Introduction • Task oriented dialog system is used to conduct particular objectives. • It is essential to generate query with KB. • Currently(2018), RNN based on hidden states has yielded good performances. •  But, there are still problems • It is hard to comprehend KB and RNN hidden states • Takes too long to process long sequences with attention

5. 5 Introduction • MemNN • A Recurrent attention model to utilize large external memory • Reports embedding to the external memory • Reads the memory repeatedly with query vectors • This approach enables… • Remembers KB for longer than before • Encodes long sequential dialog fast • However… • MemNN only chooses from the pool. • It does not generate answers.

6. 6 Model Description • Mem2Seq • Solves the limitations of MemNN • Mem2Seeq relates concepts of pointer network to multi-hop attention mechanism. • Mem2Seq copies words directly from KB • Mem2Seq learns generating dynamic query to access to memory.

7. 7 Model Description • Mem2Seq(architecture) • Composed of MemNN Encoder and memory decoder • MemNN Encoder makes vectors for dialog reports • Memory Decoder generates responses by reading and copying memory

8. 8 Model Description • Terms & Equations • Sequence Tokens for dialog records • $ is a special sign of token to generate words from memory content • Tuple for Knowledge Base • Concat of X and B

9. 9 Model Description • Memory Encoder • 𝑈 is a word wise concatenation of dialog and sentinel token. • The memory of MemNN is represented as • 𝐶 is a vector mapped with token used in reading query vectors. • Repeated for K hops. • For each memory sequence, the model calculates attention weights at hop k.

10. 10 Model Description • Memory Encoder • pk is responsible for memory selector to assign relations between memory queries. • The model reads memory ok through the sum of weights • The result of the encoder is ok and it is the input of the decoder of Mem2Seq.

11. 11 Model Description • Memory Decoder • Uses both dialog records and KB • GRU modules receives previously generated words and query to generate new queries every time step t. • Query h0 is the result of the Encoder • In every step, the decoder computes vocabulary distributions and memory contents distributions • The decoder generates tokens at the memory by pointing the input words.

12. 12 Model Description • Sentinel • If memory has no required words, memory content distribution yields sentinel words. • Memory Content • Dialog record is saved in the memory with respect to words. • Speakers and time are added to each token. • When saving KB, the token is based on subject, relations, and objects. • KB is only used to consult on particular conversations.

13. 13 Experimental Results

14. 14 Analysis and Discussion • Memory Attention • As shown in the picture, the distribution of weights is very clear.

15. 15 Conclusion • Mem2Seq is a memory to sequence model for task—oriented dialog system in end-to-end framework. • Mem2Seq is combining multi-hop attention mechanism of end-to-end memory network with pointer network. • They validated the performance of Mem2Seq with experiments.

16. Thank you

Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems

Recommandé

Recommandé

Contenu connexe

Similaire à Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems

Similaire à Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems (20)

Plus de ivaderivader

Plus de ivaderivader (20)

Dernier

Dernier (20)

Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems