Chatbot

•Télécharger en tant que PPTX, PDF•

0 j'aime•258 vues

Deep Learning project: Chatbot using LTSM Sequence to Sequence Learning and Attention Mechanism Technologies: Python, NLTK, Tensorflow

Données & analyses

MODEL CONFIGURATIONS
Network Architecture:
• Model Type: Basic, Tied, Attention Seq2Seq
• No. units in LSTM: 128, 256, 512, 1024
• No. stacked LSTM layers: 1, 2, 3
Simon Fraser University
PROBLEM
Chatbot:
• Capture an input sentence from users and perform text
processing
• Apply and Benchmark Sequence to Sequence model
variations to predict output sentence
• Best prediction is displayed via a chatbot web interface.
Approaches: Sequence to Sequence learning
• Neural Machine Translation by Jointly Learning to Align
and Translate: https://arxiv.org/abs/1409.0473
• Evaluating Language Model:
https://courses.engr.illinois.edu/cs498jh/Slides/Lecture04.pdf
• Sequence to Sequence Learning:
https://papers.nips.cc/paper/5346-sequence-to-sequence-
learning-with-neural-networks.pdf
• A Neural Conversational Model:
https://arxiv.org/abs/1506.05869
Chatbot: Hi, how are you?
Jin Zhang, Yang Zhou, Fred Qin, Liam Bui
DATASET
Data Sources:
• Subtitle dataset: Cornell’s movie subtitles dataset with
about 140,000 input-output sentence pairs.
• Twitter dataset: twitter conversation dataset with about
130,000 input-output sentence pairs.
Data Processing:
• Vocabulary Size: 8000 (Rare word replaced with “unk”)
• Input Sentence Length: 20 (padding “<pad>” is used)
• Output Sentence Length: 20 (padding “<pad>” is used)
• End of Sentence “<eos>” is used
DEMO
CONCLUSIONS
• Tied weight have little effect on the Seq2Seq chatbot model
• Experiment with Attention Mechanism is inconclusive
• Adding more units improve the model’s learning capacity
• Adding more stacked layers does not improve the model,
indicating overfitting.
• The model can learn grammar well, but its capacity to learn
semantic meaning seems to be limited on small dataset.
• However, it does show some potential to learn the context.
Perhaps, experiment on larger dataset could improve the
model.
LIMITATIONS
• Due to computing resource constraints, we cannot train
Seq2Seq model for larger dataset or larger network.
• The model currently does not have mechanism to maintain
historical context (i.e. past sentences)
OUTPUT EXAMPLES
Good:
• Input: happy birthday
• True Output: thank you
• Predicted Output: thank you
• Input: i love you so much
• True Output: i love you more
• Predicted Output: i love you too
Network Training:
• Loss function: cross-entropy loss for sentence
• Optimizer: Adam
• Learning Rate: 0.001
• Batch Size: 64 -> 1,000+ batches per epoch
• No. epochs: 20
No. units in LSTM: No. stacked LSTM layers:
EXPERIMENT RESULTS
Model Type:
2.192 2.19
2.15
2.45 2.46
2.47
2.1
2.2
2.3
2.4
2.5
Tied Basic Attention
ValidationLoss
Model Type
Seq2Seq– 1 layer – 128 units
Subtitle Twitter2
2.19
2.16 2.14
2.11
2.46
2.42 2.39
2.37
2.1
2.2
2.3
2.4
2.5
128 256 512 1024
ValidationLoss
No. units in LSTM
Basic Seq2Seq - 1 layer
Subtitle Twitter
2.14 2.15 2.16
2.39 2.41
2.43
2.1
2.2
2.3
2.4
2.5
1 2 3
ValidationLoss
No. stacked LSTM layers
Basic Seq2Seq – 512 units
Subtitle Twitter
Average
• Input: what do u like most about being
part of this show?
• True Output: being part of the show
• Predicted Output: the best of the world
• Input: anyone know how to web design
or know someone who can?
• True Output: we are experts in services
in our portfolio email to leadscom
• Predicted Output: we can help you in
design of our website in reasonable price
visit our portfolio
Bad:
• Input: let me know if youre down for
this
• True Output: very down lets do it
• Predicted Output: i think you are a
good one
• Input: am i allowed to write bernie
name on the ballot amp vote?
• True Output: in the name of
democracy yes
• Predicted Output: you are a racist

Recommandé

seq2seq learning for end-to-end dialogue systemsJordy Van Landeghem

Deep learning - ChatbotLiam Bui

Chatbot pptManish Mishra

Implemetation of parallelism in HMM DNN based state of the art kaldi ASR ToolkitShubham Verma

Natural Language Generation / Stanford cs224n 2019w lecture 15 Reviewchangedaeoh

The Main Concepts of Speech Recognition子毅楊

NLP Project Full CycleVsevolod Dyomkin

NLP & Machine Learning - An Introductory Talk Vijay Ganti

Recommandé

seq2seq learning for end-to-end dialogue systemsJordy Van Landeghem

Deep learning - ChatbotLiam Bui

Chatbot pptManish Mishra

Implemetation of parallelism in HMM DNN based state of the art kaldi ASR ToolkitShubham Verma

Natural Language Generation / Stanford cs224n 2019w lecture 15 Reviewchangedaeoh

The Main Concepts of Speech Recognition子毅楊

NLP Project Full CycleVsevolod Dyomkin

NLP & Machine Learning - An Introductory Talk Vijay Ganti

Anthiil Inside workshop on NLPSatyam Saxena

Weakly Supervised Machine ReadingIsabelle Augenstein

Challenging Reading Comprehension on Daily Conversation: Passage Completion o...Jinho Choi

Machine Learning in NLPVijay Ganti

UCU NLP Summer Workshops 2017 - Part 2Yuriy Guts

Automatic speech recognition system using deep learningAnkan Dutta

Deep Learning for Speech Recognition - Vikrant Singh TomarWithTheBest

GPT-2: Language Models are Unsupervised Multitask LearnersYoung Seok Kim

"Automatic speech recognition for mobile applications in Yandex" — Fran Campi...Yandex

NLP using Deep learningBabu Priyavrat

Natural Language Processing: L01 introductionananth

Short story presentationStutiAgarwal36

Deep Learning for Natural Language ProcessingJonathan Mugan

Everything you need to know about chatbotsKonstant Infosolutions Pvt. Ltd.

Xmpp and javaSoham Sengupta

BERT - Part 1 Learning Notes of Senthil KumarSenthil Kumar M

NLP DLforDSLiangqun Lu

Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik

Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Fwdays

Core java day1Soham Sengupta

NLP and Deep Learning for non_expertsSanghamitra Deb

An LSTM-Based Neural Network Architecture for Model TransformationsJordi Cabot

Contenu connexe

Tendances

Anthiil Inside workshop on NLPSatyam Saxena

Weakly Supervised Machine ReadingIsabelle Augenstein

Challenging Reading Comprehension on Daily Conversation: Passage Completion o...Jinho Choi

Machine Learning in NLPVijay Ganti

UCU NLP Summer Workshops 2017 - Part 2Yuriy Guts

Automatic speech recognition system using deep learningAnkan Dutta

Deep Learning for Speech Recognition - Vikrant Singh TomarWithTheBest

GPT-2: Language Models are Unsupervised Multitask LearnersYoung Seok Kim

"Automatic speech recognition for mobile applications in Yandex" — Fran Campi...Yandex

NLP using Deep learningBabu Priyavrat

Natural Language Processing: L01 introductionananth

Short story presentationStutiAgarwal36

Deep Learning for Natural Language ProcessingJonathan Mugan

Everything you need to know about chatbotsKonstant Infosolutions Pvt. Ltd.

Xmpp and javaSoham Sengupta

BERT - Part 1 Learning Notes of Senthil KumarSenthil Kumar M

NLP DLforDSLiangqun Lu

Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik

Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"Fwdays

Core java day1Soham Sengupta

Tendances (20)

Anthiil Inside workshop on NLP

Weakly Supervised Machine Reading

Challenging Reading Comprehension on Daily Conversation: Passage Completion o...

Machine Learning in NLP

UCU NLP Summer Workshops 2017 - Part 2

Automatic speech recognition system using deep learning

Deep Learning for Speech Recognition - Vikrant Singh Tomar

GPT-2: Language Models are Unsupervised Multitask Learners

"Automatic speech recognition for mobile applications in Yandex" — Fran Campi...

NLP using Deep learning

Natural Language Processing: L01 introduction

Short story presentation

Deep Learning for Natural Language Processing

Everything you need to know about chatbots

Xmpp and java

BERT - Part 1 Learning Notes of Senthil Kumar

NLP DLforDS

Engineering Intelligent NLP Applications Using Deep Learning – Part 2

Thomas Wolf "An Introduction to Transfer Learning and Hugging Face"

Core java day1

Similaire à Chatbot

NLP and Deep Learning for non_expertsSanghamitra Deb

An LSTM-Based Neural Network Architecture for Model TransformationsJordi Cabot

Loom promises: be there!Jean-Francois James

What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...confluent

prace_days_ml_2019.pptxssuserf583ac

prace_days_ml_2019.pptxRohanBorgalli

prace_days_ml_2019.pptxSreeVani74

Realtime traffic analyserAlex Moskvin

Deep DomainZachary S. Brown

It summit 150604 cb_wcl_ld_kmh_v6_to_publishkevin_donovan

Building Big Data Streaming ArchitecturesDavid Martínez Rego

"Source Code Abstracts Classification Using CNN", Vadim Markovtsev, Lead Soft...Dataconomy Media

An LSTM-Based Neural Network Architecture for Model TransformationsLola Burgueño

Deep Learning Made Easy with Deep FeaturesTuri, Inc.

Understanding and Designing Ultra low latency systems | Low Latency | Ultra L...Anand Narayanan

Internals of Presto ServiceTreasure Data, Inc.

Natural Language Query to SQL conversion using Machine Learning ApproachMinhazul Arefin

Building Scalable and Extendable Data Pipeline for Call of Duty Games: Lesson...Yaroslav Tkachenko

EuroMPI 2013 presentation: McMPIDan Holmes

Lec16 - Autoencoders.pptxSameer Gulshan

Similaire à Chatbot (20)

NLP and Deep Learning for non_experts

An LSTM-Based Neural Network Architecture for Model Transformations

Loom promises: be there!

What's inside the black box? Using ML to tune and manage Kafka. (Matthew Stum...

prace_days_ml_2019.pptx

Realtime traffic analyser

Deep Domain

It summit 150604 cb_wcl_ld_kmh_v6_to_publish

Building Big Data Streaming Architectures

"Source Code Abstracts Classification Using CNN", Vadim Markovtsev, Lead Soft...

An LSTM-Based Neural Network Architecture for Model Transformations

Deep Learning Made Easy with Deep Features

Understanding and Designing Ultra low latency systems | Low Latency | Ultra L...

Internals of Presto Service

Natural Language Query to SQL conversion using Machine Learning Approach

Building Scalable and Extendable Data Pipeline for Call of Duty Games: Lesson...

EuroMPI 2013 presentation: McMPI

Lec16 - Autoencoders.pptx

Dernier

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...gajnagarg

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal

➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...amitlee9823

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...amitlee9823

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...gajnagarg

Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...gajnagarg

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823

➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823

Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila

Dernier (20)

Anomaly detection and data imputation within time series

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -

➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...

Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...

➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...

Aspirational Block Program Block Syaldey District - Almora

Chatbot

1. MODEL CONFIGURATIONS Network Architecture: • Model Type: Basic, Tied, Attention Seq2Seq • No. units in LSTM: 128, 256, 512, 1024 • No. stacked LSTM layers: 1, 2, 3 Simon Fraser University PROBLEM Chatbot: • Capture an input sentence from users and perform text processing • Apply and Benchmark Sequence to Sequence model variations to predict output sentence • Best prediction is displayed via a chatbot web interface. Approaches: Sequence to Sequence learning • Neural Machine Translation by Jointly Learning to Align and Translate: https://arxiv.org/abs/1409.0473 • Evaluating Language Model: https://courses.engr.illinois.edu/cs498jh/Slides/Lecture04.pdf • Sequence to Sequence Learning: https://papers.nips.cc/paper/5346-sequence-to-sequence- learning-with-neural-networks.pdf • A Neural Conversational Model: https://arxiv.org/abs/1506.05869 Chatbot: Hi, how are you? Jin Zhang, Yang Zhou, Fred Qin, Liam Bui DATASET Data Sources: • Subtitle dataset: Cornell’s movie subtitles dataset with about 140,000 input-output sentence pairs. • Twitter dataset: twitter conversation dataset with about 130,000 input-output sentence pairs. Data Processing: • Vocabulary Size: 8000 (Rare word replaced with “unk”) • Input Sentence Length: 20 (padding “<pad>” is used) • Output Sentence Length: 20 (padding “<pad>” is used) • End of Sentence “<eos>” is used DEMO CONCLUSIONS • Tied weight have little effect on the Seq2Seq chatbot model • Experiment with Attention Mechanism is inconclusive • Adding more units improve the model’s learning capacity • Adding more stacked layers does not improve the model, indicating overfitting. • The model can learn grammar well, but its capacity to learn semantic meaning seems to be limited on small dataset. • However, it does show some potential to learn the context. Perhaps, experiment on larger dataset could improve the model. LIMITATIONS • Due to computing resource constraints, we cannot train Seq2Seq model for larger dataset or larger network. • The model currently does not have mechanism to maintain historical context (i.e. past sentences) OUTPUT EXAMPLES Good: • Input: happy birthday • True Output: thank you • Predicted Output: thank you • Input: i love you so much • True Output: i love you more • Predicted Output: i love you too Network Training: • Loss function: cross-entropy loss for sentence • Optimizer: Adam • Learning Rate: 0.001 • Batch Size: 64 -> 1,000+ batches per epoch • No. epochs: 20 No. units in LSTM: No. stacked LSTM layers: EXPERIMENT RESULTS Model Type: 2.192 2.19 2.15 2.45 2.46 2.47 2.1 2.2 2.3 2.4 2.5 Tied Basic Attention ValidationLoss Model Type Seq2Seq– 1 layer – 128 units Subtitle Twitter2 2.19 2.16 2.14 2.11 2.46 2.42 2.39 2.37 2.1 2.2 2.3 2.4 2.5 128 256 512 1024 ValidationLoss No. units in LSTM Basic Seq2Seq - 1 layer Subtitle Twitter 2.14 2.15 2.16 2.39 2.41 2.43 2.1 2.2 2.3 2.4 2.5 1 2 3 ValidationLoss No. stacked LSTM layers Basic Seq2Seq – 512 units Subtitle Twitter Average • Input: what do u like most about being part of this show? • True Output: being part of the show • Predicted Output: the best of the world • Input: anyone know how to web design or know someone who can? • True Output: we are experts in services in our portfolio email to leadscom • Predicted Output: we can help you in design of our website in reasonable price visit our portfolio Bad: • Input: let me know if youre down for this • True Output: very down lets do it • Predicted Output: i think you are a good one • Input: am i allowed to write bernie name on the ballot amp vote? • True Output: in the name of democracy yes • Predicted Output: you are a racist