NLP using Deep learning

•Télécharger en tant que PPTX, PDF•

3 j'aime•1,360 vues

We explore how to implement Natural language processing using LSTM. Also illustrate the underlying principles behind creating Word2Vec.

Technologie

contents
• Sentiment Analysis
• Word2Vec
• FFN, RNN & LSTM
• Translation
• Live video commentary using CNN-LSTM
2

Sentiment analysis
3
The sentence is tokenized into words and sentiment analysis of each word
is done to conclude the overall sentiment of the sentence.

Word2Vec
4
Words with positive sentiment
will be near to each other.
Word2vec is a two-layer neural net that
processes text. Its input is a text corpus
and its output is a set of vectors:
feature vectors for words in that corpus.
While Word2vec is not a deep neural
network, it turns text into a numerical
form that deep nets can understand.

Constructing Word2Vec
• https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/tutorials/word2vec/word2vec_basic.py
5
1. Identify the dataset to created Word2Vec Relation : http://mattmahoney.net/dc/
2. Read the data into a list of strings.
3. Build the dictionary and replace rare words with unique key (UNK) token.
1. Dictionary – map of words (strings) to their codes
2. Count – map of words( strings) to count of occurrences
3. Reverse Dictionary – map codes ( integers) to words ( strings).
4. generate a training batch for the skip-gram model.
5. Build and train a skip-gram model.
1. Construct the SGD optimizer using a learning rate of 1.0.
2. Compute the cosine similarity between minibatch examples and all embeddings.
6. Begin training
7. Visualize the embeddings

Skip-gram model
6
Skip-gram model is neural network implementation which gives the probability of two words
occurring adjacent to each other

Why FFN is not fit for NLP?
𝑊(1)
x1
x2
𝑊(2)
𝑧(2)
𝑧(3)
8
There is no communication between neurons in the same layer,
thus not able to relate the context of one word with another.

Is RNN better?
𝑊(1)
x1
x2
𝑊(2)
𝑧(2)
𝑧(3)
9
RNN seems to solve the problem but it doesn’t memory
to remember the context from earlier sentences.

LSTM Cell
10
LSTM cell has inbuilt memory cell which can be passed to other layer of LSTM cells.
Please note that this memory cell is passed without applying any filter like softmax.

Loading Word2Vec
11
• https://www.oreilly.com/learning/perform-sentiment-analysis-with-lstms-using-tensorflow
Loading already trained Word2Vec instead of building
from scratch

Implementing LSTM
• https://www.oreilly.com/learning/perform-sentiment-analysis-with-lstms-using-tensorflow
14

Stacked LSTM
def lstm_cell():
return tf.contrib.rnn.BasicLSTMCell(lstm_size)
stacked_lstm = tf.contrib.rnn.MultiRNNCell(
[lstm_cell() for _ in range(number_of_layers)])
initial_state = state = stacked_lstm.zero_state(batch_size, tf.float32)
for i in range(num_steps):
# The value of state is updated after processing each batch of words.
# The rest of the code.
# ...
16
Stacked LSTM improves performance but only upto 8 layers. After that, the performance is not great.

Next
• Is Word2Vector good enough?
• Can we use LSTM for translation?
• Can we combine LSTM with other networks like CNN?
18

References
• https://github.com/NLeSC/mcfly/wiki/User-manual
• https://www.oreilly.com/learning/perform-sentiment-analysis-with-lstms-using-
tensorflow
• http://www.asimovinstitute.org/neural-network-zoo-prequel-cells-layers/
19

Recommandé

Text prediction based on Recurrent Neural Network Language ModelANIRUDHMALODE2

Recurrent Convolutional Neural Networks for Text ClassificationShuangshuang Zhou

Skip, residual and densely connected RNN architecturesfgodin

Recurrent Neural NetworkMohammad Sabouri

seq2seq learning for end-to-end dialogue systemsJordy Van Landeghem

Neural machine translation by jointly learning to align and translatesotanemoto

Five pattern(facade mediator_singleton_monostate_null)이효서

NEURAL DISCOURSE MODELLING OF CONVERSATIONSijnlc

Recommandé

Text prediction based on Recurrent Neural Network Language ModelANIRUDHMALODE2

Recurrent Convolutional Neural Networks for Text ClassificationShuangshuang Zhou

Skip, residual and densely connected RNN architecturesfgodin

Recurrent Neural NetworkMohammad Sabouri

seq2seq learning for end-to-end dialogue systemsJordy Van Landeghem

Neural machine translation by jointly learning to align and translatesotanemoto

Five pattern(facade mediator_singleton_monostate_null)이효서

NEURAL DISCOURSE MODELLING OF CONVERSATIONSijnlc

NEURAL DISCOURSE MODELLING OF CONVERSATIONSkevig

Deep Learning and Modern NLPZachary S. Brown

Explaining Character-Aware Neural Networks for Word-Level Prediction: Do They...fgodin

Temporal Hypermap Theory and ApplicationAbel Nyamapfene

A neural conversational_modelsotanemoto

Deeplearning NLPFrancesco Gadaleta

ChatbotLiam Bui

Convolutional Neural Networks for Natural Language Processing / Stanford cs22...changedaeoh

Hn2513581359IJERA Editor

GROUP SESSION KEY EXCHANGE MULTILAYER PERCEPTRON BASED SIMULATED ANNEALING GU...ijwmn

Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)Márton Miháltz

Real-time Coreference Resolution for Dialogue UnderstandingJinho Choi

Intro to Probabilistic Programming and Clojure’s AnglicanNils Blum-Oeste, Ph.D.

Google Duplex AIMd. Mehedi Hasan

Transformer Introduction (Seminar Material)Yuta Niki

Crash course in chat botsDylan Thorne

LSTM TutorialRalph Schlosser

Sequence learning and modern RNNsGrigory Sapunov

NLP DLforDSLiangqun Lu

Recurrent Neural Networks for Text Analysisodsc

AINL 2016: NikolenkoLidia Pivovarova

Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik

Contenu connexe

Tendances

NEURAL DISCOURSE MODELLING OF CONVERSATIONSkevig

Deep Learning and Modern NLPZachary S. Brown

Explaining Character-Aware Neural Networks for Word-Level Prediction: Do They...fgodin

Temporal Hypermap Theory and ApplicationAbel Nyamapfene

A neural conversational_modelsotanemoto

Deeplearning NLPFrancesco Gadaleta

ChatbotLiam Bui

Convolutional Neural Networks for Natural Language Processing / Stanford cs22...changedaeoh

Hn2513581359IJERA Editor

GROUP SESSION KEY EXCHANGE MULTILAYER PERCEPTRON BASED SIMULATED ANNEALING GU...ijwmn

Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)Márton Miháltz

Real-time Coreference Resolution for Dialogue UnderstandingJinho Choi

Intro to Probabilistic Programming and Clojure’s AnglicanNils Blum-Oeste, Ph.D.

Google Duplex AIMd. Mehedi Hasan

Transformer Introduction (Seminar Material)Yuta Niki

Crash course in chat botsDylan Thorne

LSTM TutorialRalph Schlosser

Sequence learning and modern RNNsGrigory Sapunov

NLP DLforDSLiangqun Lu

Recurrent Neural Networks for Text Analysisodsc

Tendances (20)

NEURAL DISCOURSE MODELLING OF CONVERSATIONS

Deep Learning and Modern NLP

Explaining Character-Aware Neural Networks for Word-Level Prediction: Do They...

Temporal Hypermap Theory and Application

A neural conversational_model

Deeplearning NLP

Chatbot

Convolutional Neural Networks for Natural Language Processing / Stanford cs22...

Hn2513581359

GROUP SESSION KEY EXCHANGE MULTILAYER PERCEPTRON BASED SIMULATED ANNEALING GU...

Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)

Real-time Coreference Resolution for Dialogue Understanding

Intro to Probabilistic Programming and Clojure’s Anglican

Google Duplex AI

Transformer Introduction (Seminar Material)

Crash course in chat bots

LSTM Tutorial

Sequence learning and modern RNNs

NLP DLforDS

Recurrent Neural Networks for Text Analysis

Similaire à NLP using Deep learning

AINL 2016: NikolenkoLidia Pivovarova

Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik

TensorflowKnoldus Inc.

Arabic named entity recognition using deep learning approachIJECEIAES

Short story presentationStutiAgarwal36

Turkish language modeling using BERTAbdurrahimDerric

Deep Learning for Natural Language ProcessingParrotAI

Natural Language Processing Advancements By Deep Learning: A SurveyRimzim Thube

Applying Deep Learning Machine Translation to Language ServicesYannis Flet-Berliac

Natural Language Processing - Research and Application TrendsShreyas Suresh Rao

IRJET- Survey on Generating Suggestions for Erroneous Part in a SentenceIRJET Journal

Natural Language Generation / Stanford cs224n 2019w lecture 15 Reviewchangedaeoh

EXPERIMENTS ON DIFFERENT RECURRENT NEURAL NETWORKS FOR ENGLISH-HINDI MACHINE ...csandit

Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...IRJET Journal

NLP using transformers Arvind Devaraj

[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...IJET - International Journal of Engineering and Techniques

NLP with TensorFlow.pdfAkankshaPathak42

Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...Simplilearn

Sk t academy lecture noteSusang Kim

Transformers AI PPT.pptxRahulKumar854607

Similaire à NLP using Deep learning (20)

AINL 2016: Nikolenko

Engineering Intelligent NLP Applications Using Deep Learning – Part 2

Tensorflow

Arabic named entity recognition using deep learning approach

Short story presentation

Turkish language modeling using BERT

Deep Learning for Natural Language Processing

Natural Language Processing Advancements By Deep Learning: A Survey

Applying Deep Learning Machine Translation to Language Services

Natural Language Processing - Research and Application Trends

IRJET- Survey on Generating Suggestions for Erroneous Part in a Sentence

Natural Language Generation / Stanford cs224n 2019w lecture 15 Review

EXPERIMENTS ON DIFFERENT RECURRENT NEURAL NETWORKS FOR ENGLISH-HINDI MACHINE ...

Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...

NLP using transformers

[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...

NLP with TensorFlow.pdf

Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...

Sk t academy lecture note

Transformers AI PPT.pptx

Plus de Babu Priyavrat

5G and Drones Babu Priyavrat

Tricks in natural language processingBabu Priyavrat

Lda and it's applicationsBabu Priyavrat

Ensemble learning TechniquesBabu Priyavrat

Introduction to TensorFlowBabu Priyavrat

Neural networkBabu Priyavrat

Supervised Machine Learning in RBabu Priyavrat

Introduction to-machine-learningBabu Priyavrat

Plus de Babu Priyavrat (8)

5G and Drones

Tricks in natural language processing

Lda and it's applications

Ensemble learning Techniques

Introduction to TensorFlow

Neural network

Supervised Machine Learning in R

Introduction to-machine-learning

Dernier

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Tech Trends Report 2024 Future Today Institute.pdfhans926745

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Partners Life - Insurer Innovation Award 2024The Digital Insurer

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Evaluating the top large language models.pdfChristopherTHyatt

Dernier (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Boost PC performance: How more available memory can improve productivity

08448380779 Call Girls In Civil Lines Women Seeking Men

Data Cloud, More than a CDP by Matt Robison

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

How to Troubleshoot Apps for the Modern Connected Worker

Boost Fertility New Invention Ups Success Rates.pdf

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

08448380779 Call Girls In Friends Colony Women Seeking Men

Handwritten Text Recognition for manuscripts and early printed texts

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Exploring the Future Potential of AI-Enabled Smartphone Processors

Tech Trends Report 2024 Future Today Institute.pdf

Powerful Google developer tools for immediate impact! (2023-24 C)

Axa Assurance Maroc - Insurer Innovation Award 2024

Partners Life - Insurer Innovation Award 2024

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Evaluating the top large language models.pdf

NLP using Deep learning

1. NLP using Deep Learning Babu Priyavrat

2. contents • Sentiment Analysis • Word2Vec • FFN, RNN & LSTM • Translation • Live video commentary using CNN-LSTM 2

3. Sentiment analysis 3 The sentence is tokenized into words and sentiment analysis of each word is done to conclude the overall sentiment of the sentence.

4. Word2Vec 4 Words with positive sentiment will be near to each other. Word2vec is a two-layer neural net that processes text. Its input is a text corpus and its output is a set of vectors: feature vectors for words in that corpus. While Word2vec is not a deep neural network, it turns text into a numerical form that deep nets can understand.

5. Constructing Word2Vec • https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/tutorials/word2vec/word2vec_basic.py 5 1. Identify the dataset to created Word2Vec Relation : http://mattmahoney.net/dc/ 2. Read the data into a list of strings. 3. Build the dictionary and replace rare words with unique key (UNK) token. 1. Dictionary – map of words (strings) to their codes 2. Count – map of words( strings) to count of occurrences 3. Reverse Dictionary – map codes ( integers) to words ( strings). 4. generate a training batch for the skip-gram model. 5. Build and train a skip-gram model. 1. Construct the SGD optimizer using a learning rate of 1.0. 2. Compute the cosine similarity between minibatch examples and all embeddings. 6. Begin training 7. Visualize the embeddings

6. Skip-gram model 6 Skip-gram model is neural network implementation which gives the probability of two words occurring adjacent to each other

7. word2Vec - Visualizing Embeddings 7

8. Why FFN is not fit for NLP? 𝑊(1) x1 x2 𝑊(2) 𝑧(2) 𝑧(3) 8 There is no communication between neurons in the same layer, thus not able to relate the context of one word with another.

9. Is RNN better? 𝑊(1) x1 x2 𝑊(2) 𝑧(2) 𝑧(3) 9 RNN seems to solve the problem but it doesn’t memory to remember the context from earlier sentences.

10. LSTM Cell 10 LSTM cell has inbuilt memory cell which can be passed to other layer of LSTM cells. Please note that this memory cell is passed without applying any filter like softmax.

11. Loading Word2Vec 11 • https://www.oreilly.com/learning/perform-sentiment-analysis-with-lstms-using-tensorflow Loading already trained Word2Vec instead of building from scratch

12. Loading Training Data 12

13. Implementing LSTM 13

14. Implementing LSTM • https://www.oreilly.com/learning/perform-sentiment-analysis-with-lstms-using-tensorflow 14

15. LSTM accuracy 15

16. Stacked LSTM def lstm_cell(): return tf.contrib.rnn.BasicLSTMCell(lstm_size) stacked_lstm = tf.contrib.rnn.MultiRNNCell( [lstm_cell() for _ in range(number_of_layers)]) initial_state = state = stacked_lstm.zero_state(batch_size, tf.float32) for i in range(num_steps): # The value of state is updated after processing each batch of words. # The rest of the code. # ... 16 Stacked LSTM improves performance but only upto 8 layers. After that, the performance is not great.

17. Question & Answers

18. Next • Is Word2Vector good enough? • Can we use LSTM for translation? • Can we combine LSTM with other networks like CNN? 18

19. References • https://github.com/NLeSC/mcfly/wiki/User-manual • https://www.oreilly.com/learning/perform-sentiment-analysis-with-lstms-using- tensorflow • http://www.asimovinstitute.org/neural-network-zoo-prequel-cells-layers/ 19