Representation Learning for NLP

•Télécharger en tant que PPTX, PDF•

2 j'aime•1,142 vues

Anuj Gupta

Workshop Proposal for Fifth El 2017

Sciences

Representation Learning for
NLP: Deep Dive
Anuj Gupta, Satyam Saxena

• Duration : 6 hrs
• Level : Intermediate to Advanced
• Objective: For each of the topics, we will dig into the concepts,
maths to build a theoretical understanding; followed by code
(jupyter notebooks) to understand the implementation details.

Module 1 (30 mins)
• Introduction to Text Representation (5 mins)
• Old ways of representing text (20 mins)
• Bag-Of-Words
• TF–IDF
• Co-occurrence matrix + SVD
• Pros and Cons
• Introduction to Embedding spaces (5 mins)
Outline/Time Map - 4 Modules

Module 2 (160 mins)
• Word-Vectors
• Introduction + Bigram model (25 mins)
• CBOW model (25 mins)
• SKIP-GRAM model (25 mins)
[Efficient estimation of word representations in vector space. Mikolov, et. al.
ICLR Workshop, 2013]
• Speed-Up (20 mins)
• Negative Sampling
• Hierarchical Softmax
[Distributed representations of words and phrases and their compositionality.
Mikolov, et. al. ANIPS, 2013]

• Word-Vectors (contd)
• GLOVE model (30 mins)
[GloVe: Global Vectors for Word Representation. Pennington et. al. EMNLP
2014]
• t-SNE (15 mins)
[Visualizing Data using t-SNE. Hinton et. al. 2008
How to Use t-SNE Effectively – Distill]
• Pros and Cons of using pre-trained word vectors (5 mins)
• Q & A (20 mins)

Module 3 (70 mins)
• Sentence2vec/Paragraph2vec/Doc2Vec
• Introduction (5 mins)
• PV-DM model (35 mins)
• PV-DBOW model
[Distributed representations of sentences and documents. Mikolov, et. al. ICML,
2014]
• Skip-Thoughts model (20 mins)
[Skip-Thought Vectors. Kiros et. al. arXiv preprint 2015]
• Pros and Cons (10 mins)

Module 4 (70 mins)
• Char2Vec
• Introduction (5 mins)
• Introduction to RNNs, LSTMs (20 mins)
• 1-hot Encoding (30 mins)
[The Unreasonable Effectiveness of Recurrent Neural Networks. Andrej Karpathy 2015]
• Character Embeddings (20 mins)
[Character-Aware Neural Language Models. Yoon Kim et. al. AAAI 2015]
• Pros and Cons (5 mins)
• Q & A (10 mins)

Recommandé

The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...Jinho Choi

TMP minimumsIvanova Olga

Unsupervised Deep Learning in NLP hadifar

Deep Learning & NLP: Graphs to the Rescue!Roelof Pieters

EMMA Summer School - C. Padron-Napoles - Choosing a MOOC approach that meets ...EUmoocs

302 week 12Barbara White

design of experiments.docxVijay kumar Ssit

How to design Collaborative learning activitiesMETIS-project

Recommandé

The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...Jinho Choi

TMP minimumsIvanova Olga

Unsupervised Deep Learning in NLP hadifar

Deep Learning & NLP: Graphs to the Rescue!Roelof Pieters

EMMA Summer School - C. Padron-Napoles - Choosing a MOOC approach that meets ...EUmoocs

302 week 12Barbara White

design of experiments.docxVijay kumar Ssit

How to design Collaborative learning activitiesMETIS-project

Orbital Mechanics via a Simulation-based learningTechnological Ecosystems for Enhancing Multiculturality

Chounta@pawsIrene-Angelica Chounta

Naver learning to rank question answer pairs using hrde-ltcNAVER Engineering

Investigating the Impact of Organised Orchestration on TeachingLighton Phiri

21idt.pdfEkta Jolly

My 1st learning designCarla Gonçalves

Deep Learning: a birds eye viewRoelof Pieters

A framework and a TDD methodology for testing web service compositionsFelipe Besson

Seq2seq Model to Tokenize the Chinese LanguageJinho Choi

Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Thilo Stadelmann

Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Lucidworks

Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesMatthew Lease

Software Arch TDD ppt.pdfTed M. Young

[KDD 2018 tutorial] End to-end goal-oriented question answering systemsQi He

NLG, Training, Inference & Evaluation Deep Learning Italia

ODSC East 2020 : Continuous_learning_systemsAnuj Gupta

Continuous Learning Systems: Building ML systems that learn from their mistakesAnuj Gupta

Sarcasm Detection: Achilles Heel of sentiment analysisAnuj Gupta

NLP BootcampAnuj Gupta

NLP Bootcamp 2018 : Representation Learning of text for NLPAnuj Gupta

Recent Advances in NLPAnuj Gupta

Contenu connexe

Similaire à Representation Learning for NLP

Orbital Mechanics via a Simulation-based learningTechnological Ecosystems for Enhancing Multiculturality

Chounta@pawsIrene-Angelica Chounta

Naver learning to rank question answer pairs using hrde-ltcNAVER Engineering

Investigating the Impact of Organised Orchestration on TeachingLighton Phiri

21idt.pdfEkta Jolly

My 1st learning designCarla Gonçalves

Deep Learning: a birds eye viewRoelof Pieters

A framework and a TDD methodology for testing web service compositionsFelipe Besson

Seq2seq Model to Tokenize the Chinese LanguageJinho Choi

Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Thilo Stadelmann

Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...Lucidworks

Deep Learning for Information Retrieval: Models, Progress, & OpportunitiesMatthew Lease

Software Arch TDD ppt.pdfTed M. Young

[KDD 2018 tutorial] End to-end goal-oriented question answering systemsQi He

NLG, Training, Inference & Evaluation Deep Learning Italia

Similaire à Representation Learning for NLP (16)

Orbital Mechanics via a Simulation-based learning

Chounta@paws

Naver learning to rank question answer pairs using hrde-ltc

Investigating the Impact of Organised Orchestration on Teaching

21idt.pdf

My 1st learning design

Deep Learning: a birds eye view

A framework and a TDD methodology for testing web service compositions

Seq2seq Model to Tokenize the Chinese Language

Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)

Enriching Solr with Deep Learning for a Question Answering System - Sanket Sh...

Deep Learning for Information Retrieval: Models, Progress, & Opportunities

Software Arch TDD ppt.pdf

[KDD 2018 tutorial] End to-end goal-oriented question answering systems

NLG, Training, Inference & Evaluation

Plus de Anuj Gupta

ODSC East 2020 : Continuous_learning_systemsAnuj Gupta

Continuous Learning Systems: Building ML systems that learn from their mistakesAnuj Gupta

Sarcasm Detection: Achilles Heel of sentiment analysisAnuj Gupta

NLP BootcampAnuj Gupta

NLP Bootcamp 2018 : Representation Learning of text for NLPAnuj Gupta

Recent Advances in NLPAnuj Gupta

Talk from NVidia Developer ConnectAnuj Gupta

Representation Learning of Text for NLPAnuj Gupta

Synthetic Gradients - Decoupling Layers of a Neural NetsAnuj Gupta

DLBLR talkAnuj Gupta

Building Continuous Learning SystemsAnuj Gupta

Plus de Anuj Gupta (11)

ODSC East 2020 : Continuous_learning_systems

Continuous Learning Systems: Building ML systems that learn from their mistakes

Sarcasm Detection: Achilles Heel of sentiment analysis

NLP Bootcamp

NLP Bootcamp 2018 : Representation Learning of text for NLP

Recent Advances in NLP

Talk from NVidia Developer Connect

Representation Learning of Text for NLP

Synthetic Gradients - Decoupling Layers of a Neural Nets

DLBLR talk

Building Continuous Learning Systems

Dernier

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha

Animal Communication- Auditory and Visual.pptxUmerFayaz5

Chemistry 4th semester series (krishna).pdfSumit Kumar yadav

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari

Botany 4th semester series (krishna).pdfSumit Kumar yadav

Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju

Forensic Biology & Its biological significance.pdfrohankumarsinghrore1

Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6

Green chemistry and Sustainable development.pptxRajatChauhan518211

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani

Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1

GBSN - Microbiology (Unit 1)Areesha Ahmad

CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823

VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra

Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174

Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora

fundamental of entomology all in one topics of entomologyDrAnita Sharma

Dernier (20)

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000

Animal Communication- Auditory and Visual.pptx

Chemistry 4th semester series (krishna).pdf

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

Botany 4th semester series (krishna).pdf

Pests of mustard_Identification_Management_Dr.UPR.pdf

Forensic Biology & Its biological significance.pdf

Biopesticide (2).pptx .This slides helps to know the different types of biop...

Green chemistry and Sustainable development.pptx

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...

Recombinant DNA technology (Immunological screening)

GBSN - Microbiology (Unit 1)

CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡

VIRUSES structure and classification ppt by Dr.Prince C P

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis

Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN

Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency

fundamental of entomology all in one topics of entomology

Representation Learning for NLP

1. Representation Learning for NLP: Deep Dive Anuj Gupta, Satyam Saxena

2. • Duration : 6 hrs • Level : Intermediate to Advanced • Objective: For each of the topics, we will dig into the concepts, maths to build a theoretical understanding; followed by code (jupyter notebooks) to understand the implementation details.

3. Module 1 (30 mins) • Introduction to Text Representation (5 mins) • Old ways of representing text (20 mins) • Bag-Of-Words • TF–IDF • Co-occurrence matrix + SVD • Pros and Cons • Introduction to Embedding spaces (5 mins) Outline/Time Map - 4 Modules

4. Module 2 (160 mins) • Word-Vectors • Introduction + Bigram model (25 mins) • CBOW model (25 mins) • SKIP-GRAM model (25 mins) [Efficient estimation of word representations in vector space. Mikolov, et. al. ICLR Workshop, 2013] • Speed-Up (20 mins) • Negative Sampling • Hierarchical Softmax [Distributed representations of words and phrases and their compositionality. Mikolov, et. al. ANIPS, 2013]

5. • Word-Vectors (contd) • GLOVE model (30 mins) [GloVe: Global Vectors for Word Representation. Pennington et. al. EMNLP 2014] • t-SNE (15 mins) [Visualizing Data using t-SNE. Hinton et. al. 2008 How to Use t-SNE Effectively – Distill] • Pros and Cons of using pre-trained word vectors (5 mins) • Q & A (20 mins)

6. Module 3 (70 mins) • Sentence2vec/Paragraph2vec/Doc2Vec • Introduction (5 mins) • PV-DM model (35 mins) • PV-DBOW model [Distributed representations of sentences and documents. Mikolov, et. al. ICML, 2014] • Skip-Thoughts model (20 mins) [Skip-Thought Vectors. Kiros et. al. arXiv preprint 2015] • Pros and Cons (10 mins)

7. Module 4 (70 mins) • Char2Vec • Introduction (5 mins) • Introduction to RNNs, LSTMs (20 mins) • 1-hot Encoding (30 mins) [The Unreasonable Effectiveness of Recurrent Neural Networks. Andrej Karpathy 2015] • Character Embeddings (20 mins) [Character-Aware Neural Language Models. Yoon Kim et. al. AAAI 2015] • Pros and Cons (5 mins) • Q & A (10 mins)