SlideShare une entreprise Scribd logo
1  sur  87
Télécharger pour lire hors ligne
Towards String-to-Tree
Neural Machine Translation
Roee Aharoni & Yoav Goldberg
NLP Lab, Bar Ilan University
ACL 2017
NMT is all the rage!
NMT is all the rage!
NMT is all the rage!
• Driving the current state-of-the-art (Sennrich et al., 2016)
NMT is all the rage!
• Driving the current state-of-the-art (Sennrich et al., 2016)
• Widely adopted by the industry
Seq2Seq with Attention
Bahdanau et al. (2015)
Seq2Seq with Attention
f = argmax
f0
p(f0
|e)
Bahdanau et al. (2015)
Syntax was all the rage!
Syntax was all the rage!
• The “previous” state-of-the-
art was syntax-based SMT
From Rico Sennrich, “NMT: Breaking
the Performance Plateau”, 2016
From Williams, Sennrich, Post & Koehn (2016),
“Syntax-based Statistical Machine Translation”
Syntax was all the rage!
• The “previous” state-of-the-
art was syntax-based SMT
• i.e. systems that used
linguistic information (usually
represented as parse trees)
Syntax was all the rage!
• The “previous” state-of-the-
art was syntax-based SMT
• i.e. systems that used
linguistic information (usually
represented as parse trees)
• “Beaten” by NMT in 2016
From Rico Sennrich, “NMT: Breaking
the Performance Plateau”, 2016
Syntax was all the rage!
• The “previous” state-of-the-
art was syntax-based SMT
• i.e. systems that used
linguistic information (usually
represented as parse trees)
• “Beaten” by NMT in 2016
• Can we bring the benefits
of syntax into the recent
neural systems? From Rico Sennrich, “NMT: Breaking
the Performance Plateau”, 2016
Syntax: Constituency Structure
Syntax: Constituency Structure
• A Constituency (a.k.a Phrase-Structure) grammar defines a set of
rewrite rules which describe the structure of the language.
Syntax: Constituency Structure
• A Constituency (a.k.a Phrase-Structure) grammar defines a set of
rewrite rules which describe the structure of the language.
• Groups words into larger units (constituents)
Syntax: Constituency Structure
• A Constituency (a.k.a Phrase-Structure) grammar defines a set of
rewrite rules which describe the structure of the language.
• Groups words into larger units (constituents)
• Defines a hierarchy between constituents
Syntax: Constituency Structure
• A Constituency (a.k.a Phrase-Structure) grammar defines a set of
rewrite rules which describe the structure of the language.
• Groups words into larger units (constituents)
• Defines a hierarchy between constituents
• Draws relations between different constituents (words, phrases,
clauses…)
Why Syntax Can Help MT?
• Hints as to which word sequences belong together
Why Syntax Can Help MT?
• Hints as to which word sequences belong together
• Helps in producing well structured sentences
Why Syntax Can Help MT?
• Hints as to which word sequences belong together
• Helps in producing well structured sentences
• Allows informed reordering decisions according to the
syntactic structure
Why Syntax Can Help MT?
• Hints as to which word sequences belong together
• Helps in producing well structured sentences
• Allows informed reordering decisions according to the
syntactic structure
• Encourages long-distance dependencies when
selecting translations
Why Syntax Can Help MT?
String-to-Tree Translation
source
target
Our Approach: String-to-Tree NMT
Our Approach: String-to-Tree NMT
source
Our Approach: String-to-Tree NMT
source target
Our Approach: String-to-Tree NMT
• Main idea: translate a source sentence into a linearized tree of the
target sentence
source target
Our Approach: String-to-Tree NMT
• Main idea: translate a source sentence into a linearized tree of the
target sentence
• Inspired by works on RNN-based syntactic parsing (Vinyals et. al, 

2015, Choe & Charniak, 2016)
source target
Our Approach: String-to-Tree NMT
• Main idea: translate a source sentence into a linearized tree of the
target sentence
• Inspired by works on RNN-based syntactic parsing (Vinyals et. al, 

2015, Choe & Charniak, 2016)
• Allows using the seq2seq framework as-is
source target
Experimental Details
• We used the Nematus toolkit (Sennrich et al. 2017)

• Joint BPE segmentation (Sennrich et al. 2016)
• For training, we parse the target side using the BLLIP parser
(McClosky, Charniak and Johnson, 2006)
• Requires some care about making BPE, Tokenization and Parser
work together
Experiments - Large Scale
Experiments - Large Scale
• German to English, 4.5 million parallel training sentences from WMT16
Experiments - Large Scale
• German to English, 4.5 million parallel training sentences from WMT16
• Train two NMT models using the same setup (same settings as the SOTA neural
system in WMT16)
Experiments - Large Scale
• German to English, 4.5 million parallel training sentences from WMT16
• Train two NMT models using the same setup (same settings as the SOTA neural
system in WMT16)
• syntax-aware (bpe2tree)
Experiments - Large Scale
• German to English, 4.5 million parallel training sentences from WMT16
• Train two NMT models using the same setup (same settings as the SOTA neural
system in WMT16)
• syntax-aware (bpe2tree)
• syntax-agnostic baseline (bpe2bpe)
Experiments - Large Scale
• German to English, 4.5 million parallel training sentences from WMT16
• Train two NMT models using the same setup (same settings as the SOTA neural
system in WMT16)
• syntax-aware (bpe2tree)
• syntax-agnostic baseline (bpe2bpe)
• The syntax-aware model performs better in terms of BLEU
Single
Model
5 Model
Ensemble
Experiments - Low Resource
• German/Russian/Czech to
English - 180k-140k parallel
training sentences (News
Commentary v8)
• The syntax-aware model
performs better in terms of
BLEU in all cases (12
comparisons)
• Up to 2+ BLEU improvement
Looking Beyond BLEU
Accurate Trees
• 99% of the predicted trees in the development set had valid bracketing
• Eye-balling the predicted trees found them well-formed and following
the syntax of English.
Where Syntax Helps? Alignments
Where Syntax Helps? Alignments
• The attention based
model induces soft
alignments between
the source and the
target
Where Syntax Helps? Alignments
• The attention based
model induces soft
alignments between
the source and the
target
• The syntax-aware
model produced
more sensible
alignments
Where Syntax Helps? Alignments
• The attention based
model induces soft
alignments between
the source and the
target
• The syntax-aware
model produced
more sensible
alignments
Where Syntax Helps? Alignments
• The attention based
model induces soft
alignments between
the source and the
target
• The syntax-aware
model produced
more sensible
alignments
Where Syntax Helps? Alignments
• The attention based
model induces soft
alignments between
the source and the
target
• The syntax-aware
model produces
more sensible
alignments
Where Syntax Helps? Alignments
• The attention based
model induces soft
alignments between
the source and the
target
• The syntax-aware
model produces
more sensible
alignments
Where Syntax Helps? Alignments
• The attention based
model induces soft
alignments between
the source and the
target
• The syntax-aware
model produces
more sensible
alignments
Where Syntax Helps? Alignments
• The attention based
model induces soft
alignments between
the source and the
target
• The syntax-aware
model produces
more sensible
alignments
Attending to Source Syntax
Attending to Source Syntax
• We inspected the attention weights
during the production of the tree’s
opening brackets
Attending to Source Syntax
• We inspected the attention weights
during the production of the tree’s
opening brackets
• The model consistently attends to
the main verb (“hatte") or to
structural markers (question marks,
hyphens…) in the source sentence
Attending to Source Syntax
• We inspected the attention weights
during the production of the tree’s
opening brackets
• The model consistently attends to
the main verb (“hatte") or to
structural markers (question marks,
hyphens…) in the source sentence
• Indicates the system implicitly
learns source syntax to some
extent (Shi, Padhi and Knight, 2016)
and possibly plans the decoding
accordingly
Where Syntax Helps? Structure
Where Syntax Helps? Structure
Where Syntax Helps? Structure
Structure (I) - Reordering
Structure (I) - Reordering
• German to English translation
requires a significant amount of
reordering during translation
Structure (I) - Reordering
• German to English translation
requires a significant amount of
reordering during translation
• Quantifying reordering shows
that the syntax-aware system
performs more reordering
during the training process
Structure (I) - Reordering
Structure (I) - Reordering
• We would like to interpret the increased reordering from a syntactic
perspective
Structure (I) - Reordering
• We would like to interpret the increased reordering from a syntactic
perspective
• We extract GHKM rules (Galley et al., 2004) from the dev set using the
predicted trees and attention-induced alignments
Structure (I) - Reordering
• We would like to interpret the increased reordering from a syntactic
perspective
• We extract GHKM rules (Galley et al., 2004) from the dev set using the
predicted trees and attention-induced alignments
• The most common rules reveal linguistically sensible transformations, like
moving the verb from the end of a German constituent to the beginning of
the matching English one
German English:
Structure (I) - Reordering
• We would like to interpret the increased reordering from a syntactic
perspective
• We extract GHKM rules (Galley et al., 2004) from the dev set using the
predicted trees and attention-induced alignments
• The most common rules reveal linguistically sensible transformations, like
moving the verb from the end of a German constituent to the beginning of
the matching English one
• More examples in the paper
German English:
Structure (II) - Relative Constructions
Structure (II) - Relative Constructions
• A common linguistic structure is relative constructions, i.e. “The XXX which
YYY”, “A XXX whose YYY”…
Structure (II) - Relative Constructions
• A common linguistic structure is relative constructions, i.e. “The XXX which
YYY”, “A XXX whose YYY”…
• The words that connect the clauses in such constructions are called relative
pronouns, i.e. “who”, “which”, “whom”…
Structure (II) - Relative Constructions
• A common linguistic structure is relative constructions, i.e. “The XXX which
YYY”, “A XXX whose YYY”…
• The words that connect the clauses in such constructions are called relative
pronouns, i.e. “who”, “which”, “whom”…
• The syntax-aware system produced more relative pronouns due to the
syntactic context
Structure (II) - Relative Constructions
Source:
“Guangzhou, das in Deutschland auch Kanton genannt wird…”
Structure (II) - Relative Constructions
Source:
“Guangzhou, das in Deutschland auch Kanton genannt wird…”
Reference:
“Guangzhou, which is also known as Canton in Germany…”
Structure (II) - Relative Constructions
Syntax-Agnostic:
“Guangzhou, also known in Germany, is one of…
Source:
“Guangzhou, das in Deutschland auch Kanton genannt wird…”
Reference:
“Guangzhou, which is also known as Canton in Germany…”
Structure (II) - Relative Constructions
Syntax-Agnostic:
“Guangzhou, also known in Germany, is one of…
Syntax-Based:
“Guangzhou, which is also known as the canton in Germany,…”
Source:
“Guangzhou, das in Deutschland auch Kanton genannt wird…”
Reference:
“Guangzhou, which is also known as Canton in Germany…”
Structure (II) - Relative Constructions
Source:
“Zugleich droht der stark von internationalen Firmen abhängigen
Region ein Imageschaden…”
Structure (II) - Relative Constructions
Reference:
“At the same time, the image of the region, which is heavily reliant
on international companies…”
Source:
“Zugleich droht der stark von internationalen Firmen abhängigen
Region ein Imageschaden…”
Structure (II) - Relative Constructions
Syntax-Agnostic:
“At the same time, the region's heavily dependent region…”
Reference:
“At the same time, the image of the region, which is heavily reliant
on international companies…”
Source:
“Zugleich droht der stark von internationalen Firmen abhängigen
Region ein Imageschaden…”
Structure (II) - Relative Constructions
Syntax-Agnostic:
“At the same time, the region's heavily dependent region…”
Syntax-Based:
“At the same time, the region, which is heavily dependent on
international firms…”
Reference:
“At the same time, the image of the region, which is heavily reliant
on international companies…”
Source:
“Zugleich droht der stark von internationalen Firmen abhängigen
Region ein Imageschaden…”
Structure (II) - Relative Constructions
Human Evaluation
Human Evaluation
• We performed a small-scale human-evaluation using mechanical
turk on the first 500 sentences in newstest 2015
Human Evaluation
• We performed a small-scale human-evaluation using mechanical
turk on the first 500 sentences in newstest 2015
• Two turkers per sentence
Human Evaluation
• We performed a small-scale human-evaluation using mechanical
turk on the first 500 sentences in newstest 2015
• Two turkers per sentence
• The syntax-aware translations had an advantage over the baseline
0
95
190
2bpe better neutral 2tree better
Conclusions
Conclusions
• Neural machine translation can clearly benefit from target-side
syntax 



Other recent work include:
Conclusions
• Neural machine translation can clearly benefit from target-side
syntax 



Other recent work include:
• Eriguchi et al., 2017, Wu et al., 2017 (Dependency)
Conclusions
• Neural machine translation can clearly benefit from target-side
syntax 



Other recent work include:
• Eriguchi et al., 2017, Wu et al., 2017 (Dependency)
• Nadejde et al., 2017 (CCG)
Conclusions
• Neural machine translation can clearly benefit from target-side
syntax 



Other recent work include:
• Eriguchi et al., 2017, Wu et al., 2017 (Dependency)
• Nadejde et al., 2017 (CCG)
• A general approach - can be easily incorporated into other neural
language generation tasks like summarization, image caption
generation…
Conclusions
• Neural machine translation can clearly benefit from target-side
syntax 



Other recent work include:
• Eriguchi et al., 2017, Wu et al., 2017 (Dependency)
• Nadejde et al., 2017 (CCG)
• A general approach - can be easily incorporated into other neural
language generation tasks like summarization, image caption
generation…
• Larger picture: don’t throw away your linguistics! Neural systems
can also leverage symbolic linguistic information
Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine Translation

Contenu connexe

En vedette

Cs231n 2017 lecture10 Recurrent Neural Networks
Cs231n 2017 lecture10 Recurrent Neural NetworksCs231n 2017 lecture10 Recurrent Neural Networks
Cs231n 2017 lecture10 Recurrent Neural NetworksYanbin Kong
 
How we sleep well at night using Hystrix at Finn.no
How we sleep well at night using Hystrix at Finn.noHow we sleep well at night using Hystrix at Finn.no
How we sleep well at night using Hystrix at Finn.noHenning Spjelkavik
 
Cs231n 2017 lecture12 Visualizing and Understanding
Cs231n 2017 lecture12 Visualizing and UnderstandingCs231n 2017 lecture12 Visualizing and Understanding
Cs231n 2017 lecture12 Visualizing and UnderstandingYanbin Kong
 
Recommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRecommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRoelof Pieters
 
Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Roelof Pieters
 
iPhone5c的最后猜测
iPhone5c的最后猜测iPhone5c的最后猜测
iPhone5c的最后猜测Yanbin Kong
 
Philosophy of Deep Learning
Philosophy of Deep LearningPhilosophy of Deep Learning
Philosophy of Deep LearningMelanie Swan
 
Hackathon 2014 NLP Hack
Hackathon 2014 NLP HackHackathon 2014 NLP Hack
Hackathon 2014 NLP HackRoelof Pieters
 
Vectorland: Brief Notes from Using Text Embeddings for Search
Vectorland: Brief Notes from Using Text Embeddings for SearchVectorland: Brief Notes from Using Text Embeddings for Search
Vectorland: Brief Notes from Using Text Embeddings for SearchBhaskar Mitra
 
Deep Learning for Chatbot (1/4)
Deep Learning for Chatbot (1/4)Deep Learning for Chatbot (1/4)
Deep Learning for Chatbot (1/4)Jaemin Cho
 
State of Blockchain 2017: Smartnetworks and the Blockchain Economy
State of Blockchain 2017:  Smartnetworks and the Blockchain EconomyState of Blockchain 2017:  Smartnetworks and the Blockchain Economy
State of Blockchain 2017: Smartnetworks and the Blockchain EconomyMelanie Swan
 
Blockchain Economic Theory
Blockchain Economic TheoryBlockchain Economic Theory
Blockchain Economic TheoryMelanie Swan
 
Neural Models for Document Ranking
Neural Models for Document RankingNeural Models for Document Ranking
Neural Models for Document RankingBhaskar Mitra
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalBhaskar Mitra
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageRoelof Pieters
 

En vedette (20)

Cs231n 2017 lecture10 Recurrent Neural Networks
Cs231n 2017 lecture10 Recurrent Neural NetworksCs231n 2017 lecture10 Recurrent Neural Networks
Cs231n 2017 lecture10 Recurrent Neural Networks
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
How we sleep well at night using Hystrix at Finn.no
How we sleep well at night using Hystrix at Finn.noHow we sleep well at night using Hystrix at Finn.no
How we sleep well at night using Hystrix at Finn.no
 
Cs231n 2017 lecture12 Visualizing and Understanding
Cs231n 2017 lecture12 Visualizing and UnderstandingCs231n 2017 lecture12 Visualizing and Understanding
Cs231n 2017 lecture12 Visualizing and Understanding
 
Recommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRecommender Systems, Matrices and Graphs
Recommender Systems, Matrices and Graphs
 
Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!
 
iPhone5c的最后猜测
iPhone5c的最后猜测iPhone5c的最后猜测
iPhone5c的最后猜测
 
Philosophy of Deep Learning
Philosophy of Deep LearningPhilosophy of Deep Learning
Philosophy of Deep Learning
 
Hackathon 2014 NLP Hack
Hackathon 2014 NLP HackHackathon 2014 NLP Hack
Hackathon 2014 NLP Hack
 
Vectorland: Brief Notes from Using Text Embeddings for Search
Vectorland: Brief Notes from Using Text Embeddings for SearchVectorland: Brief Notes from Using Text Embeddings for Search
Vectorland: Brief Notes from Using Text Embeddings for Search
 
Deep Learning for Chatbot (1/4)
Deep Learning for Chatbot (1/4)Deep Learning for Chatbot (1/4)
Deep Learning for Chatbot (1/4)
 
State of Blockchain 2017: Smartnetworks and the Blockchain Economy
State of Blockchain 2017:  Smartnetworks and the Blockchain EconomyState of Blockchain 2017:  Smartnetworks and the Blockchain Economy
State of Blockchain 2017: Smartnetworks and the Blockchain Economy
 
Blockchain Economic Theory
Blockchain Economic TheoryBlockchain Economic Theory
Blockchain Economic Theory
 
Neural Models for Document Ranking
Neural Models for Document RankingNeural Models for Document Ranking
Neural Models for Document Ranking
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information Retrieval
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
 

Similaire à Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine Translation

Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translationkhyati gupta
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translationkhyati gupta
 
What might a spoken corpus tell us about language
What might a spoken corpus tell us about languageWhat might a spoken corpus tell us about language
What might a spoken corpus tell us about languageUCLDH
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...RajkiranVeluri
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik
 
How to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningHow to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningLena Shakurova
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AISATHYANARAYANAKB
 
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...QuantInsti
 
The Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine TranslationThe Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine TranslationGennadi Lembersky
 
Natural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A SurveyNatural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A SurveyAkshayaNagarajan10
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categoriesWarNik Chow
 
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...Dr. Haxel Consult
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Lifeng (Aaron) Han
 
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...Lifeng (Aaron) Han
 
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...Alp Öktem
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT Lifeng (Aaron) Han
 

Similaire à Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine Translation (20)

Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translation
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translation
 
project present
project presentproject present
project present
 
What might a spoken corpus tell us about language
What might a spoken corpus tell us about languageWhat might a spoken corpus tell us about language
What might a spoken corpus tell us about language
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 
AINL 2016: Nikolenko
AINL 2016: NikolenkoAINL 2016: Nikolenko
AINL 2016: Nikolenko
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
 
Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
 
How to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningHow to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learning
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AI
 
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
 
The Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine TranslationThe Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine Translation
 
Natural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A SurveyNatural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A Survey
 
Natural Language Processing using Java
Natural Language Processing using JavaNatural Language Processing using Java
Natural Language Processing using Java
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories
 
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
 
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
pptphrase-tagset-mapping-for-french-and-english-treebanks-and-its-application...
 
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation...
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
 

Plus de Association for Computational Linguistics

Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Association for Computational Linguistics
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Association for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Association for Computational Linguistics
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Association for Computational Linguistics
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Association for Computational Linguistics
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Association for Computational Linguistics
 
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian TranslationToshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian TranslationAssociation for Computational Linguistics
 
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemHua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemAssociation for Computational Linguistics
 

Plus de Association for Computational Linguistics (20)

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextMuis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text
 
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...
 
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour AnalysisCastro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
 
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text SimplificationElior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification
 
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsDaniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions
 
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...
 
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...
 
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
 
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian TranslationToshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
 
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemHua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
 

Dernier

Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdfssuserdda66b
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 

Dernier (20)

Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 

Roee Aharoni - 2017 - Towards String-to-Tree Neural Machine Translation

  • 1. Towards String-to-Tree Neural Machine Translation Roee Aharoni & Yoav Goldberg NLP Lab, Bar Ilan University ACL 2017
  • 2. NMT is all the rage!
  • 3. NMT is all the rage!
  • 4. NMT is all the rage! • Driving the current state-of-the-art (Sennrich et al., 2016)
  • 5. NMT is all the rage! • Driving the current state-of-the-art (Sennrich et al., 2016) • Widely adopted by the industry
  • 7. Seq2Seq with Attention f = argmax f0 p(f0 |e) Bahdanau et al. (2015)
  • 8. Syntax was all the rage!
  • 9. Syntax was all the rage! • The “previous” state-of-the- art was syntax-based SMT From Rico Sennrich, “NMT: Breaking the Performance Plateau”, 2016
  • 10. From Williams, Sennrich, Post & Koehn (2016), “Syntax-based Statistical Machine Translation” Syntax was all the rage! • The “previous” state-of-the- art was syntax-based SMT • i.e. systems that used linguistic information (usually represented as parse trees)
  • 11. Syntax was all the rage! • The “previous” state-of-the- art was syntax-based SMT • i.e. systems that used linguistic information (usually represented as parse trees) • “Beaten” by NMT in 2016 From Rico Sennrich, “NMT: Breaking the Performance Plateau”, 2016
  • 12. Syntax was all the rage! • The “previous” state-of-the- art was syntax-based SMT • i.e. systems that used linguistic information (usually represented as parse trees) • “Beaten” by NMT in 2016 • Can we bring the benefits of syntax into the recent neural systems? From Rico Sennrich, “NMT: Breaking the Performance Plateau”, 2016
  • 14. Syntax: Constituency Structure • A Constituency (a.k.a Phrase-Structure) grammar defines a set of rewrite rules which describe the structure of the language.
  • 15. Syntax: Constituency Structure • A Constituency (a.k.a Phrase-Structure) grammar defines a set of rewrite rules which describe the structure of the language. • Groups words into larger units (constituents)
  • 16. Syntax: Constituency Structure • A Constituency (a.k.a Phrase-Structure) grammar defines a set of rewrite rules which describe the structure of the language. • Groups words into larger units (constituents) • Defines a hierarchy between constituents
  • 17. Syntax: Constituency Structure • A Constituency (a.k.a Phrase-Structure) grammar defines a set of rewrite rules which describe the structure of the language. • Groups words into larger units (constituents) • Defines a hierarchy between constituents • Draws relations between different constituents (words, phrases, clauses…)
  • 18. Why Syntax Can Help MT?
  • 19. • Hints as to which word sequences belong together Why Syntax Can Help MT?
  • 20. • Hints as to which word sequences belong together • Helps in producing well structured sentences Why Syntax Can Help MT?
  • 21. • Hints as to which word sequences belong together • Helps in producing well structured sentences • Allows informed reordering decisions according to the syntactic structure Why Syntax Can Help MT?
  • 22. • Hints as to which word sequences belong together • Helps in producing well structured sentences • Allows informed reordering decisions according to the syntactic structure • Encourages long-distance dependencies when selecting translations Why Syntax Can Help MT?
  • 26. Our Approach: String-to-Tree NMT source target
  • 27. Our Approach: String-to-Tree NMT • Main idea: translate a source sentence into a linearized tree of the target sentence source target
  • 28. Our Approach: String-to-Tree NMT • Main idea: translate a source sentence into a linearized tree of the target sentence • Inspired by works on RNN-based syntactic parsing (Vinyals et. al, 
 2015, Choe & Charniak, 2016) source target
  • 29. Our Approach: String-to-Tree NMT • Main idea: translate a source sentence into a linearized tree of the target sentence • Inspired by works on RNN-based syntactic parsing (Vinyals et. al, 
 2015, Choe & Charniak, 2016) • Allows using the seq2seq framework as-is source target
  • 30. Experimental Details • We used the Nematus toolkit (Sennrich et al. 2017)
 • Joint BPE segmentation (Sennrich et al. 2016) • For training, we parse the target side using the BLLIP parser (McClosky, Charniak and Johnson, 2006) • Requires some care about making BPE, Tokenization and Parser work together
  • 32. Experiments - Large Scale • German to English, 4.5 million parallel training sentences from WMT16
  • 33. Experiments - Large Scale • German to English, 4.5 million parallel training sentences from WMT16 • Train two NMT models using the same setup (same settings as the SOTA neural system in WMT16)
  • 34. Experiments - Large Scale • German to English, 4.5 million parallel training sentences from WMT16 • Train two NMT models using the same setup (same settings as the SOTA neural system in WMT16) • syntax-aware (bpe2tree)
  • 35. Experiments - Large Scale • German to English, 4.5 million parallel training sentences from WMT16 • Train two NMT models using the same setup (same settings as the SOTA neural system in WMT16) • syntax-aware (bpe2tree) • syntax-agnostic baseline (bpe2bpe)
  • 36. Experiments - Large Scale • German to English, 4.5 million parallel training sentences from WMT16 • Train two NMT models using the same setup (same settings as the SOTA neural system in WMT16) • syntax-aware (bpe2tree) • syntax-agnostic baseline (bpe2bpe) • The syntax-aware model performs better in terms of BLEU Single Model 5 Model Ensemble
  • 37. Experiments - Low Resource • German/Russian/Czech to English - 180k-140k parallel training sentences (News Commentary v8) • The syntax-aware model performs better in terms of BLEU in all cases (12 comparisons) • Up to 2+ BLEU improvement
  • 39. Accurate Trees • 99% of the predicted trees in the development set had valid bracketing • Eye-balling the predicted trees found them well-formed and following the syntax of English.
  • 40. Where Syntax Helps? Alignments
  • 41. Where Syntax Helps? Alignments • The attention based model induces soft alignments between the source and the target
  • 42. Where Syntax Helps? Alignments • The attention based model induces soft alignments between the source and the target • The syntax-aware model produced more sensible alignments
  • 43. Where Syntax Helps? Alignments • The attention based model induces soft alignments between the source and the target • The syntax-aware model produced more sensible alignments
  • 44. Where Syntax Helps? Alignments • The attention based model induces soft alignments between the source and the target • The syntax-aware model produced more sensible alignments
  • 45. Where Syntax Helps? Alignments • The attention based model induces soft alignments between the source and the target • The syntax-aware model produces more sensible alignments
  • 46. Where Syntax Helps? Alignments • The attention based model induces soft alignments between the source and the target • The syntax-aware model produces more sensible alignments
  • 47. Where Syntax Helps? Alignments • The attention based model induces soft alignments between the source and the target • The syntax-aware model produces more sensible alignments
  • 48. Where Syntax Helps? Alignments • The attention based model induces soft alignments between the source and the target • The syntax-aware model produces more sensible alignments
  • 50. Attending to Source Syntax • We inspected the attention weights during the production of the tree’s opening brackets
  • 51. Attending to Source Syntax • We inspected the attention weights during the production of the tree’s opening brackets • The model consistently attends to the main verb (“hatte") or to structural markers (question marks, hyphens…) in the source sentence
  • 52. Attending to Source Syntax • We inspected the attention weights during the production of the tree’s opening brackets • The model consistently attends to the main verb (“hatte") or to structural markers (question marks, hyphens…) in the source sentence • Indicates the system implicitly learns source syntax to some extent (Shi, Padhi and Knight, 2016) and possibly plans the decoding accordingly
  • 53. Where Syntax Helps? Structure
  • 54. Where Syntax Helps? Structure
  • 55. Where Syntax Helps? Structure
  • 56. Structure (I) - Reordering
  • 57. Structure (I) - Reordering • German to English translation requires a significant amount of reordering during translation
  • 58. Structure (I) - Reordering • German to English translation requires a significant amount of reordering during translation • Quantifying reordering shows that the syntax-aware system performs more reordering during the training process
  • 59. Structure (I) - Reordering
  • 60. Structure (I) - Reordering • We would like to interpret the increased reordering from a syntactic perspective
  • 61. Structure (I) - Reordering • We would like to interpret the increased reordering from a syntactic perspective • We extract GHKM rules (Galley et al., 2004) from the dev set using the predicted trees and attention-induced alignments
  • 62. Structure (I) - Reordering • We would like to interpret the increased reordering from a syntactic perspective • We extract GHKM rules (Galley et al., 2004) from the dev set using the predicted trees and attention-induced alignments • The most common rules reveal linguistically sensible transformations, like moving the verb from the end of a German constituent to the beginning of the matching English one German English:
  • 63. Structure (I) - Reordering • We would like to interpret the increased reordering from a syntactic perspective • We extract GHKM rules (Galley et al., 2004) from the dev set using the predicted trees and attention-induced alignments • The most common rules reveal linguistically sensible transformations, like moving the verb from the end of a German constituent to the beginning of the matching English one • More examples in the paper German English:
  • 64. Structure (II) - Relative Constructions
  • 65. Structure (II) - Relative Constructions • A common linguistic structure is relative constructions, i.e. “The XXX which YYY”, “A XXX whose YYY”…
  • 66. Structure (II) - Relative Constructions • A common linguistic structure is relative constructions, i.e. “The XXX which YYY”, “A XXX whose YYY”… • The words that connect the clauses in such constructions are called relative pronouns, i.e. “who”, “which”, “whom”…
  • 67. Structure (II) - Relative Constructions • A common linguistic structure is relative constructions, i.e. “The XXX which YYY”, “A XXX whose YYY”… • The words that connect the clauses in such constructions are called relative pronouns, i.e. “who”, “which”, “whom”… • The syntax-aware system produced more relative pronouns due to the syntactic context
  • 68. Structure (II) - Relative Constructions
  • 69. Source: “Guangzhou, das in Deutschland auch Kanton genannt wird…” Structure (II) - Relative Constructions
  • 70. Source: “Guangzhou, das in Deutschland auch Kanton genannt wird…” Reference: “Guangzhou, which is also known as Canton in Germany…” Structure (II) - Relative Constructions
  • 71. Syntax-Agnostic: “Guangzhou, also known in Germany, is one of… Source: “Guangzhou, das in Deutschland auch Kanton genannt wird…” Reference: “Guangzhou, which is also known as Canton in Germany…” Structure (II) - Relative Constructions
  • 72. Syntax-Agnostic: “Guangzhou, also known in Germany, is one of… Syntax-Based: “Guangzhou, which is also known as the canton in Germany,…” Source: “Guangzhou, das in Deutschland auch Kanton genannt wird…” Reference: “Guangzhou, which is also known as Canton in Germany…” Structure (II) - Relative Constructions
  • 73. Source: “Zugleich droht der stark von internationalen Firmen abhängigen Region ein Imageschaden…” Structure (II) - Relative Constructions
  • 74. Reference: “At the same time, the image of the region, which is heavily reliant on international companies…” Source: “Zugleich droht der stark von internationalen Firmen abhängigen Region ein Imageschaden…” Structure (II) - Relative Constructions
  • 75. Syntax-Agnostic: “At the same time, the region's heavily dependent region…” Reference: “At the same time, the image of the region, which is heavily reliant on international companies…” Source: “Zugleich droht der stark von internationalen Firmen abhängigen Region ein Imageschaden…” Structure (II) - Relative Constructions
  • 76. Syntax-Agnostic: “At the same time, the region's heavily dependent region…” Syntax-Based: “At the same time, the region, which is heavily dependent on international firms…” Reference: “At the same time, the image of the region, which is heavily reliant on international companies…” Source: “Zugleich droht der stark von internationalen Firmen abhängigen Region ein Imageschaden…” Structure (II) - Relative Constructions
  • 78. Human Evaluation • We performed a small-scale human-evaluation using mechanical turk on the first 500 sentences in newstest 2015
  • 79. Human Evaluation • We performed a small-scale human-evaluation using mechanical turk on the first 500 sentences in newstest 2015 • Two turkers per sentence
  • 80. Human Evaluation • We performed a small-scale human-evaluation using mechanical turk on the first 500 sentences in newstest 2015 • Two turkers per sentence • The syntax-aware translations had an advantage over the baseline 0 95 190 2bpe better neutral 2tree better
  • 82. Conclusions • Neural machine translation can clearly benefit from target-side syntax 
 
 Other recent work include:
  • 83. Conclusions • Neural machine translation can clearly benefit from target-side syntax 
 
 Other recent work include: • Eriguchi et al., 2017, Wu et al., 2017 (Dependency)
  • 84. Conclusions • Neural machine translation can clearly benefit from target-side syntax 
 
 Other recent work include: • Eriguchi et al., 2017, Wu et al., 2017 (Dependency) • Nadejde et al., 2017 (CCG)
  • 85. Conclusions • Neural machine translation can clearly benefit from target-side syntax 
 
 Other recent work include: • Eriguchi et al., 2017, Wu et al., 2017 (Dependency) • Nadejde et al., 2017 (CCG) • A general approach - can be easily incorporated into other neural language generation tasks like summarization, image caption generation…
  • 86. Conclusions • Neural machine translation can clearly benefit from target-side syntax 
 
 Other recent work include: • Eriguchi et al., 2017, Wu et al., 2017 (Dependency) • Nadejde et al., 2017 (CCG) • A general approach - can be easily incorporated into other neural language generation tasks like summarization, image caption generation… • Larger picture: don’t throw away your linguistics! Neural systems can also leverage symbolic linguistic information