SlideShare une entreprise Scribd logo
1  sur  29
Deep Learning for Machine
Translation
Satoshi Enoue, Jungi Kim, Jean Senellart, SYSTRAN
SYSTRAN Through Machine Translation
History
Rule Base Machine Translation
Example-Based Machine
Translation
Phrase Based Machine Translation
Syntax Based Machine Translation
Neural Machine
Translation
Hybrid Machine Translation
SYSTRAN
197
1968
SYSTRAN (SYStem
TRANslation)
founded by Dr.
Toma in La Jolla,
California (USA)
1969
Provided first
MT software for
the US Air Force,
(Russian to
English)
1975
Used by NASA
for the Apollo-
Soyuz
American-Soviet
project
1975
Translation systems for
all European languages
in the European
Commission
1986
SYSTRAN is acquired
by France’s Gachot SA,
thus becoming a
French company with
a U.S. subsidiary
1995
Pioneered development of
first Windows-based MT
software
1997
First free Web-based translation
service: Altavista Babelfish. SYSTRAN
made the Internet community aware
of the usefulness and capabilities of
machine translation
2002
SYSTRAN was used on
most major Internet
Portals: Yahoo!, Google,
AltaVista, Lycos.
1996
SYSTRAN within SEIKO’s
pocket translators.
1990’s
Port technology from mainframes to
Desktop PC’s and Client-Server environments
for personal and corporate use
2014
Following acquisition by CSLI,
SYSTRAN SA forms part of the
SYSTRAN International Group
2005
Launched embedded
translation software for
mobile devices
2009
Developed first
hybrid translation
software and
solution: SES 7
Translation Server
2011
Launch of SES 7
Training Server,
first solution for
self-learning of
MT engines
2015
SES8 Translation and Training
Server – Large Models
2016
More than 140 language Pairs.
Launch of SYSTRAN.io, the
Natural Language Processing
API platform
The new game changer
• Deep Neural Network
Technologies
• Image Analysis
• Voice Recognition
• Text
• Text Generation
• Word Embeddings
• Multitask NLP
• Neural Machine Translation
• … Games
• Super Human Abilities
Sequence of fascinating results
and technologies over the last 3
years – all based on Deep Neural
Network (DNN) – covering a large
variety of domains…
20/04/2016
SYSTRAN - Copyright 2016
3
The new game changer
• Deep Neural Network
Technologies
• Image Analysis
• Voice Recognition
• Text
• Text Generation
• Word Embeddings
• Multitask NLP
• Neural Machine Translation
• … Games
20/04/2016
SYSTRAN - Copyright 2016
4
The new game changer
• Deep Neural Network
Technologies
• Image Analysis
• Voice Recognition
• Text
• Text Generation
• Word Embeddings
• Multitask NLP
• Neural Machine Translation
• … Games Abilities
• Google 2015 RNN voice search
recognition outperforms 2012
DNN models
• Baidu Deep Speech announces
16.5% improvement over
baseline and higher
performance than human in
noisy environment
20/04/2016
SYSTRAN - Copyright 2016
5
The new game changer - examples
• Deep Neural Network
Technologies
• Image Analysis
• Voice Recognition
• Text
• Text Generation
• Word Embeddings
• Multitask NLP
• Neural Machine Translation
• … Gamesn Abilities
Les yeux prenaient des redoutables, des troncs de
feu.
Toutes les prétexticheurs par ces quatre
repentilleuses avec du sergent de Digne,
débragiffés nymoeurs sur les derniers instants à
hardis, boucher, sans dénongée en plus ennérence,
ils se refecturent encore. Ils auraient déjà mangé
ses très interses.
ShakespeareVictorHugo
Char-RNN, Andrej Karpathy, 2015
공급자는 AspNetXSprchyLibrary의 인스턴스를
만들어 다른 경고를 오버터 컴퓨터에 저장할 수
있습니다.
MSDN
20/04/2016
SYSTRAN - Copyright 2016
20/04/2016
SYSTRAN - Copyright 2016
7
20/04/2016
SYSTRAN - Copyright 2016
8
The new game changer - examples
• Deep Neural Network
Technologies
• Image Analysis
• Voice Recognition
• Text
• Text Generation
• Word Embeddings
• Multitask NLP
• Neural Machine Translation
• … Games
20/04/2016 9word2vec, Google, 2013
The new game changer - examples
• Unified Neural Network
Architecture for several NLP tasks
POS tagging, chunking, NER, SRL
• Focus on avoiding task/linguistic
specific engineering
• Joint decision on the different tasks
Outperforms almost all of the state
of the art results for each individual
tasks
Natural Language Processing (Almost) from Scratch, Collobert et al., 201120/04/2016
• Deep Neural Network
Technologies
• Image Analysis
• Voice Recognition
• Text
• Text Generation
• Word Embeddings
• Multitask NLP
• Neural Machine Translation
• … Gamesn Abilities
The new game changer - examples
• Deep Neural Network
Technologies
• Image Analysis
• Voice Recognition
• Text
• Text Generation
• Word Embeddings
• Multitask NLP
• Neural Machine Translation:
sentence encoding-decoding
• … Games
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, K. Cho et al, 2014
The new game changer - examples
• Deep Neural Network
Technologies
• Image Analysis
• Voice Recognition
• Text
• Text Generation
• Word Embeddings
• Multitask NLP
• Neural Machine : sentence encoding-
decoding
• … Games – DQN, AlphaGo
HUMAN-LEVEL CONTROL THROUGH DEEP REINFORCEMENT LEARNING, Google DeepMind, 201520/04/2016
AlphaGo, Google DeepMind, 201620/04/2016
SYSTRAN - Copyright 2016
The new game changer - examples
More and more evidence of
“super-human abilities”
Could we also reach Super-
human Machine Translation?
20/04/2016
SYSTRAN - Copyright 2016
14
The new game changer – ingredients
• MLP – multilayer perceptron
• Actually an “old concept”
• CNN
• Convolutional Neural network
• Word Embeddings
• Representing words as vectors
• RNN – GRU, LSTM
• MLP with memory
• Attention-Based models
• Ability to decide where to find
information
20/04/2016
SYSTRAN - Copyright 2016
15
The new game changer – ingredients
• MLP – multilayer perceptron
• Actually an “old concept”
• CNN
• Convolutional Neural network
• Word Embeddings
• Representing words as vectors
• RNN – GRU, LSTM
• MLP with memory
• Attention-Based models
• Ability to decide where to find
information
20/04/2016
SYSTRAN - Copyright 2016
16
The new game changer – ingredients
• MLP – multilayer perceptron
• Actually an “old concept”
• CNN
• Convolutional Neural network
• Word Embeddings
• Representing words as vectors
• RNN – GRU, LSTM
• MLP with memory
• Attention-Based models
• Ability to decide where to find
information
20/04/2016
SYSTRAN - Copyright 2016
17
The new game changer – ingredients
• MLP – multilayer perceptron
• Actually an “old concept”
• CNN
• Convolutional Neural network
• Word Embeddings
• Representing words as vectors
• RNN – GRU, LSTM
• MLP with memory
• Attention-Based models
• Ability to decide where to find
information
20/04/2016
SYSTRAN - Copyright 2016
18
The new game changer – ingredients
• MLP – multilayer perceptron
• Actually an “old concept”
• CNN
• Convolutional Neural network
• Word Embeddings
• Representing words as vectors
• RNN – GRU, LSTM
• MLP with memory
• Attention-Based models
• Ability to decide where to find
information
20/04/2016
SYSTRAN - Copyright 2016
19
All of these features are the ingredients to
Neural Machine Translation
About Neural Machine
Translation (NMT)
• The goal is to perform end-to-end translation
• Like in Speech Recognition
• The spirit is to remove all these features and have single system
• For Machine Translation – first NMT systems are encoder-decoder
• But not that magic
• Not systematic improvements over SMT baseline
• Use of ensemble systems
• Issues with sentence lengths, vocabulary size
• Solutions come back with some interest in “linguistic” characteristics
• Attention-Based model (alignment information)
• Deep Fusion with Language Model (better modelling of target language)
• Combine with word level (~ morphology)
20/04/2016
SYSTRAN - Copyright 2016
20
SYSTRAN approach to NMT
• Current Real Use-Case Requirements:
• Adaptation to (small) domain
• Help for post-editing
• Preserved speed
• Consistent results amongst multiple target languages
• Possibility to let users control translation through annotations, terminology
• …
• Toward Linguistically Motivated NN architecture
• SYSTRAN MT is composed of linguistic modules – let us start with them
• Lot of knowledge to leverage
20/04/2016
SYSTRAN - Copyright 2016
21
SYSTRAN Deep Learning Story – Part I
Language Identification
SYSTRAN LDK 1
•Statistical Classifier – 3-grams
•Heavily Feature Engineered over years
•e.g. diacritics model for latin language
•Include lexicon of frequent terms
•Quite good accuracy on news-type data
– need ~20 characters
Basic RNN
•“out-of-the-box” character level RNN
•no specific language specific
engineering
•80K words training per language
Google CLD
•Naïve Bayesian Classifier – 4-grams
•Trained on “big data”
•carefully scrapped over 100M pages
•Specific tricks for closely related
languages (Spanish/Portuguese)
•Geared for webpages - 200+ characters
Learnings: with same data RNN approach easily outperforms baseline, no
specific engineering needed… big data is not competing...
20/04/2016
SYSTRAN - Copyright 2016
22
News
Sentences
One-word
request
Ted-Talk
Sentences
Tweets
LDK 97 55.2 87.4 78.3
RNN 98.2 61.5 91.4 77.9
CLD 96.1 15.3 86 78.1
SYSTRAN Deep Learning Story – Part II
Part of Speech Tagging
Phase 1 - 1968-2014 - Handcrafting
•Manual Rule and Lexicon Coding of homography
•Closely related to Morphology description
•27 languages covered
Phase 2 - 2008-2015 – Annotating
•Train Classifier to "relearn” rules (fnTBL)
•Transfer knowledge through system output
•Maintenance through Annotation
Phase 3 - 2015- - Generalizing
•Relearn with RNN
•Joint decision (so far tokenization/part of speech
tagging) – working on morphology
•Better generalization from additional knowledge
(word embeddings)
20/04/2016
SYSTRAN - Copyright 2016
23
Learnings: Possibility to leverage ”handcrafting” and gain quality. But
learning becoming too smart – it also learns initial errors
SYSTRAN Deep Learning Story – Part III
Transliteration
20/04/2016 24
• Transliteration of person names
is depending on
• Source Language
• Target Language
• But also Name origin
• 카스파로프 = Kasparov
• 필리프 = Philippe
• Good Transliteration system
needs:
• Detection of origin
• Transliteration mechanism
•Extremely complicated – since it requires
phonetics modeling
Rule-Based
• Satisfactory but origin detection and multiple
domains
• No generalization - unseen sequence is wrong
PBMT
• Encoding-Decoding Approach
• Long distance "view" guarantee consistency of
transliteration
RNN
Learnings:
- losing reliability/traceability of the process
+ more global consistency, compactness of the solution
SYSTRAN Deep Learning Story – Part IV
Language Modeling
• RNN language model proves to overpass standard n-gram models
• No limitation in the span
• Seems to capture also better the language structure
• Better generalization due to word embedding
• Can be easily introduced in PBMT engine through rescoring
• Are still challenging pure sequence-to-sequence NMT approaches
20/04/2016 25
Learnings:
- Very long training process, several weeks of training for one language
+ Consistent quality gain, easy introduction in existing framework
Learnings from Deep Learning
• Consistent quality improvement in all the experiments/modules we
worked on
• Better leverage of existing training material
• Better generalization
• Incrementability: by design, it is immediate to feed more training data
– i.e. adapt dynamically to usage
• Globally more simple than alternative approaches and cognitively
interesting
• Fit to be combined in a global NN architecture
20/04/2016
SYSTRAN - Copyright 2016
26
Linguistically Motivated NN architecture
20/04/2016 SYSTRAN - Copyright 2016 27
Morphology
Syntactic Analysis
Sentence Encoding Sentence Decoding
RNN-LM
Word Embedding
Source Sentence …
Target Sentence …
What about Statistical Post Editing:
Learning to correct?
20/04/2016
SYSTRAN - Copyright 2016
28
• SPE was introduced as smart
alternative the SMT
• Corresponding to real MT use case for
localization
• Very little data can produce adaptation
• Reduce Human Post-Editor Work by
iteratively learning edits
• However implementation with PBMT
is not satisfactory
• PBMT does not learn to correct but to
translate
• Not incremental
• Learning to correct
• More control of the process
Toward a “translation checker”
• Change the paradigm – now human post-
editor to MT output, tomorrow
automatic post-editor to human output?
MT
HPE
Deep Learning for Machine Translation
• No doubt – it is coming:
• We will probably reach “superhuman” machine translation in coming years
• And this could become real translation assistant
• How is not yet completely clear
• From our perspective, we are working on hybrid approach = linguistically motivated
NN architecture
• More will also be coming from research world
• Still some work ahead
• Training of models is still a technological challenge
• We need the models to explain as much as to translate to become really useful – or
for language learning
• Multi-level analysis - document translation and not just sentences
• Multi-modal => could lead to full self language learning
20/04/2016
SYSTRAN - Copyright 2016
29

Contenu connexe

Tendances

Multilingual Content in Drupal 7 & 8 at DrupalCon Munich
Multilingual Content in Drupal 7 & 8 at DrupalCon MunichMultilingual Content in Drupal 7 & 8 at DrupalCon Munich
Multilingual Content in Drupal 7 & 8 at DrupalCon Munich
Suzanne Dergacheva
 
Macro Trends, Architecture, and the Hidden Nature of Complexity (and what doe...
Macro Trends, Architecture, and the Hidden Nature of Complexity (and what doe...Macro Trends, Architecture, and the Hidden Nature of Complexity (and what doe...
Macro Trends, Architecture, and the Hidden Nature of Complexity (and what doe...
David Meyer
 

Tendances (15)

OWF14 - Big Data : The State of Machine Learning in 2014
OWF14 - Big Data : The State of Machine  Learning in 2014OWF14 - Big Data : The State of Machine  Learning in 2014
OWF14 - Big Data : The State of Machine Learning in 2014
 
Towards Universal Semantic Understanding of Natural Languages
Towards Universal Semantic Understanding of Natural LanguagesTowards Universal Semantic Understanding of Natural Languages
Towards Universal Semantic Understanding of Natural Languages
 
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...
Omar Tawakol at AI Frontiers: The Rise Of Voice-Activated Assistants In The W...
 
"erlang, webmail and hibari" at Rakuten tech talk
"erlang, webmail and hibari" at Rakuten tech talk"erlang, webmail and hibari" at Rakuten tech talk
"erlang, webmail and hibari" at Rakuten tech talk
 
Deep Learning in NLP (BERT, ERNIE and REFORMER)
Deep Learning in NLP (BERT, ERNIE and REFORMER)Deep Learning in NLP (BERT, ERNIE and REFORMER)
Deep Learning in NLP (BERT, ERNIE and REFORMER)
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
 
Building NLP solutions using Python
Building NLP solutions using PythonBuilding NLP solutions using Python
Building NLP solutions using Python
 
Feature Engineering for NLP
Feature Engineering for NLPFeature Engineering for NLP
Feature Engineering for NLP
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
 
Multilingual Content in Drupal 7 & 8 at DrupalCon Munich
Multilingual Content in Drupal 7 & 8 at DrupalCon MunichMultilingual Content in Drupal 7 & 8 at DrupalCon Munich
Multilingual Content in Drupal 7 & 8 at DrupalCon Munich
 
Macro Trends, Architecture, and the Hidden Nature of Complexity (and what doe...
Macro Trends, Architecture, and the Hidden Nature of Complexity (and what doe...Macro Trends, Architecture, and the Hidden Nature of Complexity (and what doe...
Macro Trends, Architecture, and the Hidden Nature of Complexity (and what doe...
 
Kotlin strives for Deep Learning
Kotlin strives for Deep LearningKotlin strives for Deep Learning
Kotlin strives for Deep Learning
 
65 - An Empirical Simulation-based Study of Real-Time Speech Translation for ...
65 - An Empirical Simulation-based Study of Real-Time Speech Translation for ...65 - An Empirical Simulation-based Study of Real-Time Speech Translation for ...
65 - An Empirical Simulation-based Study of Real-Time Speech Translation for ...
 
The NLP Muppets revolution!
The NLP Muppets revolution!The NLP Muppets revolution!
The NLP Muppets revolution!
 
Chatbot ppt
Chatbot pptChatbot ppt
Chatbot ppt
 

En vedette

投影片範本 Papago!mobile ppt
投影片範本 Papago!mobile ppt投影片範本 Papago!mobile ppt
投影片範本 Papago!mobile ppt
XuanJun Lin
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
Rushdi Shams
 

En vedette (16)

Machine Translation Introduction
Machine Translation IntroductionMachine Translation Introduction
Machine Translation Introduction
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
 
Google's Multilingual Neural Machine Translation System
Google's Multilingual Neural Machine Translation SystemGoogle's Multilingual Neural Machine Translation System
Google's Multilingual Neural Machine Translation System
 
Deep Learning for Machine Translation, by Jean Senellart, SYSTRAN
Deep Learning for Machine Translation, by Jean Senellart, SYSTRANDeep Learning for Machine Translation, by Jean Senellart, SYSTRAN
Deep Learning for Machine Translation, by Jean Senellart, SYSTRAN
 
On using monolingual corpora in neural machine translation
On using monolingual corpora in neural machine translationOn using monolingual corpora in neural machine translation
On using monolingual corpora in neural machine translation
 
Deep Learning for Machine Translation
Deep Learning for Machine TranslationDeep Learning for Machine Translation
Deep Learning for Machine Translation
 
投影片範本 Papago!mobile ppt
投影片範本 Papago!mobile ppt投影片範本 Papago!mobile ppt
投影片範本 Papago!mobile ppt
 
A Simple Introduction to Word Embeddings
A Simple Introduction to Word EmbeddingsA Simple Introduction to Word Embeddings
A Simple Introduction to Word Embeddings
 
Machine translation
Machine translationMachine translation
Machine translation
 
[224] backend 개발자의 neural machine translation 개발기 김상경
[224] backend 개발자의 neural machine translation 개발기 김상경[224] backend 개발자의 neural machine translation 개발기 김상경
[224] backend 개발자의 neural machine translation 개발기 김상경
 
(Kpi summer school 2015) word embeddings and neural language modeling
(Kpi summer school 2015) word embeddings and neural language modeling(Kpi summer school 2015) word embeddings and neural language modeling
(Kpi summer school 2015) word embeddings and neural language modeling
 
[134]papago 김준석
[134]papago 김준석[134]papago 김준석
[134]papago 김준석
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
 
Papago/N2MT 개발이야기
Papago/N2MT 개발이야기Papago/N2MT 개발이야기
Papago/N2MT 개발이야기
 
Introduction to word embeddings with Python
Introduction to word embeddings with PythonIntroduction to word embeddings with Python
Introduction to word embeddings with Python
 
Machine Translation: What it is?
Machine Translation: What it is?Machine Translation: What it is?
Machine Translation: What it is?
 

Similaire à Deep Learning for Machine Translation, by Satoshi Enoue, SYSTRAN

Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Simplilearn
 

Similaire à Deep Learning for Machine Translation, by Satoshi Enoue, SYSTRAN (20)

Understanding deep learning
Understanding deep learningUnderstanding deep learning
Understanding deep learning
 
P1 2017 python
P1 2017 pythonP1 2017 python
P1 2017 python
 
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksA Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
 
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech TalksA Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
A Deeper Dive into Apache MXNet - March 2017 AWS Online Tech Talks
 
P1 2018 python
P1 2018 pythonP1 2018 python
P1 2018 python
 
2016 bioinformatics i_python_part_1_wim_vancriekinge
2016 bioinformatics i_python_part_1_wim_vancriekinge2016 bioinformatics i_python_part_1_wim_vancriekinge
2016 bioinformatics i_python_part_1_wim_vancriekinge
 
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
AWS re:Invent 2016: Deep Learning in Alexa (MAC202)
 
Amazon Deep Learning
Amazon Deep LearningAmazon Deep Learning
Amazon Deep Learning
 
Python for Data: Past, Present, Future (PyCon JP 2017 Keynote)
Python for Data: Past, Present, Future (PyCon JP 2017 Keynote)Python for Data: Past, Present, Future (PyCon JP 2017 Keynote)
Python for Data: Past, Present, Future (PyCon JP 2017 Keynote)
 
Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016
Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016
Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016
 
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?
Hoe een efficiënte Machine of Deep Learning backend ontwikkelen?
 
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
 
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
 
AI and Innovations on AWS
AI and Innovations on AWSAI and Innovations on AWS
AI and Innovations on AWS
 
Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksData Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural Networks
 
Deep Learning: Changing the Playing Field of Artificial Intelligence - MaRS G...
Deep Learning: Changing the Playing Field of Artificial Intelligence - MaRS G...Deep Learning: Changing the Playing Field of Artificial Intelligence - MaRS G...
Deep Learning: Changing the Playing Field of Artificial Intelligence - MaRS G...
 
Big data 4 webmonday
Big data 4 webmondayBig data 4 webmonday
Big data 4 webmonday
 
How Will AI Change the Role of the Data Scientist?
How Will AI Change the Role of the Data Scientist?How Will AI Change the Role of the Data Scientist?
How Will AI Change the Role of the Data Scientist?
 
Melbourne Cocoa Heads CoreML Presentation
Melbourne Cocoa Heads CoreML PresentationMelbourne Cocoa Heads CoreML Presentation
Melbourne Cocoa Heads CoreML Presentation
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 

Plus de TAUS - The Language Data Network

Plus de TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Deep Learning for Machine Translation, by Satoshi Enoue, SYSTRAN

  • 1. Deep Learning for Machine Translation Satoshi Enoue, Jungi Kim, Jean Senellart, SYSTRAN
  • 2. SYSTRAN Through Machine Translation History Rule Base Machine Translation Example-Based Machine Translation Phrase Based Machine Translation Syntax Based Machine Translation Neural Machine Translation Hybrid Machine Translation SYSTRAN 197 1968 SYSTRAN (SYStem TRANslation) founded by Dr. Toma in La Jolla, California (USA) 1969 Provided first MT software for the US Air Force, (Russian to English) 1975 Used by NASA for the Apollo- Soyuz American-Soviet project 1975 Translation systems for all European languages in the European Commission 1986 SYSTRAN is acquired by France’s Gachot SA, thus becoming a French company with a U.S. subsidiary 1995 Pioneered development of first Windows-based MT software 1997 First free Web-based translation service: Altavista Babelfish. SYSTRAN made the Internet community aware of the usefulness and capabilities of machine translation 2002 SYSTRAN was used on most major Internet Portals: Yahoo!, Google, AltaVista, Lycos. 1996 SYSTRAN within SEIKO’s pocket translators. 1990’s Port technology from mainframes to Desktop PC’s and Client-Server environments for personal and corporate use 2014 Following acquisition by CSLI, SYSTRAN SA forms part of the SYSTRAN International Group 2005 Launched embedded translation software for mobile devices 2009 Developed first hybrid translation software and solution: SES 7 Translation Server 2011 Launch of SES 7 Training Server, first solution for self-learning of MT engines 2015 SES8 Translation and Training Server – Large Models 2016 More than 140 language Pairs. Launch of SYSTRAN.io, the Natural Language Processing API platform
  • 3. The new game changer • Deep Neural Network Technologies • Image Analysis • Voice Recognition • Text • Text Generation • Word Embeddings • Multitask NLP • Neural Machine Translation • … Games • Super Human Abilities Sequence of fascinating results and technologies over the last 3 years – all based on Deep Neural Network (DNN) – covering a large variety of domains… 20/04/2016 SYSTRAN - Copyright 2016 3
  • 4. The new game changer • Deep Neural Network Technologies • Image Analysis • Voice Recognition • Text • Text Generation • Word Embeddings • Multitask NLP • Neural Machine Translation • … Games 20/04/2016 SYSTRAN - Copyright 2016 4
  • 5. The new game changer • Deep Neural Network Technologies • Image Analysis • Voice Recognition • Text • Text Generation • Word Embeddings • Multitask NLP • Neural Machine Translation • … Games Abilities • Google 2015 RNN voice search recognition outperforms 2012 DNN models • Baidu Deep Speech announces 16.5% improvement over baseline and higher performance than human in noisy environment 20/04/2016 SYSTRAN - Copyright 2016 5
  • 6. The new game changer - examples • Deep Neural Network Technologies • Image Analysis • Voice Recognition • Text • Text Generation • Word Embeddings • Multitask NLP • Neural Machine Translation • … Gamesn Abilities Les yeux prenaient des redoutables, des troncs de feu. Toutes les prétexticheurs par ces quatre repentilleuses avec du sergent de Digne, débragiffés nymoeurs sur les derniers instants à hardis, boucher, sans dénongée en plus ennérence, ils se refecturent encore. Ils auraient déjà mangé ses très interses. ShakespeareVictorHugo Char-RNN, Andrej Karpathy, 2015 공급자는 AspNetXSprchyLibrary의 인스턴스를 만들어 다른 경고를 오버터 컴퓨터에 저장할 수 있습니다. MSDN 20/04/2016 SYSTRAN - Copyright 2016
  • 9. The new game changer - examples • Deep Neural Network Technologies • Image Analysis • Voice Recognition • Text • Text Generation • Word Embeddings • Multitask NLP • Neural Machine Translation • … Games 20/04/2016 9word2vec, Google, 2013
  • 10. The new game changer - examples • Unified Neural Network Architecture for several NLP tasks POS tagging, chunking, NER, SRL • Focus on avoiding task/linguistic specific engineering • Joint decision on the different tasks Outperforms almost all of the state of the art results for each individual tasks Natural Language Processing (Almost) from Scratch, Collobert et al., 201120/04/2016 • Deep Neural Network Technologies • Image Analysis • Voice Recognition • Text • Text Generation • Word Embeddings • Multitask NLP • Neural Machine Translation • … Gamesn Abilities
  • 11. The new game changer - examples • Deep Neural Network Technologies • Image Analysis • Voice Recognition • Text • Text Generation • Word Embeddings • Multitask NLP • Neural Machine Translation: sentence encoding-decoding • … Games Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, K. Cho et al, 2014
  • 12. The new game changer - examples • Deep Neural Network Technologies • Image Analysis • Voice Recognition • Text • Text Generation • Word Embeddings • Multitask NLP • Neural Machine : sentence encoding- decoding • … Games – DQN, AlphaGo HUMAN-LEVEL CONTROL THROUGH DEEP REINFORCEMENT LEARNING, Google DeepMind, 201520/04/2016
  • 13. AlphaGo, Google DeepMind, 201620/04/2016 SYSTRAN - Copyright 2016
  • 14. The new game changer - examples More and more evidence of “super-human abilities” Could we also reach Super- human Machine Translation? 20/04/2016 SYSTRAN - Copyright 2016 14
  • 15. The new game changer – ingredients • MLP – multilayer perceptron • Actually an “old concept” • CNN • Convolutional Neural network • Word Embeddings • Representing words as vectors • RNN – GRU, LSTM • MLP with memory • Attention-Based models • Ability to decide where to find information 20/04/2016 SYSTRAN - Copyright 2016 15
  • 16. The new game changer – ingredients • MLP – multilayer perceptron • Actually an “old concept” • CNN • Convolutional Neural network • Word Embeddings • Representing words as vectors • RNN – GRU, LSTM • MLP with memory • Attention-Based models • Ability to decide where to find information 20/04/2016 SYSTRAN - Copyright 2016 16
  • 17. The new game changer – ingredients • MLP – multilayer perceptron • Actually an “old concept” • CNN • Convolutional Neural network • Word Embeddings • Representing words as vectors • RNN – GRU, LSTM • MLP with memory • Attention-Based models • Ability to decide where to find information 20/04/2016 SYSTRAN - Copyright 2016 17
  • 18. The new game changer – ingredients • MLP – multilayer perceptron • Actually an “old concept” • CNN • Convolutional Neural network • Word Embeddings • Representing words as vectors • RNN – GRU, LSTM • MLP with memory • Attention-Based models • Ability to decide where to find information 20/04/2016 SYSTRAN - Copyright 2016 18
  • 19. The new game changer – ingredients • MLP – multilayer perceptron • Actually an “old concept” • CNN • Convolutional Neural network • Word Embeddings • Representing words as vectors • RNN – GRU, LSTM • MLP with memory • Attention-Based models • Ability to decide where to find information 20/04/2016 SYSTRAN - Copyright 2016 19 All of these features are the ingredients to Neural Machine Translation
  • 20. About Neural Machine Translation (NMT) • The goal is to perform end-to-end translation • Like in Speech Recognition • The spirit is to remove all these features and have single system • For Machine Translation – first NMT systems are encoder-decoder • But not that magic • Not systematic improvements over SMT baseline • Use of ensemble systems • Issues with sentence lengths, vocabulary size • Solutions come back with some interest in “linguistic” characteristics • Attention-Based model (alignment information) • Deep Fusion with Language Model (better modelling of target language) • Combine with word level (~ morphology) 20/04/2016 SYSTRAN - Copyright 2016 20
  • 21. SYSTRAN approach to NMT • Current Real Use-Case Requirements: • Adaptation to (small) domain • Help for post-editing • Preserved speed • Consistent results amongst multiple target languages • Possibility to let users control translation through annotations, terminology • … • Toward Linguistically Motivated NN architecture • SYSTRAN MT is composed of linguistic modules – let us start with them • Lot of knowledge to leverage 20/04/2016 SYSTRAN - Copyright 2016 21
  • 22. SYSTRAN Deep Learning Story – Part I Language Identification SYSTRAN LDK 1 •Statistical Classifier – 3-grams •Heavily Feature Engineered over years •e.g. diacritics model for latin language •Include lexicon of frequent terms •Quite good accuracy on news-type data – need ~20 characters Basic RNN •“out-of-the-box” character level RNN •no specific language specific engineering •80K words training per language Google CLD •Naïve Bayesian Classifier – 4-grams •Trained on “big data” •carefully scrapped over 100M pages •Specific tricks for closely related languages (Spanish/Portuguese) •Geared for webpages - 200+ characters Learnings: with same data RNN approach easily outperforms baseline, no specific engineering needed… big data is not competing... 20/04/2016 SYSTRAN - Copyright 2016 22 News Sentences One-word request Ted-Talk Sentences Tweets LDK 97 55.2 87.4 78.3 RNN 98.2 61.5 91.4 77.9 CLD 96.1 15.3 86 78.1
  • 23. SYSTRAN Deep Learning Story – Part II Part of Speech Tagging Phase 1 - 1968-2014 - Handcrafting •Manual Rule and Lexicon Coding of homography •Closely related to Morphology description •27 languages covered Phase 2 - 2008-2015 – Annotating •Train Classifier to "relearn” rules (fnTBL) •Transfer knowledge through system output •Maintenance through Annotation Phase 3 - 2015- - Generalizing •Relearn with RNN •Joint decision (so far tokenization/part of speech tagging) – working on morphology •Better generalization from additional knowledge (word embeddings) 20/04/2016 SYSTRAN - Copyright 2016 23 Learnings: Possibility to leverage ”handcrafting” and gain quality. But learning becoming too smart – it also learns initial errors
  • 24. SYSTRAN Deep Learning Story – Part III Transliteration 20/04/2016 24 • Transliteration of person names is depending on • Source Language • Target Language • But also Name origin • 카스파로프 = Kasparov • 필리프 = Philippe • Good Transliteration system needs: • Detection of origin • Transliteration mechanism •Extremely complicated – since it requires phonetics modeling Rule-Based • Satisfactory but origin detection and multiple domains • No generalization - unseen sequence is wrong PBMT • Encoding-Decoding Approach • Long distance "view" guarantee consistency of transliteration RNN Learnings: - losing reliability/traceability of the process + more global consistency, compactness of the solution
  • 25. SYSTRAN Deep Learning Story – Part IV Language Modeling • RNN language model proves to overpass standard n-gram models • No limitation in the span • Seems to capture also better the language structure • Better generalization due to word embedding • Can be easily introduced in PBMT engine through rescoring • Are still challenging pure sequence-to-sequence NMT approaches 20/04/2016 25 Learnings: - Very long training process, several weeks of training for one language + Consistent quality gain, easy introduction in existing framework
  • 26. Learnings from Deep Learning • Consistent quality improvement in all the experiments/modules we worked on • Better leverage of existing training material • Better generalization • Incrementability: by design, it is immediate to feed more training data – i.e. adapt dynamically to usage • Globally more simple than alternative approaches and cognitively interesting • Fit to be combined in a global NN architecture 20/04/2016 SYSTRAN - Copyright 2016 26
  • 27. Linguistically Motivated NN architecture 20/04/2016 SYSTRAN - Copyright 2016 27 Morphology Syntactic Analysis Sentence Encoding Sentence Decoding RNN-LM Word Embedding Source Sentence … Target Sentence …
  • 28. What about Statistical Post Editing: Learning to correct? 20/04/2016 SYSTRAN - Copyright 2016 28 • SPE was introduced as smart alternative the SMT • Corresponding to real MT use case for localization • Very little data can produce adaptation • Reduce Human Post-Editor Work by iteratively learning edits • However implementation with PBMT is not satisfactory • PBMT does not learn to correct but to translate • Not incremental • Learning to correct • More control of the process Toward a “translation checker” • Change the paradigm – now human post- editor to MT output, tomorrow automatic post-editor to human output? MT HPE
  • 29. Deep Learning for Machine Translation • No doubt – it is coming: • We will probably reach “superhuman” machine translation in coming years • And this could become real translation assistant • How is not yet completely clear • From our perspective, we are working on hybrid approach = linguistically motivated NN architecture • More will also be coming from research world • Still some work ahead • Training of models is still a technological challenge • We need the models to explain as much as to translate to become really useful – or for language learning • Multi-level analysis - document translation and not just sentences • Multi-modal => could lead to full self language learning 20/04/2016 SYSTRAN - Copyright 2016 29

Notes de l'éditeur

  1. The last 3 years…
  2. In Image recognition
  3. In Voice Recognition
  4. Show X is to Y what Z is to …
  5. M
  6. M
  7. M
  8. Road Sign Recognition For some tasks
  9. Actually it is not one single technology but a mix of different technologies – what is very seducing is this remains relatively simple, and appealing
  10. Convolution Neural Network are very used in the image processing – and can be seen as consecutive layers of processing that progressively extract more and more advanced features
  11. Actually it is not one single technology but a mix of different technologies – what is very seducing is this remains relatively simple, and appealing
  12. Actually it is not one single technology but a mix of different technologies – what is very seducing is this remains relatively simple, and appealing
  13. End-to-end – is also called “sequence-to-sequence”
  14. Requirements from our customer are actually quite strong – and our goal is not to produce a generic academic NMT engine, but actual solutions for our customer requirements
  15. So we would like to share with you findings of these moves to DNN and we took for that several modules
  16. Example on Chinese
  17. So we are not yet there – but what we foresee and work on is to establish a NN architecture preserving the actual traditional linguistic workflow with specialized NN stacking up to produce machine translation From this specialization – we except several things - first we would be able to use the existing knowledge, second we would still have “checkpoints” in the process allowing to monitor translation process
  18. Alternatively, the other important research directions for us – is to improve modeling on Statistical Post-Editing introduced in 2007 as an alternative to raising SMT. SPE is corresponding to real user-case: very little data, an existing system performing well but not really adapted to the task.
  19. So SYSTRANN