This presentation is a part of the MosesCore project that encourages the development and usage of open source machine translation tools, notably the Moses statistical MT toolkit.
MosesCore is supported by the European Commission Grant Number 288487 under the 7th Framework Programme.
For the latest updates, follow us on Twitter - #MosesCore
TAUS MT SHOWCASE, I Used to Be a Translator, Now I Run MT, Manuel Herranz, Pangeanic, 12 June 2013
1. TAUS
MACHINE
TRANSLATION
SHOWCASE
I used to be a translator, now I run MT
11:10 – 11:30
Wednesday, 12 June 2013
Manuel Herranz
Pangeanic
2. PangeaMT
Manuel
Herranz
I used to be a translator,
Now I run Machine Translation
#manuelhrrnz
#pangeanic E:
m.herranz@pangeanic.com
pangeanic
3. Why machine Translation?
The
Data
Deluge
ü As
of
May
2009:
487
Billion
gigabytes
or
1,000,000,000
*
487,000,000,000
=
4,87
x
1020
ü EsAmates
§
Up
50%
a
year
(Oracle)
§
Doubles
every
11
hours
(IBM)
ü Language
translaAon
as
a
job
becoming
unmanageable.
Increasing
demands,
increasing
volumes,
shorter
deadlines.
Human
producAon
is
not
sufficient
(why
is
TR
so
labour
intensive?
Film
producAon,
publishing,
XF
are
highly
computarised!).
As
Content
Volume
Explodes,
Machine
TranslaPon
Becomes
an
Inevitable
Part
of
Global
Content
Strategy
hRp://ow.ly/jVuhZ
4. MT as a translator’s tool
2007
a
EU
post-‐ediPng
contract
won
by
Pangeanic
triggers
in-‐house
translators
thinking
-‐ Is
this
the
best
technology
can
do?
-‐ Why
is
the
result
so
unpredictable?
-‐ Why
is
it
so…
BAD?
-‐ Can
we
do
something
to
improve?
5. MT as a translator’s tool
2007
a
EU
post-‐ediPng
contract
won
by
Pangeanic
triggers
in-‐house
translators
thinking
-‐ Is
this
the
best
technology
can
do?
-‐ Why
is
the
result
so
unpredictable?
-‐ Why
is
it
so…
BAD?
-‐ Can
we
do
something
to
improve?
Since
2011:
Pangeanic
DIY
SMT
hp://ow.ly/lQXSv
Test,
feedback,
install,
iteraPon
euromatrixplus.
net/moses-‐
decoder/
6. Challenges
-‐ Translator
resistance:
depending
on
language:
French,
Japanese
the
most
reluctant
-‐ TranslaPon
industry
has
lagged
behind
in
many
areas,
even
in
ISO
or
technology
adopPon
-‐ Free
web-‐based
MT
services
double-‐edged
sword:
familiarize
users,
but
unrealisPc
expectaPons
Miguel
Ángel
Díaz,
Chairman
of
the
Spanish
AssociaPon
for
EmoPonal
Intelligence,
June
2013:
“It
makes
a
lot
of
sense
to
talk
about
emoPonal
intelligence
in
educaPon
since
the
current
system
prepares
the
new
genera@ons
for
a
world
which
is
not
going
to
exist.
The
job
market
has
changed
very
much
in
the
last
20
or
30
years
–
however
the
educaPonal
model
has
not.
7. MT Usage
Machine
TranslaAon
applicaAon,
usage
and
success
depend
on
ü MT
for
assimilaPon:
“gisPng”
or
“understanding“
Sports
Politics
Social
etc
Output
format
• Prac@cally
unlimited
demand;
but
free
web-‐based
services
reduce
incen@ve
to
improve
technology
• Coverage
+
important.
Instant
quality
ü MT
for
disseminaPon:
“publicaPon“
ü MT
for
direct
communicaPon
Output
format
Sports
Politics
Social
etc
• Publishable
quality
that
can
only
be
achieved
by
humans.
MT
&
tools
a
produc@vity
booster
Output
format
Output
format
Sports
Politics
Social
etc
• Current
R&D,
Military
uses
systems
for
spoken
MT,
first
applica@ons
for
smartphones
8. Unrest is continuing in Cairo as protesters set up their demand for Egypt’s
military rulers to resign
+
specific
language
rules
+
job
or
client
glossary
+
hybrid
technologies
How does Statistical MT work?
9. Back to Challenges
Translator
resistance:
-‐ I
DON’T
LIKE
IT
-‐ IT
TAKES
ME
MORE
TIME
THAN
DOING
IT
“THE
NORMAL
WAY”
-‐ NO
CONTROL
-‐ MANAGEMENT-‐IMPOSED
-‐ THREAT
TO
OWN
JOB
-‐ “DE-‐HUMANIZATION”
Management
view:
-‐ COST-‐EFFICIENCY
-‐ TIGHTER
BUDGETS,
MORE
MONEY
FOR
VALUE,
LOWER
COSTS
-‐ COMPETITION
ALREADY
AHEAD
-‐ TECHNOLOGY
CAN
BE
APPLIED
ELSEWHERE,
NOT
JUST
DOCUMENTATION
(REVENUE
OPPORTUNITY)
COMPROMISE: USER EMPOWERMENT
10. Back to Challenges
Translator
resistance:
A
MOANING
PROFESSION?
COMPROMISE: FAMILIAR PE ENVIRONMENT
15. Myth:
MT
will
never
be
as
good
as
humans
“We
cannot
solve
the
problem
using
the
same
tools
and
the
way
of
thinking
that
created
it”
A.
Einstein
uhmmm,
it
is
going
to
get
really
good...
2nd
stage
PE
material
and
more
data
make
engines
even
more
predictable.
More
specialist
engines
3rd
stage
Beyond
2030...
no
predictions
1st
stage
We
are
creating
usable
engines,
?irst
PE
experiences
2009-‐2015
or
2020
16. A Success Story
My
translaAon
company.com
Françoise
Moreau
f.moreau@mytranslationcompany.com"
87 Rue Legendre, 75017 Paris " " "Tel +33 1 42 25 55 77 "
Translation Project Manager"MT Customization Specialist!