SlideShare une entreprise Scribd logo
1  sur  10
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE


Moses from the point of view of
an LSP: The Trusted
Translations Experience

By Gustavo Lucardi
Moses from the point of view of an LSP:
  The Trusted Translations Experience




TAUS Open Source Machine Translation
Pre Conference Workshops
GALA MONACO 2012
Sunday 25 March 13:30 - 17:30

Gustavo Lucardi
COO Trusted Translations, Inc.
@glucardi
Moses from the point of view of an LSP: The Trusted Translations Experience



  An On-going Story
   • Not a Success Story (Yet)
   • An On-going Story (Since October 2010)
   • Moses from the Point of View of an LSP:
      ▫ Why Moses? Why Now?
      ▫ What exactly are we doing?
      ▫ Our Expectations (What we are looking for)
      ▫ Unexpected Experiences and Issues
      ▫ Next Steps
      ▫ Our Conclusions (Till Now)

                             TAUS Open Source Machine Translation
                                  Pre Conference Workshops
                                     GALA MONACO 2012
Moses from the point of view of an LSP: The Trusted Translations Experience



  Why Moses? Why Now?
   • Because we have to
      ▫ In Friedman’s The World is Flat, the Refrigeration Industry was
        transformed with the advent of the electric refrigerator killing the Ice
        Block industry. Transform or die.
      ▫ Every LSP has to start learn about MT or prepare to suffer the same fate
        as the Ice Block makers.
   • We believe that the future of the professional translation
     industry will include MT as a part (central?) of the process.
   • To expose all the members of our company to this new
     nascent model and its consequences in all the areas of the
     business cycle from sales to production, administration,
     in- house resources, freelancers, and of course, IT.

                             TAUS Open Source Machine Translation
                                  Pre Conference Workshops
                                     GALA MONACO 2012
Moses from the point of view of an LSP: The Trusted Translations Experience



  What exactly are we doing?
   • We built two Moses engines and are running tests:
      ▫ Legal English to Spanish (based on legal specialization of many clients)
      ▫ Technical English to Spanish (based on technical experience from one
        specific client)
   • We are also testing other MT solutions like:
      ▫ Pangea MT (Pangeanic)
      ▫ DoMY (Translation Precision Tools)
      ▫ Language Studio (Asia Online)
      ▫ Smart MATE (Applied Languages)
      ▫ LetsMT
      ▫ Specially Manuel Herranz from Pangeanic helped us understand MT


                             TAUS Open Source Machine Translation
                                  Pre Conference Workshops
                                     GALA MONACO 2012
Moses from the point of view of an LSP: The Trusted Translations Experience



  Our Expectations (What we are looking for)
   • To be able to deliver higher volumes in less time while
     maintaining quality and consistency (Quality and Time, not
     Price)
   • The price of doing MT+PE+E+P is similar to the price of T+P
     (i.e.: PE+E+P~T+P = T*0.6+T*0,33+T*0.16 = T+T*0.16 = 1.09
     T ~ 1.16 T)
   • Introduce MT culture and concepts within the entire company
   • Be ready for the New New thing in the translation industry,
     which of course is MT (as CAT tools were the New New thing 20
     years ago)
   • Have working knowledge and hands-on experience to be able to
     choose the best MT solution for each particular project
     (Language Pair, Area of Specialization, Volume)


                             TAUS Open Source Machine Translation
                                  Pre Conference Workshops
                                     GALA MONACO 2012
Moses from the point of view of an LSP: The Trusted Translations Experience



  Unexpected Experiences and Issues
   • The lack of information and documented procedures on
     how post-editing should be done (We only found one Paper
     of TAUS)
   • The difficulty of measuring human post-editing quality and
     productivity
   • The difficulty to measure the quality of an Engine utilizing
     BLEU (For example an Engine with a BLEU of 70 can have
     a worse output than Engines with a BLEU of 40)
   • The difficulty of aligning existing Parallel DATA without
     human intervention
   • The fear of translators to be displaced by MT

                             TAUS Open Source Machine Translation
                                  Pre Conference Workshops
                                     GALA MONACO 2012
Moses from the point of view of an LSP: The Trusted Translations Experience



  Next steps
   • Scripts to Train (Automate through scripts the pre-processing of DATA to train
     Moses engines: Take out Placeables like Numbers, Acronyms, XML Tags, E-
     mail addresses and URLs / Remove Short Segments and Long Segments /
     Spell check on Source and Target / Remove segments where target and source
     have different structures)
   • Scripts to Translate (Automate through scripts the pre and post-processing for
     translations with Moses engines: Go from TMX to TXT and from TXT to TMX)
   • Plug-in for Moses interaction with CAT TOOLS (Create or find PLUG-INS to
     use Moses directly through CAT TOOLS, like the ones for Okapi or Globalsight.
     This could allow us to avoid to go to and from TMX and TXT. Plug-ins for
     Moses like the plug-ins for Trados to connect with FreeTranslation, Systran
     and others)
   • Moses GUI (Create or find a user interface to allow the less tech savvy to
     interact directly with Moses)
   • Keep testing more mature MT implementations


                             TAUS Open Source Machine Translation
                                  Pre Conference Workshops
                                     GALA MONACO 2012
Moses from the point of view of an LSP: The Trusted Translations Experience



  Our Conclusions (Till Now)
   • Moses has great potential, but it is not yet optimized for LSPs
   • Its interface is not user-friendly (in fact there is no GUI yet)
   • There are no reliable sources of information for certain niches (a
     lot of work is still needed to obtain a reliable corpus in some
     niches with the tools that exist at the moment)
   • We still have to wait for the development of the new features
     that LSPs require (glossaries, blacklist words, labels)
   • It is necessary to optimize the internal processes of the LSPs to
     adapt to the needs of Moses (Management of Memories, product
     names and trademarks, languages)
   • It's better to work with an engine for a specific client, than with
     one for a specific industry or area of specialization

                             TAUS Open Source Machine Translation
                                  Pre Conference Workshops
                                     GALA MONACO 2012
Thank you!




Gustavo Lucardi
@glucardi
glucardi@trustedtranslations.com
http://www.trustedtranslations.com
http://translation-blog.trustedtranslations.com

Contenu connexe

Similaire à TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trusted Translations, 25 March 2012

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Gustavo Lucardi, Truste...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Gustavo Lucardi, Truste...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Gustavo Lucardi, Truste...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Gustavo Lucardi, Truste...TAUS - The Language Data Network
 
Diversity In Localization (Olga Melnikova)
Diversity In Localization (Olga Melnikova)Diversity In Localization (Olga Melnikova)
Diversity In Localization (Olga Melnikova)Olga Melnikova
 
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS - The Language Data Network
 
The Role Of Translators In MT: EU 2010
The Role Of Translators In MT:  EU 2010The Role Of Translators In MT:  EU 2010
The Role Of Translators In MT: EU 2010LoriThicke
 
Transifex at the TAUS Translation Technology Showcase - Silicon Valley 2015
Transifex at the TAUS Translation Technology Showcase - Silicon Valley 2015Transifex at the TAUS Translation Technology Showcase - Silicon Valley 2015
Transifex at the TAUS Translation Technology Showcase - Silicon Valley 2015TAUS - The Language Data Network
 
Opening the Black Box of Software Localization
Opening the Black Box of Software LocalizationOpening the Black Box of Software Localization
Opening the Black Box of Software LocalizationKenneth Farrall
 
WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...
WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...
WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...Welocalize
 
Translation Trends for 2015
Translation Trends for 2015Translation Trends for 2015
Translation Trends for 2015Memsource
 
Connected and continuous localization systems for content management systems
Connected and continuous localization systems for content management systemsConnected and continuous localization systems for content management systems
Connected and continuous localization systems for content management systemsTolga Secilmis
 
Growing Your Freelance Business (Olga Melnikova)
Growing Your Freelance Business (Olga Melnikova)Growing Your Freelance Business (Olga Melnikova)
Growing Your Freelance Business (Olga Melnikova)Olga Melnikova
 
Gala Webminar September 2013
Gala Webminar September 2013Gala Webminar September 2013
Gala Webminar September 2013pangeanic
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...TAUS - The Language Data Network
 
Do-it-yourself Machine Translation
Do-it-yourself Machine TranslationDo-it-yourself Machine Translation
Do-it-yourself Machine TranslationUwe Muegge
 
TAUS Roundtable Moscow, Translation Automation Going Cloud- The New Landscape...
TAUS Roundtable Moscow, Translation Automation Going Cloud- The New Landscape...TAUS Roundtable Moscow, Translation Automation Going Cloud- The New Landscape...
TAUS Roundtable Moscow, Translation Automation Going Cloud- The New Landscape...TAUS - The Language Data Network
 
Translation technology plugging the gaps_ecpd
Translation technology plugging the gaps_ecpdTranslation technology plugging the gaps_ecpd
Translation technology plugging the gaps_ecpdLucinda Brooks
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS - The Language Data Network
 
Lexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLoriThicke
 

Similaire à TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trusted Translations, 25 March 2012 (20)

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Gustavo Lucardi, Truste...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Gustavo Lucardi, Truste...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Gustavo Lucardi, Truste...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Gustavo Lucardi, Truste...
 
Diversity In Localization (Olga Melnikova)
Diversity In Localization (Olga Melnikova)Diversity In Localization (Olga Melnikova)
Diversity In Localization (Olga Melnikova)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
 
The Role Of Translators In MT: EU 2010
The Role Of Translators In MT:  EU 2010The Role Of Translators In MT:  EU 2010
The Role Of Translators In MT: EU 2010
 
Transifex at the TAUS Translation Technology Showcase - Silicon Valley 2015
Transifex at the TAUS Translation Technology Showcase - Silicon Valley 2015Transifex at the TAUS Translation Technology Showcase - Silicon Valley 2015
Transifex at the TAUS Translation Technology Showcase - Silicon Valley 2015
 
Opening the Black Box of Software Localization
Opening the Black Box of Software LocalizationOpening the Black Box of Software Localization
Opening the Black Box of Software Localization
 
WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...
WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...
WeMT Tools and Processes Welocalize TAUS Showcase October 2013 Localization W...
 
Translation Trends for 2015
Translation Trends for 2015Translation Trends for 2015
Translation Trends for 2015
 
Connected and continuous localization systems for content management systems
Connected and continuous localization systems for content management systemsConnected and continuous localization systems for content management systems
Connected and continuous localization systems for content management systems
 
Growing Your Freelance Business (Olga Melnikova)
Growing Your Freelance Business (Olga Melnikova)Growing Your Freelance Business (Olga Melnikova)
Growing Your Freelance Business (Olga Melnikova)
 
Gala Webminar September 2013
Gala Webminar September 2013Gala Webminar September 2013
Gala Webminar September 2013
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
 
Do-it-yourself Machine Translation
Do-it-yourself Machine TranslationDo-it-yourself Machine Translation
Do-it-yourself Machine Translation
 
TAUS Roundtable Moscow, Translation Automation Going Cloud- The New Landscape...
TAUS Roundtable Moscow, Translation Automation Going Cloud- The New Landscape...TAUS Roundtable Moscow, Translation Automation Going Cloud- The New Landscape...
TAUS Roundtable Moscow, Translation Automation Going Cloud- The New Landscape...
 
Translation technology plugging the gaps_ecpd
Translation technology plugging the gaps_ecpdTranslation technology plugging the gaps_ecpd
Translation technology plugging the gaps_ecpd
 
CAT TOOLS.ppt
CAT TOOLS.pptCAT TOOLS.ppt
CAT TOOLS.ppt
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
 
TAUS Moses Roundtable, Prague, 11 September 2013
TAUS Moses Roundtable, Prague, 11 September 2013TAUS Moses Roundtable, Prague, 11 September 2013
TAUS Moses Roundtable, Prague, 11 September 2013
 
Lexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLexcelera MT Breaking Compromises
Lexcelera MT Breaking Compromises
 

Plus de TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)TAUS - The Language Data Network
 

Plus de TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
 

Dernier

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 

Dernier (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trusted Translations, 25 March 2012

  • 1. TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE Moses from the point of view of an LSP: The Trusted Translations Experience By Gustavo Lucardi
  • 2. Moses from the point of view of an LSP: The Trusted Translations Experience TAUS Open Source Machine Translation Pre Conference Workshops GALA MONACO 2012 Sunday 25 March 13:30 - 17:30 Gustavo Lucardi COO Trusted Translations, Inc. @glucardi
  • 3. Moses from the point of view of an LSP: The Trusted Translations Experience An On-going Story • Not a Success Story (Yet) • An On-going Story (Since October 2010) • Moses from the Point of View of an LSP: ▫ Why Moses? Why Now? ▫ What exactly are we doing? ▫ Our Expectations (What we are looking for) ▫ Unexpected Experiences and Issues ▫ Next Steps ▫ Our Conclusions (Till Now) TAUS Open Source Machine Translation Pre Conference Workshops GALA MONACO 2012
  • 4. Moses from the point of view of an LSP: The Trusted Translations Experience Why Moses? Why Now? • Because we have to ▫ In Friedman’s The World is Flat, the Refrigeration Industry was transformed with the advent of the electric refrigerator killing the Ice Block industry. Transform or die. ▫ Every LSP has to start learn about MT or prepare to suffer the same fate as the Ice Block makers. • We believe that the future of the professional translation industry will include MT as a part (central?) of the process. • To expose all the members of our company to this new nascent model and its consequences in all the areas of the business cycle from sales to production, administration, in- house resources, freelancers, and of course, IT. TAUS Open Source Machine Translation Pre Conference Workshops GALA MONACO 2012
  • 5. Moses from the point of view of an LSP: The Trusted Translations Experience What exactly are we doing? • We built two Moses engines and are running tests: ▫ Legal English to Spanish (based on legal specialization of many clients) ▫ Technical English to Spanish (based on technical experience from one specific client) • We are also testing other MT solutions like: ▫ Pangea MT (Pangeanic) ▫ DoMY (Translation Precision Tools) ▫ Language Studio (Asia Online) ▫ Smart MATE (Applied Languages) ▫ LetsMT ▫ Specially Manuel Herranz from Pangeanic helped us understand MT TAUS Open Source Machine Translation Pre Conference Workshops GALA MONACO 2012
  • 6. Moses from the point of view of an LSP: The Trusted Translations Experience Our Expectations (What we are looking for) • To be able to deliver higher volumes in less time while maintaining quality and consistency (Quality and Time, not Price) • The price of doing MT+PE+E+P is similar to the price of T+P (i.e.: PE+E+P~T+P = T*0.6+T*0,33+T*0.16 = T+T*0.16 = 1.09 T ~ 1.16 T) • Introduce MT culture and concepts within the entire company • Be ready for the New New thing in the translation industry, which of course is MT (as CAT tools were the New New thing 20 years ago) • Have working knowledge and hands-on experience to be able to choose the best MT solution for each particular project (Language Pair, Area of Specialization, Volume) TAUS Open Source Machine Translation Pre Conference Workshops GALA MONACO 2012
  • 7. Moses from the point of view of an LSP: The Trusted Translations Experience Unexpected Experiences and Issues • The lack of information and documented procedures on how post-editing should be done (We only found one Paper of TAUS) • The difficulty of measuring human post-editing quality and productivity • The difficulty to measure the quality of an Engine utilizing BLEU (For example an Engine with a BLEU of 70 can have a worse output than Engines with a BLEU of 40) • The difficulty of aligning existing Parallel DATA without human intervention • The fear of translators to be displaced by MT TAUS Open Source Machine Translation Pre Conference Workshops GALA MONACO 2012
  • 8. Moses from the point of view of an LSP: The Trusted Translations Experience Next steps • Scripts to Train (Automate through scripts the pre-processing of DATA to train Moses engines: Take out Placeables like Numbers, Acronyms, XML Tags, E- mail addresses and URLs / Remove Short Segments and Long Segments / Spell check on Source and Target / Remove segments where target and source have different structures) • Scripts to Translate (Automate through scripts the pre and post-processing for translations with Moses engines: Go from TMX to TXT and from TXT to TMX) • Plug-in for Moses interaction with CAT TOOLS (Create or find PLUG-INS to use Moses directly through CAT TOOLS, like the ones for Okapi or Globalsight. This could allow us to avoid to go to and from TMX and TXT. Plug-ins for Moses like the plug-ins for Trados to connect with FreeTranslation, Systran and others) • Moses GUI (Create or find a user interface to allow the less tech savvy to interact directly with Moses) • Keep testing more mature MT implementations TAUS Open Source Machine Translation Pre Conference Workshops GALA MONACO 2012
  • 9. Moses from the point of view of an LSP: The Trusted Translations Experience Our Conclusions (Till Now) • Moses has great potential, but it is not yet optimized for LSPs • Its interface is not user-friendly (in fact there is no GUI yet) • There are no reliable sources of information for certain niches (a lot of work is still needed to obtain a reliable corpus in some niches with the tools that exist at the moment) • We still have to wait for the development of the new features that LSPs require (glossaries, blacklist words, labels) • It is necessary to optimize the internal processes of the LSPs to adapt to the needs of Moses (Management of Memories, product names and trademarks, languages) • It's better to work with an engine for a specific client, than with one for a specific industry or area of specialization TAUS Open Source Machine Translation Pre Conference Workshops GALA MONACO 2012