What is Quality? A Machine Translation Perspective

•Télécharger en tant que PPTX, PDF•

0 j'aime•655 vues

Presentation given by Tony O'Dowd, Founder and Chief Architect, KantanMT at the Gala Roundtable in Carton House, Ireland.

Technologie Business

What we aim to cover?
 The MT & Quality Relationship
 What is quality?
 Possible ways of measuring it
 Automated evaluation methods
 Who needs to measure quality
 Localisation stakeholders
 Conclusion

Machine Translation & Quality

The Quality & MT Relationship

Machine Translation & Quality

Attributes of Quality
 Language Attributes
 Adequacy



Accuracy of generated texts
Based on word recall & precision

 Fluency




Comprehensibility of texts
Readability, understandability
Based on phrase reuse and
assembly

 Task-oriented Attributes
 Productivity


Post-editing speed

 Acceptability



Fit-for-purpose measurement
Usable translations within the
context of the end user

Machine Translation & Quality

Automated Evaluations
 Many difference techniques available



All compute similarity of generated texts to reference texts
The smaller the difference => the better the quality!

NIST

Fluency

Usability

GTM

F-Measure

Productivity

TER

Adequacy

BLEU

Acceptability
METEOR

Language

Task
Machine Translation & Quality

Who needs to measure Quality?
 The Localisation Stakeholder Dilemma
 Developers of MT Engines




Automated BLEU, METEOR, F-MEASURE, TER ideal and practical
No individual measurement has absolute meaning


but points quality curve in the right direction within a domain

Machine Translation & Quality

Who needs to measure Quality?
 The Localisation Stakeholder Dilemma
 Production Teams (PMs, LEs and QEs)


Need segment measurements on quality and PE efforts



Determine tiered segment post-edit rate
Distribution of post-editing tasks based on segment quality

 Localisation Managers


Need productivity measurements to predict budget and schedule



Aka Project Segment Reports
MT Measurements need to ‘fit’ business planning and charge models

 Translators


Unfortunately, don’t get a fair deal


No segment information, just top level project
Machine Translation & Quality

F-Measure

TER

BLEU

GTM

METEOR

NIST

MT Developers

Production

The Quality & MT Relationship

Machine Translation & Quality

Conclusions
 There are many automated MT quality measurements




Mostly suitable for MT developers
Not optimal for production teams
Of no use to translators

 All rely on reference texts to compute measurements

 What’s needed?
 Segment level measurements



Drive project schedule and charge model
High correlation to human effort

 Do not rely on reference texts to compute measurements

Machine Translation & Quality

Recommandé

5. bleuHiroshi Matsumoto

Deview2013 naver labs_nsmt_외부공개버전_김준석NAVER D2

[2A4]DeepLearningAtNAVERNAVER D2

KantanFest: Mindaugas Kazlauskaskantanmt

Kantanfest: Dimitar Shterionov - Part 2kantanmt

Kantanfest: Laura Casanellaskantanmt

Kantanfest: Dimitar Shterionov - Part 1kantanmt

KantanFest: Andy Waykantanmt

Recommandé

5. bleuHiroshi Matsumoto

Deview2013 naver labs_nsmt_외부공개버전_김준석NAVER D2

[2A4]DeepLearningAtNAVERNAVER D2

KantanFest: Mindaugas Kazlauskaskantanmt

Kantanfest: Dimitar Shterionov - Part 2kantanmt

Kantanfest: Laura Casanellaskantanmt

Kantanfest: Dimitar Shterionov - Part 1kantanmt

KantanFest: Andy Waykantanmt

KantanFest: Tony O'Dowdkantanmt

Get Started with KantanNeuralkantanmt

You Asked, We Will Answerkantanmt

ATC Summit 2016: The 7th Habit of 7 Habits of Effective MT Systemskantanmt

Cross Border Selling: Breaking the Language Barrier with Automated Translationkantanmt

Go global with this Winning Combination – Content strategy and Machine Transl...kantanmt

Webinar automotive and engineering content 16.06.16kantanmt

IC4 Cloud Security Workshop 2016kantanmt

New Ways to Engage Clients with Custom Machine Translationkantanmt

Improving your Bottom Line with Custom Machine Translationkantanmt

How to Achieve Agile Localization for High-Volume Content with Machine Transl...kantanmt

How to Improve Translation Productivitykantanmt

How to save 16 million euro for your start up businesskantanmt

What is the Economic Case for Machine Translation?kantanmt

Tips for Preparing Training Data for High Quality Machine Translationkantanmt

EAMT Workshop 2015 - KantanMTkantanmt

Breaking Language Barriers: Machine Translation for eCommercekantanmt

Cloud Computing: IC4 Cloud On-Boarding Clinic, DCUkantanmt

How to set up a high tech business in the Cloud for 2,000 EURkantanmt

How Does Your MT System Measure Up? tekom/tcworld 2014 kantanmt

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Contenu connexe

Plus de kantanmt

KantanFest: Tony O'Dowdkantanmt

Get Started with KantanNeuralkantanmt

You Asked, We Will Answerkantanmt

ATC Summit 2016: The 7th Habit of 7 Habits of Effective MT Systemskantanmt

Cross Border Selling: Breaking the Language Barrier with Automated Translationkantanmt

Go global with this Winning Combination – Content strategy and Machine Transl...kantanmt

Webinar automotive and engineering content 16.06.16kantanmt

IC4 Cloud Security Workshop 2016kantanmt

New Ways to Engage Clients with Custom Machine Translationkantanmt

Improving your Bottom Line with Custom Machine Translationkantanmt

How to Achieve Agile Localization for High-Volume Content with Machine Transl...kantanmt

How to Improve Translation Productivitykantanmt

How to save 16 million euro for your start up businesskantanmt

What is the Economic Case for Machine Translation?kantanmt

Tips for Preparing Training Data for High Quality Machine Translationkantanmt

EAMT Workshop 2015 - KantanMTkantanmt

Breaking Language Barriers: Machine Translation for eCommercekantanmt

Cloud Computing: IC4 Cloud On-Boarding Clinic, DCUkantanmt

How to set up a high tech business in the Cloud for 2,000 EURkantanmt

How Does Your MT System Measure Up? tekom/tcworld 2014 kantanmt

Plus de kantanmt (20)

KantanFest: Tony O'Dowd

Get Started with KantanNeural

You Asked, We Will Answer

ATC Summit 2016: The 7th Habit of 7 Habits of Effective MT Systems

Cross Border Selling: Breaking the Language Barrier with Automated Translation

Go global with this Winning Combination – Content strategy and Machine Transl...

Webinar automotive and engineering content 16.06.16

IC4 Cloud Security Workshop 2016

New Ways to Engage Clients with Custom Machine Translation

Improving your Bottom Line with Custom Machine Translation

How to Achieve Agile Localization for High-Volume Content with Machine Transl...

How to Improve Translation Productivity

How to save 16 million euro for your start up business

What is the Economic Case for Machine Translation?

Tips for Preparing Training Data for High Quality Machine Translation

EAMT Workshop 2015 - KantanMT

Breaking Language Barriers: Machine Translation for eCommerce

Cloud Computing: IC4 Cloud On-Boarding Clinic, DCU

How to set up a high tech business in the Cloud for 2,000 EUR

How Does Your MT System Measure Up? tekom/tcworld 2014

Dernier

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Developing An App To Navigate The Roads of BrazilV3cube

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Scaling API-first – The story of a global engineering organizationRadu Cotescu

GenAI Risks & Security Meetup 01052024.pdflior mazor

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Partners Life - Insurer Innovation Award 2024The Digital Insurer

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Dernier (20)

Advantages of Hiring UIUX Design Service Providers for Your Business

How to Troubleshoot Apps for the Modern Connected Worker

Exploring the Future Potential of AI-Enabled Smartphone Processors

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Powerful Google developer tools for immediate impact! (2023-24 C)

What Are The Drone Anti-jamming Systems Technology?

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Developing An App To Navigate The Roads of Brazil

Boost PC performance: How more available memory can improve productivity

Scaling API-first – The story of a global engineering organization

GenAI Risks & Security Meetup 01052024.pdf

GenCyber Cyber Security Day Presentation

Partners Life - Insurer Innovation Award 2024

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

AWS Community Day CPH - Three problems of Terraform

Driving Behavioral Change for Information Management through Data-Driven Gree...

Axa Assurance Maroc - Insurer Innovation Award 2024

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

What is Quality? A Machine Translation Perspective

1. No Hardware. No Software. No Hassle MT.

2. Machine Translation & Quality

3. What we aim to cover?  The MT & Quality Relationship  What is quality?  Possible ways of measuring it  Automated evaluation methods  Who needs to measure quality  Localisation stakeholders  Conclusion Machine Translation & Quality

4. The Quality & MT Relationship Machine Translation & Quality

5. Attributes of Quality  Language Attributes  Adequacy   Accuracy of generated texts Based on word recall & precision  Fluency    Comprehensibility of texts Readability, understandability Based on phrase reuse and assembly  Task-oriented Attributes  Productivity  Post-editing speed  Acceptability   Fit-for-purpose measurement Usable translations within the context of the end user Machine Translation & Quality

6. Automated Evaluations  Many difference techniques available   All compute similarity of generated texts to reference texts The smaller the difference => the better the quality! NIST Fluency Usability GTM F-Measure Productivity TER Adequacy BLEU Acceptability METEOR Language Task Machine Translation & Quality

7. Who needs to measure Quality?  The Localisation Stakeholder Dilemma  Developers of MT Engines   Automated BLEU, METEOR, F-MEASURE, TER ideal and practical No individual measurement has absolute meaning  but points quality curve in the right direction within a domain Machine Translation & Quality

8. Who needs to measure Quality?  The Localisation Stakeholder Dilemma  Production Teams (PMs, LEs and QEs)  Need segment measurements on quality and PE efforts   Determine tiered segment post-edit rate Distribution of post-editing tasks based on segment quality  Localisation Managers  Need productivity measurements to predict budget and schedule   Aka Project Segment Reports MT Measurements need to ‘fit’ business planning and charge models  Translators  Unfortunately, don’t get a fair deal  No segment information, just top level project Machine Translation & Quality

9. F-Measure TER BLEU GTM METEOR NIST MT Developers Production The Quality & MT Relationship Machine Translation & Quality

10. Conclusions  There are many automated MT quality measurements    Mostly suitable for MT developers Not optimal for production teams Of no use to translators  All rely on reference texts to compute measurements  What’s needed?  Segment level measurements   Drive project schedule and charge model High correlation to human effort  Do not rely on reference texts to compute measurements Machine Translation & Quality