Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT)

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Idiap-RR
Citation:	Werlen_Idiap-RR-29-2016
Number:	Idiap-RR-29-2016
Year:	2016
Month:	11
Institution:	Idiap
Abstract:	In this paper, we define and assess a reference-based metric to evaluate the accuracy of pronoun translation (APT). The metric automatically aligns a candidate and a reference translation using GIZA++ augmented with specific heuristics, and then counts the number of identical or different pronouns, with provision for legitimate variations and omitted pronouns. All counts are then combined into one score. The metric is applied to the results of seven systems (including the baseline) that participated in the DiscoMT 2015 shared task on pronoun translation from English to French. The APT metric reaches around 0.993-0.999 Pearson correlation with human judges (depending on the parameters of APT), while other automatic metrics such as BLEU, METEOR, or those specific to pronouns used at DiscoMT 2015 reach only 0.972-0.986 Pearson correlation.
Keywords:
Projects	Idiap SUMMA
Authors	Miculicich, Lesly Popescu-Belis, Andrei
Crossref by	MiculicichWerlen_DISCOMTATEMNLP_2017
Added by:	[ADM]
Total mark:	0
Attachments
Werlen_Idiap-RR-29-2016.pdf (MD5: 2de270e263228c7281cfeee9e943ebfc)
Notes

processing time: 0.0011 seconds.