BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
Inigo Jauregi Unanue, Jacob Parnell, Massimo Piccardi

TL;DR
This paper introduces BERTTune, a novel fine-tuning method for neural machine translation that uses BERTScore as a training objective, enabling models to better handle paraphrases and synonyms, improving translation quality.
Contribution
It proposes three methods for integrating BERTScore into training, allowing end-to-end differentiable fine-tuning of translation models with improved performance.
Findings
Up to 0.58 BLEU score improvement
Up to 0.76 BERTScore (F_BERT) increase
Effective across four diverse language pairs
Abstract
Neural machine translation models are often biased toward the limited translation references seen during training. To amend this form of overfitting, in this paper we propose fine-tuning the models with a novel training objective based on the recently-proposed BERTScore evaluation metric. BERTScore is a scoring function based on contextual embeddings that overcomes the typical limitations of n-gram-based metrics (e.g. synonyms, paraphrases), allowing translations that are different from the references, yet close in the contextual embedding space, to be treated as substantially correct. To be able to use BERTScore as a training objective, we propose three approaches for generating soft predictions, allowing the network to remain completely differentiable end-to-end. Experiments carried out over four, diverse language pairs have achieved improvements of up to 0.58 pp (3.28%) in BLEU score…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
