Machine Translation Evaluation with BERT Regressor

Hiroki Shimanaka; Tomoyuki Kajiwara; Mamoru Komachi

arXiv:1907.12679·cs.CL·July 31, 2019·23 cites

Machine Translation Evaluation with BERT Regressor

Hiroki Shimanaka, Tomoyuki Kajiwara, Mamoru Komachi

PDF

Open Access

TL;DR

This paper presents a new machine translation evaluation metric based on BERT, achieving state-of-the-art results in segment-level evaluation for English translation pairs.

Contribution

It introduces a BERT-based regression model for automatic translation quality assessment, outperforming existing metrics on WMT-2017 data.

Findings

01

Achieves state-of-the-art performance on WMT-2017 dataset

02

Effective at segment-level translation evaluation

03

Applicable across multiple English translation pairs

Abstract

We introduce the metric using BERT (Bidirectional Encoder Representations from Transformers) (Devlin et al., 2019) for automatic machine translation evaluation. The experimental results of the WMT-2017 Metrics Shared Task dataset show that our metric achieves state-of-the-art performance in segment-level metrics task for all to-English language pairs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax