Discourse Cohesion Evaluation for Document-Level Neural Machine   Translation

Xin Tan; Longyin Zhang; Guodong Zhou

arXiv:2208.09118·cs.CL·August 22, 2022

Discourse Cohesion Evaluation for Document-Level Neural Machine Translation

Xin Tan, Longyin Zhang, Guodong Zhou

PDF

Open Access

TL;DR

This paper introduces DCoEM, a new evaluation method for document-level neural machine translation that assesses discourse cohesion across four manners, addressing the limitations of traditional sentence-level metrics like BLEU.

Contribution

The paper presents a novel discourse cohesion evaluation method and a comprehensive test suite to better measure document-level translation quality.

Findings

01

DCoEM effectively evaluates discourse cohesion in document translations.

02

The test suite covers four cohesive manners: reference, conjunction, substitution, and lexical cohesion.

03

Results show DCoEM's practicality and importance in assessing document-level NMT performance.

Abstract

It is well known that translations generated by an excellent document-level neural machine translation (NMT) model are consistent and coherent. However, existing sentence-level evaluation metrics like BLEU can hardly reflect the model's performance at the document level. To tackle this issue, we propose a Discourse Cohesion Evaluation Method (DCoEM) in this paper and contribute a new test suite that considers four cohesive manners (reference, conjunction, substitution, and lexical cohesion) to measure the cohesiveness of document translations. The evaluation results on recent document-level NMT systems show that our method is practical and essential in estimating translations at the document level.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsTest