DiscoTK: Using Discourse Structure for Machine Translation Evaluation

Shafiq Joty; Francisco Guzman; Lluis Marquez; Preslav Nakov

arXiv:1911.12547·cs.CL·December 2, 2019

DiscoTK: Using Discourse Structure for Machine Translation Evaluation

Shafiq Joty, Francisco Guzman, Lluis Marquez, Preslav Nakov

PDF

Open Access

TL;DR

DiscoTK introduces discourse-structure-based automatic metrics for machine translation evaluation, outperforming previous systems in correlating with human judgments on WMT datasets.

Contribution

It proposes a novel approach using discourse trees and convolution kernels, combined with existing metrics, to improve translation quality assessment.

Findings

01

Outperforms previous metrics in correlation with human judgments.

02

Effective use of discourse structure enhances evaluation accuracy.

03

Validated on WMT12 and WMT13 datasets.

Abstract

We present novel automatic metrics for machine translation evaluation that use discourse structure and convolution kernels to compare the discourse tree of an automatic translation with that of the human reference. We experiment with five transformations and augmentations of a base discourse tree representation based on the rhetorical structure theory, and we combine the kernel scores for each of them into a single score. Finally, we add other metrics from the ASIYA MT evaluation toolkit, and we tune the weights of the combination on actual human judgments. Experiments on the WMT12 and WMT13 metrics shared task datasets show correlation with human judgments that outperforms what the best systems that participated in these years achieved, both at the segment and at the system level.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsConvolution