SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window
Vikas Raunak, Tom Kocmi, Matt Post

TL;DR
SLIDE is a reference-free, document-level evaluation metric for machine translation that uses a sliding window approach to leverage source context, significantly improving accuracy over sentence-level metrics and rivaling reference-based methods.
Contribution
Introduces SLIDE, a novel reference-free evaluation metric that uses a sliding window to incorporate source context, enhancing document-level translation quality assessment.
Findings
SLIDE outperforms sentence-level baselines in pairwise system accuracy.
SLIDE can match the performance of reference-based metrics in some cases.
Source context can substitute references in disambiguating source ambiguities.
Abstract
Reference-based metrics that operate at the sentence-level typically outperform quality estimation metrics, which have access only to the source and system output. This is unsurprising, since references resolve ambiguities that may be present in the source. In this paper, we investigate whether additional source context can effectively substitute for a reference. We present a metric named SLIDE (SLIding Document Evaluator), which operates on blocks of sentences. SLIDE leverages a moving window that slides over each document in the test set, feeding each chunk of sentences into an unmodified, off-the-shelf quality estimation model. We find that SLIDE obtains significantly higher pairwise system accuracy than its sentence-level baseline, in some cases even eliminating the gap with reference-base metrics. This suggests that source context may provide the same information as a human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
