An Automatic Quality Metric for Evaluating Simultaneous Interpretation
Mana Makinae, Katsuhito Sudoh, Masaru Yamada, Satoshi Nakamura

TL;DR
This paper introduces an automatic evaluation metric for simultaneous interpretation that measures word order synchronization using cross-lingual models, addressing latency-quality trade-offs especially in distant language pairs.
Contribution
It proposes a novel metric based on rank correlation and pre-trained models to evaluate SI and SiMT focusing on word order alignment.
Findings
Effective in measuring word order synchronization
Validated on NAIST-SIC-Aligned and JNPC datasets
Correlates well with human judgment
Abstract
Simultaneous interpretation (SI), the translation of one language to another in real time, starts translation before the original speech has finished. Its evaluation needs to consider both latency and quality. This trade-off is challenging especially for distant word order language pairs such as English and Japanese. To handle this word order gap, interpreters maintain the word order of the source language as much as possible to keep up with original language to minimize its latency while maintaining its quality, whereas in translation reordering happens to keep fluency in the target language. This means outputs synchronized with the source language are desirable based on the real SI situation, and it's a key for further progress in computational SI and simultaneous machine translation (SiMT). In this work, we propose an automatic evaluation metric for SI and SiMT focusing on word order…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
