Towards the evaluation of automatic simultaneous speech translation from a communicative perspective
Claudio Fantinuoli, Bianca Prandi

TL;DR
This paper evaluates a real-time speech translation system by comparing it to professional interpreters using a communicative framework, highlighting differences in intelligibility and informativeness from a user-centric perspective.
Contribution
It introduces a novel evaluation approach for speech translation systems based on human interpreter assessment frameworks, emphasizing communication quality.
Findings
Humans outperform machines in intelligibility.
Machines are slightly better in informativeness.
Framework offers a user-centric evaluation perspective.
Abstract
In recent years, automatic speech-to-speech and speech-to-text translation has gained momentum thanks to advances in artificial intelligence, especially in the domains of speech recognition and machine translation. The quality of such applications is commonly tested with automatic metrics, such as BLEU, primarily with the goal of assessing improvements of releases or in the context of evaluation campaigns. However, little is known about how the output of such systems is perceived by end users or how they compare to human performances in similar communicative tasks. In this paper, we present the results of an experiment aimed at evaluating the quality of a real-time speech translation engine by comparing it to the performance of professional simultaneous interpreters. To do so, we adopt a framework developed for the assessment of human interpreters and use it to perform a manual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
