TL;DR
REINA is a novel loss function based on information theory that improves the latency and quality tradeoff in simultaneous speech translation systems, achieving state-of-the-art results.
Contribution
This paper introduces REINA, a new information-theoretic loss for training adaptive policies in SimulST, enhancing the latency/quality tradeoff over previous methods.
Findings
REINA improves the latency/quality tradeoff by up to 21%.
State-of-the-art streaming results on multiple language pairs.
Effective training using only open source or synthetic data.
Abstract
Simultaneous Speech Translation (SimulST) systems stream in audio while simultaneously emitting translated text or speech. Such systems face the significant challenge of balancing translation quality and latency. We introduce a strategy to optimize this tradeoff: wait for more input only if you gain information by doing so. Based on this strategy, we present Regularized Entropy INformation Adaptation (REINA), a novel loss to train an adaptive policy using an existing non-streaming translation model. We derive REINA from information theory principles and show that REINA helps push the reported Pareto frontier of the latency/quality tradeoff over prior works. Utilizing REINA, we train a SimulST model on French, Spanish and German, both from and into English. Training on only open source or synthetically generated data, we achieve state-of-the-art (SOTA) streaming results for models of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
