TL;DR
This paper presents FBK's direct architecture approach for simultaneous translation and automatic subtitling, achieving lower latency and higher translation quality compared to previous systems.
Contribution
Introduces a unified direct model architecture for both simultaneous translation and subtitling, leveraging offline-trained models for real-time inference and subtitle synchronization.
Findings
Reduced computational latency compared to previous top systems.
Achieved up to 3.5 BLEU improvement in English-German translation.
Outperformed existing direct subtitling solutions in SubER metrics.
Abstract
This paper describes the FBK's participation in the Simultaneous Translation and Automatic Subtitling tracks of the IWSLT 2023 Evaluation Campaign. Our submission focused on the use of direct architectures to perform both tasks: for the simultaneous one, we leveraged the knowledge already acquired by offline-trained models and directly applied a policy to obtain the real-time inference; for the subtitling one, we adapted the direct ST model to produce well-formed subtitles and exploited the same architecture to produce timestamps needed for the subtitle synchronization with audiovisual content. Our English-German SimulST system shows a reduced computational-aware latency compared to the one achieved by the top-ranked systems in the 2021 and 2022 rounds of the task, with gains of up to 3.5 BLEU. Our automatic subtitling system outperforms the only existing solution based on a direct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
