SBAAM! Eliminating Transcript Dependency in Automatic Subtitling

Marco Gaido; Sara Papi; Matteo Negri; Mauro Cettolo; Luisa Bentivogli

arXiv:2405.10741·cs.CL·May 20, 2024

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling

Marco Gaido, Sara Papi, Matteo Negri, Mauro Cettolo, Luisa Bentivogli

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a novel model that generates subtitles directly from audio without relying on transcripts, improving accessibility and performance across languages.

Contribution

The first direct subtitle generation model that eliminates the need for intermediate transcripts for translation, segmentation, and timestamp prediction.

Findings

01

Achieves state-of-the-art performance across multiple languages

02

Outperforms transcript-dependent methods in accuracy

03

Validated through manual evaluation

Abstract

Subtitling plays a crucial role in enhancing the accessibility of audiovisual content and encompasses three primary subtasks: translating spoken dialogue, segmenting translations into concise textual units, and estimating timestamps that govern their on-screen duration. Past attempts to automate this process rely, to varying degrees, on automatic transcripts, employed diversely for the three subtasks. In response to the acknowledged limitations associated with this reliance on transcripts, recent research has shifted towards transcription-free solutions for translation and segmentation, leaving the direct generation of timestamps as uncharted territory. To fill this gap, we introduce the first direct model capable of producing automatic subtitles, entirely eliminating any dependence on intermediate transcripts also for timestamp prediction. Experimental results, backed by manual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling· underline

Taxonomy

TopicsNatural Language Processing Techniques · Translation Studies and Practices · Subtitles and Audiovisual Media