Machine Translation Verbosity Control for Automatic Dubbing

Surafel M. Lakew; Marcello Federico; Yue Wang; Cuong Hoang; Yogesh; Virkar; Roberto Barra-Chicote; Robert Enyedi

arXiv:2110.03847·cs.CL·October 11, 2021

Machine Translation Verbosity Control for Automatic Dubbing

Surafel M. Lakew, Marcello Federico, Yue Wang, Cuong Hoang, Yogesh, Virkar, Roberto Barra-Chicote, Robert Enyedi

PDF

TL;DR

This paper presents methods to control the verbosity of machine translation output to improve the quality of automatic dubbing, demonstrating benefits through both intrinsic and extrinsic evaluations on multiple languages.

Contribution

It introduces new techniques for MT verbosity control specifically tailored for automatic dubbing, enhancing translation alignment and dubbing quality.

Findings

01

Verbosity control improves dubbing synchronization

02

Proposed methods outperform state-of-the-art in intrinsic evaluations

03

Subjective tests show increased dubbing quality with verbosity control

Abstract

Automatic dubbing aims at seamlessly replacing the speech in a video document with synthetic speech in a different language. The task implies many challenges, one of which is generating translations that not only convey the original content, but also match the duration of the corresponding utterances. In this paper, we focus on the problem of controlling the verbosity of machine translation output, so that subsequent steps of our automatic dubbing pipeline can generate dubs of better quality. We propose new methods to control the verbosity of MT output and compare them against the state of the art with both intrinsic and extrinsic evaluations. For our experiments we use a public data set to dub English speeches into French, Italian, German and Spanish. Finally, we report extensive subjective tests that measure the impact of MT verbosity control on the final quality of dubbed video clips.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.