Automatic Song Translation for Tonal Languages
Fenfei Guo, Chen Zhang, Zhirui Zhang, Qixin He, Kejun Zhang, Jun Xie,, Jordan Boyd-Graber

TL;DR
This paper introduces GagaST, an unsupervised system for translating songs into tonal languages like Mandarin, effectively balancing meaning, singability, and intelligibility through novel metrics and a new benchmark.
Contribution
It presents the first unsupervised approach for English-Mandarin song translation that aligns tones with melody while preserving meaning and singability.
Findings
GagaST outperforms baseline models in balancing semantics and singability.
The new benchmark enables standardized evaluation of song translation quality.
Both automatic and human evaluations confirm GagaST's effectiveness.
Abstract
This paper develops automatic song translation (AST) for tonal languages and addresses the unique challenge of aligning words' tones with melody of a song in addition to conveying the original meaning. We propose three criteria for effective AST -- preserving meaning, singability and intelligibility -- and design metrics for these criteria. We develop a new benchmark for English--Mandarin song translation and develop an unsupervised AST system, Guided AliGnment for Automatic Song Translation (GagaST), which combines pre-training with three decoding constraints. Both automatic and human evaluations show GagaST successfully balances semantics and singability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Natural Language Processing Techniques
