End-to-End Speech Translation with Knowledge Distillation
Yuchen Liu, Hao Xiong, Zhongjun He, Jiajun Zhang, Hua Wu, Haifeng, Wang, Chengqing Zong

TL;DR
This paper introduces a knowledge distillation method to enhance end-to-end speech translation models by leveraging a trained text translation teacher, resulting in significant BLEU score improvements on multiple language pairs.
Contribution
The paper presents a novel application of knowledge distillation from text translation models to improve end-to-end speech translation performance.
Findings
End-to-end speech translation is feasible for similar and dissimilar language pairs.
Knowledge distillation improves BLEU scores by over 3.5 points.
The approach reduces error propagation and model size.
Abstract
End-to-end speech translation (ST), which directly translates from source language speech into target language text, has attracted intensive attentions in recent years. Compared to conventional pipeline systems, end-to-end ST models have advantages of lower latency, smaller model size and less error propagation. However, the combination of speech recognition and text translation in one model is more difficult than each of these two tasks. In this paper, we propose a knowledge distillation approach to improve ST model by transferring the knowledge from text translation model. Specifically, we first train a text translation model, regarded as a teacher model, and then ST model is trained to learn output probabilities from teacher model through knowledge distillation. Experiments on English- French Augmented LibriSpeech and English-Chinese TED corpus show that end-to-end ST is possible to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
MethodsKnowledge Distillation
