Recent Advances in End-to-End Simultaneous Speech Translation
Xiaoqian Liu, Guoqiang Hu, Yangfan Du, Erfeng He, Yingfeng Luo, Chen, Xu, Tong Xiao, Jingbo Zhu

TL;DR
This paper reviews recent progress in end-to-end simultaneous speech translation, highlighting key challenges like processing continuous speech, real-time constraints, quality-latency trade-offs, and data scarcity, and discusses potential solutions.
Contribution
It provides a comprehensive overview of recent advances in SimulST, identifying major challenges and proposing directions for future research in the field.
Findings
Identified key challenges in SimulST such as latency and data scarcity.
Summarized recent solutions and approaches to improve SimulST.
Highlighted future research directions for advancing SimulST.
Abstract
Simultaneous speech translation (SimulST) is a demanding task that involves generating translations in real-time while continuously processing speech input. This paper offers a comprehensive overview of the recent developments in SimulST research, focusing on four major challenges. Firstly, the complexities associated with processing lengthy and continuous speech streams pose significant hurdles. Secondly, satisfying real-time requirements presents inherent difficulties due to the need for immediate translation output. Thirdly, striking a balance between translation quality and latency constraints remains a critical challenge. Finally, the scarcity of annotated data adds another layer of complexity to the task. Through our exploration of these challenges and the proposed solutions, we aim to provide valuable insights into the current landscape of SimulST research and suggest promising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
