Self-Modifying State Modeling for Simultaneous Machine Translation
Donglei Yu, Xiaomian Kang, Yuchen Liu, Yu Zhou, Chengqing Zong

TL;DR
The paper introduces SM², a novel training paradigm for Simultaneous Machine Translation that optimizes decisions at each state independently, enabling better policy learning and compatibility with bidirectional encoders.
Contribution
SM² eliminates the need for decision path exploration, allowing precise decision optimization and improved translation quality in SiMT models.
Findings
SM² outperforms strong baselines in SiMT tasks.
SM² enables offline models to acquire SiMT capabilities through fine-tuning.
SM² achieves higher translation quality with bidirectional encoders.
Abstract
Simultaneous Machine Translation (SiMT) generates target outputs while receiving stream source inputs and requires a read/write policy to decide whether to wait for the next source token or generate a new target token, whose decisions form a \textit{decision path}. Existing SiMT methods, which learn the policy by exploring various decision paths in training, face inherent limitations. These methods not only fail to precisely optimize the policy due to the inability to accurately assess the individual impact of each decision on SiMT performance, but also cannot sufficiently explore all potential paths because of their vast number. Besides, building decision paths requires unidirectional encoders to simulate streaming source inputs, which impairs the translation quality of SiMT models. To solve these issues, we propose \textbf{S}elf-\textbf{M}odifying \textbf{S}tate \textbf{M}odeling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques
