Self-Modifying State Modeling for Simultaneous Machine Translation

Donglei Yu; Xiaomian Kang; Yuchen Liu; Yu Zhou; Chengqing Zong

arXiv:2406.02237·cs.CL·June 5, 2024

Self-Modifying State Modeling for Simultaneous Machine Translation

Donglei Yu, Xiaomian Kang, Yuchen Liu, Yu Zhou, Chengqing Zong

PDF

Open Access 1 Repo 1 Video

TL;DR

The paper introduces SM², a novel training paradigm for Simultaneous Machine Translation that optimizes decisions at each state independently, enabling better policy learning and compatibility with bidirectional encoders.

Contribution

SM² eliminates the need for decision path exploration, allowing precise decision optimization and improved translation quality in SiMT models.

Findings

01

SM² outperforms strong baselines in SiMT tasks.

02

SM² enables offline models to acquire SiMT capabilities through fine-tuning.

03

SM² achieves higher translation quality with bidirectional encoders.

Abstract

Simultaneous Machine Translation (SiMT) generates target outputs while receiving stream source inputs and requires a read/write policy to decide whether to wait for the next source token or generate a new target token, whose decisions form a \textit{decision path}. Existing SiMT methods, which learn the policy by exploring various decision paths in training, face inherent limitations. These methods not only fail to precisely optimize the policy due to the inability to accurately assess the individual impact of each decision on SiMT performance, but also cannot sufficiently explore all potential paths because of their vast number. Besides, building decision paths requires unidirectional encoders to simulate streaming source inputs, which impairs the translation quality of SiMT models. To solve these issues, we propose \textbf{S}elf-\textbf{M}odifying \textbf{S}tate \textbf{M}odeling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EurekaForNLP/SM2
pytorchOfficial

Videos

Self-Modifying State Modeling for Simultaneous Machine Translation· underline

Taxonomy

TopicsNatural Language Processing Techniques