ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation

Yuzhe Shang; Pengzhi Gao; Yazheng Yang; Jiayao Ma; Wei Liu; Jian Luan; Jinsong Su

arXiv:2603.14903·cs.CL·March 31, 2026

ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation

Yuzhe Shang, Pengzhi Gao, Yazheng Yang, Jiayao Ma, Wei Liu, Jian Luan, Jinsong Su

PDF

TL;DR

ExPosST introduces an explicit position allocation framework for LLM-based simultaneous translation, improving efficiency and consistency across various positional encodings and policies.

Contribution

The paper proposes a novel explicit position allocation method and a policy-consistent fine-tuning strategy to enhance LLM-based simultaneous translation.

Findings

01

Supports diverse translation policies effectively.

02

Enables efficient decoding with fixed positional slots.

03

Bridges the gap between training and inference behaviors.

Abstract

Large language models (LLMs) have recently demonstrated promising performance in simultaneous machine translation (SimulMT). However, applying decoder-only LLMs to SimulMT introduces a positional mismatch, which leads to a dilemma between decoding efficiency and positional consistency. Existing approaches often rely on specific positional encodings or carefully designed prompting schemes, and thus fail to simultaneously achieve inference efficiency, positional consistency, and broad model compatibility. In this work, we propose ExPosST, a general framework that resolves this dilemma through explicit position allocation. ExPosST reserves fixed positional slots for incoming source tokens, enabling efficient decoding with KV cache across different positional encoding methods. To further bridge the gap between fine-tuning and inference, we introduce a policy-consistent fine-tuning strategy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.