DSSP: Diffusion State Space Policy with Full-History Encoding

Zhiyuan Guan; Jianshu Hu; Han Fang; Yunpeng Jiang; Yize Huang; Shujia Li; Xiao Li; and Yutong Ban

arXiv:2605.14598·cs.RO·May 22, 2026

DSSP: Diffusion State Space Policy with Full-History Encoding

Zhiyuan Guan, Jianshu Hu, Han Fang, Yunpeng Jiang, Yize Huang, Shujia Li, Xiao Li, and Yutong Ban

PDF

TL;DR

DSSP introduces a history-conditioned diffusion policy using state space models to improve long-horizon robot manipulation tasks, achieving state-of-the-art results with efficient full-history encoding.

Contribution

The paper proposes DSSP, a novel diffusion policy with full-history encoding via state space models, enhancing long-term task performance in robot manipulation.

Findings

01

DSSP outperforms existing methods on simulation benchmarks.

02

DSSP achieves state-of-the-art results with smaller model size.

03

Hierarchical conditioning effectively captures long-term dependencies.

Abstract

Diffusion-based imitation learning has shown strong promise for robot manipulation. However, most existing policies condition only on the current observation or a short window of recent observations, limiting their ability to resolve history-dependent ambiguities in long-horizon tasks. To address this, we introduce DSSP, a history-conditioned Diffusion State Space Policy that enables efficient, full-history conditioning for robot manipulation. Leveraging the continuous sequence modeling properties of State Space Models (SSMs), our history encoder effectively compresses the entire observation stream into a compact context representation. To ensure this context preserves critical information regarding future state evolution, the encoder is optimized with a dynamics-aware auxiliary training objective. This high-level context representation is then seamlessly fused with recent state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.