DSSP: Diffusion State Space Policy with Full-History Encoding
Zhiyuan Guan, Jianshu Hu, Han Fang, Yunpeng Jiang, Yize Huang, Shujia Li, Xiao Li, and Yutong Ban

TL;DR
DSSP introduces a history-conditioned diffusion policy using state space models to improve long-horizon robot manipulation tasks, achieving state-of-the-art results with efficient full-history encoding.
Contribution
The paper proposes DSSP, a novel diffusion policy with full-history encoding via state space models, enhancing long-term task performance in robot manipulation.
Findings
DSSP outperforms existing methods on simulation benchmarks.
DSSP achieves state-of-the-art results with smaller model size.
Hierarchical conditioning effectively captures long-term dependencies.
Abstract
Diffusion-based imitation learning has shown strong promise for robot manipulation. However, most existing policies condition only on the current observation or a short window of recent observations, limiting their ability to resolve history-dependent ambiguities in long-horizon tasks. To address this, we introduce DSSP, a history-conditioned Diffusion State Space Policy that enables efficient, full-history conditioning for robot manipulation. Leveraging the continuous sequence modeling properties of State Space Models (SSMs), our history encoder effectively compresses the entire observation stream into a compact context representation. To ensure this context preserves critical information regarding future state evolution, the encoder is optimized with a dynamics-aware auxiliary training objective. This high-level context representation is then seamlessly fused with recent state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
