Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models
Jiahang Cao, Qiang Zhang, Jingkai Sun, Jiaxu Wang, Hao Cheng, Yulin Li, Jun Ma, Kun Wu, Zhiyuan Xu, Yecheng Shao, Wen Zhao, Gang Han, Yijie Guo, Renjing Xu

TL;DR
The paper introduces the Mamba Policy, a lightweight yet high-performing 3D diffusion policy that reduces parameter count significantly while maintaining or improving task performance, suitable for resource-limited devices.
Contribution
It proposes the XMamba Block and Mamba Policy framework, achieving over 80% parameter reduction with superior performance in 3D manipulation tasks.
Findings
Outperforms baselines on Adroit, Dexart, and MetaWorld datasets
Requires significantly less computational resources
Shows enhanced robustness in long-horizon scenarios
Abstract
Diffusion models have been widely employed in the field of 3D manipulation due to their efficient capability to learn distributions, allowing for precise prediction of action trajectories. However, diffusion models typically rely on large parameter UNet backbones as policy networks, which can be challenging to deploy on resource-constrained devices. Recently, the Mamba model has emerged as a promising solution for efficient modeling, offering low computational complexity and strong performance in sequence modeling. In this work, we propose the Mamba Policy, a lighter but stronger policy that reduces the parameter count by over 80% compared to the original policy network while achieving superior performance. Specifically, we introduce the XMamba Block, which effectively integrates input information with conditional features and leverages a combination of Mamba and Attention mechanisms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsClimate Change Policy and Economics · European Monetary and Fiscal Policies · Stochastic processes and financial applications
MethodsSoftmax · Attention Is All You Need · Mamba: Linear-Time Sequence Modeling with Selective State Spaces · Diffusion
