Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation
Minjie Zhu, Yichen Zhu, Jinming Li, Junjie Wen, Zhiyuan Xu, Ning Liu,, Ran Cheng, Chaomin Shen, Yaxin Peng, Feifei Feng, Jian Tang

TL;DR
This paper introduces extbf{ extsc{ScaleDP}}, a scalable diffusion transformer policy that effectively increases model size to 1 billion parameters for robotic manipulation, improving performance and stability in visuomotor tasks.
Contribution
The paper proposes extbf{ extsc{ScaleDP}}, a novel method with modules that stabilize training and enable diffusion policies to scale to 1 billion parameters for robotic control.
Findings
extbf{ extsc{ScaleDP}} outperforms previous diffusion policies by 21.6% on MetaWorld tasks.
It achieves a 36.25% average improvement on real-world robot tasks.
Successfully scales diffusion policies from 10 million to 1 billion parameters.
Abstract
Diffusion Policy is a powerful technique tool for learning end-to-end visuomotor robot control. It is expected that Diffusion Policy possesses scalability, a key attribute for deep neural networks, typically suggesting that increasing model size would lead to enhanced performance. However, our observations indicate that Diffusion Policy in transformer architecture (\DP) struggles to scale effectively; even minor additions of layers can deteriorate training outcomes. To address this issue, we introduce Scalable Diffusion Transformer Policy for visuomotor learning. Our proposed method, namely \textbf{\methodname}, introduces two modules that improve the training dynamic of Diffusion Policy and allow the network to better handle multimodal action distribution. First, we identify that \DP~suffers from large gradient issues, making the optimization of Diffusion Policy unstable. To resolve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Surface Polishing Techniques · Copper Interconnects and Reliability · Magnetic Properties and Applications
