On-Device Diffusion Transformer Policy for Efficient Robot Manipulation
Yiming Wu, Huan Wang, Zhenghao Chen, Jianxin Pang, Dong Xu

TL;DR
This paper introduces LightDP, a framework that accelerates diffusion policies for robot manipulation on mobile devices by combining network compression, pruning, and distillation, enabling real-time, resource-efficient deployment.
Contribution
LightDP presents a unified pruning and retraining pipeline with consistency distillation to significantly reduce diffusion model complexity for mobile robot manipulation.
Findings
Achieves real-time action prediction on mobile devices.
Maintains competitive performance with state-of-the-art diffusion policies.
Effective model compression without significant accuracy loss.
Abstract
Diffusion Policies have significantly advanced robotic manipulation tasks via imitation learning, but their application on resource-constrained mobile platforms remains challenging due to computational inefficiency and extensive memory footprint. In this paper, we propose LightDP, a novel framework specifically designed to accelerate Diffusion Policies for real-time deployment on mobile devices. LightDP addresses the computational bottleneck through two core strategies: network compression of the denoising modules and reduction of the required sampling steps. We first conduct an extensive computational analysis on existing Diffusion Policy architectures, identifying the denoising network as the primary contributor to latency. To overcome performance degradation typically associated with conventional pruning methods, we introduce a unified pruning and retraining pipeline, optimizing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Advanced Neural Network Applications
