TL;DR
This paper introduces a real-time iteration scheme for diffusion policies in robotic manipulation, significantly reducing inference time without retraining, enabling efficient use of large pre-trained models in latency-critical tasks.
Contribution
It adapts the Real-Time Iteration scheme from optimal control to diffusion inference, providing a resource-efficient method to accelerate robotic policies without additional training or policy redesign.
Findings
Substantial reduction in inference time demonstrated in simulations
Maintains comparable performance to full-step denoising methods
Effective handling of discrete actions like grasping in manipulation tasks
Abstract
Diffusion Policies have demonstrated impressive performance in robotic manipulation tasks. However, their long inference time, resulting from an extensive iterative denoising process, and the need to execute an action chunk before the next prediction to maintain consistent actions limit their applicability to latency-critical tasks or simple tasks with a short cycle time. While recent methods explored distillation or alternative policy structures to accelerate inference, these often demand additional training, which can be resource-intensive for large robotic models. In this paper, we introduce a novel approach inspired by the Real-Time Iteration (RTI) Scheme, a method from optimal control that accelerates optimization by leveraging solutions from previous time steps as initial guesses for subsequent iterations. We explore the application of this scheme in diffusion inference and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
