Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training

Sodtavilan Odonchimed; Tatsuya Matsushima; Simon Holk; Yusuke Iwasawa; Yutaka Matsuo

arXiv:2507.21452·cs.LG·July 30, 2025

Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training

Sodtavilan Odonchimed, Tatsuya Matsushima, Simon Holk, Yusuke Iwasawa, Yutaka Matsuo

PDF

TL;DR

This paper introduces RAGDP, a retrieval-augmented framework that speeds up diffusion policies in imitation learning without extra training, maintaining high accuracy even with significant acceleration.

Contribution

RAGDP leverages a knowledge base and retrieval to reduce diffusion steps, eliminating additional training and improving speed-accuracy trade-offs in diffusion policies.

Findings

01

RAGDP improves inference speed by up to 20 times.

02

RAGDP achieves 7% higher accuracy than distillation methods.

03

RAGDP maintains performance without extra training.

Abstract

Diffusion Policies (DPs) have attracted attention for their ability to achieve significant accuracy improvements in various imitation learning tasks. However, DPs depend on Diffusion Models, which require multiple noise removal steps to generate a single action, resulting in long generation times. To solve this problem, knowledge distillation-based methods such as Consistency Policy (CP) have been proposed. However, these methods require a significant amount of training time, especially for difficult tasks. In this study, we propose RAGDP (Retrieve-Augmented Generation for Diffusion Policies) as a novel framework that eliminates the need for additional training using a knowledge base to expedite the inference of pre-trained DPs. In concrete, RAGDP encodes observation-action pairs through the DP encoder to construct a vector database of expert demonstrations. During inference, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.