ReDiF: Reinforced Distillation for Few Step Diffusion

Amirhossein Tighkhorshid; Zahra Dehghanian; Gholamali Aminian; Chengchun Shi; Hamid R. Rabiee

arXiv:2512.22802·cs.LG·December 30, 2025

ReDiF: Reinforced Distillation for Few Step Diffusion

Amirhossein Tighkhorshid, Zahra Dehghanian, Gholamali Aminian, Chengchun Shi, Hamid R. Rabiee

PDF

Open Access

TL;DR

This paper introduces ReDiF, a reinforcement learning-based distillation method for diffusion models that optimizes the denoising process, enabling high-quality generation with fewer steps and less computation.

Contribution

ReDiF is the first RL-driven distillation framework for diffusion models that dynamically guides the denoising process, improving efficiency and performance.

Findings

01

Achieves superior performance with fewer inference steps.

02

Reduces computational resources compared to existing methods.

03

Model-agnostic and applicable to various diffusion models.

Abstract

Distillation addresses the slow sampling problem in diffusion models by creating models with smaller size or fewer steps that approximate the behavior of high-step teachers. In this work, we propose a reinforcement learning based distillation framework for diffusion models. Instead of relying on fixed reconstruction or consistency losses, we treat the distillation process as a policy optimization problem, where the student is trained using a reward signal derived from alignment with the teacher's outputs. This RL driven approach dynamically guides the student to explore multiple denoising paths, allowing it to take longer, optimized steps toward high-probability regions of the data distribution, rather than relying on incremental refinements. Our framework utilizes the inherent ability of diffusion models to handle larger steps and effectively manage the generative process. Experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis