Your Pre-trained Diffusion Model Secretly Knows Restoration
Sudarshan Rajagopalan, Vishal M. Patel

TL;DR
This paper reveals that pre-trained diffusion models inherently possess restoration capabilities that can be unlocked through prompt learning, enabling effective image and video restoration without fine-tuning or additional modules.
Contribution
The authors demonstrate that direct prompt learning within a diffusion bridge framework unlocks inherent restoration behavior in pre-trained diffusion models, avoiding fine-tuning.
Findings
Achieves competitive restoration performance across diverse degradations.
Enables high-quality restoration without fine-tuning or control modules.
Extends approach to both image and video diffusion models.
Abstract
Pre-trained diffusion models have enabled significant advancements in All-in-One Restoration (AiOR), offering improved perceptual quality and generalization. However, diffusion-based restoration methods primarily rely on fine-tuning or Control-Net style modules to leverage the pre-trained diffusion model's priors for AiOR. In this work, we show that these pre-trained diffusion models inherently possess restoration behavior, which can be unlocked by directly learning prompt embeddings at the output of the text encoder. Interestingly, this behavior is largely inaccessible through text prompts and text-token embedding optimization. Furthermore, we observe that naive prompt learning is unstable because the forward noising process using degraded images is misaligned with the reverse sampling trajectory. To resolve this, we train prompts within a diffusion bridge formulation that aligns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
