One-Step Diffusion-based Real-World Image Super-Resolution with Visual Perception Distillation
Xue Wu, Jingwei Xin, Zhijun Tu, Jie Hu, Jie Li, Nannan Wang, and Xinbo Gao

TL;DR
This paper introduces VPD-SR, a one-step diffusion-based super-resolution framework that combines semantic and high-frequency perception guidance to produce high-quality, perceptually faithful images efficiently.
Contribution
The paper proposes a novel one-step diffusion model for super-resolution that integrates semantic supervision and high-frequency perception loss, improving perceptual quality and semantic fidelity.
Findings
VPD-SR outperforms previous methods in perceptual quality.
Achieves comparable or better results with just one-step sampling.
Enhances semantic consistency and high-frequency detail restoration.
Abstract
Diffusion-based models have been widely used in various visual generation tasks, showing promising results in image super-resolution (SR), while typically being limited by dozens or even hundreds of sampling steps. Although existing methods aim to accelerate the inference speed of multi-step diffusion-based SR methods through knowledge distillation, their generated images exhibit insufficient semantic alignment with real images, resulting in suboptimal perceptual quality reconstruction, specifically reflected in the CLIPIQA score. These methods still have many challenges in perceptual quality and semantic fidelity. Based on the challenges, we propose VPD-SR, a novel visual perception diffusion distillation framework specifically designed for SR, aiming to construct an effective and efficient one-step SR model. Specifically, VPD-SR consists of two components: Explicit Semantic-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Image Fusion Techniques · Image Processing Techniques and Applications
