One Step Diffusion-based Super-Resolution with Time-Aware Distillation
Xiao He, Huaao Tang, Zhijun Tu, Junchao Zhang, Kun Cheng, Hanting, Chen, Yong Guo, Mingrui Zhu, Nannan Wang, Xinbo Gao, Jie Hu

TL;DR
This paper introduces TAD-SR, a novel diffusion-based super-resolution method that uses time-aware distillation and a score-based strategy to produce high-quality images efficiently in just one sampling step.
Contribution
The paper proposes a time-aware diffusion distillation approach with a new score strategy and a latent adversarial loss to improve super-resolution quality and efficiency.
Findings
Achieves comparable or better performance than SOTA with one sampling step.
Effectively focuses on high-frequency details through score distillation.
Demonstrates superior results on synthetic and real-world datasets.
Abstract
Diffusion-based image super-resolution (SR) methods have shown promise in reconstructing high-resolution images with fine details from low-resolution counterparts. However, these approaches typically require tens or even hundreds of iterative samplings, resulting in significant latency. Recently, techniques have been devised to enhance the sampling efficiency of diffusion-based SR models via knowledge distillation. Nonetheless, when aligning the knowledge of student and teacher models, these solutions either solely rely on pixel-level loss constraints or neglect the fact that diffusion models prioritize varying levels of information at different time steps. To accomplish effective and efficient image super-resolution, we propose a time-aware diffusion distillation method, named TAD-SR. Specifically, we introduce a novel score distillation strategy to align the data distribution between…
Peer Reviews
Decision·Submitted to ICLR 2025
1. The paper targets at an important problem of distillation of SR diffusion models. While diffusion distillation is a popular research area, it is interesting to see some insight particularly designed for SR models 2. The paper introduces a novel technique to reduce the bias of the score estimate of generated samples in SDS, which particularly fits in the insights from SR. 3. Empirical results shows promising improvements.
1. The biggest concern is insufficient baselines. The method compare against a large number of non-diffusion based methods or diffusion based iterative methods, but it lacks comparisons against the most closely related methods: other diffusion distillation algorithms. This method distill a pre-trained SR diffusion model into one step with some specific design for SR, but there are many distillation methods designed for general diffusion models, such as consistency model and the family of distrib
* The paper is well-written. * Experimental results demonstrate that the proposed method achieves state-of-the-art performance with high efficiency.
* The evaluation is not comprehensive. Some image fidelity metrics are lacking, such as PSNR and SSIM on ImageNet-Test, where the competing methods ResShift and SinSR all reported. * The improvement over the previous single-step distillation method SinSR is minor. Considering that LPIPS—a crucial metric for perceptual quality—is very important, the increase from 0.221 to 0.227 represents a big drop in quality and is not slight. * The ablation study examines only the presence or absence of the
1. This paper proposes a time-aware distillation method that accelerates diffusion-based SR models into a single inference step. 2. The writing of this paper is good.
See the questions.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Fluorescence Microscopy Techniques · Photoacoustic and Ultrasonic Imaging · Optical Coherence Tomography Applications
MethodsDiffusion · ALIGN
