AddSR: Accelerating Diffusion-based Blind Super-Resolution with   Adversarial Diffusion Distillation

Rui Xie; Chen Zhao; Kai Zhang; Zhenyu Zhang; Jun Zhou and; Jian Yang; Ying Tai

arXiv:2404.01717·cs.CV·December 30, 2024·2 cites

AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation

Rui Xie, Chen Zhao, Kai Zhang, Zhenyu Zhang, Jun Zhou and, Jian Yang, Ying Tai

PDF

Open Access 1 Repo

TL;DR

AddSR significantly accelerates diffusion-based blind super-resolution by integrating adversarial diffusion distillation and ControlNet, resulting in faster processing and improved image restoration quality.

Contribution

The paper introduces AddSR, a novel method combining distillation and ControlNet to enhance efficiency and robustness in diffusion-based blind super-resolution.

Findings

01

Achieves 7x faster speed than previous state-of-the-art models.

02

Produces better image restoration results.

03

Demonstrates robustness with HR-based training constraints.

Abstract

Blind super-resolution methods based on stable diffusion showcase formidable generative capabilities in reconstructing clear high-resolution images with intricate details from low-resolution inputs. However, their practical applicability is often hampered by poor efficiency, stemming from the requirement of thousands or hundreds of sampling steps. Inspired by the efficient adversarial diffusion distillation (ADD), we design~\name~to address this issue by incorporating the ideas of both distillation and ControlNet. Specifically, we first propose a prediction-based self-refinement strategy to provide high-frequency information in the student model output with marginal additional time cost. Furthermore, we refine the training process by employing HR images, rather than LR images, to regulate the teacher model, providing a more robust constraint for distillation. Second, we introduce a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NJU-PCALab/AddSR
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion