SAM-DiffSR: Structure-Modulated Diffusion Model for Image   Super-Resolution

Chengcheng Wang; Zhiwei Hao; Yehui Tang; Jianyuan Guo; Yujie Yang; Kai; Han; Yunhe Wang

arXiv:2402.17133·cs.CV·February 13, 2025·5 cites

SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution

Chengcheng Wang, Zhiwei Hao, Yehui Tang, Jianyuan Guo, Yujie Yang, Kai, Han, Yunhe Wang

PDF

Open Access 1 Repo

TL;DR

SAM-DiffSR enhances diffusion-based image super-resolution by integrating structural information from SAM during training, improving detail recovery and artifact suppression without increasing inference computational cost.

Contribution

The paper introduces a novel method that encodes structural information from SAM into the diffusion process, boosting super-resolution quality without extra inference costs.

Findings

01

Outperforms existing diffusion-based SR methods by 0.74 dB PSNR on DIV2K.

02

Effectively suppresses artifacts in super-resolved images.

03

Does not require SAM during inference, reducing computational load.

Abstract

Diffusion-based super-resolution (SR) models have recently garnered significant attention due to their potent restoration capabilities. But conventional diffusion models perform noise sampling from a single distribution, constraining their ability to handle real-world scenes and complex textures across semantic regions. With the success of segment anything model (SAM), generating sufficiently fine-grained region masks can enhance the detail recovery of diffusion-based SR model. However, directly integrating SAM into SR models will result in much higher computational cost. In this paper, we propose the SAM-DiffSR model, which can utilize the fine-grained structure information from SAM in the process of sampling noise to improve the image quality without additional computational cost during inference. In the process of training, we encode structural position information into the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lose4578/SAM-DiffSR
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhotoacoustic and Ultrasonic Imaging · Image and Signal Denoising Methods · Advanced Image Processing Techniques

MethodsDiffusion · Segment Anything Model