StructSR: Refuse Spurious Details in Real-World Image Super-Resolution
Yachao Li, Dong Liang, Tianyu Ding, Sheng-Jun Huang

TL;DR
StructSR is a plug-and-play method that enhances structural fidelity and suppresses spurious details in diffusion-based real-world image super-resolution, significantly improving image quality without extra training or external priors.
Contribution
It introduces the Structure-Aware Screening mechanism for real-time structural fidelity enhancement in diffusion-based super-resolution models.
Findings
Improves PSNR and SSIM by over 5% and 9% on synthetic datasets.
Enhances image structure and texture fidelity across multiple diffusion-based methods.
Operates without additional fine-tuning or external priors.
Abstract
Diffusion-based models have shown great promise in real-world image super-resolution (Real-ISR), but often generate content with structural errors and spurious texture details due to the empirical priors and illusions of these models. To address this issue, we introduce StructSR, a simple, effective, and plug-and-play method that enhances structural fidelity and suppresses spurious details for diffusion-based Real-ISR. StructSR operates without the need for additional fine-tuning, external model priors, or high-level semantic knowledge. At its core is the Structure-Aware Screening (SAS) mechanism, which identifies the image with the highest structural similarity to the low-resolution (LR) input in the early inference stage, allowing us to leverage it as a historical structure knowledge to suppress the generation of spurious details. By intervening in the diffusion inference process,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Cell Image Analysis Techniques
MethodsDiffusion
