Bridging Fidelity-Reality with Controllable One-Step Diffusion for Image Super-Resolution

Hao Chen; Junyang Chen; Jinshan Pan; Jiangxin Dong

arXiv:2512.14061·cs.CV·December 17, 2025

Bridging Fidelity-Reality with Controllable One-Step Diffusion for Image Super-Resolution

Hao Chen, Junyang Chen, Jinshan Pan, Jiangxin Dong

PDF

Open Access

TL;DR

This paper introduces CODSR, a controllable one-step diffusion model for image super-resolution that improves fidelity, perceptual quality, and semantic alignment by leveraging uncompressed input information, region-adaptive priors, and text guidance.

Contribution

The paper proposes a novel controllable diffusion framework with LQ-guided modulation, region-adaptive priors, and text guidance to address fidelity and semantic alignment issues in super-resolution.

Findings

01

Achieves superior perceptual quality over state-of-the-art methods.

02

Maintains competitive fidelity with efficient one-step inference.

03

Effectively aligns text prompts with semantic regions in images.

Abstract

Recent diffusion-based one-step methods have shown remarkable progress in the field of image super-resolution, yet they remain constrained by three critical limitations: (1) inferior fidelity performance caused by the information loss from compression encoding of low-quality (LQ) inputs; (2) insufficient region-discriminative activation of generative priors; (3) misalignment between text prompts and their corresponding semantic regions. To address these limitations, we propose CODSR, a controllable one-step diffusion network for image super-resolution. First, we propose an LQ-guided feature modulation module that leverages original uncompressed information from LQ inputs to provide high-fidelity conditioning for the diffusion process. We then develop a region-adaptive generative prior activation method to effectively enhance perceptual richness without sacrificing local structural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Image and Video Quality Assessment