TL;DR
Bird-SR introduces a bidirectional reward-guided diffusion framework that enhances real-world image super-resolution by balancing structural fidelity and perceptual quality through trajectory-level optimization and reward strategies.
Contribution
The paper proposes Bird-SR, a novel diffusion-based super-resolution method that jointly leverages synthetic and real-world data with reward-guided optimization and dynamic weighting.
Findings
Outperforms state-of-the-art methods in perceptual quality on real-world benchmarks.
Effectively preserves structural consistency in super-resolved images.
Balances perceptual enhancement and structural fidelity through trajectory-specific strategies.
Abstract
Powered by multimodal text-to-image priors, diffusion-based super-resolution excels at synthesizing intricate details; however, models trained on synthetic low-resolution (LR) and high-resolution (HR) image pairs often degrade when applied to real-world LR images due to significant distribution shifts. We propose Bird-SR, a bidirectional reward-guided diffusion framework that formulates super-resolution as trajectory-level preference optimization via reward feedback learning (ReFL), jointly leveraging synthetic LR-HR pairs and real-world LR images. For structural fidelity easily affected in ReFL, the model is directly optimized on synthetic pairs at early diffusion steps, which also facilitates structure preservation for real-world inputs under smaller distribution gap in structure levels. For perceptual enhancement, quality-guided rewards are applied to both synthetic and real LR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
