Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception   Priors

Jiangang Wang; Qingnan Fan; Qi Zhang; Haigen Liu; Yuhang Yu; Jinwei; Chen; Wenqi Ren

arXiv:2412.07152·cs.CV·December 11, 2024

Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception Priors

Jiangang Wang, Qingnan Fan, Qi Zhang, Haigen Liu, Yuhang Yu, Jinwei, Chen, Wenqi Ren

PDF

Open Access

TL;DR

Hero-SR is a diffusion-based super-resolution framework that enhances perceptual naturalness and semantic consistency by adaptively selecting diffusion steps and integrating multi-modal guidance, achieving state-of-the-art results.

Contribution

It introduces two novel modules, DTSM and OWMS, to improve human perception alignment in super-resolution, a significant advancement over prior methods.

Findings

01

Achieves state-of-the-art performance in Real-SR tasks.

02

Effectively preserves intricate details and perceptual quality.

03

Demonstrates improved semantic consistency with human perception standards.

Abstract

Owing to the robust priors of diffusion models, recent approaches have shown promise in addressing real-world super-resolution (Real-SR). However, achieving semantic consistency and perceptual naturalness to meet human perception demands remains difficult, especially under conditions of heavy degradation and varied input complexities. To tackle this, we propose Hero-SR, a one-step diffusion-based SR framework explicitly designed with human perception priors. Hero-SR consists of two novel modules: the Dynamic Time-Step Module (DTSM), which adaptively selects optimal diffusion steps for flexibly meeting human perceptual standards, and the Open-World Multi-modality Supervision (OWMS), which integrates guidance from both image and text domains through CLIP to improve semantic consistency and perceptual naturalness. Through these modules, Hero-SR generates high-resolution images that not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging

MethodsDiffusion · Contrastive Language-Image Pre-training