S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and   Blind Super-Resolution

Minghao She; Wendong Mao; Huihong Shi; Zhongfeng Wang

arXiv:2308.08142·cs.CV·August 17, 2023

S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution

Minghao She, Wendong Mao, Huihong Shi, Zhongfeng Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces S2R, a lightweight transformer-based framework with a coarse-to-fine training strategy that excels in both ideal and blind super-resolution tasks, significantly improving performance and convergence speed.

Contribution

The paper proposes a novel double-win framework combining a lightweight transformer model and a two-stage training strategy for superior ideal and blind SR performance.

Findings

01

Outperforms other SR models in ideal conditions with 578K parameters.

02

Achieves better results than blind SR models with only 10 gradient updates.

03

Accelerates transfer learning in real-world SR by 300 times.

Abstract

Nowadays, deep learning based methods have demonstrated impressive performance on ideal super-resolution (SR) datasets, but most of these methods incur dramatically performance drops when directly applied in real-world SR reconstruction tasks with unpredictable blur kernels. To tackle this issue, blind SR methods are proposed to improve the visual results on random blur kernels, which causes unsatisfactory reconstruction effects on ideal low-resolution images similarly. In this paper, we propose a double-win framework for ideal and blind SR task, named S2R, including a light-weight transformer-based SR model (S2R transformer) and a novel coarse-to-fine training strategy, which can achieve excellent visual results on both ideal and random fuzzy conditions. On algorithm level, S2R transformer smartly combines some efficient and light-weight blocks to enhance the representation ability of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

berumotto-vermouth/S2R
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings