S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution
Minghao She, Wendong Mao, Huihong Shi, Zhongfeng Wang

TL;DR
This paper introduces S2R, a lightweight transformer-based framework with a coarse-to-fine training strategy that excels in both ideal and blind super-resolution tasks, significantly improving performance and convergence speed.
Contribution
The paper proposes a novel double-win framework combining a lightweight transformer model and a two-stage training strategy for superior ideal and blind SR performance.
Findings
Outperforms other SR models in ideal conditions with 578K parameters.
Achieves better results than blind SR models with only 10 gradient updates.
Accelerates transfer learning in real-world SR by 300 times.
Abstract
Nowadays, deep learning based methods have demonstrated impressive performance on ideal super-resolution (SR) datasets, but most of these methods incur dramatically performance drops when directly applied in real-world SR reconstruction tasks with unpredictable blur kernels. To tackle this issue, blind SR methods are proposed to improve the visual results on random blur kernels, which causes unsatisfactory reconstruction effects on ideal low-resolution images similarly. In this paper, we propose a double-win framework for ideal and blind SR task, named S2R, including a light-weight transformer-based SR model (S2R transformer) and a novel coarse-to-fine training strategy, which can achieve excellent visual results on both ideal and random fuzzy conditions. On algorithm level, S2R transformer smartly combines some efficient and light-weight blocks to enhance the representation ability of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
