Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach
Gang Wu, Junjun Jiang, Junpeng Jiang, Xianming Liu

TL;DR
This paper introduces ConvFormer, a novel lightweight super-resolution network that combines convolution and transformer techniques to achieve high performance with reduced computational costs, suitable for resource-limited devices.
Contribution
The paper proposes ConvFormer-based Super-Resolution (CFSR), a new efficient architecture that replaces self-attention with large kernel convolutions and includes an edge-preserving network for high-frequency detail retention.
Findings
CFSR outperforms existing lightweight SR methods in accuracy and efficiency.
Achieves 0.39 dB higher PSNR on Urban100 dataset for x2 super-resolution.
Reduces parameters by 26% and FLOPs by 31% compared to state-of-the-art methods.
Abstract
Recent progress in single-image super-resolution (SISR) has achieved remarkable performance, yet the computational costs of these methods remain a challenge for deployment on resource-constrained devices. In particular, transformer-based methods, which leverage self-attention mechanisms, have led to significant breakthroughs but also introduce substantial computational costs. To tackle this issue, we introduce the Convolutional Transformer layer (ConvFormer) and propose a ConvFormer-based Super-Resolution network (CFSR), offering an effective and efficient solution for lightweight image super-resolution. The proposed method inherits the advantages of both convolution-based and transformer-based approaches. Specifically, CFSR utilizes large kernel convolutions as a feature mixer to replace the self-attention module, efficiently modeling long-range dependencies and extensive receptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Photoacoustic and Ultrasonic Imaging
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Adam · Absolute Position Encodings · Layer Normalization · Residual Connection · Byte Pair Encoding · Dropout
