Transforming Image Super-Resolution: A ConvFormer-based Efficient   Approach

Gang Wu; Junjun Jiang; Junpeng Jiang; Xianming Liu

arXiv:2401.05633·cs.CV·October 22, 2024·1 cites

Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach

Gang Wu, Junjun Jiang, Junpeng Jiang, Xianming Liu

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces ConvFormer, a novel lightweight super-resolution network that combines convolution and transformer techniques to achieve high performance with reduced computational costs, suitable for resource-limited devices.

Contribution

The paper proposes ConvFormer-based Super-Resolution (CFSR), a new efficient architecture that replaces self-attention with large kernel convolutions and includes an edge-preserving network for high-frequency detail retention.

Findings

01

CFSR outperforms existing lightweight SR methods in accuracy and efficiency.

02

Achieves 0.39 dB higher PSNR on Urban100 dataset for x2 super-resolution.

03

Reduces parameters by 26% and FLOPs by 31% compared to state-of-the-art methods.

Abstract

Recent progress in single-image super-resolution (SISR) has achieved remarkable performance, yet the computational costs of these methods remain a challenge for deployment on resource-constrained devices. In particular, transformer-based methods, which leverage self-attention mechanisms, have led to significant breakthroughs but also introduce substantial computational costs. To tackle this issue, we introduce the Convolutional Transformer layer (ConvFormer) and propose a ConvFormer-based Super-Resolution network (CFSR), offering an effective and efficient solution for lightweight image super-resolution. The proposed method inherits the advantages of both convolution-based and transformer-based approaches. Specifically, CFSR utilizes large kernel convolutions as a feature mixer to replace the self-attention module, efficiently modeling long-range dependencies and extensive receptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aitical/cfsr
pytorchOfficial

Models

🤗
GWu/CFSR
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Photoacoustic and Ultrasonic Imaging

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Adam · Absolute Position Encodings · Layer Normalization · Residual Connection · Byte Pair Encoding · Dropout