Unifying Dimensions: A Linear Adaptive Approach to Lightweight Image   Super-Resolution

Zhenyu Hu; Wanjie Sun

arXiv:2409.17597·cs.CV·September 27, 2024

Unifying Dimensions: A Linear Adaptive Approach to Lightweight Image Super-Resolution

Zhenyu Hu, Wanjie Sun

PDF

Open Access 1 Repo

TL;DR

This paper introduces LAMNet, a lightweight convolution-based Transformer for image super-resolution that combines adaptive spatial aggregation with efficient design, achieving high performance with significantly reduced inference latency.

Contribution

The paper proposes a novel convolution-based linear focal separable attention mechanism and a dual-branch structure with an information exchange module, enabling efficient long-range modeling in super-resolution.

Findings

01

LAMNet outperforms existing SA-based Transformers in accuracy.

02

LAMNet achieves a 3x speedup in inference time.

03

The proposed method maintains high performance with lower computational cost.

Abstract

Window-based transformers have demonstrated outstanding performance in super-resolution tasks due to their adaptive modeling capabilities through local self-attention (SA). However, they exhibit higher computational complexity and inference latency than convolutional neural networks. In this paper, we first identify that the adaptability of the Transformers is derived from their adaptive spatial aggregation and advanced structural design, while their high latency results from the computational costs and memory layout transformations associated with the local SA. To simulate this aggregation approach, we propose an effective convolution-based linear focal separable attention (FSA), allowing for long-range dynamic modeling with linear complexity. Additionally, we introduce an effective dual-branch structure combined with an ultra-lightweight information exchange module (IEM) to enhance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zononhzy/lamnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Advanced Vision and Imaging

MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Softmax · Layer Normalization · Dropout · Dense Connections