Unifying Dimensions: A Linear Adaptive Approach to Lightweight Image Super-Resolution
Zhenyu Hu, Wanjie Sun

TL;DR
This paper introduces LAMNet, a lightweight convolution-based Transformer for image super-resolution that combines adaptive spatial aggregation with efficient design, achieving high performance with significantly reduced inference latency.
Contribution
The paper proposes a novel convolution-based linear focal separable attention mechanism and a dual-branch structure with an information exchange module, enabling efficient long-range modeling in super-resolution.
Findings
LAMNet outperforms existing SA-based Transformers in accuracy.
LAMNet achieves a 3x speedup in inference time.
The proposed method maintains high performance with lower computational cost.
Abstract
Window-based transformers have demonstrated outstanding performance in super-resolution tasks due to their adaptive modeling capabilities through local self-attention (SA). However, they exhibit higher computational complexity and inference latency than convolutional neural networks. In this paper, we first identify that the adaptability of the Transformers is derived from their adaptive spatial aggregation and advanced structural design, while their high latency results from the computational costs and memory layout transformations associated with the local SA. To simulate this aggregation approach, we propose an effective convolution-based linear focal separable attention (FSA), allowing for long-range dynamic modeling with linear complexity. Additionally, we introduce an effective dual-branch structure combined with an ultra-lightweight information exchange module (IEM) to enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Advanced Vision and Imaging
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Softmax · Layer Normalization · Dropout · Dense Connections
