Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution
Karam Park, Jae Woong Soh, Nam Ik Cho

TL;DR
This paper introduces ASID, a lightweight Transformer-based super-resolution network that reduces computational complexity through attention-sharing and information distillation, achieving competitive performance with fewer parameters.
Contribution
The paper proposes a novel lightweight SR network, ASID, that incorporates attention-sharing and an adapted information distillation scheme for Transformer-based methods.
Findings
ASID requires only around 300K parameters.
ASID outperforms state-of-the-art SR methods with similar parameter counts.
ASID achieves competitive performance with significantly fewer parameters.
Abstract
Transformer-based Super-Resolution (SR) methods have demonstrated superior performance compared to convolutional neural network (CNN)-based SR approaches due to their capability to capture long-range dependencies. However, their high computational complexity necessitates the development of lightweight approaches for practical use. To address this challenge, we propose the Attention-Sharing Information Distillation (ASID) network, a lightweight SR network that integrates attention-sharing and an information distillation structure specifically designed for Transformer-based SR methods. We modify the information distillation scheme, originally designed for efficient CNN operations, to reduce the computational load of stacked self-attention layers, effectively addressing the efficiency bottleneck. Additionally, we introduce attention-sharing across blocks to further minimize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image Processing Techniques · CCD and CMOS Imaging Sensors · Image Processing Techniques and Applications
