How Powerful Potential of Attention on Image Restoration?
Cong Wang, Jinshan Pan, Yeying Jin, Liyan Wang, Wei Wang, Gang Fu,, Wenqi Ren, Xiaochun Cao

TL;DR
This paper investigates the potential of attention mechanisms in image restoration by removing the feed-forward network component, introducing Continuous Scaling Attention (CSAttn), and demonstrating competitive performance across tasks.
Contribution
The paper presents a novel attention structure, CSAttn, that operates without FFN, offering a flexible alternative for image restoration tasks and providing new insights into attention mechanisms.
Findings
CSAttn outperforms CNN and Transformer-based methods in image restoration.
Removing FFN is feasible and effective for image restoration.
Simple operations within attention significantly impact performance.
Abstract
Transformers have demonstrated their effectiveness in image restoration tasks. Existing Transformer architectures typically comprise two essential components: multi-head self-attention and feed-forward network (FFN). The former captures long-range pixel dependencies, while the latter enables the model to learn complex patterns and relationships in the data. Previous studies have demonstrated that FFNs are key-value memories \cite{geva2020transformer}, which are vital in modern Transformer architectures. In this paper, we conduct an empirical study to explore the potential of attention mechanisms without using FFN and provide novel structures to demonstrate that removing FFN is flexible for image restoration. Specifically, we propose Continuous Scaling Attention (\textbf{CSAttn}), a method that computes attention continuously in three stages without using FFN. To achieve competitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhotoacoustic and Ultrasonic Imaging · Image Processing Techniques and Applications · CCD and CMOS Imaging Sensors
MethodsAttention Is All You Need · Absolute Position Encodings · Residual Connection · Softmax · Position-Wise Feed-Forward Layer · Layer Normalization · Dropout · Linear Layer · Multi-Head Attention · Byte Pair Encoding
