HAT: Hybrid Attention Transformer for Image Restoration
Xiangyu Chen, Xintao Wang, Wenlong Zhang, Xiangtao Kong, Yu Qiao, Jiantao Zhou, and Chao Dong

TL;DR
HAT is a novel hybrid attention transformer that combines channel and window-based self-attention to enhance input utilization and cross-window information aggregation, significantly improving performance across various image restoration tasks.
Contribution
The paper introduces a Hybrid Attention Transformer (HAT) that integrates channel and window-based self-attention with an overlapping cross-attention module and a pre-training strategy for superior image restoration.
Findings
Achieves state-of-the-art results on multiple benchmarks.
Effectively utilizes more input pixels for better restoration.
Extends to diverse applications like super-resolution and denoising.
Abstract
Transformer-based methods have shown impressive performance in image restoration tasks, such as image super-resolution and denoising. However, we find that these networks can only utilize a limited spatial range of input information through attribution analysis. This implies that the potential of Transformer is still not fully exploited in existing networks. In order to activate more input pixels for better restoration, we propose a new Hybrid Attention Transformer (HAT). It combines both channel attention and window-based self-attention schemes, thus making use of their complementary advantages. Moreover, to better aggregate the cross-window information, we introduce an overlapping cross-attention module to enhance the interaction between neighboring window features. In the training stage, we additionally adopt a same-task pre-training strategy to further exploit the potential of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Photoacoustic and Ultrasonic Imaging
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Adam · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Concatenated Skip Connection
