Joint multi-dimensional dynamic attention and transformer for general image restoration
Huan Zhang, Xu Zhang, Nian Cai, Jianglei Di, and Yun Zhang

TL;DR
This paper presents a novel image restoration architecture combining multi-dimensional dynamic attention and transformers within a U-Net framework, effectively handling complex outdoor image degradations with improved performance and efficiency.
Contribution
It introduces a new architecture integrating dynamic attention and transformers, with selective CNNs and transformers in different parts, for versatile and efficient image restoration.
Findings
Outperforms existing methods across five restoration tasks.
Achieves a better balance between performance and computational complexity.
Enhances high-level vision tasks with improved restoration quality.
Abstract
Outdoor images often suffer from severe degradation due to rain, haze, and noise, impairing image quality and challenging high-level tasks. Current image restoration methods struggle to handle complex degradation while maintaining efficiency. This paper introduces a novel image restoration architecture that combines multi-dimensional dynamic attention and self-attention within a U-Net framework. To leverage the global modeling capabilities of transformers and the local modeling capabilities of convolutions, we integrate sole CNNs in the encoder-decoder and sole transformers in the latent layer. Additionally, we design convolutional kernels with selected multi-dimensional dynamic attention to capture diverse degraded inputs efficiently. A transformer block with transposed self-attention further enhances global feature extraction while maintaining efficiency. Extensive experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Medical Image Segmentation Techniques · Image Processing Techniques and Applications
MethodsSoftmax · Attention Is All You Need · Concatenated Skip Connection · Max Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · U-Net
