AMSA-UNet: An Asymmetric Multiple Scales U-net Based on Self-attention for Deblurring
Yingying Wang

TL;DR
AMSA-UNet introduces a multi-scale U-Net with self-attention for image deblurring, enhancing accuracy and efficiency by capturing long-range dependencies and focusing on blurry regions.
Contribution
The paper proposes a novel asymmetric multi-scale U-Net with integrated self-attention and frequency domain computation for improved deblurring performance.
Findings
Significant accuracy improvements over existing methods
Enhanced focus on blurry regions through multi-scale architecture
Reduced computational complexity via frequency domain techniques
Abstract
The traditional ingle-scale U-Net often leads to the loss of spatial information during deblurring, which affects the deblurring accracy. Additionally, due to the convolutional method's limitation in capturing long-range dependencies, the quality of the recovered image is degraded. To address the above problems, an asymmetric multiple scales U-net based on self-attention (AMSA-UNet) is proposed to improve the accuracy and computational complexity. By introducing a multiple-scales U shape architecture, the network can focus on blurry regions at the global level and better recover image details at the local level. In order to overcome the limitations of traditional convolutional methods in capturing the long-range dependencies of information, a self-attention mechanism is introduced into the decoder part of the backbone network, which significantly increases the model's receptive field,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Max Pooling · Focus · U-Net
