AMSA-UNet: An Asymmetric Multiple Scales U-net Based on Self-attention   for Deblurring

Yingying Wang

arXiv:2406.09015·cs.CV·June 14, 2024·1 cites

AMSA-UNet: An Asymmetric Multiple Scales U-net Based on Self-attention for Deblurring

Yingying Wang

PDF

Open Access

TL;DR

AMSA-UNet introduces a multi-scale U-Net with self-attention for image deblurring, enhancing accuracy and efficiency by capturing long-range dependencies and focusing on blurry regions.

Contribution

The paper proposes a novel asymmetric multi-scale U-Net with integrated self-attention and frequency domain computation for improved deblurring performance.

Findings

01

Significant accuracy improvements over existing methods

02

Enhanced focus on blurry regions through multi-scale architecture

03

Reduced computational complexity via frequency domain techniques

Abstract

The traditional ingle-scale U-Net often leads to the loss of spatial information during deblurring, which affects the deblurring accracy. Additionally, due to the convolutional method's limitation in capturing long-range dependencies, the quality of the recovered image is degraded. To address the above problems, an asymmetric multiple scales U-net based on self-attention (AMSA-UNet) is proposed to improve the accuracy and computational complexity. By introducing a multiple-scales U shape architecture, the network can focus on blurry regions at the global level and better recover image details at the local level. In order to overcome the limitations of traditional convolutional methods in capturing the long-range dependencies of information, a self-attention mechanism is introduced into the decoder part of the backbone network, which significantly increases the model's receptive field,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Max Pooling · Focus · U-Net