LoFormer: Local Frequency Transformer for Image Deblurring

Xintian Mao; Jiansheng Wang; Xingran Xie; Qingli Li; Yan Wang

arXiv:2407.16993·cs.CV·July 25, 2024

LoFormer: Local Frequency Transformer for Image Deblurring

Xintian Mao, Jiansheng Wang, Xingran Xie, Qingli Li, Yan Wang

PDF

2 Repos

TL;DR

LoFormer is a novel transformer architecture that models long-range dependencies in image deblurring by combining local frequency domain self-attention with an MLP gating mechanism, improving detail preservation and global context understanding.

Contribution

Introduces LoFormer, a local frequency domain self-attention transformer with an MLP gating mechanism for enhanced image deblurring performance.

Findings

01

Achieves 34.09 dB PSNR on GoPro dataset

02

Significantly improves deblurring quality over existing methods

03

Efficiently balances coarse and fine-grained feature learning

Abstract

Due to the computational complexity of self-attention (SA), prevalent techniques for image deblurring often resort to either adopting localized SA or employing coarse-grained global SA methods, both of which exhibit drawbacks such as compromising global modeling or lacking fine-grained correlation. In order to address this issue by effectively modeling long-range dependencies without sacrificing fine-grained details, we introduce a novel approach termed Local Frequency Transformer (LoFormer). Within each unit of LoFormer, we incorporate a Local Channel-wise SA in the frequency domain (Freq-LC) to simultaneously capture cross-covariance within low- and high-frequency local windows. These operations offer the advantage of (1) ensuring equitable learning opportunities for both coarse-grained structures and fine-grained details, and (2) exploring a broader range of representational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections