DnSwin: Toward Real-World Denoising via Continuous Wavelet Sliding-Transformer
Hao Li, Zhijing Yang, Xiaobin Hong, Ziying Zhao, Junyang Chen, Yukai, Shi, Jinshan Pan

TL;DR
DnSwin introduces a novel wavelet-based sliding-transformer approach for real-world image denoising, effectively capturing frequency dependencies and outperforming existing methods on benchmark datasets.
Contribution
The paper proposes a continuous wavelet sliding-transformer that leverages wavelet transforms and self-attention to improve real-world image denoising performance.
Findings
Outperforms state-of-the-art denoising methods on real-world benchmarks
Effectively captures frequency dependencies in noisy images
Enhances denoising quality by integrating wavelet transforms with transformers
Abstract
Real-world image denoising is a practical image restoration problem that aims to obtain clean images from in-the-wild noisy inputs. Recently, the Vision Transformer (ViT) has exhibited a strong ability to capture long-range dependencies, and many researchers have attempted to apply the ViT to image denoising tasks. However, a real-world image is an isolated frame that makes the ViT build long-range dependencies based on the internal patches, which divides images into patches, disarranges noise patterns and damages gradient continuity. In this article, we propose to resolve this issue by using a continuous Wavelet Sliding-Transformer that builds frequency correspondences under real-world scenes, called DnSwin. Specifically, we first extract the bottom features from noisy input images by using a convolutional neural network (CNN) encoder. The key to DnSwin is to extract high-frequency and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Image Processing Techniques and Applications · Photoacoustic and Ultrasonic Imaging
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Layer Normalization · Dropout · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Vision Transformer
