TL;DR
This paper introduces a novel network architecture combining Swin transformer and convolutional layers within a UNet framework, along with a comprehensive noise synthesis model, to improve blind image denoising for real-world images.
Contribution
It proposes a Swin-Conv-UNet architecture and a versatile noise synthesis method to enhance blind denoising performance on real images.
Findings
Achieves state-of-the-art results in blind denoising tasks.
Effectively models various real-world noise types.
Improves practical denoising performance with the new data synthesis strategy.
Abstract
While recent years have witnessed a dramatic upsurge of exploiting deep neural networks toward solving image denoising, existing methods mostly rely on simple noise assumptions, such as additive white Gaussian noise (AWGN), JPEG compression noise and camera sensor noise, and a general-purpose blind denoising method for real images remains unsolved. In this paper, we attempt to solve this problem from the perspective of network architecture design and training data synthesis. Specifically, for the network architecture design, we propose a swin-conv block to incorporate the local modeling ability of residual convolutional layer and non-local modeling ability of swin transformer block, and then plug it as the main building block into the widely-used image-to-image translation UNet architecture. For the training data synthesis, we design a practical noise degradation model which takes into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Residual Connection · Stochastic Depth · Swin Transformer
