TL;DR
TM-BSN introduces a novel triangular-masked convolutional architecture that models real-world noise correlation in self-supervised image denoising, outperforming existing methods without downsampling.
Contribution
The paper proposes a triangular-masked blind-spot network that accurately captures spatially correlated noise, improving denoising performance while maintaining full resolution and efficiency.
Findings
Achieves state-of-the-art results on real-world benchmarks.
Effectively models spatially correlated noise without downsampling.
Improves accuracy and efficiency through knowledge distillation.
Abstract
Blind-spot networks (BSNs) enable self-supervised image denoising by preventing access to the target pixel, allowing clean signal estimation without ground-truth supervision. However, this approach assumes pixel-wise noise independence, which is violated in real-world sRGB images due to spatially correlated noise from the camera's image signal processing (ISP) pipeline. While several methods employ downsampling to decorrelate noise, they alter noise statistics and limit the network's ability to utilize full contextual information. In this paper, we propose the Triangular-Masked Blind-Spot Network (TM-BSN), a novel blind-spot architecture that accurately models the spatial correlation of real sRGB noise. This correlation originates from demosaicing, where each pixel is reconstructed from neighboring samples with spatially decaying weights, resulting in a diamond-shaped pattern. To align…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
