Unconditional CNN denoisers contain sparse semantic representation of images
Zahra Kadkhodaie, St\'ephane Mallat, Eero Simoncelli

TL;DR
This paper reveals that unconditional CNN denoisers develop sparse, semantically meaningful image representations internally, enabling self-guided image reconstruction and understanding of learned features without supervision.
Contribution
It demonstrates that a middle block in a fully-convolutional UNet forms a sparse, semantic representation of images, and introduces a novel self-guided reconstruction algorithm based on this insight.
Findings
Sparse semantic representations emerge in the UNet middle block.
Euclidean distances in the representation space are semantically meaningful.
The proposed algorithm enables self-guided image reconstruction from the learned representation.
Abstract
Generative diffusion models learn probability densities over diverse image datasets by estimating the score with a neural network trained to remove noise. Despite their remarkable success in generating high-quality images, the internal mechanisms of the underlying score networks are not well understood. Here, we examine the image representation that arises from score estimation in a {fully-convolutional unconditional UNet}. We show that the middle block of the UNet decomposes individual images into sparse subsets of active channels, and that the vector of spatial averages of these channels can provide a nonlinear representation of the underlying clean images. Euclidean distances in this representation space are semantically meaningful, even though no conditioning information is provided during training. We develop a novel algorithm for stochastic reconstruction of images conditioned on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Neural Networks and Applications
