Mixtures of conditional Gaussian scale mixtures applied to multiscale image representations
Lucas Theis, Reshad Hosseini, Matthias Bethge

TL;DR
This paper introduces a probabilistic model using Gaussian scale mixtures and multiscale representations for natural images, outperforming previous models in likelihood evaluation and generating images with higher-order correlations.
Contribution
It presents a directed graphical model for natural images based on Gaussian scale mixtures, enabling both high-quality image generation and principled evaluation.
Findings
Achieves the best reported cross-entropy rate for natural image models
Generates images with interesting higher-order correlations
Outperforms Markov random field based models in likelihood evaluation
Abstract
We present a probabilistic model for natural images which is based on Gaussian scale mixtures and a simple multiscale representation. In contrast to the dominant approach to modeling whole images focusing on Markov random fields, we formulate our model in terms of a directed graphical model. We show that it is able to generate images with interesting higher-order correlations when trained on natural images or samples from an occlusion based model. More importantly, the directed model enables us to perform a principled evaluation. While it is easy to generate visually appealing images, we demonstrate that our model also yields the best performance reported to date when evaluated with respect to the cross-entropy rate, a measure tightly linked to the average log-likelihood.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
