Image Modeling with Deep Convolutional Gaussian Mixture Models
Alexander Gepperth, Benedikt Pf\"ulb

TL;DR
This paper introduces Deep Convolutional Gaussian Mixture Models (DCGMMs), a hierarchical approach that models images efficiently, enabling end-to-end training and high-quality image generation, outperforming traditional GMMs in clustering and sampling tasks.
Contribution
The paper proposes DCGMMs, a novel deep hierarchical GMM architecture that leverages convolutional layers for improved image modeling and training via stochastic gradient descent.
Findings
DCGMMs outperform flat GMMs in clustering accuracy.
DCGMMs generate sharper images than traditional GMMs.
DCGMMs effectively detect outliers in image datasets.
Abstract
In this conceptual work, we present Deep Convolutional Gaussian Mixture Models (DCGMMs): a new formulation of deep hierarchical Gaussian Mixture Models (GMMs) that is particularly suitable for describing and generating images. Vanilla (i.e., flat) GMMs require a very large number of components to describe images well, leading to long training times and memory issues. DCGMMs avoid this by a stacked architecture of multiple GMM layers, linked by convolution and pooling operations. This allows to exploit the compositionality of images in a similar way as deep CNNs do. DCGMMs can be trained end-to-end by Stochastic Gradient Descent. This sets them apart from vanilla GMMs which are trained by Expectation-Maximization, requiring a prior k-means initialization which is infeasible in a layered structure. For generating sharp images with DCGMMs, we introduce a new gradient-based technique for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
