Multivariate Density Estimation with Deep Neural Mixture Models
Edmondo Trentin (DIISM, University of Siena, Italy)

TL;DR
This paper introduces Deep Neural Mixture Models (DNMMs) for multivariate density estimation, providing a novel approach that combines neural networks with mixture models, ensuring probabilistic validity and demonstrating superior performance over traditional methods.
Contribution
The paper extends Neural Mixture Densities to multivariate DNN mixtures, proposing a maximum-likelihood training algorithm and an automatic architecture selection procedure.
Findings
DNMMs effectively model complex multivariate densities.
Experimental results show DNMMs outperform traditional density estimation methods.
The approach satisfies Kolmogorov's axioms numerically.
Abstract
Albeit worryingly underrated in the recent literature on machine learning in general (and, on deep learning in particular), multivariate density estimation is a fundamental task in many applications, at least implicitly, and still an open issue. With a few exceptions, deep neural networks (DNNs) have seldom been applied to density estimation, mostly due to the unsupervised nature of the estimation task, and (especially) due to the need for constrained training algorithms that ended up realizing proper probabilistic models that satisfy Kolmogorov's axioms. Moreover, in spite of the well-known improvement in terms of modeling capabilities yielded by mixture models over plain single-density statistical estimators, no proper mixtures of multivariate DNN-based component densities have been investigated so far. The paper fills this gap by extending our previous work on Neural Mixture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Neural Networks and Applications · Gaussian Processes and Bayesian Inference
