Model Building for Semiparametric Mixtures
Ramani S. Pilla, Francesco Bartolucci, Bruce G. Lindsay

TL;DR
This paper introduces a unified theoretical framework and algorithms for estimating the complexity of multivariate mixture models, enabling flexible high-dimensional data fitting and data reduction.
Contribution
It develops a novel approach using concave optimization and penalty functions to find nonparametric maximum likelihood estimators for multivariate mixtures, including the use of sieve parameters.
Findings
Proved convergence of the proposed algorithms.
Demonstrated the method on yeast microarray data.
Showed how to reduce dimensionality with sieve parameters.
Abstract
An important and yet difficult problem in fitting multivariate mixture models is determining the mixture complexity. We develop theory and a unified framework for finding the nonparametric maximum likelihood estimator of a multivariate mixing distribution and consequently estimating the mixture complexity. Multivariate mixtures provide a flexible approach to fitting high-dimensional data while offering data reduction through the number, location and shape of the component densities. The central principle of our method is to cast the mixture maximization problem in the concave optimization framework with finitely many linear inequality constraints and turn it into an unconstrained problem using a "penalty function". We establish the existence of parameter estimators and prove the convergence properties of the proposed algorithms. The role of a "sieve parameter'' in reducing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Soil Geostatistics and Mapping
