Estimation of Gaussian mixtures in small sample studies using $l_1$   penalization

Stephane Chretien

arXiv:0901.4752·stat.CO·October 9, 2014

Estimation of Gaussian mixtures in small sample studies using $l_1$ penalization

Stephane Chretien

PDF

Open Access

TL;DR

This paper introduces a robust penalized EM algorithm for estimating Gaussian mixture components from small datasets, outperforming standard maximum likelihood methods in experiments.

Contribution

It develops a novel $l_1$ penalized EM approach with proven convergence properties for small sample Gaussian mixture estimation.

Findings

01

The proposed method outperforms maximum likelihood estimation in small sample scenarios.

02

The penalized EM algorithm converges to solutions satisfying KKT conditions.

03

Monte Carlo experiments validate the robustness and effectiveness of the new estimator.

Abstract

Many experiments in medicine and ecology can be conveniently modeled by finite Gaussian mixtures but face the problem of dealing with small data sets. We propose a robust version of the estimator based on self-regression and sparsity promoting penalization in order to estimate the components of Gaussian mixtures in such contexts. A space alternating version of the penalized EM algorithm is obtained and we prove that its cluster points satisfy the Karush-Kuhn-Tucker conditions. Monte Carlo experiments are presented in order to compare the results obtained by our method and by standard maximum likelihood estimation. In particular, our estimator is seen to perform better than the maximum likelihood estimator.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference