Detection and Feature Selection in Sparse Mixture Models

Nicolas Verzelen; Ery Arias-Castro

arXiv:1405.1478·math.ST·October 4, 2016

Detection and Feature Selection in Sparse Mixture Models

Nicolas Verzelen, Ery Arias-Castro

PDF

TL;DR

This paper investigates detection and feature selection in high-dimensional Gaussian mixture models, deriving theoretical bounds and evaluating procedures like eigenvalue-based tests and moment-based tests under sparsity assumptions.

Contribution

It provides new information bounds and compares the performance of various detection procedures in sparse high-dimensional Gaussian mixtures.

Findings

01

Eigenvalue-based tests perform well under certain sparsity conditions

02

Moment-based tests like skewness and kurtosis are effective with improved null control

03

Theoretical bounds guide the choice of detection methods in high-dimensional settings

Abstract

We consider Gaussian mixture models in high dimensions and concentrate on the twin tasks of detection and feature selection. Under sparsity assumptions on the difference in means, we derive information bounds and establish the performance of various procedures, including the top sparse eigenvalue of the sample covariance matrix and other projection tests based on moments, such as the skewness and kurtosis tests of Malkovich and Afifi (1973), and other variants which we were better able to control under the null.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.