Regularized EM Algorithms: A Unified Framework and Statistical Guarantees
Xinyang Yi, Constantine Caramanis

TL;DR
This paper introduces a unified regularized EM framework for high-dimensional latent variable models, balancing optimization and statistical errors, with guarantees demonstrated on various models like sparse Gaussian mixtures and high-dimensional regression.
Contribution
It develops a general regularized EM algorithm framework that addresses high-dimensional challenges and provides statistical guarantees for multiple models.
Findings
The framework achieves linear local convergence in high-dimensional settings.
Regularization balances optimization progress and structure identification.
Statistical guarantees are established for Gaussian mixtures, mixed regression, and missing data models.
Abstract
Latent variable models are a fundamental modeling tool in machine learning applications, but they present significant computational and analytical challenges. The popular EM algorithm and its variants, is a much used algorithmic tool; yet our rigorous understanding of its performance is highly incomplete. Recently, work in Balakrishnan et al. (2014) has demonstrated that for an important class of problems, EM exhibits linear local convergence. In the high-dimensional setting, however, the M-step may not be well defined. We address precisely this setting through a unified treatment using regularization. While regularization for high-dimensional problems is by now well understood, the iterative EM algorithm requires a careful balancing of making progress towards the solution while identifying the right structure (e.g., sparsity or low-rank). In particular, regularizing the M-step using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Bayesian Methods and Mixture Models · Statistical Methods and Inference
