Expectation Maximization (EM) Converges for General Agnostic Mixtures
Avishek Ghosh

TL;DR
This paper extends the convergence analysis of gradient EM algorithms to a broad class of parametric function fitting problems in the agnostic setting, beyond traditional mixture linear regression.
Contribution
It demonstrates exponential convergence of gradient EM for fitting arbitrary parametric functions with strongly convex and smooth loss functions under certain conditions.
Findings
Gradient EM converges exponentially to population loss minimizers.
The framework includes mixed linear regression, classifiers, and generalized linear models.
Proper initialization and separation conditions are crucial for convergence.
Abstract
Mixture of linear regression is well studied in statistics and machine learning, where the data points are generated probabilistically using linear models. Algorithms like Expectation Maximization (EM) may be used to recover the ground truth regressors for this problem. Recently, in \cite{pal2022learning,ghosh_agnostic} the mixed linear regression problem is studied in the agnostic setting, where no generative model on data is assumed. Rather, given a set of data points, the objective is \emph{fit} lines by minimizing a suitable loss function. It is shown that a modification of EM, namely gradient EM converges exponentially to appropriately defined loss minimizer even in the agnostic setting. In this paper, we study the problem of \emph{fitting} parametric functions to given set of data points. We adhere to the agnostic setup. However, instead of fitting lines equipped…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
