Expectation Maximization (EM) Converges for General Agnostic Mixtures

Avishek Ghosh

arXiv:2604.05842·cs.LG·April 8, 2026

Expectation Maximization (EM) Converges for General Agnostic Mixtures

Avishek Ghosh

PDF

TL;DR

This paper extends the convergence analysis of gradient EM algorithms to a broad class of parametric function fitting problems in the agnostic setting, beyond traditional mixture linear regression.

Contribution

It demonstrates exponential convergence of gradient EM for fitting arbitrary parametric functions with strongly convex and smooth loss functions under certain conditions.

Findings

01

Gradient EM converges exponentially to population loss minimizers.

02

The framework includes mixed linear regression, classifiers, and generalized linear models.

03

Proper initialization and separation conditions are crucial for convergence.

Abstract

Mixture of linear regression is well studied in statistics and machine learning, where the data points are generated probabilistically using $k$ linear models. Algorithms like Expectation Maximization (EM) may be used to recover the ground truth regressors for this problem. Recently, in \cite{pal2022learning,ghosh_agnostic} the mixed linear regression problem is studied in the agnostic setting, where no generative model on data is assumed. Rather, given a set of data points, the objective is \emph{fit} $k$ lines by minimizing a suitable loss function. It is shown that a modification of EM, namely gradient EM converges exponentially to appropriately defined loss minimizer even in the agnostic setting. In this paper, we study the problem of \emph{fitting} $k$ parametric functions to given set of data points. We adhere to the agnostic setup. However, instead of fitting lines equipped…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.