Characterizing Evolution in Expectation-Maximization Estimates for Overspecified Mixed Linear Regression

Zhankun Luo; Abolfazl Hashemi

arXiv:2508.10154·cs.LG·March 9, 2026

Characterizing Evolution in Expectation-Maximization Estimates for Overspecified Mixed Linear Regression

Zhankun Luo, Abolfazl Hashemi

PDF

TL;DR

This paper provides a theoretical analysis of the EM algorithm's convergence behavior and statistical accuracy in overspecified mixed linear regression, highlighting differences based on initial weight balance and extending to low SNR scenarios.

Contribution

It characterizes the convergence rates and statistical accuracy of EM in overspecified 2MLR, revealing how initial weight balance affects efficiency and accuracy, and extends analysis to low SNR regimes.

Findings

01

Linear convergence with unbalanced weights at population level

02

Sublinear convergence with balanced weights at population level

03

Statistical accuracy depends on weight balance and sample size

Abstract

Mixture models have attracted significant attention due to practical effectiveness and comprehensive theoretical foundations. A persisting challenge is model misspecification, which occurs when the model to be fitted has more mixture components than those in the data distribution. In this paper, we develop a theoretical understanding of the Expectation-Maximization (EM) algorithm's behavior in the context of targeted model misspecification for overspecified two-component Mixed Linear Regression (2MLR) with unknown $d$ -dimensional regression parameters and mixing weights. In Theorem 5.1 at the population level, with an unbalanced initial guess for mixing weights, we establish linear convergence of regression parameters in $O (lo g (1/ ϵ))$ steps. Conversely, with a balanced initial guess for mixing weights, we observe sublinear convergence in $O (ϵ^{- 2})$ steps to achieve the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.