Characterizing Evolution in Expectation-Maximization Estimates for Overspecified Mixed Linear Regression
Zhankun Luo, Abolfazl Hashemi

TL;DR
This paper provides a theoretical analysis of the EM algorithm's convergence behavior and statistical accuracy in overspecified mixed linear regression, highlighting differences based on initial weight balance and extending to low SNR scenarios.
Contribution
It characterizes the convergence rates and statistical accuracy of EM in overspecified 2MLR, revealing how initial weight balance affects efficiency and accuracy, and extends analysis to low SNR regimes.
Findings
Linear convergence with unbalanced weights at population level
Sublinear convergence with balanced weights at population level
Statistical accuracy depends on weight balance and sample size
Abstract
Mixture models have attracted significant attention due to practical effectiveness and comprehensive theoretical foundations. A persisting challenge is model misspecification, which occurs when the model to be fitted has more mixture components than those in the data distribution. In this paper, we develop a theoretical understanding of the Expectation-Maximization (EM) algorithm's behavior in the context of targeted model misspecification for overspecified two-component Mixed Linear Regression (2MLR) with unknown -dimensional regression parameters and mixing weights. In Theorem 5.1 at the population level, with an unbalanced initial guess for mixing weights, we establish linear convergence of regression parameters in steps. Conversely, with a balanced initial guess for mixing weights, we observe sublinear convergence in steps to achieve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
