Big Learning Expectation Maximization
Yulai Cong, Sijia Li

TL;DR
This paper introduces Big Learning EM, an enhanced EM algorithm inspired by foundation models, which improves training robustness for mixture models by better avoiding poor local optima, demonstrated through experiments and benchmarks.
Contribution
We propose Big Learning EM, a novel EM upgrade leveraging big learning principles to enhance mixture model training and avoid bad local optima.
Findings
Empirically achieves near-optimal solutions with high probability.
Outperforms existing techniques on benchmark clustering datasets.
Demonstrates effectiveness and advantages through simulated and real data experiments.
Abstract
Mixture models serve as one fundamental tool with versatile applications. However, their training techniques, like the popular Expectation Maximization (EM) algorithm, are notoriously sensitive to parameter initialization and often suffer from bad local optima that could be arbitrarily worse than the optimal. To address the long-lasting bad-local-optima challenge, we draw inspiration from the recent ground-breaking foundation models and propose to leverage their underlying big learning principle to upgrade the EM. Specifically, we present the Big Learning EM (BigLearn-EM), an EM upgrade that simultaneously performs joint, marginal, and orthogonally transformed marginal matchings between data and model distributions. Through simulated experiments, we empirically show that the BigLearn-EM is capable of delivering the optimal with high probability; comparisons on benchmark clustering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Domain Adaptation and Few-Shot Learning
