Big Learning Expectation Maximization

Yulai Cong; Sijia Li

arXiv:2312.11926·cs.LG·December 20, 2023·1 cites

Big Learning Expectation Maximization

Yulai Cong, Sijia Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces Big Learning EM, an enhanced EM algorithm inspired by foundation models, which improves training robustness for mixture models by better avoiding poor local optima, demonstrated through experiments and benchmarks.

Contribution

We propose Big Learning EM, a novel EM upgrade leveraging big learning principles to enhance mixture model training and avoid bad local optima.

Findings

01

Empirically achieves near-optimal solutions with high probability.

02

Outperforms existing techniques on benchmark clustering datasets.

03

Demonstrates effectiveness and advantages through simulated and real data experiments.

Abstract

Mixture models serve as one fundamental tool with versatile applications. However, their training techniques, like the popular Expectation Maximization (EM) algorithm, are notoriously sensitive to parameter initialization and often suffer from bad local optima that could be arbitrarily worse than the optimal. To address the long-lasting bad-local-optima challenge, we draw inspiration from the recent ground-breaking foundation models and propose to leverage their underlying big learning principle to upgrade the EM. Specifically, we present the Big Learning EM (BigLearn-EM), an EM upgrade that simultaneously performs joint, marginal, and orthogonally transformed marginal matchings between data and model distributions. Through simulated experiments, we empirically show that the BigLearn-EM is capable of delivering the optimal with high probability; comparisons on benchmark clustering…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yulaicong/big-learning-expectation-maximization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Domain Adaptation and Few-Shot Learning