$k$-MLE: A fast algorithm for learning statistical mixture models
Frank Nielsen

TL;DR
The paper introduces $k$-MLE, a fast local search algorithm for learning mixture models that leverages Bregman divergences and can be efficiently initialized with $k$-MLE++ for improved convergence.
Contribution
It presents a novel, efficient local search algorithm for mixture models, connecting Bregman divergences with hard clustering, and proposes a probabilistic initialization method.
Findings
$k$-MLE converges reliably to high-likelihood solutions.
The algorithm is faster than traditional EM methods.
Initialization with $k$-MLE++ improves convergence quality.
Abstract
We describe -MLE, a fast and efficient local search algorithm for learning finite statistical mixtures of exponential families such as Gaussian mixture models. Mixture models are traditionally learned using the expectation-maximization (EM) soft clustering technique that monotonically increases the incomplete (expected complete) likelihood. Given prescribed mixture weights, the hard clustering -MLE algorithm iteratively assigns data to the most likely weighted component and update the component models using Maximum Likelihood Estimators (MLEs). Using the duality between exponential families and Bregman divergences, we prove that the local convergence of the complete likelihood of -MLE follows directly from the convergence of a dual additively weighted Bregman hard clustering. The inner loop of -MLE can be implemented using any -means heuristic like the celebrated Lloyd's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
