Gradient-based training of Gaussian Mixture Models for High-Dimensional Streaming Data
Alexander Gepperth, Benedikt Pf\"ulb

TL;DR
This paper introduces a novel SGD-based method for training Gaussian Mixture Models efficiently on high-dimensional streaming data without the need for data-driven initialization, addressing common issues like local optima and numerical instability.
Contribution
The approach enables training GMMs with minimal batch sizes and introduces an adaptive annealing and exponential-free likelihood approximation for stability and performance.
Findings
Outperforms stochastic EM on high-dimensional data
Eliminates need for k-means initialization
Handles non-stationary streaming data effectively
Abstract
We present an approach for efficiently training Gaussian Mixture Model (GMM) by Stochastic Gradient Descent (SGD) with non-stationary, high-dimensional streaming data. Our training scheme does not require data-driven parameter initialization (e.g., k-means) and can thus be trained based on a random initialization. Furthermore, the approach allows mini-batch sizes as low as 1, which are typical for streaming-data settings. Major problems in such settings are undesirable local optima during early training phases and numerical instabilities due to high data dimensionalities. We introduce an adaptive annealing procedure to address the first problem, whereas numerical instabilities are eliminated by using an exponential-free approximation to the standard GMM log-likelihood. Experiments on a variety of visual and non-visual benchmarks show that our SGD approach can be trained completely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Gaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models
MethodsStochastic Gradient Descent
