Revisiting Incremental Stochastic Majorization-Minimization Algorithms with Applications to Mixture of Experts

TrungKhang Tran; TrungTin Nguyen; Gersende Fort; Tung Doan; Hien Duy Nguyen; Binh T. Nguyen; Florence Forbes; Christopher Drovandi

arXiv:2601.19811·stat.ML·January 28, 2026

Revisiting Incremental Stochastic Majorization-Minimization Algorithms with Applications to Mixture of Experts

TrungKhang Tran, TrungTin Nguyen, Gersende Fort, Tung Doan, Hien Duy Nguyen, Binh T. Nguyen, Florence Forbes, Christopher Drovandi

PDF

Open Access

TL;DR

This paper introduces a generalized incremental stochastic Majorization-Minimization algorithm that relaxes EM constraints, providing theoretical guarantees and outperforming standard optimizers in mixture of experts models, with applications to real-world data.

Contribution

It develops a flexible incremental stochastic MM algorithm with convergence guarantees, extending EM applicability and demonstrating superior empirical performance.

Findings

01

Algorithm converges to stationary points with vanishing gradients.

02

Outperforms stochastic gradient descent, RMSProp, Adam, and second-order methods.

03

Effective on real-world bioinformatics and ecophysiological datasets.

Abstract

Processing high-volume, streaming data is increasingly common in modern statistics and machine learning, where batch-mode algorithms are often impractical because they require repeated passes over the full dataset. This has motivated incremental stochastic estimation methods, including the incremental stochastic Expectation-Maximization (EM) algorithm formulated via stochastic approximation. In this work, we revisit and analyze an incremental stochastic variant of the Majorization-Minimization (MM) algorithm, which generalizes incremental stochastic EM as a special case. Our approach relaxes key EM requirements, such as explicit latent-variable representations, enabling broader applicability and greater algorithmic flexibility. We establish theoretical guarantees for the incremental stochastic MM algorithm, proving consistency in the sense that the iterates converge to a stationary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Metaheuristic Optimization Algorithms Research · Gaussian Processes and Bayesian Inference