A Dynamical Systems Approach for Convergence of the Bayesian EM Algorithm
Orlando Romero, Subhro Das, Pin-Yu Chen, S\'ergio Pequito

TL;DR
This paper applies Lyapunov stability theory from systems and control to analyze and establish convergence conditions for the Bayesian EM algorithm in machine learning, revealing potential for faster convergence under certain assumptions.
Contribution
It introduces a dynamical systems framework using Lyapunov stability to analyze convergence of the MAP-EM algorithm, extending the theoretical understanding of its behavior.
Findings
Provided convergence conditions for MAP-EM using Lyapunov theory
Shown that under certain assumptions, EM can achieve linear or quadratic convergence
Expanded the set of sufficient conditions for EM algorithm applications
Abstract
Out of the recent advances in systems and control (S\&C)-based analysis of optimization algorithms, not enough work has been specifically dedicated to machine learning (ML) algorithms and its applications. This paper addresses this gap by illustrating how (discrete-time) Lyapunov stability theory can serve as a powerful tool to aid, or even lead, in the analysis (and potential design) of optimization algorithms that are not necessarily gradient-based. The particular ML problem that this paper focuses on is that of parameter estimation in an incomplete-data Bayesian framework via the popular optimization algorithm known as maximum a posteriori expectation-maximization (MAP-EM). Following first principles from dynamical systems stability theory, conditions for convergence of MAP-EM are developed. Furthermore, if additional assumptions are met, we show that fast convergence (linear or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Blind Source Separation Techniques · Bayesian Methods and Mixture Models
