Markovian Foundations for Quasi-Stochastic Approximation with Applications to Extremum Seeking Control
Caio Kalil Lauand, Sean Meyn

TL;DR
This paper develops a Markovian framework for quasi-stochastic approximation, providing new theoretical insights, error bounds, and stability results applicable to optimization and extremum seeking control, especially under non-Lipschitz conditions.
Contribution
It introduces a novel Markovian foundation for QSA, derives an exact ODE representation, and extends stability and error analysis to non-Lipschitz algorithms in extremum seeking.
Findings
Error bound of order O(α) reduced to O(α^2) with filters
New stability results for non-Lipschitz extremum seeking algorithms
Potential for error bounds better than O(α) with Markovian noise
Abstract
This paper concerns quasi-stochastic approximation (QSA) to solve root finding problems commonly found in applications to optimization and reinforcement learning. The general constant gain algorithm may be expressed as the time-inhomogeneous ODE , with state process evolving on . Theory is based on an almost periodic vector field, so that in particular the time average of defines the time-homogeneous mean vector field with . Under smoothness assumptions on the functions involved, the following exact representation is obtained: \[\frac{d}{dt}\Theta_t=\alpha[\bar{f}(\Theta_t)-\alpha\bar\Upsilon_t+\alpha^2\mathcal{W}_t^0+\alpha\frac{d}{dt}\mathcal{W}_t^1+\frac{d^2}{dt^2}\mathcal{W}_t^2]\] along with formulae for the smooth signals $\{\bar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExtremum Seeking Control Systems · Stochastic processes and financial applications
