Markovian Foundations for Quasi-Stochastic Approximation with   Applications to Extremum Seeking Control

Caio Kalil Lauand; Sean Meyn

arXiv:2207.06371·math.OC·April 2, 2024·5 cites

Markovian Foundations for Quasi-Stochastic Approximation with Applications to Extremum Seeking Control

Caio Kalil Lauand, Sean Meyn

PDF

Open Access

TL;DR

This paper develops a Markovian framework for quasi-stochastic approximation, providing new theoretical insights, error bounds, and stability results applicable to optimization and extremum seeking control, especially under non-Lipschitz conditions.

Contribution

It introduces a novel Markovian foundation for QSA, derives an exact ODE representation, and extends stability and error analysis to non-Lipschitz algorithms in extremum seeking.

Findings

01

Error bound of order O(α) reduced to O(α^2) with filters

02

New stability results for non-Lipschitz extremum seeking algorithms

03

Potential for error bounds better than O(α) with Markovian noise

Abstract

This paper concerns quasi-stochastic approximation (QSA) to solve root finding problems commonly found in applications to optimization and reinforcement learning. The general constant gain algorithm may be expressed as the time-inhomogeneous ODE $\frac{d}{d t} Θ_{t} = α f_{t} (Θ_{t})$ , with state process $Θ$ evolving on $R^{d}$ . Theory is based on an almost periodic vector field, so that in particular the time average of $f_{t} (θ)$ defines the time-homogeneous mean vector field $\overset{ˉ}{f} : R^{d} \to R^{d}$ with $\overset{ˉ}{f} (θ^{*}) = 0$ . Under smoothness assumptions on the functions involved, the following exact representation is obtained: \[\frac{d}{dt}\Theta_t=\alpha[\bar{f}(\Theta_t)-\alpha\bar\Upsilon_t+\alpha^2\mathcal{W}_t^0+\alpha\frac{d}{dt}\mathcal{W}_t^1+\frac{d^2}{dt^2}\mathcal{W}_t^2]\] along with formulae for the smooth signals $\{\bar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExtremum Seeking Control Systems · Stochastic processes and financial applications