On the stability of some controlled Markov chains and its applications   to stochastic approximation with Markovian dynamic

Christophe Andrieu; Vladislav B. Tadi\'c; Matti Vihola

arXiv:1205.4181·math.ST·February 2, 2015

On the stability of some controlled Markov chains and its applications to stochastic approximation with Markovian dynamic

Christophe Andrieu, Vladislav B. Tadi\'c, Matti Vihola

PDF

TL;DR

This paper presents a practical Lyapunov-based method to establish the stability of controlled Markov chains, including those with time-scale separation, and applies it to prove convergence in stochastic approximation algorithms.

Contribution

It introduces a novel approach combining Lyapunov functions for joint processes to prove stability and convergence, even with adaptive step sizes and time-scale separation.

Findings

01

Applicable to a broad class of controlled Markov chains

02

Ensures convergence of stochastic approximation algorithms

03

Validates methods in computational statistics

Abstract

We develop a practical approach to establish the stability, that is, the recurrence in a given set, of a large class of controlled Markov chains. These processes arise in various areas of applied science and encompass important numerical methods. We show in particular how individual Lyapunov functions and associated drift conditions for the parametrized family of Markov transition probabilities and the parameter update can be combined to form Lyapunov functions for the joint process, leading to the proof of the desired stability property. Of particular interest is the fact that the approach applies even in situations where the two components of the process present a time-scale separation, which is a crucial feature of practical situations. We then move on to show how such a recurrence property can be used in the context of stochastic approximation in order to prove the convergence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.