Almost Sure Convergence Rates of Stochastic Approximation and Reinforcement Learning via a Poisson-Moreau Drift
Xinyu Liu, Zixuan Xie, Shangtong Zhang

TL;DR
This paper derives almost sure convergence rates for stochastic approximation algorithms with Markovian noise, including reinforcement learning methods like Q-learning, using a novel Lyapunov drift approach.
Contribution
It introduces a new Lyapunov drift technique combining Poisson-equation correction and Moreau-envelope smoothing for analyzing convergence rates.
Findings
Achieves convergence rate close to o(n^{1 - 2ta}) for power-law learning rates
Obtains near-optimal convergence rate close to o(n^{-1}) for harmonic learning rates
Provides a theoretical framework applicable to reinforcement learning algorithms with Markovian noise
Abstract
Establishing almost sure convergence rates for stochastic approximation and reinforcement learning under Markovian noise is a fundamental theoretical challenge. We make progress towards this challenge for a class of stochastic approximation algorithms whose expected updates are contractive, a setting that arises in many reinforcement learning algorithms such as -learning and linear temporal difference learning. Specifically, for a power-law learning rate with , we obtain an almost sure convergence rate arbitrarily close to . For a harmonic learning rate , we obtain an almost sure convergence rate arbitrarily close to , which we argue is a strong result because it is close to the optimal rate given by the law of the iterated logarithm (for a special case of i.i.d. noise). Key to our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
