Finite-Time Analysis of Asynchronous Stochastic Approximation and   $Q$-Learning

Guannan Qu; Adam Wierman

arXiv:2002.00260·math.OC·February 6, 2020·24 cites

Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$-Learning

Guannan Qu, Adam Wierman

PDF

Open Access

TL;DR

This paper provides a finite-time convergence analysis for asynchronous stochastic approximation schemes, including $Q$-learning, achieving bounds that match or improve upon existing results for both synchronous and asynchronous cases.

Contribution

It introduces a general asynchronous SA framework with a weighted infinity-norm contractive operator and derives sharp finite-time bounds, specifically improving asynchronous $Q$-learning analysis.

Findings

01

Finite-time convergence bounds for asynchronous SA schemes.

02

Matching the best bounds for synchronous $Q$-learning.

03

Improved bounds for asynchronous $Q$-learning.

Abstract

We consider a general asynchronous Stochastic Approximation (SA) scheme featuring a weighted infinity-norm contractive operator, and prove a bound on its finite-time convergence rate on a single trajectory. Additionally, we specialize the result to asynchronous $Q$ -learning. The resulting bound matches the sharpest available bound for synchronous $Q$ -learning, and improves over previous known bounds for asynchronous $Q$ -learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Privacy-Preserving Technologies in Data