The Role of Coverage in Online Reinforcement Learning

Tengyang Xie; Dylan J. Foster; Yu Bai; Nan Jiang; Sham M. Kakade

arXiv:2210.04157·cs.LG·October 11, 2022·1 cites

The Role of Coverage in Online Reinforcement Learning

Tengyang Xie, Dylan J. Foster, Yu Bai, Nan Jiang, Sham M. Kakade

PDF

Open Access 1 Video

TL;DR

This paper reveals that good coverage conditions, especially coverability, are crucial for enabling sample-efficient online reinforcement learning, linking offline data properties to online exploration success.

Contribution

It establishes a novel connection between coverage conditions and online RL sample efficiency, introducing the sequential extrapolation coefficient as a new complexity measure.

Findings

01

Coverability enables sample-efficient online RL even without prior knowledge of the data distribution.

02

Weaker coverage notions are insufficient for online RL, despite their sufficiency for offline RL.

03

Existing complexity measures like Bellman rank do not fully capture coverability, leading to the proposal of a new measure.

Abstract

Coverage conditions -- which assert that the data logging distribution adequately covers the state space -- play a fundamental role in determining the sample complexity of offline reinforcement learning. While such conditions might seem irrelevant to online reinforcement learning at first glance, we establish a new connection by showing -- somewhat surprisingly -- that the mere existence of a data distribution with good coverage can enable sample-efficient online RL. Concretely, we show that coverability -- that is, existence of a data distribution that satisfies a ubiquitous coverage condition called concentrability -- can be viewed as a structural property of the underlying MDP, and can be exploited by standard algorithms for sample-efficient exploration, even when the agent does not know said distribution. We complement this result by proving that several weaker notions of coverage,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The Role of Coverage in Online Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Auction Theory and Applications · Game Theory and Applications