Replicability in Reinforcement Learning

Amin Karbasi; Grigoris Velegkas; Lin F. Yang; Felix Zhou

arXiv:2305.19562·cs.LG·October 31, 2023·1 cites

Replicability in Reinforcement Learning

Amin Karbasi, Grigoris Velegkas, Lin F. Yang, Felix Zhou

PDF

Open Access 1 Video

TL;DR

This paper introduces the concept of replicability in reinforcement learning, providing algorithms and bounds for policy estimation that ensure consistent outputs across independent runs, and explores relaxed and approximate notions of replicability.

Contribution

It formalizes replicability in RL, develops efficient algorithms with theoretical guarantees, and introduces relaxed and approximate versions with improved sample complexities.

Findings

01

Efficient $ ho$-replicable algorithm with specific sample complexity.

02

Lower bounds for deterministic algorithms on replicability.

03

A TV indistinguishable algorithm with reduced sample complexity.

Abstract

We initiate the mathematical study of replicability as an algorithmic property in the context of reinforcement learning (RL). We focus on the fundamental setting of discounted tabular MDPs with access to a generative model. Inspired by Impagliazzo et al. [2022], we say that an RL algorithm is replicable if, with high probability, it outputs the exact same policy after two executions on i.i.d. samples drawn from the generator when its internal randomness is the same. We first provide an efficient $ρ$ -replicable algorithm for $(ε, δ)$ -optimal policy estimation with sample and time complexity $O (\frac{N ^{3} \cdot l o g ( 1/ δ )}{( 1 - γ ) ^{5} \cdot ε ^{2} \cdot ρ ^{2}})$ , where $N$ is the number of state-action pairs. Next, for the subclass of deterministic algorithms, we provide a lower bound of order…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Replicability in Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Auction Theory and Applications

MethodsFocus