Necessary and Sufficient Oracles: Toward a Computational Taxonomy For   Reinforcement Learning

Dhruv Rohatgi; Dylan J. Foster

arXiv:2502.08632·cs.LG·February 13, 2025

Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning

Dhruv Rohatgi, Dylan J. Foster

PDF

Open Access

TL;DR

This paper develops a computational taxonomy for reinforcement learning by analyzing the impact of different supervised learning oracles on RL complexity, identifying minimal and near-minimal oracles in various models.

Contribution

It introduces a formal framework to compare supervised learning oracles in RL, establishing minimality results and the benefits of resets in certain settings.

Findings

01

Two-context regression is minimal in Block MDPs with episodic access.

02

One-context regression is near-minimal with reset access, showing reset benefits.

03

Cryptographic evidence shows the minimal oracle is insufficient for Low-Rank MDPs.

Abstract

Algorithms for reinforcement learning (RL) in large state spaces crucially rely on supervised learning subroutines to estimate objects such as value functions or transition probabilities. Since only the simplest supervised learning problems can be solved provably and efficiently, practical performance of an RL algorithm depends on which of these supervised learning "oracles" it assumes access to (and how they are implemented). But which oracles are better or worse? Is there a minimal oracle? In this work, we clarify the impact of the choice of supervised learning oracle on the computational complexity of RL, as quantified by the oracle strength. First, for the task of reward-free exploration in Block MDPs in the standard episodic access model -- a ubiquitous setting for RL with function approximation -- we identify two-context regression as a minimal oracle, i.e. an oracle that is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Computability, Logic, AI Algorithms

MethodsFocus