Comments on the Du-Kakade-Wang-Yang Lower Bounds

Benjamin Van Roy; Shi Dong

arXiv:1911.07910·cs.LG·November 20, 2019·26 cites

Comments on the Du-Kakade-Wang-Yang Lower Bounds

Benjamin Van Roy, Shi Dong

PDF

Open Access

TL;DR

This paper compares and reconciles different theoretical results on the sample complexity of reinforcement learning, focusing on lower bounds and the role of the eluder dimension in problem tractability.

Contribution

It clarifies the relationship between recent lower bounds and the eluder dimension framework, providing a unified understanding of problem complexity in reinforcement learning.

Findings

01

Reconciles interpretations of lower bounds and eluder dimension.

02

Highlights conditions under which RL problems are tractable or intractable.

03

Provides insights into the theoretical landscape of sample complexity in RL.

Abstract

Du, Kakade, Wang, and Yang recently established intriguing lower bounds on sample complexity, which suggest that reinforcement learning with a misspecified representation is intractable. Another line of work, which centers around a statistic called the eluder dimension, establishes tractability of problems similar to those considered in the Du-Kakade-Wang-Yang paper. We compare these results and reconcile interpretations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Evolutionary Algorithms and Applications