Is a Good Representation Sufficient for Sample Efficient Reinforcement   Learning?

Simon S. Du; Sham M. Kakade; Ruosong Wang; Lin F. Yang

arXiv:1910.03016·cs.LG·March 2, 2020·32 cites

Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?

Simon S. Du, Sham M. Kakade, Ruosong Wang, Lin F. Yang

PDF

Open Access

TL;DR

This paper investigates the limitations of good representations in achieving sample-efficient reinforcement learning, revealing that representation quality alone is insufficient unless it surpasses certain thresholds, with implications for various learning paradigms.

Contribution

The work establishes sharp thresholds for representation quality necessary for sample-efficient RL and demonstrates fundamental limitations and exponential separations between different learning methods.

Findings

01

Good representations are not sufficient for sample-efficient RL without passing certain quality thresholds.

02

There are fundamental lower bounds on the dimensionality of representations for effective RL.

03

Exponential sample complexity gaps exist between perfect and suboptimal representations across learning paradigms.

Abstract

Modern deep learning methods provide effective means to learn good representations. However, is a good representation itself sufficient for sample efficient reinforcement learning? This question has largely been studied only with respect to (worst-case) approximation error, in the more classical approximate dynamic programming literature. With regards to the statistical viewpoint, this question is largely unexplored, and the extant body of literature mainly focuses on conditions which permit sample efficient reinforcement learning with little understanding of what are necessary conditions for efficient reinforcement learning. This work shows that, from the statistical viewpoint, the situation is far subtler than suggested by the more traditional approximation viewpoint, where the requirements on the representation that suffice for sample efficient RL are even more stringent. Our main…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Machine Learning and Algorithms