Reinforcement Learning, Bit by Bit

Xiuyuan Lu; Benjamin Van Roy; Vikranth Dwaracherla; Morteza Ibrahimi,; Ian Osband; Zheng Wen

arXiv:2103.04047·cs.LG·May 9, 2023·5 cites

Reinforcement Learning, Bit by Bit

Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi,, Ian Osband, Zheng Wen

PDF

Open Access

TL;DR

This paper explores principles of data efficiency in reinforcement learning, focusing on information acquisition and retention, and demonstrates simple agents that improve data efficiency through theoretical insights and computational results.

Contribution

It introduces a conceptual framework and regret analysis for understanding data-efficient reinforcement learning, along with simple agent designs illustrating these ideas.

Findings

01

Principled guidance on what information to seek and retain

02

Simple agents demonstrating improved data efficiency

03

Computational results validating the concepts

Abstract

Reinforcement learning agents have demonstrated remarkable achievements in simulated environments. Data efficiency poses an impediment to carrying this success over to real environments. The design of data-efficient agents calls for a deeper understanding of information acquisition and representation. We discuss concepts and regret analysis that together offer principled guidance. This line of thinking sheds light on questions of what information to seek, how to seek that information, and what information to retain. To illustrate concepts, we design simple agents that build on them and present computational results that highlight data efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Data Stream Mining Techniques