Learning Dynamic Abstract Representations for Sample-Efficient   Reinforcement Learning

Mehdi Dadvar; Rashmeet Kaur Nayyar; Siddharth Srivastava

arXiv:2210.01955·cs.LG·December 9, 2022

Learning Dynamic Abstract Representations for Sample-Efficient Reinforcement Learning

Mehdi Dadvar, Rashmeet Kaur Nayyar, Siddharth Srivastava

PDF

Open Access

TL;DR

This paper introduces a domain-independent method for dynamically learning state abstractions during reinforcement learning, significantly improving sample efficiency and performance across various problems.

Contribution

It proposes a novel top-down, dynamic abstraction method based on Q-value dispersion, reducing the need for manual abstraction design in RL.

Findings

01

Automatically learns problem-specific abstractions

02

Achieves higher sample efficiency

03

Outperforms existing approaches in multiple domains

Abstract

In many real-world problems, the learning agent needs to learn a problem's abstractions and solution simultaneously. However, most such abstractions need to be designed and refined by hand for different problems and domains of application. This paper presents a novel top-down approach for constructing state abstractions while carrying out reinforcement learning. Starting with state variables and a simulator, it presents a novel domain-independent approach for dynamically computing an abstraction based on the dispersion of Q-values in abstract states as the agent continues acting and learning. Extensive empirical evaluation on multiple domains and problems shows that this approach automatically learns abstractions that are finely-tuned to the problem, yield powerful sample efficiency, and result in the RL agent significantly outperforming existing approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Evolutionary Algorithms and Applications