RICE: Breaking Through the Training Bottlenecks of Reinforcement   Learning with Explanation

Zelei Cheng; Xian Wu; Jiahao Yu; Sabrina Yang; Gang Wang; Xinyu Xing

arXiv:2405.03064·cs.LG·June 7, 2024

RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation

Zelei Cheng, Xian Wu, Jiahao Yu, Sabrina Yang, Gang Wang, Xinyu Xing

PDF

Open Access 1 Repo

TL;DR

This paper introduces RICE, a novel reinforcement learning refinement method that uses explanation techniques to create a better initial state distribution, helping agents overcome training bottlenecks and improve performance in complex tasks.

Contribution

RICE is a new refining scheme that integrates explanation methods to construct a mixed initial state distribution, providing theoretical guarantees and improved performance over existing methods.

Findings

01

RICE outperforms existing schemes in various RL environments.

02

The method effectively helps agents escape training bottlenecks.

03

Theoretical analysis shows tighter sub-optimality bounds.

Abstract

Deep reinforcement learning (DRL) is playing an increasingly important role in real-world applications. However, obtaining an optimally performing DRL agent for complex tasks, especially with sparse rewards, remains a significant challenge. The training of a DRL agent can be often trapped in a bottleneck without further progress. In this paper, we propose RICE, an innovative refining scheme for reinforcement learning that incorporates explanation methods to break through the training bottlenecks. The high-level idea of RICE is to construct a new initial state distribution that combines both the default initial states and critical states identified through explanation methods, thereby encouraging the agent to explore from the mixed initial states. Through careful design, we can theoretically guarantee that our refining scheme has a tighter sub-optimality bound. We evaluate RICE in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chengzelei/rice
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics