Improve the efficiency of deep reinforcement learning through semantic   exploration guided by natural language

Zhourui Guo; Meng Yao; Yang Yu; Qiyue Yin

arXiv:2309.11753·cs.AI·September 22, 2023

Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language

Zhourui Guo, Meng Yao, Yang Yu, Qiyue Yin

PDF

Open Access

TL;DR

This paper introduces a retrieval-based semantic exploration method guided by natural language to improve the efficiency of deep reinforcement learning, especially in sparse-reward tasks, by selectively querying an oracle using relevant past interactions.

Contribution

It proposes a novel approach for efficient oracle interaction in RL using neural encoding and retrieval of relevant questions from a large corpus, reducing interaction costs.

Findings

01

Significantly reduces the number of interactions needed for learning.

02

Improves performance in sparse-reward object manipulation tasks.

03

Outperforms baseline methods without retrieval-based guidance.

Abstract

Reinforcement learning is a powerful technique for learning from trial and error, but it often requires a large number of interactions to achieve good performance. In some domains, such as sparse-reward tasks, an oracle that can provide useful feedback or guidance to the agent during the learning process is really of great importance. However, querying the oracle too frequently may be costly or impractical, and the oracle may not always have a clear answer for every situation. Therefore, we propose a novel method for interacting with the oracle in a selective and efficient way, using a retrieval-based approach. We assume that the interaction can be modeled as a sequence of templated questions and answers, and that there is a large corpus of previous interactions available. We use a neural network to encode the current state of the agent and the oracle, and retrieve the most relevant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Software Engineering Research · Machine Learning and Data Classification