ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal   Reinforcement Learning

Harris Chan; Yuhuai Wu; Jamie Kiros; Sanja Fidler; Jimmy Ba

arXiv:1902.04546·cs.LG·February 13, 2019·21 cites

ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning

Harris Chan, Yuhuai Wu, Jamie Kiros, Sanja Fidler, Jimmy Ba

PDF

Open Access

TL;DR

ACTRCE introduces a natural language goal representation to enhance reinforcement learning, enabling agents to generalize to unseen instructions and outperform previous methods like HER in complex 3D navigation tasks.

Contribution

The paper proposes ACTRCE, a novel extension of HER that uses natural language as goal representation, improving generalization and applicability in challenging RL environments.

Findings

01

ACTRCE outperforms HER in 3D navigation tasks.

02

Language goal representations enable generalization to unseen instructions.

03

Hindsight advice significantly improves learning efficiency.

Abstract

Sparse reward is one of the most challenging problems in reinforcement learning (RL). Hindsight Experience Replay (HER) attempts to address this issue by converting a failed experience to a successful one by relabeling the goals. Despite its effectiveness, HER has limited applicability because it lacks a compact and universal goal representation. We present Augmenting experienCe via TeacheR's adviCE (ACTRCE), an efficient reinforcement learning technique that extends the HER framework using natural language as the goal representation. We first analyze the differences among goal representation, and show that ACTRCE can efficiently solve difficult reinforcement learning problems in challenging 3D navigation tasks, whereas HER with non-language goal representation failed to learn. We also show that with language goal representations, the agent can generalize to unseen instructions, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Multimodal Machine Learning Applications

MethodsExperience Replay