ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
Harris Chan, Yuhuai Wu, Jamie Kiros, Sanja Fidler, Jimmy Ba

TL;DR
ACTRCE introduces a natural language goal representation to enhance reinforcement learning, enabling agents to generalize to unseen instructions and outperform previous methods like HER in complex 3D navigation tasks.
Contribution
The paper proposes ACTRCE, a novel extension of HER that uses natural language as goal representation, improving generalization and applicability in challenging RL environments.
Findings
ACTRCE outperforms HER in 3D navigation tasks.
Language goal representations enable generalization to unseen instructions.
Hindsight advice significantly improves learning efficiency.
Abstract
Sparse reward is one of the most challenging problems in reinforcement learning (RL). Hindsight Experience Replay (HER) attempts to address this issue by converting a failed experience to a successful one by relabeling the goals. Despite its effectiveness, HER has limited applicability because it lacks a compact and universal goal representation. We present Augmenting experienCe via TeacheR's adviCE (ACTRCE), an efficient reinforcement learning technique that extends the HER framework using natural language as the goal representation. We first analyze the differences among goal representation, and show that ACTRCE can efficiently solve difficult reinforcement learning problems in challenging 3D navigation tasks, whereas HER with non-language goal representation failed to learn. We also show that with language goal representations, the agent can generalize to unseen instructions, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Multimodal Machine Learning Applications
MethodsExperience Replay
