ETHER: Aligning Emergent Communication for Hindsight Experience Replay
Kevin Denamgana\"i, Daniel Hernandez, Ozan Vardal, Sondess Missaoui,, James Alfred Walker

TL;DR
This paper introduces ETHER, an approach that uses emergent communication and a referential game to improve language grounding and feedback in reinforcement learning, enhancing Hindsight Experience Replay without relying on oracle functions.
Contribution
The paper proposes ETHER, which aligns emergent language with natural language and leverages unsupervised referential games to improve HER's applicability and performance.
Findings
Emergent language aligns with natural language in BabyAI.
Using all trajectories improves RL performance.
EC as an auxiliary task enhances HER's data efficiency.
Abstract
Natural language instruction following is paramount to enable collaboration between artificial agents and human beings. Natural language-conditioned reinforcement learning (RL) agents have shown how natural languages' properties, such as compositionality, can provide a strong inductive bias to learn complex policies. Previous architectures like HIGhER combine the benefit of language-conditioning with Hindsight Experience Replay (HER) to deal with sparse rewards environments. Yet, like HER, HIGhER relies on an oracle predicate function to provide a feedback signal highlighting which linguistic description is valid for which state. This reliance on an oracle limits its application. Additionally, HIGhER only leverages the linguistic information contained in successful RL trajectories, thus hurting its final performance and data-efficiency. Without early successful trajectories, HIGhER is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Speech and dialogue systems · Language, Metaphor, and Cognition
MethodsDense Connections · Convolution · Q-Learning · Deep Q-Network · Experience Replay · ALIGN
