ETHER: Aligning Emergent Communication for Hindsight Experience Replay

Kevin Denamgana\"i; Daniel Hernandez; Ozan Vardal; Sondess Missaoui,; James Alfred Walker

arXiv:2307.15494·cs.CL·December 19, 2023

ETHER: Aligning Emergent Communication for Hindsight Experience Replay

Kevin Denamgana\"i, Daniel Hernandez, Ozan Vardal, Sondess Missaoui,, James Alfred Walker

PDF

Open Access

TL;DR

This paper introduces ETHER, an approach that uses emergent communication and a referential game to improve language grounding and feedback in reinforcement learning, enhancing Hindsight Experience Replay without relying on oracle functions.

Contribution

The paper proposes ETHER, which aligns emergent language with natural language and leverages unsupervised referential games to improve HER's applicability and performance.

Findings

01

Emergent language aligns with natural language in BabyAI.

02

Using all trajectories improves RL performance.

03

EC as an auxiliary task enhances HER's data efficiency.

Abstract

Natural language instruction following is paramount to enable collaboration between artificial agents and human beings. Natural language-conditioned reinforcement learning (RL) agents have shown how natural languages' properties, such as compositionality, can provide a strong inductive bias to learn complex policies. Previous architectures like HIGhER combine the benefit of language-conditioning with Hindsight Experience Replay (HER) to deal with sparse rewards environments. Yet, like HER, HIGhER relies on an oracle predicate function to provide a feedback signal highlighting which linguistic description is valid for which state. This reliance on an oracle limits its application. Additionally, HIGhER only leverages the linguistic information contained in successful RL trajectories, thus hurting its final performance and data-efficiency. Without early successful trajectories, HIGhER is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and cultural evolution · Speech and dialogue systems · Language, Metaphor, and Cognition

MethodsDense Connections · Convolution · Q-Learning · Deep Q-Network · Experience Replay · ALIGN