Learning to Request Guidance in Emergent Communication

Benjamin Kolb; Leon Lang; Henning Bartsch; Arwin Gansekoele; Raymond; Koopmanschap; Leonardo Romor; David Speck; Mathijs Mul; Elia Bruni

arXiv:1912.05525·cs.AI·December 12, 2019·1 cites

Learning to Request Guidance in Emergent Communication

Benjamin Kolb, Leon Lang, Henning Bartsch, Arwin Gansekoele, Raymond, Koopmanschap, Leonardo Romor, David Speck, Mathijs Mul, Elia Bruni

PDF

Open Access

TL;DR

This paper introduces a bidirectional communication framework where an agent can request guidance from a pre-trained guide during learning, leading to adaptive help-seeking behavior based on uncertainty, which improves learning efficiency.

Contribution

It extends previous unidirectional emergent communication models by enabling the agent to ask for help, with a gating mechanism that reduces guidance requests over time.

Findings

01

Guidance requests decrease as training progresses.

02

Requests are made primarily in high-uncertainty situations.

03

The gating mechanism effectively controls guidance flow.

Abstract

Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent. The guide achieves this by providing the agent with discrete messages in an emerged language about how to solve the task. We extend this one-directional communication by a one-bit communication channel from the learner back to the guide: It is able to ask the guide for help, and we limit the guidance by penalizing the learner for these requests. During training, the agent learns to control this gate based on its current observation. We find that the amount of requested guidance decreases over time and guidance is requested in situations of high uncertainty. We investigate the agent's performance in cases of open and closed gates and discuss potential motives for the observed gating behavior.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Language and cultural evolution · Evolutionary Algorithms and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings