Emergence of Communication in an Interactive World with Consistent   Speakers

Ben Bogin; Mor Geva; Jonathan Berant

arXiv:1809.00549·cs.CL·March 26, 2019·30 cites

Emergence of Communication in an Interactive World with Consistent Speakers

Ben Bogin, Mor Geva, Jonathan Berant

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new environment and algorithm for training agents to develop communication from raw pixels, improving stability and context-independence over traditional policy gradient methods.

Contribution

It presents a novel environment and a representation-based training algorithm that enhances the emergence of consistent, context-independent communication in agents.

Findings

01

Our algorithm outperforms policy gradient in stability and performance.

02

The method increases context-independence in emergent communication.

03

Empirical results demonstrate improved communication consistency.

Abstract

Training agents to communicate with one another given task-based supervision only has attracted considerable attention recently, due to the growing interest in developing models for human-agent interaction. Prior work on the topic focused on simple environments, where training using policy gradient was feasible despite the non-stationarity of the agents during training. In this paper, we present a more challenging environment for testing the emergence of communication from raw pixels, where training using policy gradient fails. We propose a new model and training algorithm, that utilizes the structure of a learned representation space to produce more consistent speakers at the initial phases of training, which stabilizes learning. We empirically show that our algorithm substantially improves performance compared to policy gradient. We also propose a new alignment-based metric for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

benbogin/emergence-communication-cco
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications