gComm: An environment for investigating generalization in Grounded   Language Acquisition

Rishi Hazra; Sonu Dixit

arXiv:2105.03943·cs.CL·May 21, 2021

gComm: An environment for investigating generalization in Grounded Language Acquisition

Rishi Hazra, Sonu Dixit

PDF

Open Access 1 Repo

TL;DR

gComm offers a versatile platform for studying how agents develop and generalize communication skills in grounded language tasks within a challenging 2D environment.

Contribution

It introduces a new environment with multi-modal agents and tools for analyzing communication strategies and their generalization in grounded language acquisition.

Findings

01

Supports research on communication development in agents

02

Facilitates evaluation of generalization in language use

03

Provides a realistic, challenging setting for grounded language tasks

Abstract

gComm is a step towards developing a robust platform to foster research in grounded language acquisition in a more challenging and realistic setting. It comprises a 2-d grid environment with a set of agents (a stationary speaker and a mobile listener connected via a communication channel) exposed to a continuous array of tasks in a partially observable setting. The key to solving these tasks lies in agents developing linguistic abilities and utilizing them for efficiently exploring the environment. The speaker and listener have access to information provided in different modalities, i.e. the speaker's input is a natural language instruction that contains the target and task specifications and the listener's input is its grid-view. Each must rely on the other to complete the assigned task, however, the only way they can achieve the same, is to develop and use some form of communication.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SonuDixit/gComm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Speech and dialogue systems