gComm: An environment for investigating generalization in Grounded Language Acquisition
Rishi Hazra, Sonu Dixit

TL;DR
gComm offers a versatile platform for studying how agents develop and generalize communication skills in grounded language tasks within a challenging 2D environment.
Contribution
It introduces a new environment with multi-modal agents and tools for analyzing communication strategies and their generalization in grounded language acquisition.
Findings
Supports research on communication development in agents
Facilitates evaluation of generalization in language use
Provides a realistic, challenging setting for grounded language tasks
Abstract
gComm is a step towards developing a robust platform to foster research in grounded language acquisition in a more challenging and realistic setting. It comprises a 2-d grid environment with a set of agents (a stationary speaker and a mobile listener connected via a communication channel) exposed to a continuous array of tasks in a partially observable setting. The key to solving these tasks lies in agents developing linguistic abilities and utilizing them for efficiently exploring the environment. The speaker and listener have access to information provided in different modalities, i.e. the speaker's input is a natural language instruction that contains the target and task specifications and the listener's input is its grid-view. Each must rely on the other to complete the assigned task, however, the only way they can achieve the same, is to develop and use some form of communication.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Speech and dialogue systems
