Interpretation of Emergent Communication in Heterogeneous Collaborative   Embodied Agents

Shivansh Patel; Saim Wani; Unnat Jain; Alexander Schwing; Svetlana; Lazebnik; Manolis Savva; Angel X. Chang

arXiv:2110.05769·cs.CV·October 13, 2021

Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents

Shivansh Patel, Saim Wani, Unnat Jain, Alexander Schwing, Svetlana, Lazebnik, Manolis Savva, Angel X. Chang

PDF

TL;DR

This paper introduces the CoMON task to analyze how emergent communication between heterogeneous embodied AI agents can be grounded in perception and spatial structure, revealing interpretability in their communication patterns.

Contribution

It proposes a new collaborative navigation task and analyzes emergent communication, demonstrating grounding in perception and spatial environment structure.

Findings

01

Emergent communication is grounded in agent observations.

02

Communication patterns reflect spatial and egocentric perspectives.

03

Heterogeneous agents develop interpretable communication strategies.

Abstract

Communication between embodied AI agents has received increasing attention in recent years. Despite its use, it is still unclear whether the learned communication is interpretable and grounded in perception. To study the grounding of emergent forms of communication, we first introduce the collaborative multi-object navigation task CoMON. In this task, an oracle agent has detailed environment information in the form of a map. It communicates with a navigator agent that perceives the environment visually and is tasked to find a sequence of goals. To succeed at the task, effective communication is essential. CoMON hence serves as a basis to study different communication mechanisms between heterogeneous agents, that is, agents with different capabilities and roles. We study two common communication mechanisms and analyze their communication patterns through an egocentric and spatial lens.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.