Learning Multi-Object Positional Relationships via Emergent Communication
Yicheng Feng, Boshi An, and Zongqing Lu

TL;DR
This paper explores how emergent communication enables agents to understand and generalize multi-object positional relationships, demonstrating the potential for improved generalization and transfer learning in AI communication tasks.
Contribution
The study introduces a focus on emergent communication about multi-object positional relationships and shows how input variation affects generalization and transfer learning capabilities.
Findings
Language generalizes well to new multi-step tasks.
Pre-trained referential communication improves transfer performance.
Emergent communication effectively encodes positional relationships.
Abstract
The study of emergent communication has been dedicated to interactive artificial intelligence. While existing work focuses on communication about single objects or complex image scenes, we argue that communicating relationships between multiple objects is important in more realistic tasks, but understudied. In this paper, we try to fill this gap and focus on emergent communication about positional relationships between two objects. We train agents in the referential game where observations contain two objects, and find that generalization is the major problem when the positional relationship is involved. The key factor affecting the generalization ability of the emergent language is the input variation between Speaker and Listener, which is realized by a random image generator in our work. Further, we find that the learned language can generalize well in a new multi-step MDP task where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques
