Learning to Infer Belief Embedded Communication
Guo Ye, Han Liu, Biswa Sengupta

TL;DR
This paper presents IEC, a novel multi-agent communication algorithm that enables agents to learn implicit grammar and intentions through co-evolution, significantly improving collaboration efficiency in various environments.
Contribution
Introduction of IEC, a co-evolving perception and language generation model that enhances multi-agent communication and learning speed compared to existing algorithms.
Findings
IEC learns 50% faster than MADDPG.
Disabling belief inference reduces performance by 38%.
Disabling communication module reduces performance by 60%.
Abstract
In multi-agent collaboration problems with communication, an agent's ability to encode their intention and interpret other agents' strategies is critical for planning their future actions. This paper introduces a novel algorithm called Intention Embedded Communication (IEC) to mimic an agent's language learning ability. IEC contains a perception module for decoding other agents' intentions in response to their past actions. It also includes a language generation module for learning implicit grammar during communication with two or more agents. Such grammar, by construction, should be compact for efficient communication. Both modules undergo conjoint evolution - similar to an infant's babbling that enables it to learn a language of choice by trial and error. We utilised three multi-agent environments, namely predator/prey, traffic junction and level-based foraging and illustrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Adam · Batch Normalization · Dense Connections · Experience Replay · Convolution · Weight Decay · MADDPG
