Communicating Unexpectedness for Out-of-Distribution Multi-Agent   Reinforcement Learning

Min Whoo Lee; Kibeom Kim; Soo Wung Shin; Minsu Lee; Byoung-Tak Zhang

arXiv:2501.01140·cs.MA·January 3, 2025

Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning

Min Whoo Lee, Kibeom Kim, Soo Wung Shin, Minsu Lee, Byoung-Tak Zhang

PDF

Open Access

TL;DR

This paper introduces Unexpected Encoding Scheme, a decentralized multi-agent reinforcement learning method where agents communicate surprising environmental aspects to improve adaptation to unforeseen situations, enhancing robustness in dynamic environments.

Contribution

The paper presents a novel communication approach for multi-agent RL that encodes unexpected environmental changes, enabling better adaptation to out-of-distribution scenarios.

Findings

01

Supports robust adaptation to dynamic environments

02

Improves out-of-distribution generalization in multi-agent settings

03

Demonstrates effectiveness in multi-robot warehouse tasks

Abstract

Applying multi-agent reinforcement learning methods to realistic settings is challenging as it may require the agents to quickly adapt to unexpected situations that are rarely or never encountered in training. Recent methods for generalization to such out-of-distribution settings are limited to more specific, restricted instances of distribution shifts. To tackle adaptation to distribution shifts, we propose Unexpected Encoding Scheme, a novel decentralized multi-agent reinforcement learning algorithm where agents communicate "unexpectedness," the aspects of the environment that are surprising. In addition to a message yielded by the original reward-driven communication, each agent predicts the next observation based on previous experience, measures the discrepancy between the prediction and the actually encountered observation, and encodes this discrepancy as a message. Experiments on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications