Adaptive Value Decomposition: Coordinating a Varying Number of Agents in Urban Systems
Yexin Li, Jinjin Guo, Haoyu Zhang, Yuhan Zhao, Yiwen Sun, Zihao Jiao

TL;DR
This paper introduces Adaptive Value Decomposition, a novel MARL framework designed to handle dynamic agent populations and asynchronous actions in urban systems, improving coordination and diversity.
Contribution
It proposes a new adaptive MARL method that manages varying agent numbers and mitigates policy-induced action homogenization, enhancing urban system coordination.
Findings
AVD outperforms existing methods in bike-sharing redistribution tasks.
The framework effectively adapts to changing agent populations.
It maintains behavioral diversity and coordination quality.
Abstract
Multi-agent reinforcement learning (MARL) provides a promising paradigm for coordinating multi-agent systems (MAS). However, most existing methods rely on restrictive assumptions, such as a fixed number of agents and fully synchronous action execution. These assumptions are often violated in urban systems, where the number of active agents varies over time, and actions may have heterogeneous durations, resulting in a semi-MARL setting. Moreover, while sharing policy parameters among agents is commonly adopted to improve learning efficiency, it can lead to highly homogeneous actions when a subset of agents make decisions concurrently under similar observations, potentially degrading coordination quality. To address these challenges, we propose Adaptive Value Decomposition (AVD), a cooperative MARL framework that adapts to a dynamically changing agent population. AVD further incorporates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations · Urban Transport and Accessibility
