Evaluating Generalization and Transfer Capacity of Multi-Agent Reinforcement Learning Across Variable Number of Agents
Bengisu Guresti, Nazim Kemal Ure

TL;DR
This paper investigates how multi-agent reinforcement learning models trained with a fixed number of agents generalize and transfer to different agent counts, showing training with fewer agents can sometimes outperform training with more.
Contribution
It introduces an analysis of the transfer and generalization capabilities of MARL models across varying agent numbers, highlighting the potential for more scalable training strategies.
Findings
Training with fewer agents can yield comparable or better performance.
Optimal training agent count may differ from target deployment count.
Transfer across many agents can be more efficient than scaling up during training.
Abstract
Multi-agent Reinforcement Learning (MARL) problems often require cooperation among agents in order to solve a task. Centralization and decentralization are two approaches used for cooperation in MARL. While fully decentralized methods are prone to converge to suboptimal solutions due to partial observability and nonstationarity, the methods involving centralization suffer from scalability limitations and lazy agent problem. Centralized training decentralized execution paradigm brings out the best of these two approaches; however, centralized training still has an upper limit of scalability not only for acquired coordination performance but also for model size and training time. In this work, we adopt the centralized training with decentralized execution paradigm and investigate the generalization and transfer capacity of the trained models across variable number of agents. This capacity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Mosquito-borne diseases and control
