Bi-CL: A Reinforcement Learning Framework for Robots Coordination Through Bi-level Optimization
Zechen Hu, Daigo Shishika, Xuesu Xiao, and Xuan Wang

TL;DR
This paper presents Bi-CL, a bi-level reinforcement learning framework for multi-robot coordination that improves learning efficiency and scalability through a novel optimization structure and alignment mechanism.
Contribution
It introduces a bi-level optimization approach within reinforcement learning for multi-robot systems, addressing information mismatch and enhancing training efficiency.
Findings
Bi-CL learns more efficiently than traditional methods.
Achieves comparable coordination performance to existing baselines.
Effective in route-based and graph-based multi-robot scenarios.
Abstract
In multi-robot systems, achieving coordinated missions remains a significant challenge due to the coupled nature of coordination behaviors and the lack of global information for individual robots. To mitigate these challenges, this paper introduces a novel approach, Bi-level Coordination Learning (Bi-CL), that leverages a bi-level optimization structure within a centralized training and decentralized execution paradigm. Our bi-level reformulation decomposes the original problem into a reinforcement learning level with reduced action space, and an imitation learning level that gains demonstrations from a global optimizer. Both levels contribute to improved learning efficiency and scalability. We note that robots' incomplete information leads to mismatches between the two levels of learning models. To address this, Bi-CL further integrates an alignment penalty mechanism, aiming to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Traffic control and management
