Bi-CL: A Reinforcement Learning Framework for Robots Coordination   Through Bi-level Optimization

Zechen Hu; Daigo Shishika; Xuesu Xiao; and Xuan Wang

arXiv:2404.14649·cs.RO·August 22, 2024

Bi-CL: A Reinforcement Learning Framework for Robots Coordination Through Bi-level Optimization

Zechen Hu, Daigo Shishika, Xuesu Xiao, and Xuan Wang

PDF

Open Access

TL;DR

This paper presents Bi-CL, a bi-level reinforcement learning framework for multi-robot coordination that improves learning efficiency and scalability through a novel optimization structure and alignment mechanism.

Contribution

It introduces a bi-level optimization approach within reinforcement learning for multi-robot systems, addressing information mismatch and enhancing training efficiency.

Findings

01

Bi-CL learns more efficiently than traditional methods.

02

Achieves comparable coordination performance to existing baselines.

03

Effective in route-based and graph-based multi-robot scenarios.

Abstract

In multi-robot systems, achieving coordinated missions remains a significant challenge due to the coupled nature of coordination behaviors and the lack of global information for individual robots. To mitigate these challenges, this paper introduces a novel approach, Bi-level Coordination Learning (Bi-CL), that leverages a bi-level optimization structure within a centralized training and decentralized execution paradigm. Our bi-level reformulation decomposes the original problem into a reinforcement learning level with reduced action space, and an imitation learning level that gains demonstrations from a global optimizer. Both levels contribute to improved learning efficiency and scalability. We note that robots' incomplete information leads to mismatches between the two levels of learning models. To address this, Bi-CL further integrates an alignment penalty mechanism, aiming to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Traffic control and management