Cooperative Control of Mobile Robots with Stackelberg Learning

Joewie J. Koh; Guohui Ding; Christoffer Heckman; Lijun Chen,; Alessandro Roncone

arXiv:2008.00679·cs.RO·May 7, 2021

Cooperative Control of Mobile Robots with Stackelberg Learning

Joewie J. Koh, Guohui Ding, Christoffer Heckman, Lijun Chen,, Alessandro Roncone

PDF

TL;DR

This paper introduces SLiCC, a novel deep reinforcement learning method for multi-robot cooperation that models decision-making as Stackelberg bimatrix games to improve collective utility.

Contribution

SLiCC is the first approach to integrate Stackelberg game theory with deep reinforcement learning for cooperative multi-robot control.

Findings

01

SLiCC outperforms centralized multi-agent Q-learning in a bi-robot transportation task.

02

SLiCC effectively models asymmetric agent preferences and capabilities.

03

The method achieves higher combined utility in cooperative scenarios.

Abstract

Multi-robot cooperation requires agents to make decisions that are consistent with the shared goal without disregarding action-specific preferences that might arise from asymmetry in capabilities and individual objectives. To accomplish this goal, we propose a method named SLiCC: Stackelberg Learning in Cooperative Control. SLiCC models the problem as a partially observable stochastic game composed of Stackelberg bimatrix games, and uses deep reinforcement learning to obtain the payoff matrices associated with these games. Appropriate cooperative actions are then selected with the derived Stackelberg equilibria. Using a bi-robot cooperative object transportation problem, we validate the performance of SLiCC against centralized multi-agent Q-learning and demonstrate that SLiCC achieves better combined utility.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning