Leveraging Untrustworthy Commands for Multi-Robot Coordination in   Unpredictable Environments: A Bandit Submodular Maximization Approach

Zirui Xu; Xiaofeng Lin; Vasileios Tzoumas

arXiv:2309.16161·eess.SY·October 13, 2023

Leveraging Untrustworthy Commands for Multi-Robot Coordination in Unpredictable Environments: A Bandit Submodular Maximization Approach

Zirui Xu, Xiaofeng Lin, Vasileios Tzoumas

PDF

Open Access

TL;DR

This paper introduces MetaBSG, a robust algorithm for multi-robot coordination that effectively leverages untrustworthy external commands by learning when to follow them or a submodular optimization algorithm, ensuring performance guarantees in unpredictable environments.

Contribution

The paper proposes MetaBSG, a novel meta-algorithm that adaptively combines external commands with a submodular maximization approach, providing performance guarantees even with unreliable inputs.

Findings

01

MetaBSG outperforms baseline methods in simulated multi-target tracking.

02

The algorithm effectively balances following commands and independent optimization.

03

Performance guarantees hold even when external commands are arbitrarily bad.

Abstract

We study the problem of multi-agent coordination in unpredictable and partially-observable environments with untrustworthy external commands. The commands are actions suggested to the robots, and are untrustworthy in that their performance guarantees, if any, are unknown. Such commands may be generated by human operators or machine learning algorithms and, although untrustworthy, can often increase the robots' performance in complex multi-robot tasks. We are motivated by complex multi-robot tasks such as target tracking, environmental mapping, and area monitoring. Such tasks are often modeled as submodular maximization problems due to the information overlap among the robots. We provide an algorithm, Meta Bandit Sequential Greedy (MetaBSG), which enjoys performance guarantees even when the external commands are arbitrarily bad. MetaBSG leverages a meta-algorithm to learn whether the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Reinforcement Learning in Robotics