Stochastic Prize-Collecting Games: Strategic Planning in Multi-Robot Systems
Malintha Fernando, Petter \"Ogren, Silun Zhang

TL;DR
This paper introduces Stochastic Prize-Collecting Games (SPCG), a multi-robot planning framework that accounts for self-interest, stochasticity, and energy constraints, with algorithms and empirical results demonstrating near-optimal solutions.
Contribution
It extends the Team Orienteering Problem to competitive multi-robot settings with stochastic rewards, providing theoretical analysis and novel algorithms for strategic planning.
Findings
Unique pure Nash equilibrium aligns with optimal TOP routing.
Algorithms scale efficiently to large teams and handle imbalanced rewards.
Policies achieve 87-95% of optimal solutions in experiments.
Abstract
The Team Orienteering Problem (TOP) generalizes many real-world multi-robot scheduling and routing tasks that occur in autonomous mobility, aerial logistics, and surveillance applications. While many flavors of the TOP exist for planning in multi-robot systems, they assume that all the robots cooperate toward a single objective; thus, they do not extend to settings where the robots compete in reward-scarce environments. We propose Stochastic Prize-Collecting Games (SPCG) as an extension of the TOP to plan in the presence of self-interested robots operating on a graph, under energy constraints and stochastic transitions. A theoretical study on complete and star graphs establishes that there is a unique pure Nash equilibrium in SPCGs that coincides with the optimal routing solution of an equivalent TOP given a rank-based conflict resolution rule. This work proposes two algorithms: Ordinal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
