TL;DR
This paper introduces a scalable decision-theoretic planning method for open, multiagent systems where agents cannot communicate and the number of agents varies, enabling effective reasoning about others' presence and actions.
Contribution
The authors propose a novel scalable approach that models a few agents to predict many, extending Monte Carlo tree search for open multiagent environments with theoretical performance guarantees.
Findings
Effective in wildfire suppression simulations
Outperforms baseline methods
Provides theoretical bounds on extrapolation error
Abstract
In open agent systems, the set of agents that are cooperating or competing changes over time and in ways that are nontrivial to predict. For example, if collaborative robots were tasked with fighting wildfires, they may run out of suppressants and be temporarily unavailable to assist their peers. We consider the problem of planning in these contexts with the additional challenges that the agents are unable to communicate with each other and that there are many of them. Because an agent's optimal action depends on the actions of others, each agent must not only predict the actions of its peers, but, before that, reason whether they are even present to perform an action. Addressing openness thus requires agents to model each other's presence, which becomes computationally intractable with high numbers of agents. We present a novel, principled, and scalable method in this context that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
