Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs
Alex DeWeese, Guannan Qu

TL;DR
This paper introduces the Extended Cutoff Policy Class, a novel approach for locally interdependent multi-agent MDPs, enabling near-optimal decision-making under partial observability and overcoming limitations of previous policies.
Contribution
It presents the first non-trivial class of policies that are exponentially close to optimal, capable of remembering beyond local visibility, and addresses performance issues like Penalty Jittering.
Findings
Policies are exponentially close to optimal with respect to visibility.
The new policies outperform previous solutions in small and fixed visibility scenarios.
The approach guarantees fully observable joint optimal behavior under certain conditions.
Abstract
Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) are known to be NEXP-Complete and intractable to solve. However, for problems such as cooperative navigation, obstacle avoidance, and formation control, basic assumptions can be made about local visibility and local dependencies. The work DeWeese and Qu 2024 formalized these assumptions in the construction of the Locally Interdependent Multi-Agent MDP. In this setting, it establishes three closed-form policies that are tractable to compute in various situations and are exponentially close to optimal with respect to visibility. However, it is also shown that these solutions can have poor performance when the visibility is small and fixed, often getting stuck during simulations due to the so called "Penalty Jittering" phenomenon. In this work, we establish the Extended Cutoff Policy Class which is, to the best of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Distributed Control Multi-Agent Systems · Autonomous Vehicle Technology and Safety
