Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs

Alex DeWeese; Guannan Qu

arXiv:2506.04215·cs.MA·June 5, 2025

Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs

Alex DeWeese, Guannan Qu

PDF

Open Access

TL;DR

This paper introduces the Extended Cutoff Policy Class, a novel approach for locally interdependent multi-agent MDPs, enabling near-optimal decision-making under partial observability and overcoming limitations of previous policies.

Contribution

It presents the first non-trivial class of policies that are exponentially close to optimal, capable of remembering beyond local visibility, and addresses performance issues like Penalty Jittering.

Findings

01

Policies are exponentially close to optimal with respect to visibility.

02

The new policies outperform previous solutions in small and fixed visibility scenarios.

03

The approach guarantees fully observable joint optimal behavior under certain conditions.

Abstract

Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) are known to be NEXP-Complete and intractable to solve. However, for problems such as cooperative navigation, obstacle avoidance, and formation control, basic assumptions can be made about local visibility and local dependencies. The work DeWeese and Qu 2024 formalized these assumptions in the construction of the Locally Interdependent Multi-Agent MDP. In this setting, it establishes three closed-form policies that are tractable to compute in various situations and are exponentially close to optimal with respect to visibility. However, it is also shown that these solutions can have poor performance when the visibility is small and fixed, often getting stuck during simulations due to the so called "Penalty Jittering" phenomenon. In this work, we establish the Extended Cutoff Policy Class which is, to the best of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Distributed Control Multi-Agent Systems · Autonomous Vehicle Technology and Safety