Communication-Based Decomposition Mechanisms for Decentralized MDPs

Claudia V. Goldman; Shlomo Zilberstein

arXiv:1111.0065·cs.AI·November 2, 2011

Communication-Based Decomposition Mechanisms for Decentralized MDPs

Claudia V. Goldman, Shlomo Zilberstein

PDF

TL;DR

This paper introduces a communication-based decomposition framework for decentralized MDPs, enabling efficient planning in multi-agent systems with costly communication, and provides algorithms with empirical validation.

Contribution

It develops the Dec-SMDP-Com framework, allowing decomposition of decentralized MDPs into single-agent problems with communication, and proposes algorithms for optimal and goal-oriented communication strategies.

Findings

01

Polynomial-time algorithm for goal-oriented agent behaviors

02

Heuristic search converges to optimal decomposition

03

Empirical results demonstrate effective approximate solutions

Abstract

Multi-agent planning in stochastic environments can be framed formally as a decentralized Markov decision problem. Many real-life distributed problems that arise in manufacturing, multi-robot coordination and information gathering scenarios can be formalized using this framework. However, finding the optimal solution in the general case is hard, limiting the applicability of recently developed algorithms. This paper provides a practical approach for solving decentralized control problems when communication among the decision makers is possible, but costly. We develop the notion of communication-based mechanism that allows us to decompose a decentralized MDP into multiple single-agent problems. In this framework, referred to as decentralized semi-Markov decision process with direct communication (Dec-SMDP-Com), agents operate separately between communications. We show that finding an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.