Policy Synthesis for Factored MDPs with Graph Temporal Logic   Specifications

Murat Cubuktepe; Zhe Xu; Ufuk Topcu

arXiv:2001.09066·cs.MA·January 27, 2020·6 cites

Policy Synthesis for Factored MDPs with Graph Temporal Logic Specifications

Murat Cubuktepe, Zhe Xu, Ufuk Topcu

PDF

Open Access

TL;DR

This paper presents a distributed algorithm for synthesizing policies in multi-agent systems with spatial-temporal tasks, leveraging factored MDPs and graph temporal logic to enable scalable solutions.

Contribution

It introduces a novel distributed synthesis method that decomposes complex multi-agent policy problems into smaller, agent-specific problems with linear runtime in the number of agents.

Findings

01

Algorithm scales to hundreds of agents

02

Runs in linear time relative to total agents

03

Exponential complexity only in local neighborhood size

Abstract

We study the synthesis of policies for multi-agent systems to implement spatial-temporal tasks. We formalize the problem as a factored Markov decision process subject to so-called graph temporal logic specifications. The transition function and the spatial-temporal task of each agent depend on the agent itself and its neighboring agents. The structure in the model and the specifications enable to develop a distributed algorithm that, given a factored Markov decision process and a graph temporal logic formula, decomposes the synthesis problem into a set of smaller synthesis problems, one for each agent. We prove that the algorithm runs in time linear in the total number of agents. The size of the synthesis problem for each agent is exponential only in the number of neighboring agents, which is typically much smaller than the number of agents. We demonstrate the algorithm in case studies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Gene Regulatory Network Analysis · Reinforcement Learning in Robotics