Mixed Integer Linear Programming For Exact Finite-Horizon Planning In Decentralized Pomdps
Raghav Aras (INRIA Lorraine - LORIA), Alain Dutech (INRIA Lorraine -, LORIA), Fran\c{c}ois Charpillet (INRIA Lorraine - LORIA)

TL;DR
This paper introduces a novel MILP-based approach for exact finite-horizon planning in decentralized POMDPs, leveraging sequence-form policy representation to efficiently compute optimal joint-policies.
Contribution
It presents a new mathematical programming method that uses sequence-form policies and MILP formulation to solve Dec-Pomdp planning more efficiently than existing algorithms.
Findings
MILP approach solves benchmark problems faster
Sequence-form representation reduces problem complexity
Optimal policies are obtained directly from MILP solutions
Abstract
We consider the problem of finding an n-agent joint-policy for the optimal finite-horizon control of a decentralized Pomdp (Dec-Pomdp). This is a problem of very high complexity (NEXP-hard in n >= 2). In this paper, we propose a new mathematical programming approach for the problem. Our approach is based on two ideas: First, we represent each agent's policy in the sequence-form and not in the tree-form, thereby obtaining a very compact representation of the set of joint-policies. Second, using this compact representation, we solve this problem as an instance of combinatorial optimization for which we formulate a mixed integer linear program (MILP). The optimal solution of the MILP directly yields an optimal joint-policy for the Dec-Pomdp. Computational experience shows that formulating and solving the MILP requires significantly less time to solve benchmark Dec-Pomdp problems than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Advanced Control Systems Optimization · Optimization and Search Problems
