Multi-agent Multi-target Path Planning in Markov Decision Processes
Farhad Nawaz, Melkior Ornik

TL;DR
This paper introduces efficient algorithms for multi-agent, multi-target path planning in Markov decision processes, balancing optimality and computational complexity, with proven performance in various scenarios.
Contribution
It presents a polynomial-time suboptimal algorithm for multi-target path planning in MDPs and a target partitioning method with optimality guarantees for clustered targets.
Findings
Algorithms outperform heuristics in speed and optimality.
Proven optimality for certain MDP classes.
Effective in gridworld and ocean-inspired environments.
Abstract
Missions for autonomous systems often require agents to visit multiple targets in complex operating conditions. This work considers the problem of visiting a set of targets in minimum time by a team of non-communicating agents in a Markov decision process (MDP). The single-agent problem is at least NP-complete by reducing it to a Hamiltonian path problem. We first discuss an optimal algorithm based on Bellman's optimality equation that is exponential in the number of target states. Then, we trade-off optimality for time complexity by presenting a suboptimal algorithm that is polynomial at each time step. We prove that the proposed algorithm generates optimal policies for certain classes of MDPs. Extending our procedure to the multi-agent case, we propose a target partitioning algorithm that approximately minimizes the expected time to visit the targets. We prove that our algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Optimization and Search Problems
