Parametrized Multi-Agent Routing via Deep Attention Models
Salar Basiri, Dhananjay Tiwari, Srinivasa M. Salapaka

TL;DR
This paper introduces a deep learning framework for multi-agent routing and facility location problems, achieving significant speedups and near-optimal solutions for complex NP-hard tasks.
Contribution
It presents a novel neural policy model, the Shortest Path Network, that efficiently approximates solutions for parametrized multi-agent routing problems, outperforming traditional methods.
Findings
Up to 100× faster policy inference and gradient computation.
Over 10× lower cost than metaheuristics.
Matches Gurobi's optimal cost with 1500× speedup.
Abstract
We propose a scalable deep learning framework for parametrized sequential decision-making (ParaSDM), where multiple agents jointly optimize discrete action policies and shared continuous parameters. A key subclass of this setting arises in Facility-Location and Path Optimization (FLPO), where multi-agent systems must simultaneously determine optimal routes and facility locations, aiming to minimize the cumulative transportation cost within the network. FLPO problems are NP-hard due to their mixed discrete-continuous structure and highly non-convex objective. To address this, we integrate the Maximum Entropy Principle (MEP) with a neural policy model called the Shortest Path Network (SPN)-a permutation-invariant encoder-decoder that approximates the MEP solution while enabling efficient gradient-based optimization over shared parameters. The SPN achieves up to 100 speedup in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
