Parametrized Multi-Agent Routing via Deep Attention Models

Salar Basiri; Dhananjay Tiwari; Srinivasa M. Salapaka

arXiv:2507.22338·cs.LG·July 31, 2025

Parametrized Multi-Agent Routing via Deep Attention Models

Salar Basiri, Dhananjay Tiwari, Srinivasa M. Salapaka

PDF

TL;DR

This paper introduces a deep learning framework for multi-agent routing and facility location problems, achieving significant speedups and near-optimal solutions for complex NP-hard tasks.

Contribution

It presents a novel neural policy model, the Shortest Path Network, that efficiently approximates solutions for parametrized multi-agent routing problems, outperforming traditional methods.

Findings

01

Up to 100× faster policy inference and gradient computation.

02

Over 10× lower cost than metaheuristics.

03

Matches Gurobi's optimal cost with 1500× speedup.

Abstract

We propose a scalable deep learning framework for parametrized sequential decision-making (ParaSDM), where multiple agents jointly optimize discrete action policies and shared continuous parameters. A key subclass of this setting arises in Facility-Location and Path Optimization (FLPO), where multi-agent systems must simultaneously determine optimal routes and facility locations, aiming to minimize the cumulative transportation cost within the network. FLPO problems are NP-hard due to their mixed discrete-continuous structure and highly non-convex objective. To address this, we integrate the Maximum Entropy Principle (MEP) with a neural policy model called the Shortest Path Network (SPN)-a permutation-invariant encoder-decoder that approximates the MEP solution while enabling efficient gradient-based optimization over shared parameters. The SPN achieves up to 100 $\times$ speedup in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.