DAN: Decentralized Attention-based Neural Network for the MinMax Multiple Traveling Salesman Problem
Yuhong Cao, Zhanhong Sun, Guillaume Sartoretti

TL;DR
This paper introduces DAN, a decentralized attention-based neural network using Transformer architecture and multi-agent reinforcement learning to efficiently solve large-scale MinMax mTSP instances, outperforming existing methods in solution quality and speed.
Contribution
DAN is a novel decentralized neural approach that scales to large mTSP instances, leveraging Transformer architecture and multi-agent RL for improved collaboration and efficiency.
Findings
DAN matches or outperforms state-of-the-art solvers on large-scale mTSP instances.
DAN maintains low planning times while solving instances with up to 1000 cities.
DAN exhibits enhanced agent collaboration compared to baseline methods.
Abstract
The multiple traveling salesman problem (mTSP) is a well-known NP-hard problem with numerous real-world applications. In particular, this work addresses MinMax mTSP, where the objective is to minimize the max tour length among all agents. Many robotic deployments require recomputing potentially large mTSP instances frequently, making the natural trade-off between computing time and solution quality of great importance. However, exact and heuristic algorithms become inefficient as the number of cities increases, due to their computational complexity. Encouraged by the recent developments in deep reinforcement learning (dRL), this work approaches the mTSP as a cooperative task and introduces DAN, a decentralized attention-based neural method that aims at tackling this key trade-off. In DAN, agents learn fully decentralized policies to collaboratively construct a tour, by predicting each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations · Vehicle Routing Optimization Methods
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dropout · Layer Normalization · Softmax · Label Smoothing · Byte Pair Encoding
