Solving the Min-Max Multiple Traveling Salesmen Problem via Learning-Based Path Generation and Optimal Splitting
Wen Wang, Xiangchen Wu, Liang Wang, Hao Hu, Xianping Tao, Linghao Zhang

TL;DR
This paper introduces a novel reinforcement learning framework called Generate-and-Split (GaS) for efficiently solving the NP-hard Min-Max Multiple Traveling Salesmen Problem by jointly optimizing path generation and splitting.
Contribution
The paper proposes a joint training framework combining RL with an optimal splitting algorithm, improving solution quality and scalability for the m3-TSP.
Findings
GaS outperforms existing learning-based methods in solution quality.
The approach demonstrates high transferability across different problem instances.
The joint training enhances the consistency and effectiveness of the solution process.
Abstract
This study addresses the Min-Max Multiple Traveling Salesmen Problem (-TSP), which aims to coordinate tours for multiple salesmen such that the length of the longest tour is minimized. Due to its NP-hard nature, exact solvers become impractical under the assumption that . As a result, learning-based approaches have gained traction for their ability to rapidly generate high-quality approximate solutions. Among these, two-stage methods combine learning-based components with classical solvers, simplifying the learning objective. However, this decoupling often disrupts consistent optimization, potentially degrading solution quality. To address this issue, we propose a novel two-stage framework named \textbf{Generate-and-Split} (GaS), which integrates reinforcement learning (RL) with an optimal splitting algorithm in a joint training process. The splitting algorithm offers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
