Reinforced Lin-Kernighan-Helsgaun Algorithms for the Traveling Salesman Problems
Jiongzhi Zheng, Kun He, Jianrong Zhou, Yan Jin, Chu-Min Li

TL;DR
This paper introduces reinforcement learning-enhanced versions of the Lin-Kernighan-Helsgaun algorithms, significantly improving their performance on various TSP variants and benchmarks through adaptive decision-making.
Contribution
The authors propose VSR-LKH and VSR-LKH-3 algorithms that integrate reinforcement learning into LKH and LKH-3, enabling adaptive search strategies for complex TSP variants.
Findings
VSR-LKH outperforms traditional LKH on 236 benchmarks.
VSR-LKH-3 surpasses existing heuristics for TSP with time windows and colored TSP.
The methods demonstrate significant improvements in solution quality and efficiency.
Abstract
TSP is a classical NP-hard combinatorial optimization problem with many practical variants. LKH is one of the state-of-the-art local search algorithms for the TSP. LKH-3 is a powerful extension of LKH that can solve many TSP variants. Both LKH and LKH-3 associate a candidate set to each city to improve the efficiency, and have two different methods, -measure and POPMUSIC, to decide the candidate sets. In this work, we first propose a Variable Strategy Reinforced LKH (VSR-LKH) algorithm, which incorporates three reinforcement learning methods (Q-learning, Sarsa, Monte Carlo) with LKH, for the TSP. We further propose a new algorithm called VSR-LKH-3 that combines the variable strategy reinforcement learning method with LKH-3 for typical TSP variants, including the TSP with time windows (TSPTW) and Colored TSP (CTSP). The proposed algorithms replace the inflexible traversal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods · Constraint Satisfaction and Optimization · Auction Theory and Applications
MethodsSarsa
