Multi-armed Bandit and Backbone boost Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problems
Long Wang, Jiongzhi Zheng, Zhengda Xiong, Kun He

TL;DR
This paper enhances the LKH heuristic for TSP by integrating dynamic backbone information and a multi-armed bandit model to adaptively select evaluation metrics, leading to significant performance improvements across various problem variants.
Contribution
It introduces a novel dynamic backbone extraction method and a multi-armed bandit framework to improve the guidance of local search algorithms for TSP and VRP variants.
Findings
Significantly improves LKH performance on TSP and VRP variants.
Demonstrates strong generalization across multiple problem types.
Outperforms existing methods in solution quality and efficiency.
Abstract
The Lin-Kernighan-Helsguan (LKH) heuristic is a classic local search algorithm for the Traveling Salesman Problem (TSP). LKH introduces an -value to replace the traditional distance metric for evaluating the edge quality, which leads to a significant improvement. However, we observe that the -value does not make full use of the historical information during the search, and single guiding information often makes LKH hard to escape from some local optima. To address the above issues, we propose a novel way to extract backbone information during the TSP local search process, which is dynamic and can be updated once a local optimal solution is found. We further propose to combine backbone information, -value, and distance to evaluate the edge quality so as to guide the search. Moreover, we abstract their different combinations to arms in a multi-armed bandit (MAB)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research
