A First Guess is Rarely the Final Answer: Learning to Search in the Traveling Salesperson Problem
Andoni Irazusta Garmendia

TL;DR
This paper introduces NICO-TSP, a neural 2-opt improvement framework for the Traveling Salesperson Problem, which learns to enhance solutions efficiently and outperforms prior methods in generalization and step efficiency.
Contribution
NICO-TSP is a novel 2-opt improvement method that aligns with local search mechanics, combining imitation learning and reinforcement learning for better TSP solution refinement.
Findings
NICO-TSP outperforms prior learned and heuristic search methods in improvement quality.
It generalizes more reliably to larger, out-of-distribution instances.
NICO-TSP serves as both a competitive local search replacement and a powerful refinement module.
Abstract
Most neural solvers for the Traveling Salesperson Problem (TSP) are trained to output a single solution, even though practitioners rarely stop there: at test time, they routinely spend extra compute on sampling or post-hoc search. This raises a natural question: can the search procedure itself be learned? Neural improvement methods take this perspective by learning a policy that applies local modifications to a candidate solution, accumulating gains over an improvement trajectory. Yet learned improvement for TSP remains comparatively immature, with existing methods still falling short of robust, scalable performance. We argue that a key reason is design mismatch: many approaches reuse state representations, architectural choices, and training recipes inherited from single-solution methods, rather than being built around the mechanics of local search. This mismatch motivates NICO-TSP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
