Generative Modeling for Robust Deep Reinforcement Learning on the Traveling Salesman Problem

Michael Li; Eric Bae; Christopher Haberland; Natasha Jaques

arXiv:2508.08718·cs.LG·August 13, 2025

Generative Modeling for Robust Deep Reinforcement Learning on the Traveling Salesman Problem

Michael Li, Eric Bae, Christopher Haberland, Natasha Jaques

PDF

Open Access

TL;DR

This paper introduces COGS, a generative sampling approach for training neural networks to solve the Traveling Salesman Problem, improving their robustness and generalization to realistic and challenging distributions.

Contribution

The paper proposes COGS, a novel generative sampling method for training neural TSP solvers, and introduces TSPLib50, a dataset for testing real-world distribution generalization.

Findings

01

COGS enhances neural TSP solver robustness to distribution shifts.

02

Neural models trained with COGS perform better on worst-case scenarios.

03

TSPLib50 effectively evaluates real-world generalization of TSP solvers.

Abstract

The Traveling Salesman Problem (TSP) is a classic NP-hard combinatorial optimization task with numerous practical applications. Classic heuristic solvers can attain near-optimal performance for small problem instances, but become computationally intractable for larger problems. Real-world logistics problems such as dynamically re-routing last-mile deliveries demand a solver with fast inference time, which has led researchers to investigate specialized neural network solvers. However, neural networks struggle to generalize beyond the synthetic data they were trained on. In particular, we show that there exist TSP distributions that are realistic in practice, which also consistently lead to poor worst-case performance for existing neural approaches. To address this issue of distribution robustness, we present Combinatorial Optimization with Generative Sampling (COGS), where training data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicle Routing Optimization Methods · Advanced Multi-Objective Optimization Algorithms · Metaheuristic Optimization Algorithms Research