Generative Modeling for Robust Deep Reinforcement Learning on the Traveling Salesman Problem
Michael Li, Eric Bae, Christopher Haberland, Natasha Jaques

TL;DR
This paper introduces COGS, a generative sampling approach for training neural networks to solve the Traveling Salesman Problem, improving their robustness and generalization to realistic and challenging distributions.
Contribution
The paper proposes COGS, a novel generative sampling method for training neural TSP solvers, and introduces TSPLib50, a dataset for testing real-world distribution generalization.
Findings
COGS enhances neural TSP solver robustness to distribution shifts.
Neural models trained with COGS perform better on worst-case scenarios.
TSPLib50 effectively evaluates real-world generalization of TSP solvers.
Abstract
The Traveling Salesman Problem (TSP) is a classic NP-hard combinatorial optimization task with numerous practical applications. Classic heuristic solvers can attain near-optimal performance for small problem instances, but become computationally intractable for larger problems. Real-world logistics problems such as dynamically re-routing last-mile deliveries demand a solver with fast inference time, which has led researchers to investigate specialized neural network solvers. However, neural networks struggle to generalize beyond the synthetic data they were trained on. In particular, we show that there exist TSP distributions that are realistic in practice, which also consistently lead to poor worst-case performance for existing neural approaches. To address this issue of distribution robustness, we present Combinatorial Optimization with Generative Sampling (COGS), where training data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods · Advanced Multi-Objective Optimization Algorithms · Metaheuristic Optimization Algorithms Research
