Improving Generalization of Deep Reinforcement Learning-based TSP   Solvers

Wenbin Ouyang; Yisen Wang; Shaochen Han; Zhejian Jin; Paul Weng

arXiv:2110.02843·cs.LG·October 7, 2021

Improving Generalization of Deep Reinforcement Learning-based TSP Solvers

Wenbin Ouyang, Yisen Wang, Shaochen Han, Zhejian Jin, Paul Weng

PDF

Open Access

TL;DR

This paper introduces MAGIC, a novel deep reinforcement learning approach with a specialized architecture and training techniques, significantly improving the generalization and efficiency of TSP solvers on larger instances.

Contribution

MAGIC combines a new neural architecture and training method, including curriculum learning and local search, to enhance DRL-based TSP solvers' performance and generalization.

Findings

01

MAGIC outperforms existing DRL-based TSP solvers on random instances.

02

MAGIC generalizes better to larger TSP instances.

03

MAGIC is competitive with traditional heuristics in performance and speed.

Abstract

Recent work applying deep reinforcement learning (DRL) to solve traveling salesman problems (TSP) has shown that DRL-based solvers can be fast and competitive with TSP heuristics for small instances, but do not generalize well to larger instances. In this work, we propose a novel approach named MAGIC that includes a deep learning architecture and a DRL training method. Our architecture, which integrates a multilayer perceptron, a graph neural network, and an attention model, defines a stochastic policy that sequentially generates a TSP solution. Our training method includes several innovations: (1) we interleave DRL policy gradient updates with local search (using a new local search technique), (2) we use a novel simple baseline, and (3) we apply curriculum learning. Finally, we empirically demonstrate that MAGIC is superior to other DRL-based methods on random TSP instances, both in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicle Routing Optimization Methods · Metaheuristic Optimization Algorithms Research · Reinforcement Learning in Robotics