Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep   Reinforcement Learning

Paulo R. de O. da Costa; Jason Rhuggenaath; Yingqian Zhang; Alp Akcay

arXiv:2004.01608·cs.LG·September 15, 2020·71 cites

Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep Reinforcement Learning

Paulo R. de O. da Costa, Jason Rhuggenaath, Yingqian Zhang, Alp Akcay

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep reinforcement learning approach to learn 2-opt local search heuristics for the Traveling Salesman Problem, improving solutions efficiently over initial guesses and surpassing previous deep learning methods.

Contribution

It presents a novel policy gradient method with a pointing attention mechanism to learn 2-opt heuristics, extending to more general k-opt moves.

Findings

01

Learned policies improve solutions over random initial solutions.

02

Approach approaches near-optimal solutions faster than prior deep learning methods.

03

Policy can be extended to more complex k-opt moves.

Abstract

Recent works using deep learning to solve the Traveling Salesman Problem (TSP) have focused on learning construction heuristics. Such approaches find TSP solutions of good quality but require additional procedures such as beam search and sampling to improve solutions and achieve state-of-the-art performance. However, few studies have focused on improvement heuristics, where a given solution is improved until reaching a near-optimal one. In this work, we propose to learn a local search heuristic based on 2-opt operators via deep reinforcement learning. We propose a policy gradient algorithm to learn a stochastic policy that selects 2-opt operations given a current solution. Moreover, we introduce a policy neural network that leverages a pointing attention mechanism, which unlike previous works, can be easily extended to more general k-opt moves. Our results show that the learned policies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

paulorocosta/learning-2opt-drl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicle Routing Optimization Methods · Metaheuristic Optimization Algorithms Research · Transportation and Mobility Innovations