A Reinforcement Learning Approach to the Orienteering Problem with Time   Windows

Ricardo Gama; Hugo L. Fernandes

arXiv:2011.03647·cs.LG·July 1, 2021

A Reinforcement Learning Approach to the Orienteering Problem with Time Windows

Ricardo Gama, Hugo L. Fernandes

PDF

2 Repos

TL;DR

This paper introduces a reinforcement learning-based Pointer Network approach to solve the Orienteering Problem with Time Windows, demonstrating improved performance and generalization over heuristics in benchmark tests.

Contribution

It proposes a modified Pointer Network architecture trained with reinforcement learning specifically for OPTW, addressing dynamic time constraints and generalizing across different tourist scenarios.

Findings

01

Outperforms common heuristics on benchmark OPTW instances.

02

Generalizes well across different tourists and regions.

03

Computes solutions efficiently in realistic timeframes.

Abstract

The Orienteering Problem with Time Windows (OPTW) is a combinatorial optimization problem where the goal is to maximize the total score collected from different visited locations. The application of neural network models to combinatorial optimization has recently shown promising results in dealing with similar problems, like the Travelling Salesman Problem. A neural network allows learning solutions using reinforcement learning or supervised learning, depending on the available data. After the learning stage, it can be generalized and quickly fine-tuned to further improve performance and personalization. The advantages are evident since, for real-world applications, solution quality, personalization, and execution times are all important factors that should be taken into account. This study explores the use of Pointer Network models trained using reinforcement learning to solve the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Softmax · Long Short-Term Memory · Tanh Activation · [LivE@PeRson]How do I talk to a real person at Expedia? · Pointer Network