TL;DR
This paper introduces a reinforcement learning-based Pointer Network approach to solve the Orienteering Problem with Time Windows, demonstrating improved performance and generalization over heuristics in benchmark tests.
Contribution
It proposes a modified Pointer Network architecture trained with reinforcement learning specifically for OPTW, addressing dynamic time constraints and generalizing across different tourist scenarios.
Findings
Outperforms common heuristics on benchmark OPTW instances.
Generalizes well across different tourists and regions.
Computes solutions efficiently in realistic timeframes.
Abstract
The Orienteering Problem with Time Windows (OPTW) is a combinatorial optimization problem where the goal is to maximize the total score collected from different visited locations. The application of neural network models to combinatorial optimization has recently shown promising results in dealing with similar problems, like the Travelling Salesman Problem. A neural network allows learning solutions using reinforcement learning or supervised learning, depending on the available data. After the learning stage, it can be generalized and quickly fine-tuned to further improve performance and personalization. The advantages are evident since, for real-world applications, solution quality, personalization, and execution times are all important factors that should be taken into account. This study explores the use of Pointer Network models trained using reinforcement learning to solve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSigmoid Activation · Softmax · Long Short-Term Memory · Tanh Activation · [LivE@PeRson]How do I talk to a real person at Expedia? · Pointer Network
