Reinforcement Learning to Solve NP-hard Problems: an Application to the CVRP
Leo Ardon

TL;DR
This paper explores the application of Reinforcement Learning to the Capacitated Vehicle Routing Problem, demonstrating its advantages in speed, versatility, and ability to generalize to unseen instances, despite not always finding the optimal solution.
Contribution
The paper formalizes CVRP within the RL framework and compares RL approaches with traditional methods, highlighting their speed and adaptability for complex problems.
Findings
RL approaches are faster than traditional solvers.
RL models can generalize to unseen problem instances.
RL offers a versatile framework for complex combinatorial problems.
Abstract
In this paper, we evaluate the use of Reinforcement Learning (RL) to solve a classic combinatorial optimization problem: the Capacitated Vehicle Routing Problem (CVRP). We formalize this problem in the RL framework and compare two of the most promising RL approaches with traditional solving techniques on a set of benchmark instances. We measure the different approaches with the quality of the solution returned and the time required to return it. We found that despite not returning the best solution, the RL approach has many advantages over traditional solvers. First, the versatility of the framework allows the resolution of more complex combinatorial problems. Moreover, instead of trying to solve a specific instance of the problem, the RL algorithm learns the skills required to solve the problem. The trained policy can then quasi instantly provide a solution to an unseen problem without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods
