Solving the capacitated vehicle routing problem with timing windows using rollouts and MAX-SAT
Harshad Khadilkar

TL;DR
This paper introduces a hybrid method combining reinforcement learning, rollouts, and MAX-SAT to efficiently solve the capacitated vehicle routing problem with timing constraints, outperforming existing learning methods and metaheuristics.
Contribution
It presents a novel hybrid approach that balances solution quality and computation time, adaptable to various problem scales without retraining.
Findings
Produces solutions closer to optimal than existing learning methods
Achieves shorter computation times than metaheuristics
Generalizable to other combinatorial problems
Abstract
The vehicle routing problem is a well known class of NP-hard combinatorial optimisation problems in literature. Traditional solution methods involve either carefully designed heuristics, or time-consuming metaheuristics. Recent work in reinforcement learning has been a promising alternative approach, but has found it difficult to compete with traditional methods in terms of solution quality. This paper proposes a hybrid approach that combines reinforcement learning, policy rollouts, and a satisfiability solver to enable a tunable tradeoff between computation times and solution quality. Results on a popular public data set show that the algorithm is able to produce solutions closer to optimal levels than existing learning based approaches, and with shorter computation times than meta-heuristics. The approach requires minimal design effort and is able to solve unseen problems of arbitrary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods · Transportation and Mobility Innovations · Maritime Ports and Logistics
