Learning to Deliver: a Foundation Model for the Montreal Capacitated Vehicle Routing Problem
Samuel J. K. Chin, Matthias Winkenbach, Akash Srivastava

TL;DR
This paper introduces FM-MCVRP, a deep learning model based on a Transformer architecture, that effectively approximates solutions to the Montreal Capacitated Vehicle Routing Problem, outperforming traditional heuristics and generalizing well to larger instances.
Contribution
We propose a novel Transformer-based foundation model for MCVRP that leverages NLP techniques, trained on sub-optimal solutions, and demonstrates superior performance and generalization capabilities.
Findings
FM-MCVRP outperforms training data solutions.
Model solutions are within 2% of benchmarks for 400-customer problems.
The model generalizes to larger problem instances and various parameters.
Abstract
In this paper, we present the Foundation Model for the Montreal Capacitated Vehicle Routing Problem (FM-MCVRP), a novel Deep Learning (DL) model that approximates high-quality solutions to a variant of the Capacitated Vehicle Routing Problem (CVRP) that characterizes many real-world applications. The so-called Montreal Capacitated Vehicle Routing Problem (MCVRP), first formally described by Bengio et al. (2021), is defined on a fixed and finite graph, which is analogous to a city. Each MCVRP instance is essentially the sub-graph connecting a randomly sampled subset of the nodes in the fixed graph, which represent a set of potential addresses in a real-world delivery problem on a given day. Our work exploits this problem structure to frame the MCVRP as an analogous Natural Language Processing (NLP) task. Specifically, we leverage a Transformer architecture embedded in a Large Language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods
MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Layer Normalization · Byte Pair Encoding · Dropout · Multi-Head Attention · Softmax · Dense Connections · Label Smoothing
