Cross-Problem Learning for Solving Vehicle Routing Problems
Zhuoyi Lin, Yaoxin Wu, Bangjian Zhou, Zhiguang Cao, Wen Song, Yingqian, Zhang, Senthilnath Jayavelu

TL;DR
This paper introduces a cross-problem learning approach that pre-trains a Transformer model on the TSP and fine-tunes it for various vehicle routing problem variants, improving efficiency and performance.
Contribution
It proposes a modular neural architecture with pre-training and fine-tuning strategies for different VRP variants, enhancing transferability and efficiency.
Findings
Full fine-tuning outperforms training from scratch.
Adapter-based fine-tuning is parameter-efficient with comparable results.
Method improves cross-distribution application and versatility.
Abstract
Existing neural heuristics often train a deep architecture from scratch for each specific vehicle routing problem (VRP), ignoring the transferable knowledge across different VRP variants. This paper proposes the cross-problem learning to assist heuristics training for different downstream VRP variants. Particularly, we modularize neural architectures for complex VRPs into 1) the backbone Transformer for tackling the travelling salesman problem (TSP), and 2) the additional lightweight modules for processing problem-specific features in complex VRPs. Accordingly, we propose to pre-train the backbone Transformer for TSP, and then apply it in the process of fine-tuning the Transformer models for each target VRP variant. On the one hand, we fully fine-tune the trained backbone Transformer and problem-specific modules simultaneously. On the other hand, we only fine-tune small adapter networks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods
MethodsDropout · Adam · Attention Is All You Need · Position-Wise Feed-Forward Layer · Layer Normalization · Linear Layer · Multi-Head Attention · Byte Pair Encoding · Absolute Position Encodings · Dense Connections
