Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation
Jieyi Bi, Yining Ma, Jiahai Wang, Zhiguang Cao, Jinbiao Chen, Yuan, Sun, Yeow Meng Chee

TL;DR
This paper introduces an Adaptive Multi-Distribution Knowledge Distillation scheme to improve the cross-distribution generalization of neural models for vehicle routing problems, achieving competitive results with less computation.
Contribution
It proposes a novel knowledge distillation method that leverages multiple teachers and an adaptive strategy to enhance model generalization across diverse distributions.
Findings
Achieves competitive performance on unseen distributions
Outperforms baseline neural methods in generalization
Consumes less computational resources during inference
Abstract
Recent neural methods for vehicle routing problems always train and test the deep models on the same instance distribution (i.e., uniform). To tackle the consequent cross-distribution generalization concerns, we bring the knowledge distillation to this field and propose an Adaptive Multi-Distribution Knowledge Distillation (AMDKD) scheme for learning more generalizable deep models. Particularly, our AMDKD leverages various knowledge from multiple teachers trained on exemplar distributions to yield a light-weight yet generalist student model. Meanwhile, we equip AMDKD with an adaptive strategy that allows the student to concentrate on difficult distributions, so as to absorb hard-to-master knowledge more effectively. Extensive experimental results show that, compared with the baseline neural methods, our AMDKD is able to achieve competitive results on both unseen in-distribution and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsVehicle License Plate Recognition · Vehicle Routing Optimization Methods
MethodsTest · Knowledge Distillation
