Learning to Iteratively Solve Routing Problems with Dual-Aspect Collaborative Transformer
Yining Ma, Jingwen Li, Zhiguang Cao, Wen Song, Le Zhang, Zhenghua, Chen, Jing Tang

TL;DR
This paper introduces a Dual-Aspect Collaborative Transformer (DACT) that separately learns node and positional embeddings with a cyclic encoding, improving VRP solution quality and generalization over existing Transformer models.
Contribution
The paper proposes a novel DACT architecture with separate embeddings and cyclic positional encoding, enhancing VRP solving capabilities and generalization.
Findings
DACT outperforms existing Transformer-based models on TSP and CVRP.
DACT exhibits superior generalization across different problem sizes.
The cyclic positional encoding effectively captures VRP solution symmetry.
Abstract
Recently, Transformer has become a prevailing deep architecture for solving vehicle routing problems (VRPs). However, it is less effective in learning improvement models for VRP because its positional encoding (PE) method is not suitable in representing VRP solutions. This paper presents a novel Dual-Aspect Collaborative Transformer (DACT) to learn embeddings for the node and positional features separately, instead of fusing them together as done in existing ones, so as to avoid potential noises and incompatible correlations. Moreover, the positional features are embedded through a novel cyclic positional encoding (CPE) method to allow Transformer to effectively capture the circularity and symmetry of VRP solutions (i.e., cyclic sequences). We train DACT using Proximal Policy Optimization and design a curriculum learning strategy for better sample efficiency. We apply DACT to solve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsVehicle Routing Optimization Methods · Metaheuristic Optimization Algorithms Research · Optimization and Packing Problems
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Dropout · Dense Connections · Label Smoothing · Softmax · Residual Connection
