CycleFormer : TSP Solver Based on Language Modeling
Jieun Yook, Junpyo Seo, Joon Huh, Han Joon Byun, and Byung-ro Moon

TL;DR
CycleFormer is a novel transformer-based model specifically designed for the Traveling Salesman Problem, incorporating TSP-specific features like dynamic node sets and cyclic positional encoding, significantly improving solution quality over previous models.
Contribution
The paper introduces CycleFormer, a transformer model tailored for TSP that fully integrates TSP characteristics, outperforming existing models in solution optimality.
Findings
Outperforms state-of-the-art transformer models for TSP.
Reduces optimality gap by approximately 2.8 times on TSP-500.
Effectively incorporates TSP-specific positional encodings.
Abstract
We propose a new transformer model for the Traveling Salesman Problem (TSP) called CycleFormer. We identified distinctive characteristics that need to be considered when applying a conventional transformer model to TSP and aimed to fully incorporate these elements into the TSP-specific transformer. Unlike the token sets in typical language models, which are limited and static, the token (node) set in TSP is unlimited and dynamic. To exploit this fact to the fullest, we equated the encoder output with the decoder linear layer and directly connected the context vector of the encoder to the decoder encoding. Additionally, we added a positional encoding to the encoder tokens that reflects the two-dimensional nature of TSP, and devised a circular positional encoding for the decoder tokens that considers the cyclic properties of a tour. By incorporating these ideas, CycleFormer outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation
MethodsSparse Evolutionary Training · Linear Layer
