UniCO: Towards a Unified Model for Combinatorial Optimization Problems
Zefang Zong, Xiaochen Wei, Guozhen Zhang, Chen Gao, Huandong Wang, Yong Li

TL;DR
UniCO introduces a transformer-based unified model for diverse combinatorial optimization problems, framing them as MDPs and employing self-supervised learning to enable generalization and few-shot performance across multiple problem types.
Contribution
The paper presents UniCO, a novel unified transformer model for various CO problems, utilizing a new CO-prefix design and a two-stage self-supervised training approach for broad applicability.
Findings
Successfully solves 10 different CO problems
Demonstrates strong generalization to unseen problems
Achieves few-shot and zero-shot performance
Abstract
Combinatorial Optimization (CO) encompasses a wide range of problems that arise in many real-world scenarios. While significant progress has been made in developing learning-based methods for specialized CO problems, a unified model with a single architecture and parameter set for diverse CO problems remains elusive. Such a model would offer substantial advantages in terms of efficiency and convenience. In this paper, we introduce UniCO, a unified model for solving various CO problems. Inspired by the success of next-token prediction, we frame each problem-solving process as a Markov Decision Process (MDP), tokenize the corresponding sequential trajectory data, and train the model using a transformer backbone. To reduce token length in the trajectory data, we propose a CO-prefix design that aggregates static problem features. To address the heterogeneity of state and action tokens…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms · Constraint Satisfaction and Optimization
MethodsFocus · Sparse Evolutionary Training
