UniCO: Towards a Unified Model for Combinatorial Optimization Problems

Zefang Zong; Xiaochen Wei; Guozhen Zhang; Chen Gao; Huandong Wang; Yong Li

arXiv:2505.06290·cs.LG·May 13, 2025

UniCO: Towards a Unified Model for Combinatorial Optimization Problems

Zefang Zong, Xiaochen Wei, Guozhen Zhang, Chen Gao, Huandong Wang, Yong Li

PDF

Open Access

TL;DR

UniCO introduces a transformer-based unified model for diverse combinatorial optimization problems, framing them as MDPs and employing self-supervised learning to enable generalization and few-shot performance across multiple problem types.

Contribution

The paper presents UniCO, a novel unified transformer model for various CO problems, utilizing a new CO-prefix design and a two-stage self-supervised training approach for broad applicability.

Findings

01

Successfully solves 10 different CO problems

02

Demonstrates strong generalization to unseen problems

03

Achieves few-shot and zero-shot performance

Abstract

Combinatorial Optimization (CO) encompasses a wide range of problems that arise in many real-world scenarios. While significant progress has been made in developing learning-based methods for specialized CO problems, a unified model with a single architecture and parameter set for diverse CO problems remains elusive. Such a model would offer substantial advantages in terms of efficiency and convenience. In this paper, we introduce UniCO, a unified model for solving various CO problems. Inspired by the success of next-token prediction, we frame each problem-solving process as a Markov Decision Process (MDP), tokenize the corresponding sequential trajectory data, and train the model using a transformer backbone. To reduce token length in the trajectory data, we propose a CO-prefix design that aggregates static problem features. To address the heterogeneity of state and action tokens…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms · Constraint Satisfaction and Optimization

MethodsFocus · Sparse Evolutionary Training