DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems
Hang Zhao, Kexiong Yu, Yuhang Huang, Renjiao Yi, Chenyang Zhu, Kai Xu

TL;DR
DISCO introduces an efficient diffusion-based solver for large-scale combinatorial optimization problems, improving solution quality and inference speed by constraining sampling space and accelerating denoising, outperforming existing methods on TSP and MIS benchmarks.
Contribution
The paper presents a novel diffusion solver that enhances solution quality and inference speed for large-scale CO problems by constraining sampling space and employing an analytically solvable denoising process.
Findings
DISCO achieves up to 5.28x faster inference than other diffusion methods.
It generalizes well to unseen problem scales, surpassing models trained specifically for those scales.
Strong performance on large TSP and Maximal Independent Set benchmarks.
Abstract
Combinatorial Optimization (CO) problems are fundamentally important in numerous real-world applications across diverse industries, characterized by entailing enormous solution space and demanding time-sensitive response. Despite recent advancements in neural solvers, their limited expressiveness struggles to capture the multi-modal nature of CO landscapes. While some research has shifted towards diffusion models, these models still sample solutions indiscriminately from the entire NP-complete solution space with time-consuming denoising processes, which limit their practicality for large problem scales. We propose DISCO, an efficient DIffusion Solver for large-scale Combinatorial Optimization problems that excels in both solution quality and inference speed. DISCO's efficacy is twofold: First, it enhances solution quality by constraining the sampling space to a more meaningful domain…
Peer Reviews
Decision·Submitted to ICLR 2025
1. DISCO significantly reduces inference times by introducing an analytically solvable denoising process and constraining the sampling space. 2. The paper is well-written.
1. The multi-modal graph search is presented as a crucial component of DISCO, with the paper asserting that its multi-modal output helps prevent sub-optimal solutions. However, the experiments do not thoroughly analyze the contribution of this module. It would be beneficial for the authors to include performance metrics with and without this module and detail the computational overhead incurred when it is enabled. 2. The model's approach involves initially splitting the graph to find solutions,
1. DISCO achieves state-of-the-art results on large-scale TSP-10000 instances and challenging MIS benchmarks, demonstrating superior performance both in terms of solution quality and speed. 2. The proposed divide-and-conquer strategy effectively generalizes DISCO to solve large-scale problem instances, highlighting the method’s scalability and versatility. 3. Beyond the previous results, the paper extends experiment results on aspects of 1) [which I think is the most significant challenge for
The graph search might lead to exponential growth in trial variance.The scalability might still remain an issue. The authors have claimed the issue in limitation and leave it as future research oppurtunity.
1. The idea of using a feasible solution as the residue constrain to guide the generation process sounds novel and reasonable. 2. Experimental results show that DISCO is both efficient and effective in solving large scale TSP problems.
1. The novelty is limited. Solution residue is from [1] and multi-model is from [2]. Can you summarize the novelty of the proposed method? 2. It is unclear why the residue term can lead to better solutions. Figure 1 is just an explanation of the intuition. More supporting analysis and evidence are needed. [1] Liu, Jiawei, et al. "Residual denoising diffusion models." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024. [2] Fu, Zhang-Hua, Kai-Bin Qiu, and Hong
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Packing Problems
MethodsSparse Evolutionary Training · Diffusion
