Collab-Solver: Collaborative Solving Policy Learning for Mixed-Integer Linear Programming
Siyuan Li, Yifan Yu, Zhihao Zhang, Mengjing Chen, Fangzhou Zhu, Tao Zhong, Peng Liu, Jianye Hao

TL;DR
Collab-Solver introduces a multi-agent policy learning framework for MILP that enables collaborative optimization of solver modules, significantly improving performance and generalization by modeling their interaction as a Stackelberg game.
Contribution
It proposes a novel multi-agent policy learning framework for MILP that models module collaboration as a Stackelberg game, with a two-phase learning paradigm for stability.
Findings
Significant performance improvements on synthetic and real-world datasets.
Policies demonstrate strong generalization across different MILP instances.
Collaborative policies outperform isolated module policies.
Abstract
Mixed-integer linear programming (MILP) has been a fundamental problem in combinatorial optimization. Conventional MILP solving mainly relies on carefully designed heuristics embedded in the branch-and-bound framework. Driven by the strong capabilities of neural networks, recent research is exploring the value of machine learning alongside conventional MILP solving. Although learning-based MILP methods have shown great promise, existing works typically learn policies for individual modules in MILP solvers in isolation, without considering their interdependence, which limits both solving efficiency and solution quality. To address this limitation, we propose Collab-Solver, a novel multi-agent-based policy learning framework for MILP that enables collaborative policy optimization for multiple modules. Specifically, we formulate the collaboration between cut selection and branching in MILP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
