Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning
Zeren Huang, Wenhao Chen, Weinan Zhang, Chuhan Shi, Furui Liu,, Hui-Ling Zhen, Mingxuan Yuan, Jianye Hao, Yong Yu, Jun Wang

TL;DR
This paper introduces a novel offline reinforcement learning approach for variable selection in mixed-integer programming, emphasizing long-term utility to improve solver efficiency and robustness over existing heuristics and learning methods.
Contribution
It formulates learning to branch as an offline RL problem with a ranking-based reward scheme, enabling long-sighted decision making in MIP solvers.
Findings
Outperforms heuristics and state-of-the-art learning models on benchmarks.
More efficient and robust across various MIP instances.
Better generalization to large-scale problems.
Abstract
Deriving a good variable selection strategy in branch-and-bound is essential for the efficiency of modern mixed-integer programming (MIP) solvers. With MIP branching data collected during the previous solution process, learning to branch methods have recently become superior over heuristics. As branch-and-bound is naturally a sequential decision making task, one should learn to optimize the utility of the whole MIP solving process instead of being myopic on each step. In this work, we formulate learning to branch as an offline reinforcement learning (RL) problem, and propose a long-sighted hybrid search scheme to construct the offline MIP dataset, which values the long-term utilities of branching decisions. During the policy training phase, we deploy a ranking-based reward assignment scheme to distinguish the promising samples from the long-term or short-term view, and train the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Routing Optimization Methods · Metaheuristic Optimization Algorithms Research · Reinforcement Learning in Robotics
