Theoretical Challenges in Learning for Branch-and-Cut
Hongyu Cheng, Amitabh Basu

TL;DR
This paper explores the theoretical limitations of using local score-based machine learning methods to guide branch-and-cut algorithms in mixed-integer linear programming, revealing potential exponential inefficiencies.
Contribution
It identifies fundamental sources of exponential growth in search trees caused by local score misalignments and small perturbations, challenging existing learning approaches.
Findings
Local score-based methods can produce exponentially larger trees than optimal.
Small perturbations in data can lead to exponential increases in tree size.
Existing policies do not guarantee small or efficient search trees.
Abstract
Machine learning is increasingly used to guide branch-and-cut (B&C) for mixed-integer linear programming by learning score-based policies for selecting branching variables and cutting planes. Many approaches train on local signals from lookahead heuristics such as strong branching, and linear programming (LP) bound improvement for cut selection. Training and evaluation of the learned models often focus on local score accuracy. We show that such local score-based methods can lead to search trees exponentially larger than optimal tree sizes, by identifying two sources of this gap. The first is that these widely used expert signals can be misaligned with overall tree size. LP bound improvement can select a root cut set that yields an exponentially larger strong branching tree than selecting cuts by a simple proxy score, and strong branching itself can be exponentially suboptimal (Dey et…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Machine Learning and Data Classification
