TL;DR
GenBench-MILP is a comprehensive benchmark suite that evaluates the quality of MILP instance generators across multiple dimensions, including validity, similarity, hardness, and utility, using solver-internal features for a more nuanced assessment.
Contribution
This paper introduces GenBench-MILP, a novel benchmark suite that standardizes and deepens the evaluation of MILP instance generation methods through multifaceted metrics and solver-internal analysis.
Findings
Instances with high structural similarity can have vastly different solver behaviors.
Solver-internal features reveal nuances missed by static structural metrics.
GenBench-MILP enables more rigorous comparison of MILP instance generators.
Abstract
The proliferation of machine learning-based methods for Mixed-Integer Linear Programming (MILP) instance generation has surged, driven by the need for diverse training datasets. However, a critical question remains: Are these generated instances truly useful and realistic? Current evaluation protocols often rely on superficial structural metrics or simple solvability checks, which frequently fail to capture the true computational complexity of real-world problems. To bridge this gap, we introduce GenBench-MILP, a comprehensive benchmark suite designed for the standardized and objective evaluation of MILP generators. Our framework assesses instance quality across four key dimensions: mathematical validity, structural similarity, computational hardness, and utility in downstream tasks. A distinctive innovation of GenBench-MILP is the analysis of solver-internal features -- including root…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
