Are Your Generated Instances Truly Useful? GenBench-MILP: A Benchmark Suite for MILP Instance Generation

Yidong Luo; Chenguang Wang; Dong Li; Tianshu Yu

arXiv:2505.24779·cs.LG·February 6, 2026

Are Your Generated Instances Truly Useful? GenBench-MILP: A Benchmark Suite for MILP Instance Generation

Yidong Luo, Chenguang Wang, Dong Li, Tianshu Yu

PDF

1 Repo

TL;DR

GenBench-MILP is a comprehensive benchmark suite that evaluates the quality of MILP instance generators across multiple dimensions, including validity, similarity, hardness, and utility, using solver-internal features for a more nuanced assessment.

Contribution

This paper introduces GenBench-MILP, a novel benchmark suite that standardizes and deepens the evaluation of MILP instance generation methods through multifaceted metrics and solver-internal analysis.

Findings

01

Instances with high structural similarity can have vastly different solver behaviors.

02

Solver-internal features reveal nuances missed by static structural metrics.

03

GenBench-MILP enables more rigorous comparison of MILP instance generators.

Abstract

The proliferation of machine learning-based methods for Mixed-Integer Linear Programming (MILP) instance generation has surged, driven by the need for diverse training datasets. However, a critical question remains: Are these generated instances truly useful and realistic? Current evaluation protocols often rely on superficial structural metrics or simple solvability checks, which frequently fail to capture the true computational complexity of real-world problems. To bridge this gap, we introduce GenBench-MILP, a comprehensive benchmark suite designed for the standardized and objective evaluation of MILP generators. Our framework assesses instance quality across four key dimensions: mathematical validity, structural similarity, computational hardness, and utility in downstream tasks. A distinctive innovation of GenBench-MILP is the analysis of solver-internal features -- including root…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anonymous-neurips-submission-2025/eva-milp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.