Towards Foundation Models for Mixed Integer Linear Programming
Sirui Li, Janardhan Kulkarni, Ishai Menache, Cathy Wu, Beibin Li

TL;DR
This paper proposes a foundation model approach for MILP by training on a diverse, synthetically generated dataset, enabling better generalization across problem classes and improving performance on unseen benchmarks.
Contribution
It introduces MILP-Evolve, a novel LLM-based evolutionary framework for generating diverse MILP problems, and demonstrates the effectiveness of a foundation model approach for MILP generalization.
Findings
Models trained on MILP-Evolve data outperform on unseen problems
Significant improvements on MIPLIB benchmarks
Demonstrates potential of foundation models for MILP applications
Abstract
Mixed Integer Linear Programming (MILP) is essential for modeling complex decision-making problems but faces challenges in computational tractability and requires expert formulation. Current deep learning approaches for MILP focus on specific problem classes and do not generalize to unseen classes. To address this shortcoming, we take a foundation model training approach, where we train a single deep learning model on a diverse set of MILP problems to generalize across problem classes. As existing datasets for MILP lack diversity and volume, we introduce MILP-Evolve, a novel LLM-based evolutionary framework that is capable of generating a large set of diverse MILP classes with an unlimited amount of instances. We study our methodology on three key learning tasks that capture diverse aspects of MILP: (1) integrality gap prediction, (2) learning to branch, and (3) a new task of aligning…
Peer Reviews
Decision·ICLR 2025 Poster
1. Employing LLM to generate diversified MILP instances is novel and helpful for training a foundation model for MILP. The entire MILP space is too huge, so datasets created by humans can only cover a part of it. So, leveraging the power of LLM is a good direction. 2. The authors' commitment to open-source the entire framework is valuable for the entire community.
1. The dataset test seems to be not that "unseen." You mentioned that you collected MILP problems from eight classes. But you randomly split them after the augmentation. Then, the trained model still learned from all these eight classes. So it would be great if you only use six classes for training, 1 for validation, and 1 for testing. Then this can further show the power of your method.
- **Diversity of MILP Generation**: The MILP-Evolve framework introduces an innovative approach to generating diverse MILP problem classes, which has the potential to enhance generalization in ML-based MILP solvers. - **Empirical Performance**: The paper demonstrates strong performance improvements on the integrality gap prediction and learning to branch tasks, providing evidence that the proposed approach can generalize to unseen MILP classes. - **Extensive Experimental Work**: The paper presen
1. **Practical Application of Language-MILP Contrastive Learning**: The Language-MILP Contrastive Learning task is positioned as a way to assist non-experts in understanding and formulating MILPs. However, the generated natural language descriptions tend to emphasize technical details (e.g., linear constraints, integer variables), and it is not entirely clear how this helps users grasp the real-world significance of MILP problems. It would be helpful if the authors could provide more clarificati
1. The proposed data generation method is validated on three MILP-related learning tasks. 2. The paper is well-structured and presented clearly.
1. The methodological contribution is somewhat limited, primarily offering a data augmentation approach that employs LLMs to generate diverse MILP instances. 2. There is a mismatch between the content and title of the paper. The title “Towards Foundation Models for Mixed Integer Linear Programming” suggests a broader scope, while the paper mainly discusses a data generation method for MILP.
Code & Models
Videos
Taxonomy
TopicsOptimization and Mathematical Programming · Multi-Criteria Decision Making · Advanced Optimization Algorithms Research
MethodsSparse Evolutionary Training · Focus
