Towards Foundation Models for Mixed Integer Linear Programming

Sirui Li; Janardhan Kulkarni; Ishai Menache; Cathy Wu; Beibin Li

arXiv:2410.08288·cs.LG·February 24, 2025·3 cites

Towards Foundation Models for Mixed Integer Linear Programming

Sirui Li, Janardhan Kulkarni, Ishai Menache, Cathy Wu, Beibin Li

PDF

Open Access 1 Datasets 1 Video 3 Reviews

TL;DR

This paper proposes a foundation model approach for MILP by training on a diverse, synthetically generated dataset, enabling better generalization across problem classes and improving performance on unseen benchmarks.

Contribution

It introduces MILP-Evolve, a novel LLM-based evolutionary framework for generating diverse MILP problems, and demonstrates the effectiveness of a foundation model approach for MILP generalization.

Findings

01

Models trained on MILP-Evolve data outperform on unseen problems

02

Significant improvements on MIPLIB benchmarks

03

Demonstrates potential of foundation models for MILP applications

Abstract

Mixed Integer Linear Programming (MILP) is essential for modeling complex decision-making problems but faces challenges in computational tractability and requires expert formulation. Current deep learning approaches for MILP focus on specific problem classes and do not generalize to unseen classes. To address this shortcoming, we take a foundation model training approach, where we train a single deep learning model on a diverse set of MILP problems to generalize across problem classes. As existing datasets for MILP lack diversity and volume, we introduce MILP-Evolve, a novel LLM-based evolutionary framework that is capable of generating a large set of diverse MILP classes with an unlimited amount of instances. We study our methodology on three key learning tasks that capture diverse aspects of MILP: (1) integrality gap prediction, (2) learning to branch, and (3) a new task of aligning…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 6Confidence 3

Strengths

1. Employing LLM to generate diversified MILP instances is novel and helpful for training a foundation model for MILP. The entire MILP space is too huge, so datasets created by humans can only cover a part of it. So, leveraging the power of LLM is a good direction. 2. The authors' commitment to open-source the entire framework is valuable for the entire community.

Weaknesses

1. The dataset test seems to be not that "unseen." You mentioned that you collected MILP problems from eight classes. But you randomly split them after the augmentation. Then, the trained model still learned from all these eight classes. So it would be great if you only use six classes for training, 1 for validation, and 1 for testing. Then this can further show the power of your method.

Reviewer 02Rating 6Confidence 4

Strengths

- **Diversity of MILP Generation**: The MILP-Evolve framework introduces an innovative approach to generating diverse MILP problem classes, which has the potential to enhance generalization in ML-based MILP solvers. - **Empirical Performance**: The paper demonstrates strong performance improvements on the integrality gap prediction and learning to branch tasks, providing evidence that the proposed approach can generalize to unseen MILP classes. - **Extensive Experimental Work**: The paper presen

Weaknesses

1. **Practical Application of Language-MILP Contrastive Learning**: The Language-MILP Contrastive Learning task is positioned as a way to assist non-experts in understanding and formulating MILPs. However, the generated natural language descriptions tend to emphasize technical details (e.g., linear constraints, integer variables), and it is not entirely clear how this helps users grasp the real-world significance of MILP problems. It would be helpful if the authors could provide more clarificati

Reviewer 03Rating 6Confidence 4

Strengths

1. The proposed data generation method is validated on three MILP-related learning tasks. 2. The paper is well-structured and presented clearly.

Weaknesses

1. The methodological contribution is somewhat limited, primarily offering a data augmentation approach that employs LLMs to generate diverse MILP instances. 2. There is a mismatch between the content and title of the paper. The title “Towards Foundation Models for Mixed Integer Linear Programming” suggests a broader scope, while the paper mainly discusses a data generation method for MILP.

Code & Models

Datasets

microsoft/MILP-Evolve
dataset· 411 dl
411 dl

Videos

Towards Foundation Models for Mixed Integer Linear Programming· slideslive

Taxonomy

TopicsOptimization and Mathematical Programming · Multi-Criteria Decision Making · Advanced Optimization Algorithms Research

MethodsSparse Evolutionary Training · Focus