A General Neural Backbone for Mixed-Integer Linear Optimization via Dual Attention
Peixin Huang, Yaoxin Wu, Yining Ma, Cathy Wu, Wen Song, Wei Zhang

TL;DR
This paper introduces a dual-attention neural architecture that enhances the representation power of MILP solvers by enabling global information exchange, leading to improved performance across multiple optimization tasks.
Contribution
It proposes a novel dual-attention mechanism that surpasses graph neural networks in representing MILP instances, improving solver efficiency and accuracy.
Findings
Consistent performance improvements over state-of-the-art baselines
Effective global information exchange enhances representation learning
Versatile application across various MILP tasks
Abstract
Mixed-integer linear programming (MILP), a widely used modeling framework for combinatorial optimization, are central to many scientific and engineering applications, yet remains computationally challenging at scale. Recent advances in deep learning address this challenge by representing MILP instances as variable-constraint bipartite graphs and applying graph neural networks (GNNs) to extract latent structural patterns and enhance solver efficiency. However, this architecture is inherently limited by the local-oriented mechanism, leading to restricted representation power and hindering neural approaches for MILP. Here we present an attention-driven neural architecture that learns expressive representations beyond the pure graph view. A dual-attention mechanism is designed to perform parallel self- and cross-attention over variables and constraints, enabling global information exchange…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization · Advanced Multi-Objective Optimization Algorithms · Advanced Graph Neural Networks
