TL;DR
SAGE is a framework that enhances optimization modeling with reasoning LLMs by explicitly incorporating modeling strategies, leading to more correct, diverse, and efficient formulations across various benchmarks.
Contribution
It introduces a strategy-aware approach with a multi-strategy dataset and training method, significantly improving correctness, diversity, and efficiency of optimization models.
Findings
Pass@1 accuracy improved from 72.7% to 80.3%.
Discovered more correct formulations and increased diversity at pass@16 by 19-29%.
Produced more compact constraint systems with 14.2% fewer constraints.
Abstract
Large language models (LLMs) can generate syntactically valid optimization programs, yet often struggle to reliably choose an effective modeling strategy, leading to incorrect formulations and inefficient solver behavior. We propose SAGE, a strategy-aware framework that makes Modeling Strategy explicit in both data construction and post-training. SAGE builds a solver-verified multi-strategy dataset and trains a student model with supervised fine-tuning followed by Segment-Weighted GRPO using a composite reward over format compliance, correctness, and solver efficiency. Across eight benchmarks spanning synthetic and real-world settings, SAGE improves average pass@1 from 72.7 to 80.3 over the strongest open-source baseline. With multiple generations, SAGE discovers more distinct correct formulations and improves component-level diversity at pass@16 by 19-29%. At the largest scale, SAGE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
