ORLM: A Customizable Framework in Training Large Models for Automated Optimization Modeling
Chenyu Huang, Zhengyang Tang, Shixi Hu, Ruoqing Jiang, Xin Zheng, Dongdong Ge, Benyou Wang, Zizhuo Wang

TL;DR
This paper introduces ORLM, an open-source framework for training large language models tailored to optimization modeling, addressing data scarcity and privacy issues, and demonstrating competitive performance on industry-relevant benchmarks.
Contribution
It presents a semi-automated data synthesis framework, OR-Instruct, for customizable training of open-source LLMs in optimization modeling, and introduces IndustryOR, a new industrial benchmark.
Findings
ORLM models outperform existing benchmarks in optimization tasks
Synthesized data significantly improves LLM capabilities in OR modeling
Scaling law and reinforcement learning can further enhance ORLM performance
Abstract
Optimization modeling plays a critical role in the application of Operations Research (OR) tools to address real-world problems, yet they pose challenges and require extensive expertise from OR experts. With the advent of large language models (LLMs), new opportunities have emerged to streamline and automate such task. However, current research predominantly relies on closed-source LLMs such as GPT-4, along with extensive prompt engineering techniques. This reliance stems from the scarcity of high-quality training datasets for optimization modeling, resulting in elevated costs, prolonged processing times, and privacy concerns. To address these challenges, our work is the first to propose a viable path for training open-source LLMs that are capable of optimization modeling and developing solver codes, eventually leading to a superior ability for automating optimization modeling and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗CardinalOperations/ORLM-LLaMA-3-8Bmodel· 258 dl· ♡ 12258 dl♡ 12
- 🤗RichardErkhov/CardinalOperations_-_ORLM-LLaMA-3-8B-ggufmodel· 498 dl498 dl
- 🤗RichardErkhov/CardinalOperations_-_ORLM-LLaMA-3-8B-8bitsmodel· 1 dl1 dl
- 🤗RichardErkhov/CardinalOperations_-_ORLM-LLaMA-3-8B-awqmodel· 1 dl1 dl
- 🤗ant-opt/LLMOPT-Qwen2.5-14Bmodel· 115 dl· ♡ 9115 dl♡ 9
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Data Processing Techniques
MethodsAttention Is All You Need · Adam · Residual Connection · Byte Pair Encoding · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections · Label Smoothing · Softmax
