Large Language Model enabled Mathematical Modeling
Guoyun Zhang

TL;DR
This paper explores how Large Language Models, especially DeepSeek-R1, can be integrated into mathematical modeling for operations research, focusing on reducing hallucinations and improving formulation accuracy in real-world decision-making scenarios.
Contribution
It systematically evaluates DeepSeek-R1's effectiveness in OR benchmarks and introduces strategies to mitigate hallucinations and enhance model reliability.
Findings
DeepSeek-R1 performs well on OR benchmarks.
Mitigation strategies reduce hallucination rates.
Enhanced formulation accuracy in applied OR scenarios.
Abstract
The integration of Large Language Models (LLMs) with optimization modeling offers a promising avenue for advancing decision-making in operations research (OR). Traditional optimization methods,such as linear programming, mixed integer programming, and simulation depend heavily on domain expertise to translate real-world problems into solvable mathematical models. While solvers like Gurobi and COPT are powerful, expert input remains essential for defining objectives, constraints, and variables. This research investigates the potential of LLMs, specifically the DeepSeek-R1 model, to bridge this formulation gap using natural language understanding and code generation. Although prior models like GPT-4, Claude, and Bard have shown strong performance in NLP and reasoning tasks, their high token costs and tendency toward hallucinations limit real-world applicability in supply chain contexts.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Explainable Artificial Intelligence (XAI)
