CHORUS: Zero-shot Hierarchical Retrieval and Orchestration for Generating Linear Programming Code
Tasnim Ahmed, Salimur Choudhury

TL;DR
CHORUS is a retrieval-augmented framework that enhances large language models' ability to generate accurate Gurobi-based linear programming code from natural language, significantly improving performance and resource efficiency.
Contribution
The paper introduces CHORUS, a hierarchical retrieval and structured reasoning framework that substantially improves LP code generation accuracy in open-source LLMs.
Findings
CHORUS outperforms baseline models and conventional RAG in LP code generation.
Open-source LLMs with CHORUS match or surpass GPT-3.5 and GPT-4 performance.
Ablation studies confirm the importance of hierarchical chunking and expert prompting.
Abstract
Linear Programming (LP) problems aim to find the optimal solution to an objective under constraints. These problems typically require domain knowledge, mathematical skills, and programming ability, presenting significant challenges for non-experts. This study explores the efficiency of Large Language Models (LLMs) in generating solver-specific LP code. We propose CHORUS, a retrieval-augmented generation (RAG) framework for synthesizing Gurobi-based LP code from natural language problem statements. CHORUS incorporates a hierarchical tree-like chunking strategy for theoretical contents and generates additional metadata based on code examples from documentation to facilitate self-contained, semantically coherent retrieval. Two-stage retrieval approach of CHORUS followed by cross-encoder reranking further ensures contextual relevance. Finally, expertly crafted prompt and structured parser…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Real-time simulation and control systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout · Layer Normalization · Byte Pair Encoding · Attention Dropout · Softmax · Residual Connection · WordPiece
