Compiling Agentic Workflows into LLM Weights: Near-Frontier Quality at Two Orders of Magnitude Less Cost

Simon Dennis; Rivaan Patil; Kevin Shabahang; Hao Guo

arXiv:2605.22502·cs.AI·May 22, 2026

Compiling Agentic Workflows into LLM Weights: Near-Frontier Quality at Two Orders of Magnitude Less Cost

Simon Dennis, Rivaan Patil, Kevin Shabahang, Hao Guo

PDF

TL;DR

This paper demonstrates that compiling agent workflows into small fine-tuned language models achieves near-frontier quality at significantly reduced costs, addressing adoption barriers in agent orchestration.

Contribution

It empirically evaluates the benefits of compiling workflows into model weights across multiple domains, overcoming barriers to adoption compared to orchestration frameworks.

Findings

01

Compiled models achieve near-frontier quality.

02

Compilation reduces costs by two orders of magnitude.

03

Addresses developer adoption barriers empirically.

Abstract

Agent orchestration frameworks have proliferated, collectively exceeding 290,000 GitHub stars across LangGraph, CrewAI, Google ADK, OpenAI Agents SDK, Semantic Kernel, Strands, and LlamaIndex. All follow the same pattern: an external orchestrator above the LLM, injecting instructions and routing decisions every turn. Recent work has shown this architecture is dominated for procedural tasks by simply providing the procedure in a frontier model's system prompt [Dennis et al., 2026a], at the cost of consuming the context window, requiring a frontier model for every conversation, and exposing proprietary procedures to third-party providers. Compiling the procedure into the weights of a small fine-tuned model -- creating a subterranean agent -- should resolve all of these concerns, and prior work (SimpleTOD, FireAct, SynTOD, WorkflowLLM, Agent Lumos) has shown the technique works. Yet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.