A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents
Vasundra Srinivasan

TL;DR
This paper introduces a methodology for selecting and composing runtime architecture patterns for production LLM agents, emphasizing the stochastic-deterministic boundary as a core primitive and addressing reliability concerns.
Contribution
It presents a novel five-step methodology for pattern selection, a diagnostic procedure for failure analysis, and a new failure mode called replay divergence.
Findings
Identified six runtime patterns for LLM agents across different types.
Mapped production failures to specific pattern weaknesses.
Demonstrated the methodology on five workloads with a reference implementation.
Abstract
Production LLM agents combine stochastic model outputs with deterministic software systems, yet the boundary between the two is rarely treated as a first-class architectural object. This paper names that boundary the stochastic-deterministic boundary (SDB): a four-part contract among a proposer, verifier, commit step, and reject signal that specifies how an LLM output becomes a system action. We argue that the SDB is the load-bearing primitive of production agent runtimes. Around this primitive, we organize agent runtime design into three concerns: Coordination, State, and Control. We present a catalog of six runtime patterns that compose the SDB differently across conversational, autonomous, and long-horizon agents: hierarchical delegation, scatter-gather plus saga, event-driven sequencing, shared state machine, supervisor plus gate, and human in the loop. For each pattern, we trace…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
