Nalar: An agent serving framework
Marco Laju, Donghyun Son, Saurabh Agarwal, Nitin Kedia, Myungjin Lee, Jayanth Srinivasa, Aditya Akella

TL;DR
Nalar is a comprehensive agent-serving framework that enhances efficiency, scalability, and robustness for complex, multi-step agentic applications by separating workflow specification from execution and providing advanced runtime control.
Contribution
It introduces a novel framework that decouples workflow design from execution, enabling scalable, efficient, and policy-driven serving of heterogeneous agent applications.
Findings
Reduces tail latency by 34-74% across workloads.
Achieves up to 2.9x speedups over baselines.
Supports scaling to 130K futures with low control overhead.
Abstract
LLM-driven agentic applications increasingly automate complex, multi-step tasks, but serving them efficiently remains challenging due to heterogeneous components, dynamic and model-driven control flow, long-running state, and unpredictable latencies. Nalar is a ground-up agent-serving framework that cleanly separates workflow specification from execution while providing the runtime visibility and control needed for robust performance. Nalar preserves full Python expressiveness, using lightweight auto-generated stubs that turn agent and tool invocations into futures carrying dependency and context metadata. A managed state layer decouples logical state from physical placement, enabling safe reuse, migration, and consistent retry behavior. A two-level control architecture combines global policy computation with local event-driven enforcement to support adaptive routing, scheduling, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Multi-Agent Systems and Negotiation · Mobile Agent-Based Network Management
