Murakkab: Resource-Efficient Agentic Workflow Orchestration in Cloud Platforms
Gohar Irfan Chaudhry, Esha Choukse, Haoran Qiu, \'I\~nigo Goiri, Rodrigo Fonseca, Adam Belay, Ricardo Bianchini

TL;DR
Murakkab is a resource-efficient system for orchestrating agentic workflows in cloud platforms, decoupling workflow design from execution to optimize resource use and meet service objectives.
Contribution
It introduces a declarative abstraction and adaptive runtime that jointly optimize workflow execution across models and hardware, improving efficiency and flexibility.
Findings
Reduces GPU usage by up to 2.8×
Decreases energy consumption by 3.7×
Lowers cost by 4.3× while maintaining SLOs
Abstract
Agentic workflows commonly coordinate multiple models and tools with complex control logic. They are quickly becoming the dominant paradigm for AI applications. However, serving them remains inefficient with today's frameworks. The key problem is that they expose workflows as opaque sequences of model and tool calls that tightly couple agent logic with model and hardware choices. Often, these workflow components are fragmented across different entities, preventing systems from reasoning about trade-offs across accuracy, latency, energy, and cost. This leads to resource waste and degraded service-level objectives (SLOs). We present Murakkab, a resource-efficient serving system for agentic workflows. Murakkab introduces a declarative abstraction that decouples workflow specification from execution configuration. A profile-guided optimizer and adaptive runtime jointly manage the full…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
