Efficient and Scalable Agentic AI with Heterogeneous Systems
Zain Asgar, Michelle Nguyen, Sachin Katti

TL;DR
This paper introduces a system for efficiently deploying and orchestrating complex AI agent workloads across heterogeneous hardware, optimizing costs and extending infrastructure lifespan.
Contribution
It presents a novel framework for planning, compiling, and dynamically orchestrating AI agent execution graphs on diverse hardware platforms.
Findings
Heterogeneous infrastructure can reduce total cost of ownership.
Combining older GPUs with newer accelerators can match the performance of latest-generation homogeneous setups.
The system enables end-to-end SLA-compliant deployment of agentic workloads.
Abstract
AI agents are emerging as a dominant workload in a wide range of applications, promising to be the vehicle that delivers the promised benefits of AI to enterprises and consumers. Unlike conventional software or static inference, agentic workloads are dynamic and structurally complex. Often these agents are directed graphs of compute and IO operations that span multi-modal data input and conversion), data processing and context gathering (e.g vector DB lookups), multiple LLM inferences, tool calls, etc. To scale AI agent usage, we need efficient and scalable deployment and agent-serving infrastructure. To tackle this challenge, in this paper, we present a system design for dynamic orchestration of AI agent workloads on heterogeneous compute infrastructure spanning CPUs and accelerators, both from different vendors and across different performance tiers within a single vendor. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Distributed and Parallel Computing Systems · Parallel Computing and Optimization Techniques
