GraphFlow: A Graph-Based Workflow Management for Efficient LLM-Agent Serving
Ao Li, Shangpeng Yang, Fahao Chen, Tianheng Xu, Peng Li, Zhou Su

TL;DR
GraphFlow introduces a graph-based workflow management system for LLM agents that enhances efficiency and generalization by dynamically constructing workflows and optimizing memory usage.
Contribution
The paper presents wGraph, a unified graph representation for workflows, and introduces GraphFlow, a system that improves LLM-agent serving through adaptive workflow generation and efficient state management.
Findings
Outperforms state-of-the-art methods on five benchmarks by ~4.95 percentage points.
Achieves ~4× reduction in memory footprint during agent serving.
Demonstrates improved generalization to unseen tasks.
Abstract
Large Language Model (LLM)-based agents demonstrate strong reasoning and execution capabilities on complex tasks when guided by structured instructions, commonly referred to as workflows. However, existing workflow-assisted agent serving systems typically rely on predefined templates and shallow matching mechanisms, which limit their ability to capture deep semantic relationships and generalize to previously unseen tasks. To address these limitations, we propose a new workflow management paradigm that represents workflows using a unified graph, termed wGraph, where each node corresponds to an atomic operation. wGraph serves as a shared substrate from which task-specific workflows are dynamically instantiated. Building on wGraph primitives, we introduce GraphFlow, a system that efficiently integrates workflows into agent serving through two key designs. First, adaptive workflow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
