GraphFlow: A Graph-Based Workflow Management for Efficient LLM-Agent Serving

Ao Li; Shangpeng Yang; Fahao Chen; Tianheng Xu; Peng Li; Zhou Su

arXiv:2605.22566·cs.LG·May 22, 2026

GraphFlow: A Graph-Based Workflow Management for Efficient LLM-Agent Serving

Ao Li, Shangpeng Yang, Fahao Chen, Tianheng Xu, Peng Li, Zhou Su

PDF

TL;DR

GraphFlow introduces a graph-based workflow management system for LLM agents that enhances efficiency and generalization by dynamically constructing workflows and optimizing memory usage.

Contribution

The paper presents wGraph, a unified graph representation for workflows, and introduces GraphFlow, a system that improves LLM-agent serving through adaptive workflow generation and efficient state management.

Findings

01

Outperforms state-of-the-art methods on five benchmarks by ~4.95 percentage points.

02

Achieves ~4× reduction in memory footprint during agent serving.

03

Demonstrates improved generalization to unseen tasks.

Abstract

Large Language Model (LLM)-based agents demonstrate strong reasoning and execution capabilities on complex tasks when guided by structured instructions, commonly referred to as workflows. However, existing workflow-assisted agent serving systems typically rely on predefined templates and shallow matching mechanisms, which limit their ability to capture deep semantic relationships and generalize to previously unseen tasks. To address these limitations, we propose a new workflow management paradigm that represents workflows using a unified graph, termed wGraph, where each node corresponds to an atomic operation. wGraph serves as a shared substrate from which task-specific workflows are dynamically instantiated. Building on wGraph primitives, we introduce GraphFlow, a system that efficiently integrates workflows into agent serving through two key designs. First, adaptive workflow…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.