TL;DR
FlowSteer introduces an end-to-end agentic workflow design paradigm using a novel executable canvas and reinforcement learning, enabling more effective and repairable complex task workflows.
Contribution
It proposes a new paradigm where a single agent designs workflows via reinforced progressive editing on an executable canvas, improving over prior human-dependent methods.
Findings
FlowSteer outperforms baselines on twelve datasets.
The framework supports diverse operator libraries and LLM backends.
Reinforced editing improves workflow construction accuracy.
Abstract
In recent years, agentic workflows have been widely applied to solve complex human tasks. However, existing workflow construction still faces key challenges, including human-dependent workflow construction, the lack of graph-level execution feedback, and the inability to repair errors in-loop during long-horizon construction. To address these challenges, we propose FlowSteer, a new paradigm of Agent Designing Agentic Workflows - a single agent itself end-to-end designs the workflow that a downstream executor runs. To support this paradigm, we introduce the Workflow Canvas, a novel executable graph-state environment that returns syntax-checked execution feedback for every atomic edit. Built on the canvas, we further propose Reinforced Progressive Canvas Editing, in which a lightweight policy agent issues one atomic edit per turn conditioned on real canvas feedback, and is trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
