Workflow-R1: Group Sub-sequence Policy Optimization for Multi-turn Workflow Construction
Mingze Kong, Zikun Qu, Zhongquan Zhou, Pengyu Liang, Xiang Li, Zhiwei Shang, Zhi Hong, Kaiyu Huang, Zhiyong Wang, Zhongxiang Dai

TL;DR
Workflow-R1 introduces a multi-turn, natural language-based framework for workflow construction, utilizing a novel Group Sub-sequence Policy Optimization method to improve multi-turn reasoning in agentic workflows.
Contribution
It reformulates workflow synthesis as a sequential decision-making process and proposes GSsPO, a structure-aware RL algorithm tailored for multi-turn agentic reasoning tasks.
Findings
Outperforms baseline methods on multiple QA benchmarks
Validates GSsPO as a general solution for sequential reasoning
Establishes Workflow-R1 as a new paradigm for workflow optimization
Abstract
The rapid evolution of agentic workflows has demonstrated strong performance of LLM-based agents in addressing complex reasoning tasks. However, existing workflow optimization methods typically formulate workflow synthesis as a static, one-shot code-centric generation problem. This paradigm imposes excessive constraints on the model's coding capabilities and restricts the flexibility required for dynamic problem-solving. In this paper, we present Workflow-R1, a framework that reformulates workflow construction as a multi-turn, natural language-based sequential decision-making process. To resolve the optimization granularity mismatch inherent in such multi-turn interactions, we introduce Group Sub-sequence Policy Optimization (GSsPO). While explicitly tailored to align with the interleaved Think-Action dynamics of agentic reasoning, GSsPO fundamentally functions as a structure-aware RL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Business Process Modeling and Analysis · Scientific Computing and Data Management
