Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks
Gianni Molinari, Fabio Ciravegna

TL;DR
RP-ReAct introduces a multi-agent framework that separates strategic planning from execution, improving reliability, efficiency, and robustness in complex enterprise tasks involving multiple tools and data sources.
Contribution
The paper presents RP-ReAct, a novel multi-agent system that decouples planning from execution, incorporating context management to enhance performance in complex enterprise environments.
Findings
Outperforms state-of-the-art baselines on the ToolQA benchmark.
Demonstrates improved generalization across multiple domains.
Shows increased robustness and stability across different model scales.
Abstract
Despite recent advances, autonomous agents often struggle to solve complex tasks in enterprise domains that require coordinating multiple tools and processing diverse data sources. This struggle is driven by two main limitations. First, single-agent architectures enforce a monolithic plan-execute loop, which directly causes trajectory instability. Second, the requirement to use local open-weight models for data privacy introduces smaller context windows leading to the rapid consumption of context from large tool outputs. To solve this problem we introduce RP-ReAct (Reasoner Planner-ReAct), a novel multi-agent approach that fundamentally decouples strategic planning from low-level execution to achieve superior reliability and efficiency. RP-ReAct consists of a Reasoner Planner Agent (RPA), responsible for planning each sub-step, continuously analysing the execution results using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAI-based Problem Solving and Planning · Multi-Agent Systems and Negotiation · Semantic Web and Ontologies
