PoAct: Policy and Action Dual-Control Agent for Generalized Applications
Guozhi Yuan, Youfeng Liu, Jingli Yang, Wei Jia, Kai Lin, Yansong Gao,, Shan He, Zilin Ding, Haitao Li

TL;DR
PoAct is a novel dual-control agent framework that dynamically switches reasoning policies and modifies action spaces, significantly improving complex reasoning and code action quality in LLM-driven agents.
Contribution
This paper introduces PoAct, a dual-control agent that enhances reasoning accuracy and code actions by managing complex action spaces and policy switching, addressing limitations of existing frameworks.
Findings
20% improvement on LegalAgentBench
Reduces token consumption in complex tasks
Demonstrates scalability with GPT-4o and GLM-4 models
Abstract
Based on their superior comprehension and reasoning capabilities, Large Language Model (LLM) driven agent frameworks have achieved significant success in numerous complex reasoning tasks. ReAct-like agents can solve various intricate problems step-by-step through progressive planning and tool calls, iteratively optimizing new steps based on environmental feedback. However, as the planning capabilities of LLMs improve, the actions invoked by tool calls in ReAct-like frameworks often misalign with complex planning and challenging data organization. Code Action addresses these issues while also introducing the challenges of a more complex action space and more difficult action organization. To leverage Code Action and tackle the challenges of its complexity, this paper proposes Policy and Action Dual-Control Agent (PoAct) for generalized applications. The aim is to achieve higher-quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
