PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Guozhi Yuan; Youfeng Liu; Jingli Yang; Wei Jia; Kai Lin; Yansong Gao,; Shan He; Zilin Ding; Haitao Li

arXiv:2501.07054·cs.AI·January 14, 2025

PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Guozhi Yuan, Youfeng Liu, Jingli Yang, Wei Jia, Kai Lin, Yansong Gao,, Shan He, Zilin Ding, Haitao Li

PDF

TL;DR

PoAct is a novel dual-control agent framework that dynamically switches reasoning policies and modifies action spaces, significantly improving complex reasoning and code action quality in LLM-driven agents.

Contribution

This paper introduces PoAct, a dual-control agent that enhances reasoning accuracy and code actions by managing complex action spaces and policy switching, addressing limitations of existing frameworks.

Findings

01

20% improvement on LegalAgentBench

02

Reduces token consumption in complex tasks

03

Demonstrates scalability with GPT-4o and GLM-4 models

Abstract

Based on their superior comprehension and reasoning capabilities, Large Language Model (LLM) driven agent frameworks have achieved significant success in numerous complex reasoning tasks. ReAct-like agents can solve various intricate problems step-by-step through progressive planning and tool calls, iteratively optimizing new steps based on environmental feedback. However, as the planning capabilities of LLMs improve, the actions invoked by tool calls in ReAct-like frameworks often misalign with complex planning and challenging data organization. Code Action addresses these issues while also introducing the challenges of a more complex action space and more difficult action organization. To leverage Code Action and tackle the challenges of its complexity, this paper proposes Policy and Action Dual-Control Agent (PoAct) for generalized applications. The aim is to achieve higher-quality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.