CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration

Xinming Hou; Mingming Yang; Wenxiang Jiao; Xing Wang; Zhaopeng Tu,; Wayne Xin Zhao

arXiv:2406.13381·cs.CL·June 21, 2024·1 cites

CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration

Xinming Hou, Mingming Yang, Wenxiang Jiao, Xing Wang, Zhaopeng Tu,, Wayne Xin Zhao

PDF

Open Access 2 Repos

TL;DR

The paper introduces CoAct, a hierarchical framework for LLMs that mimics human-like global and local planning to improve performance on complex, long-horizon web tasks, especially when facing failures.

Contribution

CoAct is the first framework to incorporate hierarchical global-local planning and collaboration in LLM systems for complex task execution.

Findings

01

Outperforms baseline methods on WebArena benchmark

02

Re-arranges process trajectory effectively when facing failures

03

Achieves superior performance on long-horizon web tasks

Abstract

Existing LLMs exhibit remarkable performance on various NLP tasks, but still struggle with complex real-world tasks, even equipped with advanced strategies like CoT and ReAct. In this work, we propose the CoAct framework, which transfers the hierarchical planning and collaboration patterns in human society to LLM systems. Specifically, our CoAct framework involves two agents: (1) A global planning agent, to comprehend the problem scope, formulate macro-level plans and provide detailed sub-task descriptions to local execution agents, which serves as the initial rendition of a global plan. (2) A local execution agent, to operate within the multi-tier task execution structure, focusing on detailed execution and implementation of specific tasks within the global plan. Experimental results on the WebArena benchmark show that CoAct can re-arrange the process trajectory when facing failures,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Semantic Web and Ontologies