CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration
Xinming Hou, Mingming Yang, Wenxiang Jiao, Xing Wang, Zhaopeng Tu,, Wayne Xin Zhao

TL;DR
The paper introduces CoAct, a hierarchical framework for LLMs that mimics human-like global and local planning to improve performance on complex, long-horizon web tasks, especially when facing failures.
Contribution
CoAct is the first framework to incorporate hierarchical global-local planning and collaboration in LLM systems for complex task execution.
Findings
Outperforms baseline methods on WebArena benchmark
Re-arranges process trajectory effectively when facing failures
Achieves superior performance on long-horizon web tasks
Abstract
Existing LLMs exhibit remarkable performance on various NLP tasks, but still struggle with complex real-world tasks, even equipped with advanced strategies like CoT and ReAct. In this work, we propose the CoAct framework, which transfers the hierarchical planning and collaboration patterns in human society to LLM systems. Specifically, our CoAct framework involves two agents: (1) A global planning agent, to comprehend the problem scope, formulate macro-level plans and provide detailed sub-task descriptions to local execution agents, which serves as the initial rendition of a global plan. (2) A local execution agent, to operate within the multi-tier task execution structure, focusing on detailed execution and implementation of specific tasks within the global plan. Experimental results on the WebArena benchmark show that CoAct can re-arrange the process trajectory when facing failures,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Semantic Web and Ontologies
