Cooperative Strategic Planning Enhances Reasoning Capabilities in Large   Language Models

Danqing Wang; Zhuorui Ye; Fei Fang; Lei Li

arXiv:2410.20007·cs.AI·October 29, 2024

Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models

Danqing Wang, Zhuorui Ye, Fei Fang, Lei Li

PDF

Open Access

TL;DR

This paper introduces CoPlanner, a cooperative multi-agent framework with distinct planning and reasoning agents, significantly improving reasoning performance in large language models on complex tasks.

Contribution

The paper presents a novel multi-agent reasoning framework that separates planning and reasoning, training the planning agent via reinforcement learning for enhanced cooperation.

Findings

01

Outperforms previous methods by 9.94% on LogiQA

02

Achieves 3.09% improvement on BBH

03

Demonstrates effective cooperation boosts multi-step reasoning

Abstract

Enhancing the reasoning capabilities of large language models (LLMs) is crucial for enabling them to tackle complex, multi-step problems. Multi-agent frameworks have shown great potential in enhancing LLMs' reasoning capabilities. However, the lack of effective cooperation between LLM agents hinders their performance, especially for multi-step reasoning tasks. This paper proposes a novel cooperative multi-agent reasoning framework (CoPlanner) by separating reasoning steps and assigning distinct duties to different agents. CoPlanner consists of two LLM agents: a planning agent and a reasoning agent. The planning agent provides high-level strategic hints, while the reasoning agent follows these hints and infers answers. By training the planning agent's policy through the interactive reasoning process via Proximal Policy Optimization (PPO), the LLaMA-3-8B-based CoPlanner outperforms the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling