S$^2$-MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency
Yuting Zeng, Weizhe Huang, Lei Jiang, Tongxuan Liu, Xitai Jin, Chen, Tianying Tiana, Jing Li, Xiaohua Xu

TL;DR
This paper introduces S$^2$-MAD, a sparsification strategy that significantly reduces token costs in multi-agent debate systems, maintaining high performance while improving efficiency for complex reasoning tasks in large language models.
Contribution
The paper proposes a novel sparsification method to cut token costs in multi-agent debate, enabling scalable and efficient reasoning in large language models.
Findings
Token costs reduced by up to 94.5% with minimal performance loss.
The approach maintains debate effectiveness across multiple datasets and models.
Efficiency improvements facilitate scalable multi-agent reasoning systems.
Abstract
Large language models (LLMs) have demonstrated remarkable capabilities across various natural language processing (NLP) scenarios, but they still face challenges when handling complex arithmetic and logical reasoning tasks. While Chain-Of-Thought (CoT) reasoning, self-consistency (SC) and self-correction strategies have attempted to guide models in sequential, multi-step reasoning, Multi-agent Debate (MAD) has emerged as a viable approach for enhancing the reasoning capabilities of LLMs. By increasing both the number of agents and the frequency of debates, the performance of LLMs improves significantly. However, this strategy results in a significant increase in token costs, presenting a barrier to scalability. To address this challenge, we introduce a novel sparsification strategy designed to reduce token costs within MAD. This approach minimizes ineffective exchanges of information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Complex Systems and Decision Making
