MASTER: A Multi-Agent System with LLM Specialized MCTS

Bingzheng Gan; Yufan Zhao; Tianyi Zhang; Jing Huang; Yusu Li; Shu Xian; Teo; Changwang Zhang; Wei Shi

arXiv:2501.14304·cs.AI·February 5, 2025

MASTER: A Multi-Agent System with LLM Specialized MCTS

Bingzheng Gan, Yufan Zhao, Tianyi Zhang, Jing Huang, Yusu Li, Shu Xian, Teo, Changwang Zhang, Wei Shi

PDF

Open Access 1 Video

TL;DR

MASTER introduces a multi-agent system leveraging specialized LLM-based MCTS to improve strategic planning and efficiency in complex tasks, achieving state-of-the-art results in question answering benchmarks.

Contribution

The paper proposes a novel multi-agent framework with specialized LLM-based MCTS that dynamically adjusts agent recruitment and communication, addressing sampling inefficiencies in reward estimation.

Findings

01

Achieves 76% accuracy on HotpotQA

02

Achieves 80% accuracy on WebShop

03

Sets new state-of-the-art performance

Abstract

Large Language Models (LLM) are increasingly being explored for problem-solving tasks. However, their strategic planning capability is often viewed with skepticism. Recent studies have incorporated the Monte Carlo Tree Search (MCTS) algorithm to augment the planning capacity of LLM. Despite its potential, MCTS relies on extensive sampling simulations to approximate the true reward distribution, which leads to two primary issues. Firstly, MCTS is effective for tasks like the Game of Go, where simulation results can yield objective rewards (e.g., 1 for a win and 0 for a loss). However, for tasks such as question answering, the result of a simulation is the answer to the question, which cannot yield an objective reward without the ground truth. Secondly, obtaining statistically significant reward estimations typically requires a sample size exceeding 30 simulations, resulting in excessive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MASTER: A Multi-Agent System with LLM Specialized MCTS· underline

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Semantic Web and Ontologies