Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration

Hai Ye; Mingbao Lin; Hwee Tou Ng; Shuicheng Yan

arXiv:2412.17061·cs.CL·May 20, 2025

Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration

Hai Ye, Mingbao Lin, Hwee Tou Ng, Shuicheng Yan

PDF

Open Access 1 Repo

TL;DR

This paper introduces Tree Search-based Orchestrated Agents (TOA), a dynamic multi-agent sampling method that improves data synthesis efficiency and performance by optimizing model collaboration through Monte Carlo Tree Search.

Contribution

It presents a novel multi-step decision-making framework for multi-agent coordination using MCTS, significantly enhancing inference compute scaling and synthesis quality.

Findings

01

Outperforms single-agent sampling in multiple tasks.

02

Achieves state-of-the-art results on WMT and AlpacaEval.

03

Surpasses existing preference learning methods on benchmarks.

Abstract

Scaling laws for inference compute in multi-agent systems remain under-explored compared to single-agent scenarios. This work aims to bridge this gap by investigating the problem of data synthesis through multi-agent sampling, where synthetic responses are generated by sampling from multiple distinct language models. Effective model coordination is crucial for successful multi-agent collaboration. Unlike previous approaches that rely on fixed workflows, we treat model coordination as a multi-step decision-making process, optimizing generation structures dynamically for each input question. We introduce Tree Search-based Orchestrated Agents~(TOA), where the workflow evolves iteratively during the sequential sampling process. To achieve this, we leverage Monte Carlo Tree Search (MCTS), integrating a reward model to provide real-time feedback and accelerate exploration. Our experiments on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oceanypt/TOA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management