A Scalable Multi-LLM Collaboration System with Retrieval-based Selection and Exploration-Exploitation-Driven Enhancement

Shengji Tang; Jianjian Cao; Weihao Lin; Jiale Hong; Bo Zhang; Shuyue Hu; Lei Bai; Tao Chen; Wanli Ouyang; Peng Ye

arXiv:2507.14200·cs.CL·May 18, 2026

A Scalable Multi-LLM Collaboration System with Retrieval-based Selection and Exploration-Exploitation-Driven Enhancement

Shengji Tang, Jianjian Cao, Weihao Lin, Jiale Hong, Bo Zhang, Shuyue Hu, Lei Bai, Tao Chen, Wanli Ouyang, Peng Ye

PDF

1 Repo 1 Datasets

TL;DR

This paper introduces SMCS, a scalable system for multi-LLM collaboration that dynamically selects and enhances LLM responses, outperforming some closed-source models on multiple benchmarks.

Contribution

The paper presents a novel scalable system with retrieval-based selection and exploration-exploitation modules for effective multi-LLM collaboration.

Findings

01

SMCS outperforms GPT-4.1 and GPT-o3-mini on benchmarks.

02

Integrating 15 open-source LLMs improves performance.

03

SMCS exceeds the average of best results with open-source LLMs.

Abstract

Existing multi-LLM collaboration systems often encounter scalability challenges when integrating new LLMs and tasks, leading to suboptimal performance. To address this, we propose SMCS, a Scalable Multi-LLM Collaboration System designed to effectively coordinate multiple open-source LLMs. The system consists of two core components: a Retrieval-based Prior Selection (RPS) module, which dynamically selects the most suitable LLMs for each input, and an Exploration-Exploitation-Driven Posterior Enhancement (EPE) module, which fosters response diversity and selects high-quality outputs through a hybrid scoring mechanism. Experiments on eight mainstream benchmarks validate the effectiveness of our system: by integrating fifteen open-source LLMs, SMCS outperforms prevailing closed-source LLMs, e.g., GPT-4.1(+5.36%) and GPT-o3-mini(+5.28%) across multiple tasks. Remarkably, it even exceeds the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

magent4aci/SMCS
github

Datasets

aisfuture/smcs_data
dataset· 1.1k dl
1.1k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.