The Law of Multi-Model Collaboration: Scaling Limits of Model Ensembling for Large Language Models
Dakuan Lu, Jiaqi Zhang, Cheng Yuan, Jiawei Shao, Xuelong Li

TL;DR
This paper introduces a theoretical scaling law for multi-model LLM ensembles, showing they outperform single models and highlighting the importance of model diversity for performance gains.
Contribution
It proposes the Law of Multi-model Collaboration, a new scaling law predicting ensemble performance limits based on total parameter count and model diversity.
Findings
Multi-model ensembles follow a power-law scaling with parameters.
Ensembles outperform single models in performance and lower loss floors.
Heterogeneous model ensembles yield better scaling than homogeneous ones.
Abstract
Recent advances in large language models (LLMs) have been largely driven by scaling laws for individual models, which predict performance improvements as model parameters and data volume increase. However, the capabilities of any single LLM are inherently bounded. One solution originates from intricate interactions among multiple LLMs, rendering their collective performance surpasses that of any constituent model. Despite the rapid proliferation of multi-model integration techniques such as model routing and post-hoc ensembling, a unifying theoretical framework of performance scaling for multi-model collaboration remains absent. In this work, we propose the Law of Multi-model Collaboration, a scaling law that predicts the performance limits of LLM ensembles based on their aggregated parameter budget. To quantify the intrinsic upper bound of multi-model collaboration, we adopt a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Big Data and Digital Economy · Artificial Intelligence in Healthcare and Education
