Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems

Shangbin Feng; Zifeng Wang; Palash Goyal; Yike Wang; Weijia Shi; Huang Xia; Hamid Palangi; Luke Zettlemoyer; Yulia Tsvetkov; Chen-Yu Lee; Tomas Pfister

arXiv:2502.04510·cs.CL·October 24, 2025

Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems

Shangbin Feng, Zifeng Wang, Palash Goyal, Yike Wang, Weijia Shi, Huang Xia, Hamid Palangi, Luke Zettlemoyer, Yulia Tsvetkov, Chen-Yu Lee, Tomas Pfister

PDF

Open Access 1 Video

TL;DR

Heterogeneous Swarms is an algorithm that jointly optimizes roles and weights of multiple language models in a system, represented as DAGs, to improve collaborative performance across various tasks.

Contribution

It introduces a novel joint optimization method for model roles and weights in multi-LLM systems using DAG representations and swarm intelligence techniques.

Findings

01

Outperforms 15 baselines by 18.5% on average across 12 tasks.

02

Discovers heterogeneous model roles and collaborative gains.

03

Benefits from diversity of language models.

Abstract

We propose Heterogeneous Swarms, an algorithm to design multi-LLM systems by jointly optimizing model roles and weights. We represent multi-LLM systems as directed acyclic graphs (DAGs) of LLMs with topological message passing for collaborative generation. Given a pool of LLM experts and a utility function, Heterogeneous Swarms employs two iterative steps: role-step and weight-step. For role-step, we interpret model roles as learning a DAG that specifies the flow of inputs and outputs between LLMs. Starting from a swarm of random continuous adjacency matrices, we decode them into discrete DAGs, call the LLMs in topological order, evaluate on the utility function (e.g. accuracy on a task), and optimize the adjacency matrices with particle swarm optimization based on the utility score. For weight-step, we assess the contribution of individual LLMs in the multi-LLM systems and optimize…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems· slideslive

Taxonomy

TopicsSimulation Techniques and Applications