Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration
Yichong Huang, Xiaocheng Feng, Baohang Li, Yang Xiang, Hui Wang, Bing, Qin, Ting Liu

TL;DR
DeePEn introduces a training-free ensemble framework that fuses heterogeneous large language models by mapping their probability distributions into a universal space, improving performance across various benchmarks without additional training.
Contribution
The paper proposes DeePEn, a novel distribution fusion method for heterogeneous LLMs that addresses token misalignment without extra training or reward models.
Findings
DeePEn improves performance on six diverse benchmarks.
Distribution fusion benefits both general LLMs and specialist models.
DeePEn complements existing ensemble methods like voting.
Abstract
Large language models (LLMs) exhibit complementary strengths in various tasks, motivating the research of LLM ensembling. However, existing work focuses on training an extra reward model or fusion model to select or combine all candidate answers, posing a great challenge to the generalization on unseen data distributions. Besides, prior methods use textual responses as communication media, ignoring the valuable information in the internal representations. In this work, we propose a training-free ensemble framework DeePEn, fusing the informative probability distributions yielded by different LLMs at each decoding step. Unfortunately, the vocabulary discrepancy between heterogeneous LLMs directly makes averaging the distributions unfeasible due to the token misalignment. To address this challenge, DeePEn maps the probability distribution of each model from its own probability space to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling
