Ensemble Learning for Heterogeneous Large Language Models with Deep   Parallel Collaboration

Yichong Huang; Xiaocheng Feng; Baohang Li; Yang Xiang; Hui Wang; Bing; Qin; Ting Liu

arXiv:2404.12715·cs.CL·May 31, 2024·1 cites

Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration

Yichong Huang, Xiaocheng Feng, Baohang Li, Yang Xiang, Hui Wang, Bing, Qin, Ting Liu

PDF

Open Access 1 Repo 1 Video

TL;DR

DeePEn introduces a training-free ensemble framework that fuses heterogeneous large language models by mapping their probability distributions into a universal space, improving performance across various benchmarks without additional training.

Contribution

The paper proposes DeePEn, a novel distribution fusion method for heterogeneous LLMs that addresses token misalignment without extra training or reward models.

Findings

01

DeePEn improves performance on six diverse benchmarks.

02

Distribution fusion benefits both general LLMs and specialist models.

03

DeePEn complements existing ensemble methods like voting.

Abstract

Large language models (LLMs) exhibit complementary strengths in various tasks, motivating the research of LLM ensembling. However, existing work focuses on training an extra reward model or fusion model to select or combine all candidate answers, posing a great challenge to the generalization on unseen data distributions. Besides, prior methods use textual responses as communication media, ignoring the valuable information in the internal representations. In this work, we propose a training-free ensemble framework DeePEn, fusing the informative probability distributions yielded by different LLMs at each decoding step. Unfortunately, the vocabulary discrepancy between heterogeneous LLMs directly makes averaging the distributions unfeasible due to the token misalignment. To address this challenge, DeePEn maps the probability distribution of each model from its own probability space to a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

OrangeInSouth/DeePEn
pytorchOfficial

Videos

Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration· slideslive

Taxonomy

TopicsTopic Modeling