Cool-Fusion: Fuse Large Language Models without Training

Cong Liu; Xiaojun Quan; Yan Pan; Liang Lin; Weigang Wu; Xu Chen

arXiv:2407.19807·cs.CL·June 10, 2025

Cool-Fusion: Fuse Large Language Models without Training

Cong Liu, Xiaojun Quan, Yan Pan, Liang Lin, Weigang Wu, Xu Chen

PDF

Open Access

TL;DR

Cool-Fusion presents a training-free method to fuse multiple large language models by ensemble reranking, significantly improving accuracy on benchmarks like GSM8K without the need for fine-tuning.

Contribution

It introduces Cool-Fusion, a novel approach to fuse heterogeneous LLMs without training, addressing vocabulary discrepancies through text-level ensembling and reranking.

Findings

01

Increases GSM8K accuracy by 17.4%

02

Applicable to models with different vocabularies

03

No training required for fusion

Abstract

We focus on the problem of fusing two or more heterogeneous large language models (LLMs) to leverage their complementary strengths. One of the challenges of model fusion is high computational load, specifically in fine-tuning or aligning vocabularies. To address this, we propose Cool-Fusion, a simple yet effective approach that fuses the knowledge of source LLMs, which does not require training. Unlike ensemble methods, Cool-Fusion is applicable to any set of source LLMs that have different vocabularies. To overcome the vocabulary discrepancies among LLMs, we ensemble LLMs on text level, allowing them to rerank the generated texts by each other with different granularities. Extensive experiments have been conducted across a variety of benchmark datasets. On GSM8K, Cool-Fusion increases accuracy from three strong source LLMs by a significant margin of 17.4\%.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsSparse Evolutionary Training · Focus · ALIGN