Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization

Costas Mavromatis; Petros Karypis; George Karypis

arXiv:2404.11531·cs.CL·April 18, 2024·2 cites

Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization

Costas Mavromatis, Petros Karypis, George Karypis

PDF

Open Access 1 Repo

TL;DR

This paper introduces PackLLM, a test-time LLM fusion method that optimizes model importance weights based on perplexity to improve task performance, outperforming existing fusion techniques.

Contribution

The paper proposes a novel test-time fusion approach for LLMs using perplexity-based importance weighting, enabling effective integration of arbitrary models during inference.

Findings

01

Perplexity effectively indicates LLM expertise.

02

PackLLM outperforms baseline fusion methods by 1.89% accuracy.

03

Leverages new LLMs to significantly boost performance.

Abstract

Fusing knowledge from multiple Large Language Models (LLMs) can combine their diverse strengths to achieve improved performance on a given task. However, current fusion approaches either rely on learning-based fusers that do not generalize to new LLMs, or do not take into account how well each LLM understands the input. In this work, we study LLM fusion at test-time, which enables leveraging knowledge from arbitrary user-specified LLMs during inference. We introduce Pack of LLMs (PackLLM), an effective method for test-time fusion that leverages each LLM's expertise, given an input prompt. PackLLM performs model fusion by solving an optimization problem for determining each LLM's importance, so that perplexity over the input prompt is minimized. First, our simple PackLLM-sim variant validates that perplexity is a good indicator for measuring each LLM's expertise. Second, our PackLLM-opt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cmavro/packllm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvancements in Photolithography Techniques · VLSI and Analog Circuit Testing · VLSI and FPGA Design Techniques

MethodsSparse Evolutionary Training