MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Jessica Echterhoff; Fartash Faghri; Raviteja Vemulapalli; Ting-Yao Hu,; Chun-Liang Li; Oncel Tuzel; Hadi Pouransari

arXiv:2407.09435·cs.AI·October 7, 2024

MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Jessica Echterhoff, Fartash Faghri, Raviteja Vemulapalli, Ting-Yao Hu,, Chun-Liang Li, Oncel Tuzel, Hadi Pouransari

PDF

Open Access 1 Video

TL;DR

This paper introduces MUSCLE, a strategy for updating large language models that minimizes performance regressions on individual instances, thereby maintaining user trust and model consistency across updates.

Contribution

The paper proposes a novel training method involving a compatibility adapter to reduce instance-level regressions during LLM updates, improving model stability.

Findings

01

Negative flips reduced by up to 40% with the proposed method

02

Model update regressions occur even with identical training procedures

03

Evaluation metrics tailored for generative and discriminative tasks

Abstract

Large Language Models (LLMs) are regularly updated to enhance performance, typically through changes in data or architecture. Within the update process, developers often prioritize improving overall performance metrics, paying less attention to maintaining compatibility with earlier model versions. Instance-level degradation (instance regression) of performance from one model version to the next can interfere with a user's mental model of the capabilities of a particular language model. Users having to adapt their mental model with every update can lead to dissatisfaction, especially when the new model has degraded compared to a prior version for a known use case (model update regression). We find that when pretrained LLM base models are updated, fine-tuned user-facing downstream task adapters experience negative flips -- previously correct instances are now predicted incorrectly. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MUSCLE: A Model Update Strategy for Compatible LLM Evolution· underline

Taxonomy

TopicsDigital Rights Management and Security · Natural Language Processing Techniques

MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training · Adapter · Focus · Balanced Selection · LLaMA