MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Jessica Echterhoff, Fartash Faghri, Raviteja Vemulapalli, Ting-Yao Hu,, Chun-Liang Li, Oncel Tuzel, Hadi Pouransari

TL;DR
This paper introduces MUSCLE, a strategy for updating large language models that minimizes performance regressions on individual instances, thereby maintaining user trust and model consistency across updates.
Contribution
The paper proposes a novel training method involving a compatibility adapter to reduce instance-level regressions during LLM updates, improving model stability.
Findings
Negative flips reduced by up to 40% with the proposed method
Model update regressions occur even with identical training procedures
Evaluation metrics tailored for generative and discriminative tasks
Abstract
Large Language Models (LLMs) are regularly updated to enhance performance, typically through changes in data or architecture. Within the update process, developers often prioritize improving overall performance metrics, paying less attention to maintaining compatibility with earlier model versions. Instance-level degradation (instance regression) of performance from one model version to the next can interfere with a user's mental model of the capabilities of a particular language model. Users having to adapt their mental model with every update can lead to dissatisfaction, especially when the new model has degraded compared to a prior version for a known use case (model update regression). We find that when pretrained LLM base models are updated, fine-tuned user-facing downstream task adapters experience negative flips -- previously correct instances are now predicted incorrectly. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDigital Rights Management and Security · Natural Language Processing Techniques
MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training · Adapter · Focus · Balanced Selection · LLaMA
