Countering Catastrophic Forgetting of Large Language Models for Better Instruction Following via Weight-Space Model Merging

Mengxian Lyu; Cheng Peng; Ziyi Chen; Mengyuan Zhang; Jieting Li Lu; Yonghui Wu

arXiv:2604.01538·cs.CL·April 3, 2026

Countering Catastrophic Forgetting of Large Language Models for Better Instruction Following via Weight-Space Model Merging

Mengxian Lyu, Cheng Peng, Ziyi Chen, Mengyuan Zhang, Jieting Li Lu, Yonghui Wu

PDF

TL;DR

This paper introduces a weight-space model merging method to adapt large language models to the medical domain, effectively mitigating catastrophic forgetting and maintaining instruction-following capabilities.

Contribution

The study proposes a novel model merging framework that combines general and clinical LLMs, improving domain adaptation efficiency and performance in clinical tasks.

Findings

01

Merged models outperform baseline fine-tuning on medical benchmarks.

02

Model merging preserves instruction-following ability while adapting to clinical data.

03

Training efficiency is improved, matching fully fine-tuned models with less supervision.

Abstract

Large language models have been adopted in the medical domain for clinical documentation to reduce clinician burden. However, studies have reported that LLMs often "forget" a significant amount of instruction-following ability when fine-tuned using a task-specific medical dataset, a critical challenge in adopting general-purpose LLMs for clinical applications. This study presents a model merging framework to efficiently adapt general-purpose LLMs to the medical domain by countering this forgetting issue. By merging a clinical foundation model (GatorTronLlama) with a general instruct model (Llama-3.1-8B-Instruct) via interpolation-based merge methods, we seek to derive a domain-adapted model with strong performance on clinical tasks while retaining instruction-following ability. Comprehensive evaluation across medical benchmarks and five clinical generation tasks (e.g., radiology and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.