KIF: Knowledge Identification and Fusion for Language Model Continual Learning
Yujie Feng, Xu Chu, Yongxin Xu, Zexin Lu, Bo Liu, Philip S. Yu,, Xiao-Ming Wu

TL;DR
KIF is a novel continual learning framework for language models that enhances knowledge transfer and retention without relying on memory replay, using skill units and a fine-grained knowledge fusion strategy.
Contribution
KIF introduces a new method for language model continual learning that segregates skill units and fuses knowledge based on importance, improving transfer and retention without memory replay.
Findings
KIF outperforms existing methods on CL benchmarks.
It effectively balances knowledge retention and transfer.
Demonstrates strong generalizability across models and methods.
Abstract
Language model continual learning (CL) has recently attracted significant interest for its ability to adapt large language models (LLMs) to dynamic real-world scenarios without retraining. A major challenge in this domain is catastrophic forgetting, where models lose previously acquired knowledge upon learning new tasks. Existing approaches commonly utilize multiple parameter-efficient fine-tuning (PEFT) blocks to acquire task-specific knowledge, yet these methods are inefficient and fail to leverage potential knowledge transfer across tasks. In this paper, we introduce a novel CL framework for language models, named Knowledge Identification and Fusion (KIF), which boosts knowledge transfer without depending on memory replay. KIF initially segregates the model into 'skill units' based on parameter dependencies, allowing for more precise control. Subsequently, it employs a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Interpreting and Communication in Healthcare · Speech and dialogue systems
MethodsBalanced Selection
