Continual Learning in Large Language Models: Methods, Challenges, and Opportunities
Hongyang Chen, Zhongwu Sun, Hongfei Ye, Kunchi Li, Xuemin Lin

TL;DR
This paper surveys methods for continual learning in large language models, discussing challenges, evaluation metrics, and future opportunities to improve knowledge retention and adaptation across tasks.
Contribution
It provides a comprehensive taxonomy, comparative analysis, and framework for understanding continual learning approaches tailored for LLMs, highlighting key distinctions from traditional ML.
Findings
Current methods show promise in specific domains
Fundamental challenges remain in knowledge integration
Evaluation metrics include forgetting rates and transfer efficiency
Abstract
Continual learning (CL) has emerged as a pivotal paradigm to enable large language models (LLMs) to dynamically adapt to evolving knowledge and sequential tasks while mitigating catastrophic forgetting-a critical limitation of the static pre-training paradigm inherent to modern LLMs. This survey presents a comprehensive overview of CL methodologies tailored for LLMs, structured around three core training stages: continual pre-training, continual fine-tuning, and continual alignment.Beyond the canonical taxonomy of rehearsal-, regularization-, and architecture-based methods, we further subdivide each category by its distinct forgetting mitigation mechanisms and conduct a rigorous comparative analysis of the adaptability and critical improvements of traditional CL methods for LLMs. In doing so, we explicitly highlight core distinctions between LLM CL and traditional machine learning,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Artificial Intelligence in Healthcare and Education · Topic Modeling
