Towards Lifelong Learning of Large Language Models: A Survey

Junhao Zheng; Shengjie Qiu; Chengming Shi; Qianli Ma

arXiv:2406.06391·cs.LG·June 11, 2024·1 cites

Towards Lifelong Learning of Large Language Models: A Survey

Junhao Zheng, Shengjie Qiu, Chengming Shi, Qianli Ma

PDF

Open Access 1 Repo

TL;DR

This survey reviews strategies for enabling large language models to learn continuously over time, focusing on internal and external knowledge methods to improve adaptability and prevent forgetting.

Contribution

It introduces a new taxonomy of 12 lifelong learning scenarios and classifies existing techniques, highlighting emerging methods like model expansion and data selection.

Findings

01

Categorized lifelong learning into 12 scenarios.

02

Identified common techniques across scenarios.

03

Highlighted emerging methods such as model expansion.

Abstract

As the applications of large language models (LLMs) expand across diverse fields, the ability of these models to adapt to ongoing changes in data, tasks, and user preferences becomes crucial. Traditional training methods, relying on static datasets, are increasingly inadequate for coping with the dynamic nature of real-world information. Lifelong learning, also known as continual or incremental learning, addresses this challenge by enabling LLMs to learn continuously and adaptively over their operational lifetime, integrating new knowledge while retaining previously learned information and preventing catastrophic forgetting. This survey delves into the sophisticated landscape of lifelong learning, categorizing strategies into two primary groups: Internal Knowledge and External Knowledge. Internal Knowledge includes continual pretraining and continual finetuning, each enhancing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qianlima-lab/awesome-lifelong-learning-methods-for-llm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling