TL;DR
This paper provides a comprehensive survey of continual learning techniques for generative AI models, categorizing methods and analyzing setups to address the challenge of catastrophic forgetting in evolving AI systems.
Contribution
It systematically reviews and categorizes continual learning approaches for various generative models, offering insights into methodologies, benchmarks, and future directions.
Findings
Categorizes approaches into architecture, regularization, and replay-based methods.
Analyzes training objectives and benchmarks for different generative models.
Provides a systematic framework for understanding continual learning in generative AI.
Abstract
The rapid advancement of generative models has empowered modern AI systems to comprehend and produce highly sophisticated content, even achieving human-level performance in specific domains. However, these models are fundamentally constrained by \emph{catastrophic forgetting}, \ie~a persistent challenge where models experience performance degradation on previously learned tasks when adapting to new tasks. To address this practical limitation, numerous approaches have been proposed to enhance the adaptability and scalability of generative AI in real-world applications. In this work, we present a comprehensive survey of continual learning methods for mainstream generative AI models, encompassing large language models, multimodal large language models, vision-language-action models, and diffusion models. Drawing inspiration from the memory mechanisms of the human brain, we systematically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
