Class-Prototype Conditional Diffusion Model with Gradient Projection for   Continual Learning

Khanh Doan; Quyen Tran; Tung Lam Tran; Tuan Nguyen; Dinh Phung; Trung; Le

arXiv:2312.06710·cs.LG·March 25, 2024·1 cites

Class-Prototype Conditional Diffusion Model with Gradient Projection for Continual Learning

Khanh Doan, Quyen Tran, Tung Lam Tran, Tuan Nguyen, Dinh Phung, Trung, Le

PDF

Open Access

TL;DR

This paper introduces GPPDM, a continual learning method using class prototypes and gradient projection in diffusion models to improve image quality and reduce catastrophic forgetting.

Contribution

The paper proposes a novel GPPDM approach that incorporates class prototypes and gradient projection to enhance diffusion model performance in continual learning.

Findings

01

Significantly outperforms existing methods in preserving image quality.

02

Effectively reduces catastrophic forgetting across multiple datasets.

03

Maintains high fidelity of generated images for old tasks.

Abstract

Mitigating catastrophic forgetting is a key hurdle in continual learning. Deep Generative Replay (GR) provides techniques focused on generating samples from prior tasks to enhance the model's memory capabilities using generative AI models ranging from Generative Adversarial Networks (GANs) to the more recent Diffusion Models (DMs). A major issue is the deterioration in the quality of generated data compared to the original, as the generator continuously self-learns from its outputs. This degradation can lead to the potential risk of catastrophic forgetting (CF) occurring in the classifier. To address this, we propose the Gradient Projection Class-Prototype Conditional Diffusion Model (GPPDM), a GR-based approach for continual learning that enhances image quality in generators and thus reduces the CF in classifiers. The cornerstone of GPPDM is a learnable class prototype that captures…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsDiffusion