Augmenting Continual Learning of Diseases with LLM-Generated Visual Concepts

Jiantao Tan; Peixian Ma; Kanghao Chen; Zhiming Dai; Ruixuan Wang

arXiv:2508.03094·cs.CV·August 6, 2025

Augmenting Continual Learning of Diseases with LLM-Generated Visual Concepts

Jiantao Tan, Peixian Ma, Kanghao Chen, Zhiming Dai, Ruixuan Wang

PDF

TL;DR

This paper introduces a novel continual learning framework for medical image classification that leverages LLM-generated visual concepts and cross-modal attention to improve semantic understanding and performance.

Contribution

It proposes a dynamic visual concept pool and a cross-modal attention module to enhance continual learning with richer semantic guidance.

Findings

01

Achieves state-of-the-art performance on medical and natural image datasets.

02

Effectively leverages LLM-generated visual concepts for improved classification.

03

Demonstrates the superiority of semantic-guided continual learning methods.

Abstract

Continual learning is essential for medical image classification systems to adapt to dynamically evolving clinical environments. The integration of multimodal information can significantly enhance continual learning of image classes. However, while existing approaches do utilize textual modality information, they solely rely on simplistic templates with a class name, thereby neglecting richer semantic information. To address these limitations, we propose a novel framework that harnesses visual concepts generated by large language models (LLMs) as discriminative semantic guidance. Our method dynamically constructs a visual concept pool with a similarity-based filtering mechanism to prevent redundancy. Then, to integrate the concepts into the continual learning process, we employ a cross-modal image-concept attention module, coupled with an attention loss. Through attention, the module…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.