Forging a Dynamic Memory: Retrieval-Guided Continual Learning for Generalist Medical Foundation Models
Zizhi Chen, Yizhen Gao, Minghao Han, Yizhou Liu, Zhaoyu Chen, Dingkang Yang, Lihua Zhang

TL;DR
This paper introduces a retrieval-guided continual learning framework for multimodal medical models, utilizing a large medical retrieval database and dynamic knowledge distillation to improve adaptation, retention, and real-time learning of complex medical tasks.
Contribution
It pioneers the integration of retrieval-augmented generation with dynamic knowledge distillation in continual learning for medical models, addressing domain gaps and intra-modality feature preservation.
Findings
Achieves state-of-the-art performance on the MGTIL benchmark.
Effectively balances domain adaptation and feature retention.
Demonstrates improved real-time learning of complex medical tasks.
Abstract
Multimodal biomedical Vision-Language Models (VLMs) exhibit immense potential in the field of Continual Learning (CL). However, they confront a core dilemma: how to preserve fine-grained intra-modality features while bridging the significant domain gap across different modalities. To address this challenge, we propose a comprehensive framework. Leveraging our 18-million multimodal and comprehensive medical retrieval database derived from PubMed scientific papers, we pioneer the integration of Retrieval-Augmented Generation (RAG) into CL. Specifically, we employ a multi-modal, multi-layer RAG system that provides real-time guidance for model fine-tuning through dynamic, on-demand knowledge retrieval. Building upon this, we introduce a dynamic knowledge distillation framework. This framework precisely resolves the aforementioned core dilemma by dynamically modulating the importance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI
