PromptDSI: Prompt-based Rehearsal-free Continual Learning for Document Retrieval
Tuan-Luc Huynh, Thuy-Trang Vu, Weiqing Wang, Yinwei Wei, Trung Le, Dragan Gasevic, Yuan-Fang Li, Thanh-Toan Do

TL;DR
PromptDSI introduces a prompt-based, rehearsal-free continual learning method for document retrieval that efficiently updates models without re-training or accessing previous data, outperforming existing methods in challenging scenarios.
Contribution
The paper proposes PromptDSI, a novel prompt-based continual learning approach that eliminates the need for rehearsal buffers and improves efficiency and stability in document retrieval tasks.
Findings
PromptDSI outperforms rehearsal-based baselines in rehearsal-free setups.
PromptDSI matches cache-based baselines in mitigating forgetting.
Significantly improves retrieval performance on new document corpora.
Abstract
Differentiable Search Index (DSI) utilizes pre-trained language models to perform indexing and document retrieval via end-to-end learning without relying on external indexes. However, DSI requires full re-training to index new documents, causing significant computational inefficiencies. Continual learning (CL) offers a solution by enabling the model to incrementally update without full re-training. Existing CL solutions in document retrieval rely on memory buffers or generative models for rehearsal, which is infeasible when accessing previous training data is restricted due to privacy concerns. To this end, we introduce PromptDSI, a prompt-based, rehearsal-free continual learning approach for document retrieval. PromptDSI follows the Prompt-based Continual Learning (PCL) framework, using learnable prompts to efficiently index new documents without accessing previous documents or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Domain Adaptation and Few-Shot Learning · Recommender Systems and Techniques
