Retrieval-augmented Prompt Learning for Pre-trained Foundation Models

Xiang Chen; Yixin Ou; Quan Feng; Lei Li; Piji Li; Haibo Ye; Sheng-Jun Huang; Shuofei Qiao; Shumin Deng; Huajun Chen; Ningyu Zhang

arXiv:2512.20145·cs.CL·December 24, 2025

Retrieval-augmented Prompt Learning for Pre-trained Foundation Models

Xiang Chen, Yixin Ou, Quan Feng, Lei Li, Piji Li, Haibo Ye, Sheng-Jun Huang, Shuofei Qiao, Shumin Deng, Huajun Chen, Ningyu Zhang

PDF

Open Access

TL;DR

RetroPrompt introduces a retrieval-augmented prompt learning method for pre-trained foundation models, improving generalization and reducing overfitting by leveraging external knowledge bases during training and inference.

Contribution

It proposes a novel retrieval-augmented prompt learning framework that decouples knowledge from memorization, enhancing performance in few-shot and zero-shot tasks.

Findings

01

Outperforms traditional prompt learning in various NLP and vision tasks

02

Reduces reliance on rote memorization, improving generalization

03

Effective in both zero-shot and few-shot scenarios

Abstract

The pre-trained foundation models (PFMs) have become essential for facilitating large-scale multimodal learning. Researchers have effectively employed the ``pre-train, prompt, and predict'' paradigm through prompt learning to induce improved few-shot performance. However, prompt learning approaches for PFMs still follow a parametric learning paradigm. As such, the stability of generalization in memorization and rote learning can be compromised. More specifically, conventional prompt learning might face difficulties in fully utilizing atypical instances and avoiding overfitting to shallow patterns with limited data during the process of fully-supervised training. To overcome these constraints, we present our approach, named RetroPrompt, which aims to achieve a balance between memorization and generalization by decoupling knowledge from mere memorization. Unlike traditional prompting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications