Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers
Weng Lam Tam, Xiao Liu, Kaixuan Ji, Lilong Xue, Xingjian Zhang, Yuxiao, Dong, Jiahua Liu, Maodi Hu, Jie Tang

TL;DR
This paper proposes a parameter-efficient prompt tuning method for neural text retrievers that enhances out-of-domain zero-shot generalization and mitigates parameter inefficiency, outperforming traditional fine-tuning methods.
Contribution
It introduces a novel prompt tuning approach for text retrieval that significantly improves cross-domain and cross-topic generalization with minimal parameter updates.
Findings
Prompt tuning improves out-of-domain zero-shot retrieval performance.
Updating only 0.1% of parameters outperforms full fine-tuning.
Curated a large cross-topic retrieval dataset with 87 topics.
Abstract
Prompt tuning attempts to update few task-specific parameters in pre-trained models. It has achieved comparable performance to fine-tuning of the full parameter set on both language understanding and generation tasks. In this work, we study the problem of prompt tuning for neural text retrievers. We introduce parameter-efficient prompt tuning for text retrieval across in-domain, cross-domain, and cross-topic settings. Through an extensive analysis, we show that the strategy can mitigate the two issues -- parameter-inefficiency and weak generalizability -- faced by fine-tuning based retrieval methods. Notably, it can significantly improve the out-of-domain zero-shot generalization of the retrieval models. By updating only 0.1% of the model parameters, the prompt tuning strategy can help retrieval models achieve better generalization performance than traditional methods in which all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
