I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval
Qian Dong, Yiding Liu, Qingyao Ai, Haitao Li, Shuaiqiang, Wang, Yiqun Liu, Dawei Yin, Shaoping Ma

TL;DR
The paper introduces I^3 Retriever, a passage retrieval model that incorporates implicit interaction via pseudo-queries, achieving a balance of high effectiveness and efficiency suitable for large-scale applications.
Contribution
It proposes a novel implicit interaction paradigm using pseudo-queries in dual-encoders, enabling end-to-end training and efficient inference without extensive computation.
Findings
Outperforms existing methods on MSMARCO and TREC2019 datasets.
Maintains efficiency comparable to vanilla dual encoders.
Compatible with pre-training and knowledge distillation techniques.
Abstract
Passage retrieval is a fundamental task in many information systems, such as web search and question answering, where both efficiency and effectiveness are critical concerns. In recent years, neural retrievers based on pre-trained language models (PLM), such as dual-encoders, have achieved huge success. Yet, studies have found that the performance of dual-encoders are often limited due to the neglecting of the interaction information between queries and candidate passages. Therefore, various interaction paradigms have been proposed to improve the performance of vanilla dual-encoders. Particularly, recent state-of-the-art methods often introduce late-interaction during the model inference process. However, such late-interaction based methods usually bring extensive computation and storage cost on large corpus. Despite their effectiveness, the concern of efficiency and space footprint is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsKnowledge Distillation
