Beyond Precision: A Study on Recall of Initial Retrieval with Neural Representations
Yan Xiao, Jiafeng Guo, Yixing Fan, Yanyan Lan, Jun Xu, and Xueqi Cheng

TL;DR
This paper investigates how neural representations can enhance the recall of initial document retrieval in information retrieval systems, addressing vocabulary mismatch issues and proposing hybrid search schemes to improve initial recall with minimal overhead.
Contribution
It introduces a neural index and two hybrid search schemes that combine neural and symbolic indices to improve initial retrieval recall in IR systems.
Findings
Hybrid schemes improve recall with small overhead
Neural index enhances initial retrieval effectiveness
Parallel and sequential search schemes outperform traditional methods
Abstract
Vocabulary mismatch is a central problem in information retrieval (IR), i.e., the relevant documents may not contain the same (symbolic) terms of the query. Recently, neural representations have shown great success in capturing semantic relatedness, leading to new possibilities to alleviate the vocabulary mismatch problem in IR. However, most existing efforts in this direction have been devoted to the re-ranking stage. That is to leverage neural representations to help re-rank a set of candidate documents, which are typically obtained from an initial retrieval stage based on some symbolic index and search scheme (e.g., BM25 over the inverted index). This naturally raises a question: if the relevant documents have not been found in the initial retrieval stage due to vocabulary mismatch, there would be no chance to re-rank them to the top positions later. Therefore, in this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Neural Networks and Applications
