SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval
Nandan Thakur, Kexin Wang, Iryna Gurevych, Jimmy Lin

TL;DR
SPRINT is a unified Python toolkit that enables fair comparison and evaluation of neural sparse retrieval models, especially in zero-shot out-of-domain scenarios, advancing practical retrieval system development.
Contribution
The paper introduces SPRINT, a comprehensive toolkit supporting multiple neural sparse retrieval models within a unified environment, facilitating reproducible evaluation and analysis.
Findings
SPLADEv2 achieves the highest average nDCG@10 score of 0.470 on BEIR.
Sparse representations with tokens outside the original query and document are key to SPLADEv2's performance.
The toolkit enables fair, reproducible benchmarking of neural sparse retrieval models in zero-shot settings.
Abstract
Traditionally, sparse retrieval systems relied on lexical representations to retrieve documents, such as BM25, dominated information retrieval tasks. With the onset of pre-trained transformer models such as BERT, neural sparse retrieval has led to a new paradigm within retrieval. Despite the success, there has been limited software supporting different sparse retrievers running in a unified, common environment. This hinders practitioners from fairly comparing different sparse models and obtaining realistic evaluation results. Another missing piece is, that a majority of prior work evaluates sparse retrieval models on in-domain retrieval, i.e. on a single dataset: MS MARCO. However, a key requirement in practical retrieval systems requires models that can generalize well to unseen out-of-domain, i.e. zero-shot retrieval tasks. In this work, we provide SPRINT, a unified Python toolkit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Dense Connections · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · TILDEv2 · WordPiece · Weight Decay
