Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers
Zhichao Geng, Yiwen Wang, Dongyu Ru, Yang Yang

TL;DR
This paper introduces novel training methods for inference-free sparse retrievers, significantly improving their search relevance to rival dense models while maintaining low latency.
Contribution
It proposes an IDF-aware penalty and a heterogeneous ensemble knowledge distillation framework to enhance inference-free sparse retriever performance.
Findings
Outperforms existing inference-free models by 3.3 NDCG@10 on BEIR
Achieves search relevance comparable to siamese sparse retrievers
Maintains client-side latency only 1.1x of BM25
Abstract
Learned sparse retrieval, which can efficiently perform retrieval through mature inverted-index engines, has garnered growing attention in recent years. Particularly, the inference-free sparse retrievers are attractive as they eliminate online model inference in the retrieval phase thereby avoids huge computational cost, offering reasonable throughput and latency. However, even the state-of-the-art (SOTA) inference-free sparse models lag far behind in terms of search relevance when compared to both sparse and dense siamese models. Towards competitive search relevance for inference-free sparse retrievers, we argue that they deserve dedicated training methods other than using same ones with siamese encoders. In this paper, we propose two different approaches for performance improvement. First, we propose an IDF-aware penalty for the matching function that suppresses the contribution of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗opensearch-project/opensearch-neural-sparse-encoding-v1model· 3.2k dl· ♡ 143.2k dl♡ 14
- 🤗opensearch-project/opensearch-neural-sparse-encoding-doc-v1model· 3.1k dl· ♡ 33.1k dl♡ 3
- 🤗opensearch-project/opensearch-neural-sparse-encoding-v2-distillmodel· 204k dl· ♡ 10204k dl♡ 10
- 🤗opensearch-project/opensearch-neural-sparse-encoding-doc-v2-distillmodel· 993k dl· ♡ 19993k dl♡ 19
- 🤗opensearch-project/opensearch-neural-sparse-encoding-doc-v2-minimodel· 1.3k dl· ♡ 61.3k dl♡ 6
- 🤗opensearch-project/opensearch-neural-sparse-encoding-multilingual-v1model· 7.8k dl· ♡ 177.8k dl♡ 17
- 🤗opensearch-project/opensearch-neural-sparse-encoding-doc-v3-gtemodel· 4.0k dl· ♡ 134.0k dl♡ 13
- 🤗Frinkleko/opensearch-doc-v3-gte-finetune-limit-samples-2model
- 🤗Frinkleko/opensearch-doc-v3-gte-finetune-limit-samples-1700model
- 🤗seerware/opensearch-neural-sparse-encoding-doc-v2-minimodel· 80 dl80 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Information Retrieval and Search Behavior · Domain Adaptation and Few-Shot Learning
MethodsSoftmax · Attention Is All You Need · Knowledge Distillation
