Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers

Zhichao Geng; Yiwen Wang; Dongyu Ru; Yang Yang

arXiv:2411.04403·cs.IR·July 2, 2025·2 cites

Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers

Zhichao Geng, Yiwen Wang, Dongyu Ru, Yang Yang

PDF

Open Access 1 Repo 10 Models 2 Datasets

TL;DR

This paper introduces novel training methods for inference-free sparse retrievers, significantly improving their search relevance to rival dense models while maintaining low latency.

Contribution

It proposes an IDF-aware penalty and a heterogeneous ensemble knowledge distillation framework to enhance inference-free sparse retriever performance.

Findings

01

Outperforms existing inference-free models by 3.3 NDCG@10 on BEIR

02

Achieves search relevance comparable to siamese sparse retrievers

03

Maintains client-side latency only 1.1x of BM25

Abstract

Learned sparse retrieval, which can efficiently perform retrieval through mature inverted-index engines, has garnered growing attention in recent years. Particularly, the inference-free sparse retrievers are attractive as they eliminate online model inference in the retrieval phase thereby avoids huge computational cost, offering reasonable throughput and latency. However, even the state-of-the-art (SOTA) inference-free sparse models lag far behind in terms of search relevance when compared to both sparse and dense siamese models. Towards competitive search relevance for inference-free sparse retrievers, we argue that they deserve dedicated training methods other than using same ones with siamese encoders. In this paper, we propose two different approaches for performance improvement. First, we propose an IDF-aware penalty for the matching function that suppresses the contribution of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhichao-aws/opensearch-sparse-model-tuning-sample
pytorch

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Information Retrieval and Search Behavior · Domain Adaptation and Few-Shot Learning

MethodsSoftmax · Attention Is All You Need · Knowledge Distillation