Faster Learned Sparse Retrieval with Block-Max Pruning
Antonio Mallia, Torten Suel, Nicola Tonellotto

TL;DR
This paper introduces Block-Max Pruning, a dynamic pruning method for learned sparse retrieval indexes, significantly enhancing efficiency and accuracy in safe and approximate retrieval scenarios.
Contribution
The paper presents a novel block-max pruning strategy specifically designed for learned sparse retrieval indexes, addressing structural differences from traditional models.
Findings
BMP outperforms existing pruning strategies in efficiency
BMP improves tradeoffs between precision and efficiency
Experimental results demonstrate significant retrieval speedups
Abstract
Learned sparse retrieval systems aim to combine the effectiveness of contextualized language models with the scalability of conventional data structures such as inverted indexes. Nevertheless, the indexes generated by these systems exhibit significant deviations from the ones that use traditional retrieval models, leading to a discrepancy in the performance of existing query optimizations that were specifically developed for traditional structures. These disparities arise from structural variations in query and document statistics, including sub-word tokenization, leading to longer queries, smaller vocabularies, and different score distributions within posting lists. This paper introduces Block-Max Pruning (BMP), an innovative dynamic pruning strategy tailored for indexes arising in learned sparse retrieval environments. BMP employs a block filtering mechanism to divide the document…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning
MethodsPruning
