Faster Learned Sparse Retrieval with Block-Max Pruning

Antonio Mallia; Torten Suel; Nicola Tonellotto

arXiv:2405.01117·cs.IR·May 3, 2024·1 cites

Faster Learned Sparse Retrieval with Block-Max Pruning

Antonio Mallia, Torten Suel, Nicola Tonellotto

PDF

Open Access 1 Repo

TL;DR

This paper introduces Block-Max Pruning, a dynamic pruning method for learned sparse retrieval indexes, significantly enhancing efficiency and accuracy in safe and approximate retrieval scenarios.

Contribution

The paper presents a novel block-max pruning strategy specifically designed for learned sparse retrieval indexes, addressing structural differences from traditional models.

Findings

01

BMP outperforms existing pruning strategies in efficiency

02

BMP improves tradeoffs between precision and efficiency

03

Experimental results demonstrate significant retrieval speedups

Abstract

Learned sparse retrieval systems aim to combine the effectiveness of contextualized language models with the scalability of conventional data structures such as inverted indexes. Nevertheless, the indexes generated by these systems exhibit significant deviations from the ones that use traditional retrieval models, leading to a discrepancy in the performance of existing query optimizations that were specifically developed for traditional structures. These disparities arise from structural variations in query and document statistics, including sub-word tokenization, leading to longer queries, smaller vocabularies, and different score distributions within posting lists. This paper introduces Block-Max Pruning (BMP), an innovative dynamic pruning strategy tailored for indexes arising in learned sparse retrieval environments. BMP employs a block filtering mechanism to divide the document…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pisa-engine/BMP
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning

MethodsPruning