Faster Learned Sparse Retrieval with Guided Traversal
Antonio Mallia, Joel Mackenzie, Torsten Suel, Nicola Tonellotto

TL;DR
This paper introduces a guided traversal method for learned sparse retrieval models that significantly improves efficiency, reducing scoring operations by four times without sacrificing effectiveness.
Contribution
It proposes a novel index traversal technique that leverages traditional sparse models to accelerate learned sparse retrieval methods.
Findings
Boosts retrieval efficiency by a factor of four.
Maintains effectiveness comparable to existing learned sparse models.
Demonstrates practical applicability in neural information retrieval systems.
Abstract
Neural information retrieval architectures based on transformers such as BERT are able to significantly improve system effectiveness over traditional sparse models such as BM25. Though highly effective, these neural approaches are very expensive to run, making them difficult to deploy under strict latency constraints. To address this limitation, recent studies have proposed new families of learned sparse models that try to match the effectiveness of learned dense models, while leveraging the traditional inverted index data structure for efficiency. Current learned sparse models learn the weights of terms in documents and, sometimes, queries; however, they exploit different vocabulary structures, document expansion techniques, and query expansion strategies, which can make them slower than traditional sparse models such as BM25. In this work, we propose a novel indexing and query…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Text and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Residual Connection · Layer Normalization · Dense Connections · Attention Dropout · Softmax
