Predicting Efficiency/Effectiveness Trade-offs for Dense vs. Sparse   Retrieval Strategy Selection

Negar Arabzadeh; Xinyi Yan; Charles L. A. Clarke

arXiv:2109.10739·cs.IR·September 23, 2021

Predicting Efficiency/Effectiveness Trade-offs for Dense vs. Sparse Retrieval Strategy Selection

Negar Arabzadeh, Xinyi Yan, Charles L. A. Clarke

PDF

TL;DR

This paper introduces a classifier that dynamically chooses between sparse, dense, or hybrid retrieval strategies for individual queries, optimizing the trade-off between computational cost and retrieval effectiveness in information retrieval tasks.

Contribution

It proposes a novel query-based classifier for strategy selection, enabling resource-efficient retrieval without sacrificing performance.

Findings

01

The classifier effectively balances retrieval quality and computational resources.

02

Hybrid strategies outperform purely sparse or dense methods under certain constraints.

03

Query-specific strategy selection improves efficiency and effectiveness trade-offs.

Abstract

Over the last few years, contextualized pre-trained transformer models such as BERT have provided substantial improvements on information retrieval tasks. Recent approaches based on pre-trained transformer models such as BERT, fine-tune dense low-dimensional contextualized representations of queries and documents in embedding space. While these dense retrievers enjoy substantial retrieval effectiveness improvements compared to sparse retrievers, they are computationally intensive, requiring substantial GPU resources, and dense retrievers are known to be more expensive from both time and resource perspectives. In addition, sparse retrievers have been shown to retrieve complementary information with respect to dense retrievers, leading to proposals for hybrid retrievers. These hybrid retrievers leverage low-cost, exact-matching based sparse retrievers along with dense retrievers to bridge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Softmax · WordPiece · Layer Normalization · Residual Connection · Linear Warmup With Linear Decay · Dropout · Attention Dropout