Fast Passage Re-ranking with Contextualized Exact Term Matching and   Efficient Passage Expansion

Shengyao Zhuang; Guido Zuccon

arXiv:2108.08513·cs.IR·September 14, 2021·48 cites

Fast Passage Re-ranking with Contextualized Exact Term Matching and Efficient Passage Expansion

Shengyao Zhuang, Guido Zuccon

PDF

Open Access 1 Repo

TL;DR

TILDEv2 improves passage re-ranking by using contextualized exact term matching and efficient passage expansion, achieving higher effectiveness and significantly smaller indexes without increasing query latency, making it suitable for resource-constrained environments.

Contribution

TILDEv2 introduces a novel indexing and matching approach that reduces memory footprint and enhances effectiveness over the original TILDE, establishing a new state-of-the-art for CPU-only passage re-ranking.

Findings

01

Indexes are 99% smaller than TILDE.

02

Ranking effectiveness improves by 24%.

03

Query latency remains below 100ms on commodity hardware.

Abstract

BERT-based information retrieval models are expensive, in both time (query latency) and computational resources (energy, hardware cost), making many of these models impractical especially under resource constraints. The reliance on a query encoder that only performs tokenization and on the pre-processing of passage representations at indexing, has allowed the recently proposed TILDE method to overcome the high query latency issue typical of BERT-based models. This however is at the expense of a lower effectiveness compared to other BERT-based re-rankers and dense retrievers. In addition, the original TILDE method is characterised by indexes with a very high memory footprint, as it expands each passage into the size of the BERT vocabulary. In this paper, we propose TILDEv2, a new model that stems from the original TILDE but that addresses its limitations. TILDEv2 relies on contextualized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ielab/tilde
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Information Retrieval and Search Behavior · Advanced Image and Video Retrieval Techniques

MethodsAttention Is All You Need · Linear Layer · TILDEv2 · Multi-Head Attention · WordPiece · Softmax · Residual Connection · Attention Dropout · Layer Normalization · Dropout