Fast Passage Re-ranking with Contextualized Exact Term Matching and Efficient Passage Expansion
Shengyao Zhuang, Guido Zuccon

TL;DR
TILDEv2 improves passage re-ranking by using contextualized exact term matching and efficient passage expansion, achieving higher effectiveness and significantly smaller indexes without increasing query latency, making it suitable for resource-constrained environments.
Contribution
TILDEv2 introduces a novel indexing and matching approach that reduces memory footprint and enhances effectiveness over the original TILDE, establishing a new state-of-the-art for CPU-only passage re-ranking.
Findings
Indexes are 99% smaller than TILDE.
Ranking effectiveness improves by 24%.
Query latency remains below 100ms on commodity hardware.
Abstract
BERT-based information retrieval models are expensive, in both time (query latency) and computational resources (energy, hardware cost), making many of these models impractical especially under resource constraints. The reliance on a query encoder that only performs tokenization and on the pre-processing of passage representations at indexing, has allowed the recently proposed TILDE method to overcome the high query latency issue typical of BERT-based models. This however is at the expense of a lower effectiveness compared to other BERT-based re-rankers and dense retrievers. In addition, the original TILDE method is characterised by indexes with a very high memory footprint, as it expands each passage into the size of the BERT vocabulary. In this paper, we propose TILDEv2, a new model that stems from the original TILDE but that addresses its limitations. TILDEv2 relies on contextualized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Advanced Image and Video Retrieval Techniques
MethodsAttention Is All You Need · Linear Layer · TILDEv2 · Multi-Head Attention · WordPiece · Softmax · Residual Connection · Attention Dropout · Layer Normalization · Dropout
