TL;DR
This paper evaluates the effectiveness of contextualized term-based models TILDE and TILDEv2 in query-by-example retrieval, compares them with BM25, and demonstrates that combining these models improves retrieval performance.
Contribution
It introduces a score interpolation method combining TILDE (or TILDEv2) with BM25, showing significant improvements in query-by-example retrieval accuracy.
Findings
BM25 performs competitively with TILDE and TILDEv2 in QBE.
Interpolating scores from TILDE/TILDEv2 and BM25 improves retrieval results.
Contextualized models capture different relevance signals than BM25.
Abstract
Term-based ranking with pre-trained transformer-based language models has recently gained attention as they bring the contextualization power of transformer models into the highly efficient term-based retrieval. In this work, we examine the generalizability of two of these deep contextualized term-based models in the context of query-by-example (QBE) retrieval in which a seed document acts as the query to find relevant documents. In this setting -- where queries are much longer than common keyword queries -- BERT inference at query time is problematic as it involves quadratic complexity. We investigate TILDE and TILDEv2, both of which leverage BERT tokenizer as their query encoder. With this approach, there is no need for BERT inference at query time, and also the query can be of any length. Our extensive evaluation on the four QBE tasks of SciDocs benchmark shows that in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Dropout · Weight Decay · Linear Warmup With Linear Decay · Adam · Dense Connections
