A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE
Herv\'e D\'ejean, St\'ephane Clinchant, Thibault Formal

TL;DR
This study compares cross-encoder and LLM-based rerankers for SPLADE retrieval, revealing that traditional cross-encoders remain competitive, especially out-of-domain, despite GPT-4's impressive zero-shot performance, informing search system design.
Contribution
It provides a comprehensive evaluation of cross-encoders versus LLM rerankers across multiple datasets, highlighting their relative strengths and limitations in different scenarios.
Findings
Cross-encoders are hard to distinguish in in-domain re-ranking.
Model type and number of documents impact effectiveness out-of-domain.
GPT-4 shows impressive zero-shot performance but traditional models remain competitive.
Abstract
We present a comparative study between cross-encoder and LLMs rerankers in the context of re-ranking effective SPLADE retrievers. We conduct a large evaluation on TREC Deep Learning datasets and out-of-domain datasets such as BEIR and LoTTE. In the first set of experiments, we show how cross-encoder rerankers are hard to distinguish when it comes to re-rerank SPLADE on MS MARCO. Observations shift in the out-of-domain scenario, where both the type of model and the number of documents to re-rank have an impact on effectiveness. Then, we focus on listwise rerankers based on Large Language Models -- especially GPT-4. While GPT-4 demonstrates impressive (zero-shot) performance, we show that traditional cross-encoders remain very competitive. Overall, our findings aim to to provide a more nuanced perspective on the recent excitement surrounding LLM-based re-rankers -- by positioning them as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies
MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Absolute Position Encodings · Softmax · Layer Normalization · Multi-Head Attention · Dropout · Residual Connection · Position-Wise Feed-Forward Layer
