A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE

Herv\'e D\'ejean; St\'ephane Clinchant; Thibault Formal

arXiv:2403.10407·cs.IR·March 18, 2024·1 cites

A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE

Herv\'e D\'ejean, St\'ephane Clinchant, Thibault Formal

PDF

Open Access

TL;DR

This study compares cross-encoder and LLM-based rerankers for SPLADE retrieval, revealing that traditional cross-encoders remain competitive, especially out-of-domain, despite GPT-4's impressive zero-shot performance, informing search system design.

Contribution

It provides a comprehensive evaluation of cross-encoders versus LLM rerankers across multiple datasets, highlighting their relative strengths and limitations in different scenarios.

Findings

01

Cross-encoders are hard to distinguish in in-domain re-ranking.

02

Model type and number of documents impact effectiveness out-of-domain.

03

GPT-4 shows impressive zero-shot performance but traditional models remain competitive.

Abstract

We present a comparative study between cross-encoder and LLMs rerankers in the context of re-ranking effective SPLADE retrievers. We conduct a large evaluation on TREC Deep Learning datasets and out-of-domain datasets such as BEIR and LoTTE. In the first set of experiments, we show how cross-encoder rerankers are hard to distinguish when it comes to re-rerank SPLADE on MS MARCO. Observations shift in the out-of-domain scenario, where both the type of model and the number of documents to re-rank have an impact on effectiveness. Then, we focus on listwise rerankers based on Large Language Models -- especially GPT-4. While GPT-4 demonstrates impressive (zero-shot) performance, we show that traditional cross-encoders remain very competitive. Overall, our findings aim to to provide a more nuanced perspective on the recent excitement surrounding LLM-based re-rankers -- by positioning them as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Storage Technologies

MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Absolute Position Encodings · Softmax · Layer Normalization · Multi-Head Attention · Dropout · Residual Connection · Position-Wise Feed-Forward Layer