Re-Rankers as Relevance Judges
Chuan Meng, Jiqun Liu, Mohammad Aliannejadi, Fengran Mo, Jeff Dalton, Maarten de Rijke

TL;DR
This paper explores repurposing re-ranking models as relevance judges, demonstrating their effectiveness and biases in predicting relevance, and comparing them to existing LLM-based relevance judgment methods across multiple datasets.
Contribution
It introduces two adaptation strategies to use re-rankers as relevance judges and evaluates their performance and biases on extensive datasets, highlighting their potential and limitations.
Findings
Re-ranker-based judges outperform UMBRELA in 40-50% of cases.
They exhibit strong self-preference and cross-family bias.
Re-rankers can be effectively adapted for relevance judgment tasks.
Abstract
Using large language models (LLMs) to predict relevance judgments has shown promising results. Most studies treat this task as a distinct research line, e.g., focusing on prompt design for predicting relevance labels given a query and passage. However, predicting relevance judgments is essentially a form of relevance prediction, a problem extensively studied in tasks such as re-ranking. Despite this potential overlap, little research has explored reusing or adapting established re-ranking methods to predict relevance judgments, leading to potential resource waste and redundant development. To bridge this gap, we reproduce re-rankers in a re-ranker-as-relevance-judge setup. We design two adaptation strategies: (i) using binary tokens (e.g., "true" and "false") generated by a re-ranker as direct judgments, and (ii) converting continuous re-ranking scores into binary labels via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Topic Modeling · Explainable Artificial Intelligence (XAI)
