Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector

Haoyan Yang; Runxue Bao; Cao Xiao; Jun Ma; Parminder Bhatia; Shangqian Gao; Taha Kass-Hout

arXiv:2505.17100·cs.CL·October 29, 2025

Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector

Haoyan Yang, Runxue Bao, Cao Xiao, Jun Ma, Parminder Bhatia, Shangqian Gao, Taha Kass-Hout

PDF

2 Datasets

TL;DR

This paper introduces a reasoning-based bias detector (RBD) that externally identifies biases in large language model evaluations and guides self-correction, significantly improving evaluation reliability across multiple bias types and model scales.

Contribution

The paper presents RBD, a novel external module for bias detection and correction in LLM evaluators, addressing limitations of existing methods and demonstrating scalability and effectiveness.

Findings

01

RBD improves evaluation accuracy by 18.5% on average.

02

RBD enhances consistency by 10.9%.

03

RBD outperforms prompting baselines and fine-tuned judges.

Abstract

LLM-as-a-Judge has emerged as a promising tool for automatically evaluating generated outputs, but its reliability is often undermined by potential biases in judgment. Existing efforts to mitigate these biases face key limitations: in-context learning-based methods fail to address rooted biases due to the evaluator's limited capacity for self-reflection, whereas fine-tuning is not applicable to all evaluator types, especially closed-source models. To address this challenge, we introduce the Reasoning-based Bias Detector (RBD), which is a plug-in module that identifies biased evaluations and generates structured reasoning to guide evaluator self-correction. Rather than modifying the evaluator itself, RBD operates externally and engages in an iterative process of bias detection and feedback-driven revision. To support its development, we design a complete pipeline consisting of biased…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.