Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization
Hongli Zhou, Hui Huang, Rui Zhang, Kehai Chen, Bing Xu, Conghui Zhu, Tiejun Zhao, Muyun Yang

TL;DR
This paper introduces JudgeBiasBench, a comprehensive benchmark for evaluating biases in LLM-based judges, revealing significant biases and proposing bias-aware training methods to mitigate them, thereby improving evaluation reliability.
Contribution
It presents a systematic bias evaluation benchmark and novel bias mitigation techniques for LLM-based judges, addressing the lack of comprehensive bias analysis in prior work.
Findings
Current judges exhibit significant and diverse biases.
Bias-aware training reduces judgment biases effectively.
Proposed methods preserve evaluation capabilities while mitigating biases.
Abstract
Large language model (LLM)-based judges are widely adopted for automated evaluation and reward modeling, yet their judgments are often affected by judgment biases. Accurately evaluating these biases is essential for ensuring the reliability of LLM-based judges. However, existing studies typically investigate limited biases under a single judge formulation, either generative or discriminative, lacking a comprehensive evaluation. To bridge this gap, we propose JudgeBiasBench, a benchmark for systematically quantifying biases in LLM-based judges. JudgeBiasBench defines a taxonomy of judgment biases across 4 dimensions, and constructs bias-augmented evaluation instances through a controlled bias injection pipeline, covering 12 representative bias types. We conduct extensive experiments across both generative and discriminative judges, revealing that current judges exhibit significant and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
