Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs

Yinong Oliver Wang; Nivedha Sivakumar; Falaah Arif Khan; Rin Metcalf Susa; Adam Golinski; Natalie Mackraz; Barry-John Theobald; Luca Zappella; Nicholas Apostoloff

arXiv:2505.23996·cs.CL·June 2, 2025

Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs

Yinong Oliver Wang, Nivedha Sivakumar, Falaah Arif Khan, Rin Metcalf Susa, Adam Golinski, Natalie Mackraz, Barry-John Theobald, Luca Zappella, Nicholas Apostoloff

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces UCerF, an uncertainty-aware fairness metric for LLMs, along with a new dataset, to better evaluate and understand model fairness considering confidence and bias, surpassing traditional accuracy-based metrics.

Contribution

It proposes a novel fairness evaluation metric that incorporates model uncertainty and provides a new diverse dataset for more accurate fairness assessment of LLMs.

Findings

01

Mistral-7B shows fairness issues due to high confidence in errors.

02

UCerF captures fairness biases overlooked by traditional metrics.

03

Benchmark results highlight the importance of uncertainty in fairness evaluation.

Abstract

The recent rapid adoption of large language models (LLMs) highlights the critical need for benchmarking their fairness. Conventional fairness metrics, which focus on discrete accuracy-based evaluations (i.e., prediction correctness), fail to capture the implicit impact of model uncertainty (e.g., higher model confidence about one group over another despite similar accuracy). To address this limitation, we propose an uncertainty-aware fairness metric, UCerF, to enable a fine-grained evaluation of model fairness that is more reflective of the internal bias in model decisions compared to conventional fairness measures. Furthermore, observing data size, diversity, and clarity issues in current datasets, we introduce a new gender-occupation fairness evaluation dataset with 31,756 samples for co-reference resolution, offering a more diverse and suitable dataset for evaluating modern LLMs. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

apple/ml-synthbias
noneOfficial

Videos

Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs· slideslive

Taxonomy

TopicsDigitalization, Law, and Regulation · Safety Systems Engineering in Autonomy · Ethics and Social Impacts of AI

MethodsFocus