Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs
Yinong Oliver Wang, Nivedha Sivakumar, Falaah Arif Khan, Rin Metcalf Susa, Adam Golinski, Natalie Mackraz, Barry-John Theobald, Luca Zappella, Nicholas Apostoloff

TL;DR
This paper introduces UCerF, an uncertainty-aware fairness metric for LLMs, along with a new dataset, to better evaluate and understand model fairness considering confidence and bias, surpassing traditional accuracy-based metrics.
Contribution
It proposes a novel fairness evaluation metric that incorporates model uncertainty and provides a new diverse dataset for more accurate fairness assessment of LLMs.
Findings
Mistral-7B shows fairness issues due to high confidence in errors.
UCerF captures fairness biases overlooked by traditional metrics.
Benchmark results highlight the importance of uncertainty in fairness evaluation.
Abstract
The recent rapid adoption of large language models (LLMs) highlights the critical need for benchmarking their fairness. Conventional fairness metrics, which focus on discrete accuracy-based evaluations (i.e., prediction correctness), fail to capture the implicit impact of model uncertainty (e.g., higher model confidence about one group over another despite similar accuracy). To address this limitation, we propose an uncertainty-aware fairness metric, UCerF, to enable a fine-grained evaluation of model fairness that is more reflective of the internal bias in model decisions compared to conventional fairness measures. Furthermore, observing data size, diversity, and clarity issues in current datasets, we introduce a new gender-occupation fairness evaluation dataset with 31,756 samples for co-reference resolution, offering a more diverse and suitable dataset for evaluating modern LLMs. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDigitalization, Law, and Regulation · Safety Systems Engineering in Autonomy · Ethics and Social Impacts of AI
MethodsFocus
