Reliable fairness auditing with semi-supervised inference

Jianhui Gao; Jessica Gronsbell

arXiv:2505.12181·stat.ME·May 19, 2026

Reliable fairness auditing with semi-supervised inference

Jianhui Gao, Jessica Gronsbell

PDF

2 Repos

TL;DR

This paper introduces Infairness, a semi-supervised framework for fairness auditing in machine learning, reducing data labeling costs while maintaining robustness and efficiency.

Contribution

The authors propose a novel semi-supervised inference method for fairness auditing that is robust and more efficient than traditional supervised approaches.

Findings

01

Infairness reduces variance by approximately 50% in real-world audits.

02

The estimator is robust to model specification.

03

It effectively combines small labeled and large unlabeled datasets.

Abstract

Machine learning (ML) models often exhibit bias that can exacerbate inequities in biomedical applications. Fairness auditing, the process of evaluating a model's performance across subpopulations, is critical for identifying and mitigating these biases. However, audits typically rely on large volumes of labeled data, which are costly and labor-intensive to obtain. To address this challenge, we introduce $Infairness$ , a unified framework for auditing a wide range of fairness criteria using semi-supervised inference. Our approach combines a small labeled dataset with a large unlabeled dataset by imputing missing outcomes via regression with carefully selected nonlinear basis functions. Through extensive theoretical and empirical analyses, we show that our proposed estimator is (i) robust to specification of the ML or imputation model and (ii) substantially more efficient than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)