Multi-Group Fairness Evaluation via Conditional Value-at-Risk Testing
Lucas Monteiro Paes, Ananda Theertha Suresh, Alex Beutel, Flavio P., Calmon, Ahmad Beirami

TL;DR
This paper introduces a CVaR-based testing method to evaluate ML model fairness across multiple sensitive groups, significantly reducing sample complexity and addressing challenges of high-dimensional group attributes.
Contribution
It proposes a novel CVaR testing approach for fairness evaluation that lowers sample complexity exponentially and incorporates prior distribution and non-i.i.d. strategies.
Findings
Sample complexity is reduced exponentially with CVaR testing.
Re9nyi entropy characterizes the sample complexity under prior weighting.
Non-i.i.d. data collection can make sample complexity independent of group count.
Abstract
Machine learning (ML) models used in prediction and classification tasks may display performance disparities across population groups determined by sensitive attributes (e.g., race, sex, age). We consider the problem of evaluating the performance of a fixed ML model across population groups defined by multiple sensitive attributes (e.g., race and sex and age). Here, the sample complexity for estimating the worst-case performance gap across groups (e.g., the largest difference in error rates) increases exponentially with the number of group-denoting sensitive attributes. To address this issue, we propose an approach to test for performance disparities based on Conditional Value-at-Risk (CVaR). By allowing a small probabilistic slack on the groups over which a model has approximately equal performance, we show that the sample complexity required for discovering performance violations is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Statistical Methods and Inference · Advanced Statistical Process Monitoring
