Measuring and signing fairness as performance under multiple stakeholder distributions
David Lopez-Paz, Diane Bouchacourt, Levent Sagun, Nicolas Usunier

TL;DR
This paper proposes a new approach to fairness in machine learning by evaluating systems across multiple stakeholder-defined distributions, emphasizing context-dependent assessments and stakeholder involvement.
Contribution
It introduces a distribution-based fairness evaluation framework using stress tests, moving beyond rigid metrics to incorporate stakeholder-curated examples and interpretability.
Findings
Stress testing reveals fairness strengths and weaknesses.
Stakeholder involvement enhances fairness assessment.
Framework improves interpretability and contextual relevance.
Abstract
As learning machines increase their influence on decisions concerning human lives, analyzing their fairness properties becomes a subject of central importance. Yet, our best tools for measuring the fairness of learning systems are rigid fairness metrics encapsulated as mathematical one-liners, offer limited power to the stakeholders involved in the prediction task, and are easy to manipulate when we exhort excessive pressure to optimize them. To advance these issues, we propose to shift focus from shaping fairness metrics to curating the distributions of examples under which these are computed. In particular, we posit that every claim about fairness should be immediately followed by the tagline "Fair under what examples, and collected by whom?". By highlighting connections to the literature in domain generalization, we propose to measure fairness as the ability of the system to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
MethodsTest
