Fairness Under Feature Exemptions: Counterfactual and Observational Measures
Sanghamitra Dutta, Praveen Venkatesh, Piotr Mardziel, Anupam Datta,, Pulkit Grover

TL;DR
This paper introduces an information-theoretic method to quantify and decompose disparity in machine learning models into exempt and non-exempt components, enabling fairer decision-making when certain features are justified.
Contribution
It proposes a novel decomposition of disparity inspired by counterfactual fairness, satisfying key axioms, and bridges causality, Simpson's paradox, and information theory.
Findings
The measure satisfies all proposed axioms for non-exempt disparity.
An impossibility result shows observational measures cannot satisfy all properties.
Case studies demonstrate practical application in auditing and training models.
Abstract
With the growing use of ML in highly consequential domains, quantifying disparity with respect to protected attributes, e.g., gender, race, etc., is important. While quantifying disparity is essential, sometimes the needs of an occupation may require the use of certain features that are critical in a way that any disparity that can be explained by them might need to be exempted. E.g., in hiring a software engineer for a safety-critical application, coding-skills may be weighed strongly, whereas name, zip code, or reference letters may be used only to the extent that they do not add disparity. In this work, we propose an information-theoretic decomposition of the total disparity (a quantification inspired from counterfactual fairness) into two components: a non-exempt component which quantifies the part that cannot be accounted for by the critical features, and an exempt component that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
