Fairness in Supervised Learning: An Information Theoretic Approach
AmirEmad Ghassami, Sajad Khodadadian, Negar Kiyavash

TL;DR
This paper introduces an information theoretic framework for creating fair supervised learning models that prevent discrimination based on sensitive attributes by using data compression and equalized odds.
Contribution
It proposes a novel method that compresses data into an auxiliary variable to ensure fairness and generalization, addressing bias in decision-making systems.
Findings
Framework effectively reduces discrimination in predictions.
Auxiliary variable ensures fairness without sacrificing accuracy.
Method generalizes well across different datasets.
Abstract
Automated decision making systems are increasingly being used in real-world applications. In these systems for the most part, the decision rules are derived by minimizing the training error on the available historical data. Therefore, if there is a bias related to a sensitive attribute such as gender, race, religion, etc. in the data, say, due to cultural/historical discriminatory practices against a certain demographic, the system could continue discrimination in decisions by including the said bias in its decision rule. We present an information theoretic framework for designing fair predictors from data, which aim to prevent discrimination against a specified sensitive attribute in a supervised learning setting. We use equalized odds as the criterion for discrimination, which demands that the prediction should be independent of the protected attribute conditioned on the actual label.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
