FAIRER: Fairness as Decision Rationale Alignment
Tianlin Li, Qing Guo, Aishan Liu, Mengnan Du, Zhiming Li, Yang Liu

TL;DR
This paper introduces a novel approach to fairness in deep neural networks by aligning decision rationales across subgroups, improving fairness without sacrificing accuracy.
Contribution
It proposes decision rationale alignment as a new fairness task and introduces gradient-guided parity alignment to optimize neuron consistency across subgroups.
Findings
Significantly improves fairness metrics across datasets.
Maintains high accuracy comparable to baseline models.
Outperforms existing fairness regularization methods.
Abstract
Deep neural networks (DNNs) have made significant progress, but often suffer from fairness issues, as deep models typically show distinct accuracy differences among certain subgroups (e.g., males and females). Existing research addresses this critical issue by employing fairness-aware loss functions to constrain the last-layer outputs and directly regularize DNNs. Although the fairness of DNNs is improved, it is unclear how the trained network makes a fair prediction, which limits future fairness improvements. In this paper, we investigate fairness from the perspective of decision rationale and define the parameter parity score to characterize the fair decision process of networks by analyzing neuron influence in various subgroups. Extensive empirical studies show that the unfair issue could arise from the unaligned decision rationales of subgroups. Existing fairness regularization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEthics and Social Impacts of AI
Methodsfail
