TL;DR
This paper introduces a causal mediation analysis framework to distinguish between direct discrimination and structural inequality in AI credit decisions, providing new identification strategies and an open-source tool.
Contribution
It formalizes causal effects in credit fairness, proposes a novel estimator under treatment-induced confounding, and offers an open-source implementation for practical use.
Findings
77% of racial denial disparity is mediated by financial features influenced by structural inequality
Proposed estimator has semiparametric efficiency and robustness properties
Empirical analysis on mortgage data demonstrates the method's practical utility
Abstract
Statistical fairness metrics in AI-driven credit decisions conflate two causally distinct mechanisms: discrimination operating directly from a protected attribute to a credit outcome, and structural inequality propagating through legitimate financial features. We formalise this distinction using Pearl's framework of natural direct and indirect effects applied to the credit decision setting. Our primary theoretical contribution is an identification strategy for natural direct and indirect effects under treatment-induced confounding -- the prevalent setting in which protected attributes causally affect both financial mediators and the final decision, violating standard sequential ignorability. We show that interventional direct and indirect effects (IDE/IIE) are identified under the weaker Modified Sequential Ignorability assumption, and prove that IDE/IIE provide conservative bounds on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
