How Proxy Race Distorts Regression-Based Fairness Audits
Xi Xin, Giles Hooker, Fei Huang

TL;DR
This paper investigates how using proxy measures for race in regression-based fairness audits can distort results, often biasing disparity estimates and affecting regulatory assessments in high-stakes domains.
Contribution
It reveals the systematic biases introduced by proxy race measures in regression analyses and demonstrates these effects with empirical data, highlighting implications for fairness auditing.
Findings
Proxy race proxies cause systematic bias in disparity estimates.
Misclassification leads to attenuation or amplification of disparities.
Structured errors correlate with socioeconomic factors and influence results.
Abstract
Proxy-based race inference is increasingly used to conduct fairness assessments when protected-class data are unavailable or legally restricted -- most prominently in U.S. fair-lending enforcement, and now explicitly contemplated in emerging insurance regulation, including Colorado's draft SB21-169 testing framework and New York's Insurance Circular Letter No. 7. Despite this growing regulatory relevance, little is known about how standard regression-based discrimination analyses behave when race is measured with error through proxies such as Bayesian Improved Surname Geocoding (BISG) or Bayesian Improved First Name and Surname Geocoding (BIFSG). This paper studies the consequences of using proxy-imputed race as a categorical regressor in regression-based fairness assessments. Treating proxy race as a categorical covariate subject to misclassification, we show that proxy-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Occupational and Professional Licensing Regulation · Names, Identity, and Discrimination Research
