FairLogue: Evaluating Intersectional Fairness across Clinical Machine Learning Use Cases using the All of Us Research Program
Nick Souligne, Vignesh Subbian

TL;DR
FairLogue is a toolkit designed to evaluate intersectional fairness in clinical machine learning models, revealing larger disparities than single-attribute analyses and emphasizing the importance of intersectional bias auditing.
Contribution
The paper introduces FairLogue, a novel toolkit for intersectional fairness auditing in healthcare, applied to real clinical models using the All of Us dataset.
Findings
Intersectional analysis revealed larger disparities than single-attribute evaluations.
Most observed disparities were similar to those expected under randomized group membership.
FairLogue provides deeper insights into bias in clinical machine learning systems.
Abstract
Intersectional biases in healthcare data can produce compound disparities in clinical machine learning models, yet most fairness evaluations assess demographic attributes independently. FairLogue, a toolkit for intersectional fairness auditing, was applied across multiple clinical prediction tasks to evaluate disparities across combined demographic groups. Using the All of Us dataset, two published models were selected for replication and evaluation: (A) prediction of selective serotonin reuptake inhibitor associated bleeding events and (B) two-year stroke risk in patients with atrial fibrillation. Observational fairness metrics were computed across race, gender, and intersectional subgroups, followed by counterfactual analysis to evaluate whether disparities were attributable to group membership. Intersectional evaluation revealed larger disparities than single-axis analyses; however,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
