Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing
Jitao Wang, Chengchun Shi, John D. Piette, Joshua R. Loftus, Donglin, Zeng, Zhenke Wu

TL;DR
This paper introduces a framework for fair reinforcement learning in healthcare, using causal inference and data preprocessing to reduce disparities while maintaining optimal decision-making.
Contribution
It develops a theoretical characterization of counterfactually fair policies and proposes a practical sequential data preprocessing method for fair RL.
Findings
The proposed method effectively reduces unfairness in simulations.
It achieves fairer access to healthcare interventions in real data.
The approach maintains high policy performance while ensuring fairness.
Abstract
When applied in healthcare, reinforcement learning (RL) seeks to dynamically match the right interventions to subjects to maximize population benefit. However, the learned policy may disproportionately allocate efficacious actions to one subpopulation, creating or exacerbating disparities in other socioeconomically-disadvantaged subgroups. These biases tend to occur in multi-stage decision making and can be self-perpetuating, which if unaccounted for could cause serious unintended consequences that limit access to care or treatment benefit. Counterfactual fairness (CF) offers a promising statistical tool grounded in causal inference to formulate and study fairness. In this paper, we propose a general framework for fair sequential decision making. We theoretically characterize the optimal CF policy and prove its stationarity, which greatly simplifies the search for optimal CF policies by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI
MethodsCausal inference
