Counterfactually Fair Reinforcement Learning via Sequential Data   Preprocessing

Jitao Wang; Chengchun Shi; John D. Piette; Joshua R. Loftus; Donglin; Zeng; Zhenke Wu

arXiv:2501.06366·stat.ML·January 15, 2025

Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing

Jitao Wang, Chengchun Shi, John D. Piette, Joshua R. Loftus, Donglin, Zeng, Zhenke Wu

PDF

Open Access

TL;DR

This paper introduces a framework for fair reinforcement learning in healthcare, using causal inference and data preprocessing to reduce disparities while maintaining optimal decision-making.

Contribution

It develops a theoretical characterization of counterfactually fair policies and proposes a practical sequential data preprocessing method for fair RL.

Findings

01

The proposed method effectively reduces unfairness in simulations.

02

It achieves fairer access to healthcare interventions in real data.

03

The approach maintains high policy performance while ensuring fairness.

Abstract

When applied in healthcare, reinforcement learning (RL) seeks to dynamically match the right interventions to subjects to maximize population benefit. However, the learned policy may disproportionately allocate efficacious actions to one subpopulation, creating or exacerbating disparities in other socioeconomically-disadvantaged subgroups. These biases tend to occur in multi-stage decision making and can be self-perpetuating, which if unaccounted for could cause serious unintended consequences that limit access to care or treatment benefit. Counterfactual fairness (CF) offers a promising statistical tool grounded in causal inference to formulate and study fairness. In this paper, we propose a general framework for fair sequential decision making. We theoretically characterize the optimal CF policy and prove its stationarity, which greatly simplifies the search for optimal CF policies by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI

MethodsCausal inference