Fairness in Reinforcement Learning: A Survey
Anka Reuel, Devin Ma

TL;DR
This survey reviews recent advances in fairness in reinforcement learning, highlighting definitions, methodologies, applications, and gaps to guide future research in responsible AI deployment.
Contribution
It provides a comprehensive overview of fairness concepts, methods, and applications in RL, identifying key gaps and future directions for operationalizing fair RL.
Findings
Various fairness definitions in RL are discussed.
Methodologies for implementing fairness in single- and multi-agent RL are summarized.
Identifies gaps such as fairness in RLHF and real-world deployment challenges.
Abstract
While our understanding of fairness in machine learning has significantly progressed, our understanding of fairness in reinforcement learning (RL) remains nascent. Most of the attention has been on fairness in one-shot classification tasks; however, real-world, RL-enabled systems (e.g., autonomous vehicles) are much more complicated in that agents operate in dynamic environments over a long period of time. To ensure the responsible development and deployment of these systems, we must better understand fairness in RL. In this paper, we survey the literature to provide the most up-to-date snapshot of the frontiers of fairness in RL. We start by reviewing where fairness considerations can arise in RL, then discuss the various definitions of fairness in RL that have been put forth thus far. We continue to highlight the methodologies researchers used to implement fairness in single- and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Experimental Behavioral Economics Studies · Innovation Diffusion and Forecasting
