Group Fairness in Bandit Arm Selection
Candice Schumann, Zhi Lang, Nicholas Mattei, John P. Dickerson

TL;DR
This paper introduces a new approach to ensure group fairness in contextual multi-armed bandit algorithms, addressing biased feedback and societal biases to promote equitable decision-making across sensitive groups.
Contribution
It proposes a novel algorithm that accounts for multiple groups and learns societal bias, with theoretical regret bounds and validation on synthetic and real-world data.
Findings
The algorithm effectively mitigates bias in arm selection.
Theoretical regret bounds are established for the proposed method.
Empirical results demonstrate improved fairness across groups.
Abstract
We propose a novel formulation of group fairness with biased feedback in the contextual multi-armed bandit (CMAB) setting. In the CMAB setting, a sequential decision maker must, at each time step, choose an arm to pull from a finite set of arms after observing some context for each of the potential arm pulls. In our model, arms are partitioned into two or more sensitive groups based on some protected feature(s) (e.g., age, race, or socio-economic status). Initial rewards received from pulling an arm may be distorted due to some unknown societal or measurement bias. We assume that in reality these groups are equal despite the biased feedback received by the agent. To alleviate this, we learn a societal bias term which can be used to both find the source of bias and to potentially fix the problem outside of the algorithm. We provide a novel algorithm that can accommodate this notion of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing · Auction Theory and Applications
