AGR: Age Group fairness Reward for Bias Mitigation in LLMs
Shuirong Cao, Ruoxi Cheng, Zhiqiang Wang

TL;DR
This paper introduces ARG, an age fairness reward for LLMs, which reduces age-related bias and improves response consistency across age groups through new datasets and reinforcement learning techniques.
Contribution
It constructs age bias datasets and proposes ARG, a novel reward to enhance age fairness in LLMs, addressing a gap in bias mitigation research.
Findings
ARG significantly improves response accuracy across age groups
It reduces disparities in LLM performance related to age
The approach is validated through extensive experiments
Abstract
LLMs can exhibit age biases, resulting in unequal treatment of individuals across age groups. While much research has addressed racial and gender biases, age bias remains little explored. The scarcity of instruction-tuning and preference datasets for age bias hampers its detection and measurement, and existing fine-tuning methods seldom address age-related fairness. In this paper, we construct age bias preference datasets and instruction-tuning datasets for RLHF. We introduce ARG, an age fairness reward to reduce differences in the response quality of LLMs across different age groups. Extensive experiments demonstrate that this reward significantly improves response accuracy and reduces performance disparities across age groups. Our source code and datasets are available at the anonymous \href{https://anonymous.4open.science/r/FairRLHF-D445/readme.md}{link}.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLaw, Economics, and Judicial Systems · Dispute Resolution and Class Actions
