AGR: Age Group fairness Reward for Bias Mitigation in LLMs

Shuirong Cao; Ruoxi Cheng; Zhiqiang Wang

arXiv:2409.04340·cs.LG·September 9, 2024

AGR: Age Group fairness Reward for Bias Mitigation in LLMs

Shuirong Cao, Ruoxi Cheng, Zhiqiang Wang

PDF

Open Access

TL;DR

This paper introduces ARG, an age fairness reward for LLMs, which reduces age-related bias and improves response consistency across age groups through new datasets and reinforcement learning techniques.

Contribution

It constructs age bias datasets and proposes ARG, a novel reward to enhance age fairness in LLMs, addressing a gap in bias mitigation research.

Findings

01

ARG significantly improves response accuracy across age groups

02

It reduces disparities in LLM performance related to age

03

The approach is validated through extensive experiments

Abstract

LLMs can exhibit age biases, resulting in unequal treatment of individuals across age groups. While much research has addressed racial and gender biases, age bias remains little explored. The scarcity of instruction-tuning and preference datasets for age bias hampers its detection and measurement, and existing fine-tuning methods seldom address age-related fairness. In this paper, we construct age bias preference datasets and instruction-tuning datasets for RLHF. We introduce ARG, an age fairness reward to reduce differences in the response quality of LLMs across different age groups. Extensive experiments demonstrate that this reward significantly improves response accuracy and reduces performance disparities across age groups. Our source code and datasets are available at the anonymous \href{https://anonymous.4open.science/r/FairRLHF-D445/readme.md}{link}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLaw, Economics, and Judicial Systems · Dispute Resolution and Class Actions