GenFair: Systematic Test Generation for Fairness Fault Detection in Large Language Models

Madhusudan Srinivasan; Jubril Abdel

arXiv:2506.03024·cs.SE·June 4, 2025

GenFair: Systematic Test Generation for Fairness Fault Detection in Large Language Models

Madhusudan Srinivasan, Jubril Abdel

PDF

Open Access

TL;DR

GenFair is a novel systematic testing framework that enhances fairness fault detection in large language models by generating diverse, realistic, and intersectional test cases using metamorphic testing techniques.

Contribution

It introduces a metamorphic fairness testing approach that improves detection of complex biases in LLMs over existing template-based methods.

Findings

01

GenFair achieves higher fault detection rates (0.73/0.69) than baselines.

02

It produces more diverse and coherent test cases.

03

GenFair effectively uncovers nuanced fairness violations.

Abstract

Large Language Models (LLMs) are increasingly deployed in critical domains, yet they often exhibit biases inherited from training data, leading to fairness concerns. This work focuses on the problem of effectively detecting fairness violations, especially intersectional biases that are often missed by existing template-based and grammar-based testing methods. Previous approaches, such as CheckList and ASTRAEA, provide structured or grammar-driven test generation but struggle with low test diversity and limited sensitivity to complex demographic interactions. To address these limitations, we propose GenFair, a metamorphic fairness testing framework that systematically generates source test cases using equivalence partitioning, mutation operators, and boundary value analysis. GenFair improves fairness testing by generating linguistically diverse, realistic, and intersectional test cases.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSafety Systems Engineering in Autonomy