GenFair: Systematic Test Generation for Fairness Fault Detection in Large Language Models
Madhusudan Srinivasan, Jubril Abdel

TL;DR
GenFair is a novel systematic testing framework that enhances fairness fault detection in large language models by generating diverse, realistic, and intersectional test cases using metamorphic testing techniques.
Contribution
It introduces a metamorphic fairness testing approach that improves detection of complex biases in LLMs over existing template-based methods.
Findings
GenFair achieves higher fault detection rates (0.73/0.69) than baselines.
It produces more diverse and coherent test cases.
GenFair effectively uncovers nuanced fairness violations.
Abstract
Large Language Models (LLMs) are increasingly deployed in critical domains, yet they often exhibit biases inherited from training data, leading to fairness concerns. This work focuses on the problem of effectively detecting fairness violations, especially intersectional biases that are often missed by existing template-based and grammar-based testing methods. Previous approaches, such as CheckList and ASTRAEA, provide structured or grammar-driven test generation but struggle with low test diversity and limited sensitivity to complex demographic interactions. To address these limitations, we propose GenFair, a metamorphic fairness testing framework that systematically generates source test cases using equivalence partitioning, mutation operators, and boundary value analysis. GenFair improves fairness testing by generating linguistically diverse, realistic, and intersectional test cases.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSafety Systems Engineering in Autonomy
