Efficient Fairness Testing in Large Language Models: Prioritizing Metamorphic Relations for Bias Detection

Suavis Giramata; Madhusudan Srinivasan; Venkat Naidu Gudivada; Upulee Kanewala

arXiv:2505.07870·cs.CL·May 14, 2025

Efficient Fairness Testing in Large Language Models: Prioritizing Metamorphic Relations for Bias Detection

Suavis Giramata, Madhusudan Srinivasan, Venkat Naidu Gudivada, Upulee Kanewala

PDF

TL;DR

This paper proposes a diversity-based prioritization method for metamorphic testing of large language models to efficiently detect fairness issues, significantly improving fault detection rates and reducing testing time.

Contribution

It introduces a novel diversity-based approach to prioritize metamorphic relations, enhancing fairness testing efficiency in large language models.

Findings

01

Improves fault detection rate by 22% over random prioritization

02

Reduces time to first failure by 15%

03

Performs within 5% of fault-based prioritization in effectiveness

Abstract

Large Language Models (LLMs) are increasingly deployed in various applications, raising critical concerns about fairness and potential biases in their outputs. This paper explores the prioritization of metamorphic relations (MRs) in metamorphic testing as a strategy to efficiently detect fairness issues within LLMs. Given the exponential growth of possible test cases, exhaustive testing is impractical; therefore, prioritizing MRs based on their effectiveness in detecting fairness violations is crucial. We apply a sentence diversity-based approach to compute and rank MRs to optimize fault detection. Experimental results demonstrate that our proposed prioritization approach improves fault detection rates by 22% compared to random prioritization and 12% compared to distance-based prioritization, while reducing the time to the first failure by 15% and 8%, respectively. Furthermore, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.