TL;DR
This paper introduces ExpGA, a novel explanation-guided genetic algorithm for fairness testing that improves efficiency and effectiveness in detecting discriminatory samples across various models and datasets.
Contribution
ExpGA combines explanation methods with genetic algorithms to enhance fairness testing, overcoming limitations of low efficiency, low effectiveness, and model-specificity in prior approaches.
Findings
ExpGA outperforms four state-of-the-art methods in efficiency and effectiveness.
It generalizes well across different models and data types.
Experiments on real-world benchmarks validate its superior performance.
Abstract
The fairness characteristic is a critical attribute of trusted AI systems. A plethora of research has proposed diverse methods for individual fairness testing. However, they are suffering from three major limitations, i.e., low efficiency, low effectiveness, and model-specificity. This work proposes ExpGA, an explanationguided fairness testing approach through a genetic algorithm (GA). ExpGA employs the explanation results generated by interpretable methods to collect high-quality initial seeds, which are prone to derive discriminatory samples by slightly modifying feature values. ExpGA then adopts GA to search discriminatory sample candidates by optimizing a fitness value. Benefiting from this combination of explanation results and GA, ExpGA is both efficient and effective to detect discriminatory individuals. Moreover, ExpGA only requires prediction probabilities of the tested model,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGenetic Algorithms
