Perfectly Parallel Fairness Certification of Neural Networks
Caterina Urban, Maria Christakis, Valentin W\"ustholz, Fuyuan Zhang

TL;DR
This paper introduces a scalable, sound static analysis method for certifying the causal fairness of neural networks, providing guarantees or detailed bias descriptions, and is implemented in an open-source tool.
Contribution
It presents a novel perfectly parallel static analysis technique for fairness certification of neural networks, with practical scalability and precision features.
Findings
Effective on models trained with popular datasets
Provides definite fairness guarantees when certification succeeds
Quantifies biased behavior when certification fails
Abstract
Recently, there is growing concern that machine-learning models, which currently assist or even automate decision making, reproduce, and in the worst case reinforce, bias of the training data. The development of tools and techniques for certifying fairness of these models or describing their biased behavior is, therefore, critical. In this paper, we propose a perfectly parallel static analysis for certifying causal fairness of feed-forward neural networks used for classification of tabular data. When certification succeeds, our approach provides definite guarantees, otherwise, it describes and quantifies the biased behavior. We design the analysis to be sound, in practice also exact, and configurable in terms of scalability and precision, thereby enabling pay-as-you-go certification. We implement our approach in an open-source tool and demonstrate its effectiveness on models trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)
