Justicia: A Stochastic SAT Approach to Formally Verify Fairness
Bishwamittra Ghosh, Debabrota Basu, Kuldeep S. Meel

TL;DR
This paper introduces Justicia, a stochastic SAT framework that formally verifies fairness metrics in machine learning models across datasets, offering scalability, robustness, and theoretical error bounds.
Contribution
The paper presents Justicia, a novel distribution-based stochastic SAT approach for verifying fairness in ML, addressing limitations of existing methods.
Findings
Verifies multiple fairness metrics like disparate impact and equalized odds.
Operates on non-Boolean and compound sensitive attributes.
Provides theoretical bounds on finite-sample verification error.
Abstract
As a technology ML is oblivious to societal good or bad, and thus, the field of fair machine learning has stepped up to propose multiple mathematical definitions, algorithms, and systems to ensure different notions of fairness in ML applications. Given the multitude of propositions, it has become imperative to formally verify the fairness metrics satisfied by different algorithms on different datasets. In this paper, we propose a stochastic satisfiability (SSAT) framework, Justicia, that formally verifies different fairness measures of supervised learning algorithms with respect to the underlying data distribution. We instantiate Justicia on multiple classification and bias mitigation algorithms, and datasets to verify different fairness metrics, such as disparate impact, statistical parity, and equalized odds. Justicia is scalable, accurate, and operates on non-Boolean and compound…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
