Transparency Tools for Fairness in AI (Luskin)
Mingliang Chen, Aria Shahverdi, Sarah Anderson, Se Yong Park, Justin, Zhang, Dana Dachman-Soled, Kristin Lauter, Min Wu

TL;DR
This paper introduces new fairness assessment and correction tools for AI, including a novel fairness definition, retraining algorithms, and parameter adjustment methods, tested on real datasets to reduce bias effectively.
Contribution
It proposes the concept of controlled fairness, along with algorithms for retraining and parameter adjustment to improve fairness in AI models.
Findings
Algorithms effectively reduce bias in tested datasets.
Tools help understand different bias dimensions.
Methods perform well in practical bias mitigation.
Abstract
We propose new tools for policy-makers to use when assessing and correcting fairness and bias in AI algorithms. The three tools are: - A new definition of fairness called "controlled fairness" with respect to choices of protected features and filters. The definition provides a simple test of fairness of an algorithm with respect to a dataset. This notion of fairness is suitable in cases where fairness is prioritized over accuracy, such as in cases where there is no "ground truth" data, only data labeled with past decisions (which may have been biased). - Algorithms for retraining a given classifier to achieve "controlled fairness" with respect to a choice of features and filters. Two algorithms are presented, implemented and tested. These algorithms require training two different models in two stages. We experiment with combinations of various types of models for the first and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)
