TARA: Training and Representation Alteration for AI Fairness and Domain Generalization
William Paul, Armin Hadzic, Neil Joshi, Fady Alajaji, Phil Burlina

TL;DR
TARA introduces a dual approach combining representation learning alteration and intelligent data augmentation to effectively mitigate bias in AI models, improving fairness and accuracy across datasets.
Contribution
The paper presents a novel dual strategy for AI fairness that integrates adversarial independence and generative augmentation, advancing bias mitigation techniques.
Findings
TARA significantly reduces bias and improves accuracy in image analytics.
It outperforms existing debiasing methods with higher accuracy and lower bias gaps.
Proposes new metrics for better assessment of debiasing effectiveness.
Abstract
We propose a novel method for enforcing AI fairness with respect to protected or sensitive factors. This method uses a dual strategy performing training and representation alteration (TARA) for the mitigation of prominent causes of AI bias by including: a) the use of representation learning alteration via adversarial independence to suppress the bias-inducing dependence of the data representation from protected factors; and b) training set alteration via intelligent augmentation to address bias-causing data imbalance, by using generative models that allow the fine control of sensitive factors related to underrepresented populations via domain adaptation and latent space manipulation. When testing our methods on image analytics, experiments demonstrate that TARA significantly or fully debiases baseline models while outperforming competing debiasing methods that have the same amount of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
