Gradient Based Activations for Accurate Bias-Free Learning
Vinod K Kurmi, Rishabh Sharma, Yash Vardhan Sharma, Vinay P., Namboodiri

TL;DR
This paper introduces a gradient-based feature masking technique that leverages biased discriminators to effectively reduce bias and improve accuracy in machine learning models, outperforming existing adversarial methods.
Contribution
It proposes a novel gradient-based feature masking approach that uses biased discriminators to enhance bias mitigation without sacrificing accuracy.
Findings
Reduces bias effectively while improving accuracy.
Outperforms recent bias mitigation methods on benchmarks.
Enhances unbiased feature learning through gradient-based masking.
Abstract
Bias mitigation in machine learning models is imperative, yet challenging. While several approaches have been proposed, one view towards mitigating bias is through adversarial learning. A discriminator is used to identify the bias attributes such as gender, age or race in question. This discriminator is used adversarially to ensure that it cannot distinguish the bias attributes. The main drawback in such a model is that it directly introduces a trade-off with accuracy as the features that the discriminator deems to be sensitive for discrimination of bias could be correlated with classification. In this work we solve the problem. We show that a biased discriminator can actually be used to improve this bias-accuracy tradeoff. Specifically, this is achieved by using a feature masking approach using the discriminator's gradients. We ensure that the features favoured for the bias…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
