Controllable Feature Whitening for Hyperparameter-Free Bias Mitigation
Yooshin Cho, Hanbyel Cho, Janghyeon Lee, HyeongGwon Hong, Jaesung Ahn, Junmo Kim

TL;DR
The paper introduces a simple, controllable feature whitening method that effectively reduces bias in neural networks by removing linear correlations, improving fairness without complex regularization or adversarial training.
Contribution
It proposes a novel feature whitening framework that mitigates bias by eliminating linear correlations, handling multiple fairness criteria, and outperforming existing methods on benchmark datasets.
Findings
Significantly reduces bias in neural networks.
Effectively handles demographic parity and equalized odds.
Outperforms existing bias mitigation methods on four datasets.
Abstract
As the use of artificial intelligence rapidly increases, the development of trustworthy artificial intelligence has become important. However, recent studies have shown that deep neural networks are susceptible to learn spurious correlations present in datasets. To improve the reliability, we propose a simple yet effective framework called controllable feature whitening. We quantify the linear correlation between the target and bias features by the covariance matrix, and eliminate it through the whitening module. Our results systemically demonstrate that removing the linear correlations between features fed into the last linear classifier significantly mitigates the bias, while avoiding the need to model intractable higher-order dependencies. A particular advantage of the proposed method is that it does not require regularization terms or adversarial learning, which often leads to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
