Learning Bias-Invariant Representation by Cross-Sample Mutual Information Minimization
Wei Zhu, Haitian Zheng, Haofu Liao, Weijian Li, Jiebo Luo

TL;DR
This paper introduces a cross-sample adversarial debiasing method that disentangles target and bias features in deep learning models, effectively reducing bias influence and improving generalization.
Contribution
The paper proposes a novel CSAD approach using cross-sample neural mutual information estimation to remove bias, enhancing model fairness and robustness.
Findings
Outperforms state-of-the-art debiasing methods on benchmark datasets
Effectively disentangles target and bias features in representations
Improves generalization and fairness in deep learning models
Abstract
Deep learning algorithms mine knowledge from the training data and thus would likely inherit the dataset's bias information. As a result, the obtained model would generalize poorly and even mislead the decision process in real-life applications. We propose to remove the bias information misused by the target task with a cross-sample adversarial debiasing (CSAD) method. CSAD explicitly extracts target and bias features disentangled from the latent representation generated by a feature extractor and then learns to discover and remove the correlation between the target and bias features. The correlation measurement plays a critical role in adversarial debiasing and is conducted by a cross-sample neural mutual information estimator. Moreover, we propose joint content and local structural representation learning to boost mutual information estimation for better performance. We conduct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
