Learning Debiased Representation via Disentangled Feature Augmentation
Jungsoo Lee, Eungyeup Kim, Juyoung Lee, Jihyeon Lee, Jaegul Choo

TL;DR
This paper introduces a novel data augmentation method that synthesizes diverse bias-conflicting samples by disentangling intrinsic and bias attributes, significantly improving debiasing and generalization in image classification models.
Contribution
It proposes a feature-level augmentation technique using disentangled representations to generate diverse bias-conflicting samples, enhancing debiasing effectiveness.
Findings
Achieves superior accuracy on synthetic datasets
Outperforms existing debiasing baselines on real-world datasets
Demonstrates the importance of diverse bias-conflicting samples
Abstract
Image classification models tend to make decisions based on peripheral attributes of data items that have strong correlation with a target variable (i.e., dataset bias). These biased models suffer from the poor generalization capability when evaluated on unbiased datasets. Existing approaches for debiasing often identify and emphasize those samples with no such correlation (i.e., bias-conflicting) without defining the bias type in advance. However, such bias-conflicting samples are significantly scarce in biased datasets, limiting the debiasing capability of these approaches. This paper first presents an empirical analysis revealing that training with "diverse" bias-conflicting samples beyond a given training set is crucial for debiasing as well as the generalization capability. Based on this observation, we propose a novel feature-level data augmentation technique in order to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
