TL;DR
EnD is a regularization method that improves neural network fairness by disentangling bias information at a specific layer, enhancing generalization without adding complexity.
Contribution
The paper introduces EnD, a novel bias correction regularizer that disentangles bias information within deep models without extra training components.
Findings
EnD improves model generalization on unbiased test sets.
EnD effectively removes hidden biases in COVID-19 radiographic image classification.
EnD requires no additional training complexity.
Abstract
Artificial neural networks perform state-of-the-art in an ever-growing number of tasks, and nowadays they are used to solve an incredibly large variety of tasks. There are problems, like the presence of biases in the training data, which question the generalization capability of these models. In this work we propose EnD, a regularization strategy whose aim is to prevent deep models from learning unwanted biases. In particular, we insert an "information bottleneck" at a certain point of the deep neural network, where we disentangle the information about the bias, still letting the useful information for the training task forward-propagating in the rest of the model. One big advantage of EnD is that we do not require additional training complexity (like decoders or extra layers in the model), since it is a regularizer directly applied on the trained model. Our experiments show that EnD…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
