EnD: Entangling and Disentangling deep representations for bias   correction

Enzo Tartaglione; Carlo Alberto Barbano; Marco Grangetto

arXiv:2103.02023·cs.CV·December 29, 2022

EnD: Entangling and Disentangling deep representations for bias correction

Enzo Tartaglione, Carlo Alberto Barbano, Marco Grangetto

PDF

2 Repos

TL;DR

EnD is a regularization method that improves neural network fairness by disentangling bias information at a specific layer, enhancing generalization without adding complexity.

Contribution

The paper introduces EnD, a novel bias correction regularizer that disentangles bias information within deep models without extra training components.

Findings

01

EnD improves model generalization on unbiased test sets.

02

EnD effectively removes hidden biases in COVID-19 radiographic image classification.

03

EnD requires no additional training complexity.

Abstract

Artificial neural networks perform state-of-the-art in an ever-growing number of tasks, and nowadays they are used to solve an incredibly large variety of tasks. There are problems, like the presence of biases in the training data, which question the generalization capability of these models. In this work we propose EnD, a regularization strategy whose aim is to prevent deep models from learning unwanted biases. In particular, we insert an "information bottleneck" at a certain point of the deep neural network, where we disentangle the information about the bias, still letting the useful information for the training task forward-propagating in the rest of the model. One big advantage of EnD is that we do not require additional training complexity (like decoders or extra layers in the model), since it is a regularizer directly applied on the trained model. Our experiments show that EnD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.