Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies
Itai Gat, Idan Schwartz, Alexander Schwing, Tamir Hazan

TL;DR
This paper introduces a novel regularization technique based on functional entropy to balance modality contributions in multi-modal classifiers, improving fairness and performance on datasets like VQA-CPv2 and SocialIQ.
Contribution
It proposes a new regularization method using functional entropy and the log-Sobolev inequality to enhance multi-modal classifier fairness and effectiveness.
Findings
Achieved state-of-the-art results on VQA-CPv2 and SocialIQ datasets.
Effectively balanced modality contributions in multi-modal classification.
Demonstrated improved performance on Colored MNIST.
Abstract
Many recent datasets contain a variety of different data modalities, for instance, image, question, and answer data in visual question answering (VQA). When training deep net classifiers on those multi-modal datasets, the modalities get exploited at different scales, i.e., some modalities can more easily contribute to the classification results than others. This is suboptimal because the classifier is inherently biased towards a subset of the modalities. To alleviate this shortcoming, we propose a novel regularization term based on the functional entropy. Intuitively, this term encourages to balance the contribution of each modality to the classification result. However, regularization with the functional entropy is challenging. To address this, we develop a method based on the log-Sobolev inequality, which bounds the functional entropy with the functional-Fisher-information.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
