Learning Fairer Representations with FairVIC

Charmaine Barker; Daniel Bethell; Dimitar Kazakov

arXiv:2404.18134·cs.LG·October 22, 2025

Learning Fairer Representations with FairVIC

Charmaine Barker, Daniel Bethell, Dimitar Kazakov

PDF

Open Access 4 Reviews

TL;DR

FairVIC is a novel deep learning method that improves fairness by incorporating variance, invariance, and covariance into the loss function, achieving significant bias reduction without sacrificing accuracy.

Contribution

This paper introduces FairVIC, a new approach that enhances fairness in neural networks by abstracting fairness concepts and integrating specific statistical terms into the training loss.

Findings

01

Achieves approximately 70% improvement in fairness metrics

02

Maintains accuracy while significantly reducing bias

03

Effective across diverse datasets and fairness criteria

Abstract

Mitigating bias in automated decision-making systems, particularly in deep learning models, is a critical challenge due to nuanced definitions of fairness, dataset-specific biases, and the inherent trade-off between fairness and accuracy. To address these issues, we introduce FairVIC, an innovative approach that enhances fairness in neural networks by integrating variance, invariance, and covariance terms into the loss function during training. Unlike methods that rely on predefined fairness criteria, FairVIC abstracts fairness concepts to minimise dependency on protected characteristics. We evaluate FairVIC against comparable bias mitigation techniques on benchmark datasets, considering both group and individual fairness, and conduct an ablation study on the accuracy-fairness trade-off. FairVIC demonstrates significant improvements ( $\approx 70%$ ) in fairness across all tested metrics…

Peer Reviews

Decision·ICLR 2025 Conference Withdrawn Submission

Reviewer 01Rating 5Confidence 4

Strengths

- The method is broadly applicable to reduce a variety of biases at the same time - Weights on new loss terms can be dynamically balanced - The additional loss terms do not introduce a large computational overhead on the forward or backward passes

Weaknesses

See questions

Reviewer 02Rating 5Confidence 3

Strengths

1. This paper introduces a novel fairness regularizer that simultaneously optimizes for both group and individual fairness. 2. The proposed method is evaluated across two modalities, demonstrating optimal performance in terms of the fairness-accuracy trade-off when compared to baseline methods. 3. Additionally, the paper presents a method for automatically tuning the weights of the regularizer.

Weaknesses

1. The evaluation is limited by the number of datasets used. It would be beneficial to test the proposed method on a wider range of tabular and text classification datasets [1, 2, 3]. 2. FairVIC shares similarities with both fair representation learning methods and fairness penalty approaches; however, the authors only compare it with the adversarial debiasing algorithm by Zhang et al. It would be helpful to compare FairVIC with other fairness regularizers, such as those addressing equalized od

Reviewer 03Rating 3Confidence 5

Strengths

The authors propose fairness regularizers that aim to satisfy notions of group, individual, and counterfactual fairness simultaneously For the chosen datasets, the method is evaluated extensively against the chosen baselines. While there are a few additions and ablations I hope to see, the results are already considerable. I like the visualization provided by Figure 1.

Weaknesses

The introduction of the regularizers is somewhat instructable. Basic questions like how do they regularizers depend on the model’s predictions are left unanswered. The dismissal of prior approaches (e.g. regularizing using a metric grounded in legal precedent, post-processing methods at large) was rather uncharitable, in my opinion. What the authors do is describe potential limitations of these methods, but I think the claims are not properly substantiated, and a more detailed account for how t

Reviewer 04Rating 3Confidence 4

Strengths

This method is straightforward and comprehensible. The writing style is clear and easy to follow. Figure 1 effectively illustrates the distinctions among various methods.

Weaknesses

**Technical flaw:** My primary concern pertains to this point. The author introduces three loss terms in equations (1), (2), and (3). However, terms (1) and (2) *have no direct dependence on the network parameters*. Therefore, the gradient of the parameters with respect to those losses is always zero, making optimization of these terms nonsensical. This constitutes a significant technical flaw. **Lack of evaluation:** Given the plethora of in-processing methods available in fairness and robust

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Neural Networks and Applications