Evaluating Fairness and Mitigating Bias in Machine Learning: A Novel Technique using Tensor Data and Bayesian Regression
Kuniko Paxton, Koorosh Aslansefat, Dhavalkumar Thakker, Yiannis Papadopoulos

TL;DR
This paper presents a novel fairness evaluation and bias mitigation technique for machine learning models handling tensor data like skin color, using Bayesian regression to improve equity in image classification without relying on annotations.
Contribution
It introduces a new approach to assess and mitigate bias in ML models dealing with tensor data, specifically skin color, by converting data into probability distributions and applying Bayesian regression.
Findings
Effective fairness evaluation without annotations
Improved bias mitigation in skin tone classification
Captures nuanced differences across skin color groups
Abstract
Fairness is a critical component of Trustworthy AI. In this paper, we focus on Machine Learning (ML) and the performance of model predictions when dealing with skin color. Unlike other sensitive attributes, the nature of skin color differs significantly. In computer vision, skin color is represented as tensor data rather than categorical values or single numerical points. However, much of the research on fairness across sensitive groups has focused on categorical features such as gender and race. This paper introduces a new technique for evaluating fairness in ML for image classification tasks, specifically without the use of annotation. To address the limitations of prior work, we handle tensor data, like skin color, without classifying it rigidly. Instead, we convert it into probability distributions and apply statistical distance measures. This novel approach allows us to capture…
Peer Reviews
Decision·ICLR 2025 Conference Withdrawn Submission
1. Extend categorical groups by representing skin color as distributions on which Wasserstein Distance can be applied. The method is generically applicable to multi- dimensional and continuous data. 2. A new latent bias mitigation method is proposed for individual fairness that leverages Bayesian regression estimation of performance.
1. Although skin color is an important fairness indicator and its continuity fits the motivation of the paper, it appears a significant limitation to only consider skin color. There are many other continuous sensitive features, and the paper didn’t consider in the experiment. Is it because they are too easy and do not unleash the full power of the method (which can be applied to tensors)? It will be interesting to see the effectiveness of the proposed method on other continuous valued attributes
1. Introduces a novel approach to fairness by representing skin color as continuous tensor data, avoiding traditional categorical groupings. 2. Uses Bayesian regression and Wasserstein Distance to capture individual-level fairness without requiring categorical annotations.
**Insufficient Coverage and Comparison with Related Works:** The paper does not provide a discussion on dependence-based methods [6-11] or adversarial representation learning approaches [1-5], both of which are established techniques for debiasing machine learning models. While the setting of this study is distinct, the continuous skin tone attribute extracted in the initial phase of this method could also be applied in models handling continuous attributes, aligning with those frameworks. I ha
The approach of treating physical characteristics as continuous variables, rather than discrete categories, is compelling. This applies not only to skin tone, but also to other demographic attributes (eg. age, perceived gender,...) and physical features (eg. hair color, perceived attractiveness, ...). While the idea of adopting continuous representations isn't novel [1,2] and the proposed method applies only for skin tone, the idea of implementing it without requiring annotated data presents an
1. Although I understand that the scope is to focus on skin tone, the study is a bit limited as the proposed methodology seemingly doesn't transfer to any other attribute of interest (also other attributes may benefit from treating them in a continuous range of values, rather than as categorical variables, eg. "age"). 2. The proposed methodology raises several concerns regarding its novelty and effectiveness. The required preprocessing step appears to be a general solution that could be applied
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Big Data Technologies and Applications
