A study on the distribution of social biases in self-supervised learning visual models
Kirill Sirotkin, Pablo Carballeira, Marcos Escudero-Vi\~nolo

TL;DR
This paper investigates social biases in self-supervised visual models trained on ImageNet, revealing that bias presence varies with model type and can be mitigated through careful selection without sacrificing performance.
Contribution
It provides an empirical analysis of social biases in SSL visual models, demonstrating the correlation with model type and offering insights for bias reduction strategies.
Findings
Bias varies with SSL model type
Bias number does not strictly depend on accuracy
Careful model selection can reduce biases
Abstract
Deep neural networks are efficient at learning the data distribution if it is sufficiently sampled. However, they can be strongly biased by non-relevant factors implicitly incorporated in the training data. These include operational biases, such as ineffective or uneven data sampling, but also ethical concerns, as the social biases are implicitly present\textemdash even inadvertently, in the training data or explicitly defined in unfair training schedules. In tasks having impact on human processes, the learning of social biases may produce discriminatory, unethical and untrustworthy consequences. It is often assumed that social biases stem from supervised learning on labelled data, and thus, Self-Supervised Learning (SSL) wrongly appears as an efficient and bias-free solution, as it does not require labelled data. However, it was recently proven that a popular SSL method also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and Data Classification
