ProtoFair: Fair Self-Supervised Contrastive Learning via Pseudo-Counterfactual Pairs

Marah Halawa; Olaf Hellwich

arXiv:2605.01971·cs.CV·May 5, 2026

ProtoFair: Fair Self-Supervised Contrastive Learning via Pseudo-Counterfactual Pairs

Marah Halawa, Olaf Hellwich

PDF

TL;DR

ProtoFair introduces a fairness-aware contrastive loss that enhances demographic fairness in self-supervised visual representations without altering existing SSL objectives.

Contribution

It proposes a novel pseudo-counterfactual pairing method using prototype clustering, compatible with various SSL frameworks, to improve fairness.

Findings

01

Achieves fairness improvements on CelebA and UTKFace datasets.

02

Maintains competitive accuracy while reducing demographic bias.

03

Seamlessly integrates with SimCLR and SupCon frameworks.

Abstract

Self-supervised learning methods learn high-quality visual representations, yet recent studies show that these representations often capture demographic biases present in the training data. Existing fairness-aware methods address this by redesigning the self-supervised objective itself, limiting portability across the rapidly evolving landscape of self-supervised learning (SSL) frameworks. We propose ProtoFair, a fairness-aware contrastive loss designed to work alongside existing SSL objectives without modifying them. ProtoFair leverages unsupervised prototype clustering to identify pseudo-counterfactual pairs: samples sharing the same cluster assignment but belonging to different sensitive groups. By pulling these content-matched, cross-group samples together in the embedding space, ProtoFair encourages the encoder to learn representations that are invariant to the sensitive attribute.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.