CUBIC: Concept Embeddings for Unsupervised Bias Identification using VLMs

David M\'endez; Gianpaolo Bontempo; Elisa Ficarra; Roberto Confalonieri; Natalia D\'iaz-Rodr\'iguez

arXiv:2505.11060·cs.CV·May 19, 2025

CUBIC: Concept Embeddings for Unsupervised Bias Identification using VLMs

David M\'endez, Gianpaolo Bontempo, Elisa Ficarra, Roberto Confalonieri, Natalia D\'iaz-Rodr\'iguez

PDF

Open Access 1 Repo

TL;DR

CUBIC is a novel method that automatically identifies biases in vision models by analyzing concept influences in the image-text latent space, without needing annotated bias examples or predefined bias concepts.

Contribution

CUBIC introduces an unsupervised approach leveraging vision-language models and linear probes to discover bias-inducing concepts without prior bias annotations or failure examples.

Findings

01

CUBIC uncovers previously unknown biases in vision models.

02

It operates without requiring dataset failure samples or bias annotations.

03

The method effectively identifies influential concepts affecting model predictions.

Abstract

Deep vision models often rely on biases learned from spurious correlations in datasets. To identify these biases, methods that interpret high-level, human-understandable concepts are more effective than those relying primarily on low-level features like heatmaps. A major challenge for these concept-based methods is the lack of image annotations indicating potentially bias-inducing concepts, since creating such annotations requires detailed labeling for each dataset and concept, which is highly labor-intensive. We present CUBIC (Concept embeddings for Unsupervised Bias IdentifiCation), a novel method that automatically discovers interpretable concepts that may bias classifier behavior. Unlike existing approaches, CUBIC does not rely on predefined bias candidates or examples of model failures tied to specific biases, as such information is not always available. Instead, it leverages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

david-mnd/cubic
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning