Predictive Representativity: Uncovering Racial Bias in AI-based Skin Cancer Detection

Andr\'es Morales-Forero (1); Lili J. Rueda (2); Ronald Herrera (3); Samuel Bassetto (1); Eric Coatanea (4) ((1) Polytechnique Montr\'eal; (2) Universidad El Bosque; (3) Boehringer Ingelheim International GmbH; (4) Tampere University)

arXiv:2507.14176·cs.LG·July 22, 2025

Predictive Representativity: Uncovering Racial Bias in AI-based Skin Cancer Detection

Andr\'es Morales-Forero (1), Lili J. Rueda (2), Ronald Herrera (3), Samuel Bassetto (1), Eric Coatanea (4) ((1) Polytechnique Montr\'eal, (2) Universidad El Bosque, (3) Boehringer Ingelheim International GmbH, (4) Tampere University)

PDF

TL;DR

This paper introduces Predictive Representativity, a framework for fairness auditing in AI skin cancer detection, revealing performance disparities across skin types and emphasizing the importance of outcome-level equity and context-sensitive fairness assessment.

Contribution

It proposes a novel framework for fairness auditing called Predictive Representativity that focuses on outcome-level equity rather than dataset composition.

Findings

01

Classifiers underperform on darker skin phototypes despite proportional sampling.

02

Performance disparities persist across different datasets and contexts.

03

Highlights the need for post-hoc fairness auditing and inclusive validation pipelines.

Abstract

Artificial intelligence (AI) systems increasingly inform medical decision-making, yet concerns about algorithmic bias and inequitable outcomes persist, particularly for historically marginalized populations. This paper introduces the concept of Predictive Representativity (PR), a framework of fairness auditing that shifts the focus from the composition of the data set to outcomes-level equity. Through a case study in dermatology, we evaluated AI-based skin cancer classifiers trained on the widely used HAM10000 dataset and on an independent clinical dataset (BOSQUE Test set) from Colombia. Our analysis reveals substantial performance disparities by skin phototype, with classifiers consistently underperforming for individuals with darker skin, despite proportional sampling in the source data. We argue that representativity must be understood not as a static feature of datasets but as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.