Towards generating more interpretable counterfactuals via concept vectors: a preliminary study on chest X-rays

Bulat Maksudov; Kathleen Curran; Alessandra Mileo

arXiv:2506.04058·eess.IV·June 5, 2025

Towards generating more interpretable counterfactuals via concept vectors: a preliminary study on chest X-rays

Bulat Maksudov, Kathleen Curran, Alessandra Mileo

PDF

Open Access 1 Repo

TL;DR

This paper explores using concept vectors in generative models to produce interpretable counterfactual explanations for chest X-ray images, aiming to improve clinical interpretability of AI models.

Contribution

It introduces a method to map clinical concepts into the latent space of autoencoders for generating concept-based counterfactuals without explicit labels.

Findings

01

Concept vectors are stable across datasets.

02

Counterfactuals can exaggerate or reduce clinical features.

03

Promising results on large pathologies like cardiomegaly.

Abstract

An essential step in deploying medical imaging models is ensuring alignment with clinical knowledge and interpretability. We focus on mapping clinical concepts into the latent space of generative models to identify Concept Activation Vectors (CAVs). Using a simple reconstruction autoencoder, we link user-defined concepts to image-level features without explicit label training. The extracted concepts are stable across datasets, enabling visual explanations that highlight clinically relevant features. By traversing latent space along concept directions, we produce counterfactuals that exaggerate or reduce specific clinical features. Preliminary results on chest X-rays show promise for large pathologies like cardiomegaly, while smaller pathologies remain challenging due to reconstruction limits. Although not outperforming baselines, this approach offers a path toward interpretable,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Luab/cavisual
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · COVID-19 diagnosis using AI

MethodsFocus · Counterfactuals Explanations