Stable Vision Concept Transformers for Medical Diagnosis
Lijie Hu, Songning Lai, Yuan Hua, Shu Yang, Jingfeng Zhang, Di Wang

TL;DR
This paper introduces Stable Vision Concept Transformers (SVCT) for medical diagnosis, combining concept-based interpretability with robustness to input perturbations, while maintaining high accuracy on medical datasets.
Contribution
It proposes the SVCT model that fuses concept features with image features and uses diffusion smoothing for stable, faithful explanations in medical imaging.
Findings
SVCT maintains accuracy comparable to baseline models.
SVCT provides stable explanations under input perturbations.
Experiments on four datasets validate interpretability and robustness.
Abstract
Transparency is a paramount concern in the medical field, prompting researchers to delve into the realm of explainable AI (XAI). Among these XAI methods, Concept Bottleneck Models (CBMs) aim to restrict the model's latent space to human-understandable high-level concepts by generating a conceptual layer for extracting conceptual features, which has drawn much attention recently. However, existing methods rely solely on concept features to determine the model's predictions, which overlook the intrinsic feature embeddings within medical images. To address this utility gap between the original models and concept-based models, we propose Vision Concept Transformer (VCT). Furthermore, despite their benefits, CBMs have been found to negatively impact model performance and fail to provide stable explanations when faced with input perturbations, which limits their application in the medical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare
MethodsDropout · Absolute Position Encodings · Byte Pair Encoding · Softmax · Label Smoothing · Transformer · Diffusion · Dense Connections · Layer Normalization · Vision Transformer
