MedConcept: Unsupervised Concept Discovery for Interpretability in Medical VLMs
Md Rakibul Haque, KM Arefeen Sultan, Tushar Kataria, Shireen Elhabian

TL;DR
MedConcept is an unsupervised framework that discovers and interprets latent medical concepts in pretrained vision-language models, enhancing transparency and trust in clinical applications.
Contribution
It introduces a novel unsupervised method for uncovering and grounding medical concepts in VLMs, with a quantitative evaluation protocol for interpretability.
Findings
Identifies sparse neuron-level concept activations in medical VLMs.
Translates activations into pseudo-report summaries for clinician inspection.
Provides a quantitative semantic verification protocol for interpretability assessment.
Abstract
While medical Vision-Language models (VLMs) achieve strong performance on tasks such as tumor or organ segmentation and diagnosis prediction, their opaque latent representations limit clinical trust and the ability to explain predictions. Interpretability of these multimodal representations are therefore essential for the trustworthy clinical deployment of pretrained medical VLMs. However, current interpretability methods, such as gradient- or attention-based visualizations, are often limited to specific tasks such as classification. Moreover, they do not provide concept-level explanations derived from shared pretrained representations that can be reused across downstream tasks. We introduce MedConcept, a framework that uncovers latent medical concepts in a fully unsupervised manner and grounds them in clinically verifiable textual semantics. MedConcept identifies sparse neuron-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
