CXR-LanIC: Language-Grounded Interpretable Classifier for Chest X-Ray Diagnosis

Yiming Tang; Wenjia Zhong; Rushi Shah; Dianbo Liu

arXiv:2510.21464·cs.CV·May 20, 2026

CXR-LanIC: Language-Grounded Interpretable Classifier for Chest X-Ray Diagnosis

Yiming Tang, Wenjia Zhong, Rushi Shah, Dianbo Liu

PDF

TL;DR

CXR-LanIC introduces a novel interpretable framework for chest X-ray diagnosis that decomposes predictions into verifiable visual patterns, enhancing transparency and clinical trust.

Contribution

The paper presents a task-aligned pattern discovery method using sparse autoencoders on multimodal embeddings to produce clinically relevant, interpretable visual features for chest X-ray diagnosis.

Findings

01

Discovered approximately 5,000 interpretable visual patterns across various radiological categories.

02

Achieved competitive diagnostic accuracy on five key chest X-ray findings.

03

Enabled transparent attribution of predictions through verifiable activation galleries.

Abstract

Deep learning models have achieved remarkable accuracy in chest X-ray diagnosis, yet their widespread clinical adoption remains limited by the black-box nature of their predictions. Clinicians require transparent, verifiable explanations to trust automated diagnoses and identify potential failure modes. We introduce CXR-LanIC (Language-Grounded Interpretable Classifier for Chest X-rays), a novel framework that addresses this interpretability challenge through task-aligned pattern discovery. Our approach trains transcoder-based sparse autoencoders on a BiomedCLIP diagnostic classifier to decompose medical image representations into interpretable visual patterns. By training an ensemble of 100 transcoders on multimodal embeddings from the MIMIC-CXR dataset, we discover approximately 5,000 monosemantic patterns spanning cardiac, pulmonary, pleural, structural, device, and artifact…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.