CXR-LanIC: Language-Grounded Interpretable Classifier for Chest X-Ray Diagnosis
Yiming Tang, Wenjia Zhong, Rushi Shah, Dianbo Liu

TL;DR
CXR-LanIC introduces a novel interpretable framework for chest X-ray diagnosis that decomposes predictions into verifiable visual patterns, enhancing transparency and clinical trust.
Contribution
The paper presents a task-aligned pattern discovery method using sparse autoencoders on multimodal embeddings to produce clinically relevant, interpretable visual features for chest X-ray diagnosis.
Findings
Discovered approximately 5,000 interpretable visual patterns across various radiological categories.
Achieved competitive diagnostic accuracy on five key chest X-ray findings.
Enabled transparent attribution of predictions through verifiable activation galleries.
Abstract
Deep learning models have achieved remarkable accuracy in chest X-ray diagnosis, yet their widespread clinical adoption remains limited by the black-box nature of their predictions. Clinicians require transparent, verifiable explanations to trust automated diagnoses and identify potential failure modes. We introduce CXR-LanIC (Language-Grounded Interpretable Classifier for Chest X-rays), a novel framework that addresses this interpretability challenge through task-aligned pattern discovery. Our approach trains transcoder-based sparse autoencoders on a BiomedCLIP diagnostic classifier to decompose medical image representations into interpretable visual patterns. By training an ensemble of 100 transcoders on multimodal embeddings from the MIMIC-CXR dataset, we discover approximately 5,000 monosemantic patterns spanning cardiac, pulmonary, pleural, structural, device, and artifact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
