CLARiTy: A Vision Transformer for Multi-Label Classification and Weakly-Supervised Localization of Chest X-ray Pathologies
John M. Statheros, Hairong Wang, Richard Klein

TL;DR
CLARiTy is a vision transformer model that improves multi-label classification and weakly-supervised localization of chest X-ray pathologies, especially small ones, using class-specific attention and anatomical priors, with high efficiency and state-of-the-art results.
Contribution
Introduces CLARiTy, a novel ViT-based framework that jointly performs multi-label classification and localization with explicit anatomical priors and class-specific tokens, surpassing prior CNN and hybrid models.
Findings
Achieves competitive classification accuracy across 14 pathologies.
Sets new state-of-the-art in localization performance on 8 pathologies.
Outperforms prior methods by 50.7% in localization accuracy.
Abstract
The interpretation of chest X-rays (CXRs) poses significant challenges, particularly in achieving accurate multi-label pathology classification and spatial localization. These tasks demand different levels of annotation granularity but are frequently constrained by the scarcity of region-level (dense) annotations. We introduce CLARiTy (Class Localizing and Attention Refining Image Transformer), a vision transformer-based model for joint multi-label classification and weakly-supervised localization of thoracic pathologies. CLARiTy employs multiple class-specific tokens to generate discriminative attention maps, and a SegmentCAM module for foreground segmentation and background suppression using explicit anatomical priors. Trained on image-level labels from the NIH ChestX-ray14 dataset, it leverages distillation from a ConvNeXtV2 teacher for efficiency. Evaluated on the official NIH…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · AI in cancer detection · Advanced Neural Network Applications
