MedSAD-CLIP: Supervised CLIP with Token-Patch Cross-Attention for Medical Anomaly Detection and Segmentation
Thuy Truong Tran, Minh Kha Do, Phuc Nguyen Duy, Min Hun Lee

TL;DR
MedSAD-CLIP introduces a supervised adaptation of CLIP with token-patch cross-attention and contrastive loss, significantly improving medical anomaly detection and segmentation accuracy across multiple datasets.
Contribution
This work presents MedSAD-CLIP, a novel method that enhances CLIP for medical anomaly detection by integrating fine-grained cross-attention and domain-specific training techniques.
Findings
Outperforms state-of-the-art in segmentation and classification
Effective across diverse medical datasets
Maintains CLIP's generalization while improving localization
Abstract
Medical anomaly detection (MAD) and segmentation play a critical role in assisting clinical diagnosis by identifying abnormal regions in medical images and localizing pathological regions. Recent CLIP-based studies are promising for anomaly detection in zero-/few-shot settings, and typically rely on global representations and weak supervision, often producing coarse localization and limited segmentation quality. In this work, we study supervised adaptation of CLIP for MAD under a realistic clinical setting where a limited yet meaningful amount of labeled abnormal data is available. Our model MedSAD-CLIP leverages fine-grained text-visual cues via the Token-Patch Cross-Attention(TPCA) to improve lesion localization while preserving the generalization capability of CLIP representations. Lightweight image adapters and learnable prompt tokens efficiently adapt the pretrained CLIP encoder to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · COVID-19 diagnosis using AI · Adversarial Robustness in Machine Learning
