MedSAD-CLIP: Supervised CLIP with Token-Patch Cross-Attention for Medical Anomaly Detection and Segmentation

Thuy Truong Tran; Minh Kha Do; Phuc Nguyen Duy; Min Hun Lee

arXiv:2603.17325·cs.CV·March 19, 2026

MedSAD-CLIP: Supervised CLIP with Token-Patch Cross-Attention for Medical Anomaly Detection and Segmentation

Thuy Truong Tran, Minh Kha Do, Phuc Nguyen Duy, Min Hun Lee

PDF

Open Access

TL;DR

MedSAD-CLIP introduces a supervised adaptation of CLIP with token-patch cross-attention and contrastive loss, significantly improving medical anomaly detection and segmentation accuracy across multiple datasets.

Contribution

This work presents MedSAD-CLIP, a novel method that enhances CLIP for medical anomaly detection by integrating fine-grained cross-attention and domain-specific training techniques.

Findings

01

Outperforms state-of-the-art in segmentation and classification

02

Effective across diverse medical datasets

03

Maintains CLIP's generalization while improving localization

Abstract

Medical anomaly detection (MAD) and segmentation play a critical role in assisting clinical diagnosis by identifying abnormal regions in medical images and localizing pathological regions. Recent CLIP-based studies are promising for anomaly detection in zero-/few-shot settings, and typically rely on global representations and weak supervision, often producing coarse localization and limited segmentation quality. In this work, we study supervised adaptation of CLIP for MAD under a realistic clinical setting where a limited yet meaningful amount of labeled abnormal data is available. Our model MedSAD-CLIP leverages fine-grained text-visual cues via the Token-Patch Cross-Attention(TPCA) to improve lesion localization while preserving the generalization capability of CLIP representations. Lightweight image adapters and learnable prompt tokens efficiently adapt the pretrained CLIP encoder to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · COVID-19 diagnosis using AI · Adversarial Robustness in Machine Learning