Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery   with SAM

Xin Hu; Janet Wang; Jihun Hamm; Rie R Yotsu; Zhengming Ding

arXiv:2409.09520·cs.CV·January 17, 2025

Enhancing Skin Disease Diagnosis: Interpretable Visual Concept Discovery with SAM

Xin Hu, Janet Wang, Jihun Hamm, Rie R Yotsu, Zhengming Ding

PDF

Open Access

TL;DR

This paper introduces a novel framework that leverages the Segment Anything Model (SAM) to generate interpretable visual concepts for skin disease diagnosis, addressing challenges of noisy clinical images and limited annotations.

Contribution

It proposes a cross-attentive fusion approach that combines SAM-generated visual concepts with global features for improved interpretability and diagnosis accuracy in skin lesion classification.

Findings

01

Effective lesion diagnosis demonstrated on two datasets.

02

Enhanced interpretability of skin disease predictions.

03

Improved performance over baseline models.

Abstract

Current AI-assisted skin image diagnosis has achieved dermatologist-level performance in classifying skin cancer, driven by rapid advancements in deep learning architectures. However, unlike traditional vision tasks, skin images in general present unique challenges due to the limited availability of well-annotated datasets, complex variations in conditions, and the necessity for detailed interpretations to ensure patient safety. Previous segmentation methods have sought to reduce image noise and enhance diagnostic performance, but these techniques require fine-grained, pixel-level ground truth masks for training. In contrast, with the rise of foundation models, the Segment Anything Model (SAM) has been introduced to facilitate promptable segmentation, enabling the automation of the segmentation process with simple yet effective prompts. Efforts applying SAM predominantly focus on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Imaging for Blood Diseases

MethodsFocus · Segment Anything Model