MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin   Lesion Diagnosis

Sadia Kamal; Tim Oates

arXiv:2501.06887·cs.CV·January 14, 2025

MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin Lesion Diagnosis

Sadia Kamal, Tim Oates

PDF

TL;DR

This paper introduces MedGrad E-CLIP, a novel vision-language model that improves transparency and trust in AI skin lesion diagnosis by providing explainable visual and textual insights into model decisions.

Contribution

It proposes MedGrad E-CLIP, an enhanced gradient-based CLIP method with a weighted entropy mechanism tailored for medical imaging, improving interpretability in skin lesion classification.

Findings

01

Enhanced model transparency with visual explanations.

02

Improved trustworthiness in AI diagnosis.

03

Effective classification of skin lesions using descriptions.

Abstract

As deep learning models gain attraction in medical data, ensuring transparent and trustworthy decision-making is essential. In skin cancer diagnosis, while advancements in lesion detection and classification have improved accuracy, the black-box nature of these methods poses challenges in understanding their decision processes, leading to trust issues among physicians. This study leverages the CLIP (Contrastive Language-Image Pretraining) model, trained on different skin lesion datasets, to capture meaningful relationships between visual features and diagnostic criteria terms. To further enhance transparency, we propose a method called MedGrad E-CLIP, which builds on gradient-based E-CLIP by incorporating a weighted entropy mechanism designed for complex medical imaging like skin lesions. This approach highlights critical image regions linked to specific diagnostic descriptions. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Language-Image Pre-training