MedGrad E-CLIP: Enhancing Trust and Transparency in AI-Driven Skin Lesion Diagnosis
Sadia Kamal, Tim Oates

TL;DR
This paper introduces MedGrad E-CLIP, a novel vision-language model that improves transparency and trust in AI skin lesion diagnosis by providing explainable visual and textual insights into model decisions.
Contribution
It proposes MedGrad E-CLIP, an enhanced gradient-based CLIP method with a weighted entropy mechanism tailored for medical imaging, improving interpretability in skin lesion classification.
Findings
Enhanced model transparency with visual explanations.
Improved trustworthiness in AI diagnosis.
Effective classification of skin lesions using descriptions.
Abstract
As deep learning models gain attraction in medical data, ensuring transparent and trustworthy decision-making is essential. In skin cancer diagnosis, while advancements in lesion detection and classification have improved accuracy, the black-box nature of these methods poses challenges in understanding their decision processes, leading to trust issues among physicians. This study leverages the CLIP (Contrastive Language-Image Pretraining) model, trained on different skin lesion datasets, to capture meaningful relationships between visual features and diagnostic criteria terms. To further enhance transparency, we propose a method called MedGrad E-CLIP, which builds on gradient-based E-CLIP by incorporating a weighted entropy mechanism designed for complex medical imaging like skin lesions. This approach highlights critical image regions linked to specific diagnostic descriptions. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Language-Image Pre-training
