Validation of a Dermatology-Focused Multimodal Large Language Model in Classification of Pigmented Skin Lesions
Joshua Mijares, Neil Jairath, Andrew Zhang, Syril Keena T. Que

TL;DR
A dermatology-focused AI model, DermFlow, outperformed both a general AI and clinicians in diagnosing pigmented skin lesions, showing high accuracy and potential for clinical use.
Contribution
The paper introduces and validates a dermatology-specific multimodal AI model for pigmented lesion classification.
Findings
DermFlow achieved 93.9% sensitivity and 89.5% specificity in lesion diagnosis.
DermFlow outperformed both clinicians and the general AI model Claude in diagnostic accuracy.
DermFlow recommended biopsy in 95.6% of cases, significantly higher than Claude's 82.4%.
Abstract
Background: Artificial intelligence (AI) has shown significant promise in augmenting diagnostic capabilities across medical specialties. Recent advancements in generative AI allow for synthesis and interpretation of complex clinical data including imaging and patient history to assess disease risk. Objective: To evaluate the diagnostic performance of a dermatology-trained multimodal large language model (DermFlow, Delaware, USA) in assessing malignancy risk of pigmented skin lesions. Methods: This retrospective study utilized data from 59 patients with 68 biopsy-proven pigmented skin lesions seen at Indiana University clinics from February 2023 to May 2025. De-identified patient histories and clinical images were input into DermFlow, and clinical images only were input into Claude Sonnet 4 (Claude) to generate differential diagnoses. Clinician pre-operative diagnoses were extracted from…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management · Artificial Intelligence in Healthcare and Education · AI in cancer detection
