Evaluating Strategies for Synthesizing Clinical Notes for Medical Multimodal AI
Niccolo Marini, Zhaohui Liang, Sivaramakrishnan Rajaraman, Zhiyun Xue, Sameer Antani

TL;DR
This paper explores methods for generating synthetic clinical notes using LLMs to improve multimodal dermatology AI models, enhancing classification accuracy and enabling cross-modal retrieval despite limited real data.
Contribution
It introduces strategies for prompt design and metadata inclusion to synthesize clinical notes, improving model robustness and cross-modal capabilities in medical AI.
Findings
Synthetic notes improve classification accuracy under domain shift
Enhanced cross-modal retrieval capabilities are demonstrated
Synthetic data mitigates data scarcity issues in dermatology AI
Abstract
Multimodal (MM) learning is emerging as a promising paradigm in biomedical artificial intelligence (AI) applications, integrating complementary modality, which highlight different aspects of patient health. The scarcity of large heterogeneous biomedical MM data has restrained the development of robust models for medical AI applications. In the dermatology domain, for instance, skin lesion datasets typically include only images linked to minimal metadata describing the condition, thereby limiting the benefits of MM data integration for reliable and generalizable predictions. Recent advances in Large Language Models (LLMs) enable the synthesis of textual description of image findings, potentially allowing the combination of image and text representations. However, LLMs are not specifically trained for use in the medical domain, and their naive inclusion has raised concerns about the risk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management · Machine Learning in Healthcare · Multimodal Machine Learning Applications
