Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback
Shenghuan Sun, Gregory M. Goldgof, Atul Butte, Ahmed M. Alaa

TL;DR
This paper presents a human-in-the-loop framework that uses expert pathologist feedback to improve the clinical plausibility and quality of synthetic medical images generated by diffusion models.
Contribution
It introduces a novel pathologist-in-the-loop approach that trains a reward model from expert feedback to enhance synthetic medical image generation.
Findings
Human feedback improves image fidelity and diversity.
The reward model effectively guides diffusion model finetuning.
Enhanced images are more useful for downstream clinical applications.
Abstract
Generative models capable of capturing nuanced clinical features in medical images hold great promise for facilitating clinical data sharing, enhancing rare disease datasets, and efficiently synthesizing annotated medical images at scale. Despite their potential, assessing the quality of synthetic medical images remains a challenge. While modern generative models can synthesize visually-realistic medical images, the clinical validity of these images may be called into question. Domain-agnostic scores, such as FID score, precision, and recall, cannot incorporate clinical knowledge and are, therefore, not suitable for assessing clinical sensibility. Additionally, there are numerous unpredictable ways in which generative models may fail to synthesize clinically plausible images, making it challenging to anticipate potential failures and manually design scores for their detection. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · AI in cancer detection · Cell Image Analysis Techniques
MethodsDiffusion
