Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced diffusion model
Peng Huang, Bowen Guo, Shuyu Liang, Junhu Fu, Yuanyuan Wang, Yi Guo

TL;DR
This paper introduces Diff-CXR, a diffusion-based framework for generating chest X-ray images from reports, incorporating disease knowledge and noise filtering to improve realism and accuracy, with promising clinical potential.
Contribution
The novel Diff-CXR framework integrates disease knowledge and noise filtering strategies into report-to-CXR generation, advancing medical image synthesis methods.
Findings
Outperforms state-of-the-art methods by 33.4% in FID on MIMIC-CXR
Achieves 23.8% improvement in mAUC score on disease classification
Enables effective training with limited real data using synthetic images.
Abstract
Text-To-Image (TTI) generation is significant for controlled and diverse image generation with broad potential applications. Although current medical TTI methods have made some progress in report-to-Chest-Xray (CXR) generation, their generation performance may be limited due to the intrinsic characteristics of medical data. In this paper, we propose a novel disease-knowledge enhanced Diffusion-based TTI learning framework, named Diff-CXR, for medical report-to-CXR generation. First, to minimize the negative impacts of noisy data on generation, we devise a Latent Noise Filtering Strategy that gradually learns the general patterns of anomalies and removes them in the latent space. Then, an Adaptive Vision-Aware Textual Learning Strategy is designed to learn concise and important report embeddings in a domain-specific Vision-Language Model, providing textual guidance for Chest-Xray…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsChemokine receptors and signaling
MethodsDiffusion
