A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data
Xinyi Wang, Grazziela Figueredo, Ruizhe Li, Wei Emma Zhang, Weitong, Chen, Xin Chen

TL;DR
This survey reviews recent deep learning techniques for radiology report generation from multimodal data, highlighting key methods, workflows, datasets, and future challenges in the field.
Contribution
It provides a comprehensive overview of the latest deep learning approaches, including large models and explainability, for multimodal radiology report generation, and offers a unified workflow and comparative analysis.
Findings
Summarizes key techniques and workflows in multimodal report generation.
Highlights state-of-the-art methods and datasets in the field.
Provides a quantitative comparison of different approaches.
Abstract
Automatic radiology report generation can alleviate the workload for physicians and minimize regional disparities in medical resources, therefore becoming an important topic in the medical image analysis field. It is a challenging task, as the computational model needs to mimic physicians to obtain information from multi-modal input data (i.e., medical images, clinical information, medical knowledge, etc.), and produce comprehensive and accurate reports. Recently, numerous works have emerged to address this issue using deep-learning-based methods, such as transformers, contrastive learning, and knowledge-base construction. This survey summarizes the key techniques developed in the most recent works and proposes a general workflow for deep-learning-based report generation with five main components, including multi-modality data acquisition, data preparation, feature learning, feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Radiomics and Machine Learning in Medical Imaging
