Rad-ReStruct: A Novel VQA Benchmark and Method for Structured Radiology Reporting
Chantal Pellegrini, Matthias Keicher, Ege \"Ozsoy, Nassir Navab

TL;DR
Rad-ReStruct introduces a new benchmark dataset and a hierarchical VQA method for automating structured radiology reporting, aiming to improve accuracy and efficiency in medical image analysis.
Contribution
It presents the Rad-ReStruct dataset and the hi-VQA method, enabling automated, structured radiology report generation with hierarchical question answering.
Findings
hi-VQA achieves competitive results on VQARad.
hi-VQA outperforms non-domain pretraining methods.
Rad-ReStruct provides a new benchmark for future research.
Abstract
Radiology reporting is a crucial part of the communication between radiologists and other medical professionals, but it can be time-consuming and error-prone. One approach to alleviate this is structured reporting, which saves time and enables a more accurate evaluation than free-text reports. However, there is limited research on automating structured reporting, and no public benchmark is available for evaluating and comparing different methods. To close this gap, we introduce Rad-ReStruct, a new benchmark dataset that provides fine-grained, hierarchically ordered annotations in the form of structured reports for X-Ray images. We model the structured reporting task as hierarchical visual question answering (VQA) and propose hi-VQA, a novel method that considers prior context in the form of previously asked questions and answers for populating a structured radiology report. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
