A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images
Pablo Messina, Pablo Pino, Denis Parra, Alvaro Soto, Cecilia Besa,, Sergio Uribe, Marcelo and\'ia, Cristian Tejos, Claudia Prieto, Daniel, Capurro

TL;DR
This survey reviews deep learning methods for automatic medical report generation from images, highlighting recent advances, challenges in evaluation metrics, and the importance of explainability in clinical AI applications.
Contribution
It provides a comprehensive overview of datasets, architectures, explainability, and evaluation metrics in the field, identifying key challenges and future directions.
Findings
Current evaluation metrics are inadequate for medical correctness.
Deep neural networks are increasingly used for report generation.
Explainability remains a significant challenge in clinical AI.
Abstract
Every year physicians face an increasing demand of image-based diagnosis from patients, a problem that can be addressed with recent artificial intelligence methods. In this context, we survey works in the area of automatic report generation from medical images, with emphasis on methods using deep neural networks, with respect to: (1) Datasets, (2) Architecture Design, (3) Explainability and (4) Evaluation Metrics. Our survey identifies interesting developments, but also remaining challenges. Among them, the current evaluation of generated reports is especially weak, since it mostly relies on traditional Natural Language Processing (NLP) metrics, which do not accurately capture medical correctness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Medical Imaging and Analysis
