Bimodal network architectures for automatic generation of image   annotation from text

Mehdi Moradi; Ali Madani; Yaniv Gur; Yufan Guo; Tanveer Syeda-Mahmood

arXiv:1809.01610·cs.CV·September 9, 2018·1 cites

Bimodal network architectures for automatic generation of image annotation from text

Mehdi Moradi, Ali Madani, Yaniv Gur, Yufan Guo, Tanveer Syeda-Mahmood

PDF

Open Access

TL;DR

This paper introduces two deep neural network architectures that automatically generate image annotations from text reports, significantly reducing the need for costly manual labeling in medical imaging.

Contribution

It proposes two novel bimodal neural network architectures for automatic ROI annotation in medical images using text, with one leveraging pre-trained image features for improved accuracy.

Findings

01

Both architectures successfully estimate ROIs from text in chest X-ray images.

02

The pre-trained image network architecture outperforms the other, with ROI centroids within 5.1% of ground truth.

03

The methods are validated against clinical annotations, demonstrating practical applicability.

Abstract

Medical image analysis practitioners have embraced big data methodologies. This has created a need for large annotated datasets. The source of big data is typically large image collections and clinical reports recorded for these images. In many cases, however, building algorithms aimed at segmentation and detection of disease requires a training dataset with markings of the areas of interest on the image that match with the described anomalies. This process of annotation is expensive and needs the involvement of clinicians. In this work we propose two separate deep neural network architectures for automatic marking of a region of interest (ROI) on the image best representing a finding location, given a textual report or a set of keywords. One architecture consists of LSTM and CNN components and is trained end to end with images, matching text, and markings of ROIs for those images. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI · AI in cancer detection

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory