Multi-Label Annotation of Chest Abdomen Pelvis Computed Tomography Text Reports Using Deep Learning
Vincent M. D'Anniballe, Fakrul Islam Tushar, Khrystyna Faryna, Songyue, Han, Maciej A. Mazurowski, Geoffrey D. Rubin, Joseph Y. Lo

TL;DR
This study developed a deep learning-based pipeline for automated, high-throughput multi-label annotation of chest, abdomen, and pelvis CT reports, achieving high accuracy and robustness across diverse diseases and organ systems.
Contribution
The paper introduces a novel combination of rule-based algorithms and attention-guided RNNs for scalable, accurate multi-label annotation of radiology reports, surpassing previous methods.
Findings
Pre-trained models outperform random initialization.
High AUC (>0.95) for all disease outcomes.
Robust performance even with reduced training data.
Abstract
Purpose: To develop high throughput multi-label annotators for body (chest, abdomen, and pelvis) Computed Tomography (CT) reports that can be applied across a variety of abnormalities, organs, and disease states. Approach: We used a dictionary approach to develop rule-based algorithms (RBA) for extraction of disease labels from radiology text reports. We targeted three organ systems (lungs/pleura, liver/gallbladder, kidneys/ureters) with four diseases per system based on their prevalence in our dataset. To expand the algorithms beyond pre-defined keywords, attention-guided recurrent neural networks (RNN) were trained using the RBA-extracted labels to classify reports as being positive for one or more diseases or normal for each organ system. Confounding effects on model performance were evaluated using random initialization or pre-trained embedding as well as different sizes of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Biomedical Text Mining and Ontologies
