Radiology Text Analysis System (RadText): Architecture and Evaluation
Song Wang, Mingquan Lin, Ying Ding, George Shih, Zhiyong Lu, Yifan, Peng

TL;DR
RadText is an open-source, modular system for automated radiology report analysis that improves efficiency, privacy, and standardization, demonstrated by high accuracy on the MIMIC-CXR dataset.
Contribution
This work introduces RadText, a flexible, hybrid, open-source radiology text analysis system with standardized interfaces and improved privacy features.
Findings
High classification accuracy with F1 score of 0.92
Supports multiple processing modules and data privacy
Standardized input/output for observational research
Abstract
Analyzing radiology reports is a time-consuming and error-prone task, which raises the need for an efficient automated radiology report analysis system to alleviate the workloads of radiologists and encourage precise diagnosis. In this work, we present RadText, an open-source radiology text analysis system developed by Python. RadText offers an easy-to-use text analysis pipeline, including de-identification, section segmentation, sentence split and word tokenization, named entity recognition, parsing, and negation detection. RadText features a flexible modular design, provides a hybrid text processing schema, and supports raw text processing and local processing, which enables better usability and improved data privacy. RadText adopts BioC as the unified interface, and also standardizes the input / output into a structured representation compatible with Observational Medical Outcomes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
