RadLex Normalization in Radiology Reports

Surabhi Datta; Jordan Godfrey-Stovall; Kirk Roberts

arXiv:2009.05128·cs.CL·September 14, 2020

RadLex Normalization in Radiology Reports

Surabhi Datta, Jordan Godfrey-Stovall, Kirk Roberts

PDF

Open Access

TL;DR

This paper introduces a deep learning approach using BERT for normalizing radiology report entities to RadLex, improving standardization and facilitating better data extraction in radiology.

Contribution

It presents the first application of RadLex normalization in radiology reports using BERT-based models and constructs a new annotated corpus for this task.

Findings

01

Best accuracy of 78.44% with span detector

02

Effective use of BM25 for candidate retrieval

03

Identified challenges in corpus construction

Abstract

Radiology reports have been widely used for extraction of various clinically significant information about patients' imaging studies. However, limited research has focused on standardizing the entities to a common radiology-specific vocabulary. Further, no study to date has attempted to leverage RadLex for standardization. In this paper, we aim to normalize a diverse set of radiological entities to RadLex terms. We manually construct a normalization corpus by annotating entities from three types of reports. This contains 1706 entity mentions. We propose two deep learning-based NLP methods based on a pre-trained language model (BERT) for automatic normalization. First, we employ BM25 to retrieve candidate concepts for the BERT-based models (re-ranker and span detector) to predict the normalized concept. The results are promising, with the best accuracy (78.44%) obtained by the span…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques