Supervised Machine Learning Algorithm for Detecting Consistency between Reported Findings and the Conclusions of Mammography Reports
Alexander Berdichevsky, Mor Peleg, and Daniel L. Rubin

TL;DR
This paper presents a supervised machine learning system that automatically detects discrepancies between mammography report findings and conclusions by classifying reports into BI-RADS categories and assessing semantic similarity.
Contribution
It introduces a novel approach combining term normalization, classification, and semantic similarity to identify inconsistencies in mammography reports.
Findings
97% accuracy in BI-RADS descriptor identification
76% precision and 83% recall in report classification
Effective detection of report-conclusion discrepancies
Abstract
Objective. Mammography reports document the diagnosis of patients' conditions. However, many reports contain non-standard terms (non-BI-RADS descriptors) and incomplete statements, which can lead to conclusions that are not well-supported by the reported findings. Our aim was to develop a tool to detect such discrepancies by comparing the reported conclusions to those that would be expected based on the reported radiology findings. Materials and Methods. A deidentified data set from an academic hospital containing 258 mammography reports supplemented by 120 reports found on the web was used for training and evaluation. Spell checking and term normalization was used to unambiguously determine the reported BI-RADS descriptors. The resulting data were input into seven classifiers that classify mammography reports, based on their Findings sections, into seven BI-RADS final assessment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Natural Language Processing Techniques
MethodsAttentive Walk-Aggregating Graph Neural Network
