Labeling of Multilingual Breast MRI Reports

Chen-Han Tsai; Nahum Kiryati; Eli Konen; Miri Sklair-Levy; Arnaldo; Mayer

arXiv:2007.03028·cs.CV·November 12, 2020

Labeling of Multilingual Breast MRI Reports

Chen-Han Tsai, Nahum Kiryati, Eli Konen, Miri Sklair-Levy, Arnaldo, Mayer

PDF

TL;DR

This paper introduces LAMBR, a custom language model for automatically classifying multilingual breast MRI reports, reducing the need for manual labeling and improving extraction accuracy in clinical settings.

Contribution

The paper presents a novel multilingual report classifier using LAMBR, addressing practical challenges and outperforming conventional methods in medical report labeling.

Findings

01

LAMBR improves label extraction accuracy

02

Reduces manual annotation effort

03

Effective across multiple languages

Abstract

Medical reports are an essential medium in recording a patient's condition throughout a clinical trial. They contain valuable information that can be extracted to generate a large labeled dataset needed for the development of clinical tools. However, the majority of medical reports are stored in an unregularized format, and a trained human annotator (typically a doctor) must manually assess and label each case, resulting in an expensive and time consuming procedure. In this work, we present a framework for developing a multilingual breast MRI report classifier using a custom-built language representation called LAMBR. Our proposed method overcomes practical challenges faced in clinical settings, and we demonstrate improved performance in extracting labels from medical reports when compared with conventional approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.