Multistep Automated Data Labelling Procedure (MADLaP) for Thyroid Nodules on Ultrasound: An Artificial Intelligence Approach for Automating Image Annotation
Jikai Zhang, Maciej M. Mazurowski, Brian C. Allen, and Benjamin, Wildman-Torbiner

TL;DR
MADLaP is a deep-learning tool that automates the annotation of thyroid ultrasound images, significantly reducing manual effort and enabling larger datasets for machine learning, though it faces challenges with heterogeneous data sources.
Contribution
This study introduces MADLaP, a novel multi-step AI system that automates thyroid ultrasound image labeling using natural language processing, image segmentation, and OCR.
Findings
MADLaP achieved 83% accuracy in labeling.
Yield increased through the processing pipeline.
Lower accuracy observed in certain examination sites.
Abstract
Machine learning (ML) for diagnosis of thyroid nodules on ultrasound is an active area of research. However, ML tools require large, well-labelled datasets, the curation of which is time-consuming and labor-intensive. The purpose of our study was to develop and test a deep-learning-based tool to facilitate and automate the data annotation process for thyroid nodules; we named our tool Multistep Automated Data Labelling Procedure (MADLaP). MADLaP was designed to take multiple inputs included pathology reports, ultrasound images, and radiology reports. Using multiple step-wise modules including rule-based natural language processing, deep-learning-based imaging segmentation, and optical character recognition, MADLaP automatically identified images of a specific thyroid nodule and correctly assigned a pathology label. The model was developed using a training set of 378 patients across our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Radiomics and Machine Learning in Medical Imaging · AI in cancer detection
MethodsTest
