Rethinking Annotation Granularity for Overcoming Shortcuts in Deep Learning-based Radiograph Diagnosis: A Multicenter Study
Luyang Luo, Hao Chen, Yongjie Xiao, Yanning Zhou, Xi Wang, Varut, Vardhanabhuti, Mingxiang Wu, Chu Han, Zaiyi Liu, Xin Hao Benjamin Fang,, Efstratios Tsougenis, Huangjing Lin, and Pheng-Ann Heng

TL;DR
This multicenter study demonstrates that using fine-grained lesion annotations in deep learning models improves generalization and lesion localization in radiograph diagnosis, surpassing models trained with only coarse annotations and matching radiologist performance.
Contribution
The study shows that fine-grained lesion annotations overcome shortcut learning, enhancing model generalizability and lesion detection in radiograph diagnosis.
Findings
CheXDet outperformed CheXNet in external classification tasks.
CheXDet achieved higher lesion localization performance across datasets.
Models with fine-grained annotations matched radiologist performance when trained sufficiently.
Abstract
Two DL models were developed using radiograph-level annotations (yes or no disease) and fine-grained lesion-level annotations (lesion bounding boxes), respectively named CheXNet and CheXDet. The models' internal classification performance and lesion localization performance were compared on a testing set (n=2,922), external classification performance was compared on NIH-Google (n=4,376) and PadChest (n=24,536) datasets, and external lesion localization performance was compared on NIH-ChestX-ray14 dataset (n=880). The models were also compared to radiologists on a subset of the internal testing set (n=496). Given sufficient training data, both models performed comparably to radiologists. CheXDet achieved significant improvement for external classification, such as in classifying fracture on NIH-Google (CheXDet area under the ROC curve [AUC]: 0.67, CheXNet AUC: 0.51; p<.001) and PadChest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging · Lung Cancer Diagnosis and Treatment
MethodsXRP Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Max Pooling · 1x1 Convolution · Concatenated Skip Connection · Dropout · Dense Block · Convolution · Average Pooling
