Did You Get What You Paid For? Rethinking Annotation Cost of Deep Learning Based Computer Aided Detection in Chest Radiographs
Tae Soo Kim, Geonwoon Jang, Sanghyup Lee, Thijs Kooi

TL;DR
This study examines how different aspects of annotation cost—quantity, quality, and granularity—affect deep learning model performance for chest X-ray detection, highlighting cost-effective strategies for data annotation.
Contribution
It provides empirical evidence on the impact of annotation cost dimensions on CAD performance and proposes cost-efficient annotation strategies for chest radiograph analysis.
Findings
Large amounts of cost-efficient annotations yield high performance.
Combining cheap and expensive labels reduces overall annotation costs.
Cost-efficient annotations can be nearly as effective as gold-standard labels.
Abstract
As deep networks require large amounts of accurately labeled training data, a strategy to collect sufficiently large and accurate annotations is as important as innovations in recognition methods. This is especially true for building Computer Aided Detection (CAD) systems for chest X-rays where domain expertise of radiologists is required to annotate the presence and location of abnormalities on X-ray images. However, there lacks concrete evidence that provides guidance on how much resource to allocate for data annotation such that the resulting CAD system reaches desired performance. Without this knowledge, practitioners often fall back to the strategy of collecting as much detail as possible on as much data as possible which is cost inefficient. In this work, we investigate how the cost of data annotation ultimately impacts the CAD model performance on classification and segmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging · Lung Cancer Diagnosis and Treatment
