How much data is needed to train a medical image deep learning system to achieve necessary high accuracy?
Junghwan Cho, Kyewook Lee, Ellie Shin, Garry Choy, Synho Do

TL;DR
This study investigates the optimal amount of training data needed for CNNs to accurately classify medical images, specifically CT scans, using a learning curve approach to inform data requirements for high accuracy.
Contribution
The paper introduces a methodology to determine the minimum training data size needed for high accuracy in medical image classification using CNNs.
Findings
Optimal training data size varies with target accuracy.
Learning curve approach effectively predicts required data volume.
Method can be generalized to other medical imaging tasks.
Abstract
The use of Convolutional Neural Networks (CNN) in natural image classification systems has produced very impressive results. Combined with the inherent nature of medical images that make them ideal for deep-learning, further application of such systems to medical image classification holds much promise. However, the usefulness and potential impact of such a system can be completely negated if it does not reach a target accuracy. In this paper, we present a study on determining the optimum size of the training data set necessary to achieve high classification accuracy with low variance in medical image classification systems. The CNN was applied to classify axial Computed Tomography (CT) images into six anatomical classes. We trained the CNN using six different sizes of training data set (5, 10, 20, 50, 100, and 200) and then tested the resulting system with a total of 6000 CT images.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Advanced X-ray and CT Imaging
