Exploring Patient Data Requirements in Training Effective AI Models for MRI-based Breast Cancer Classification
Solha Kang, Wesley De Neve, Francois Rameau, Utku Ozbulak

TL;DR
This study investigates the amount of MRI patient data needed for effective breast cancer AI models, finding that a relatively small dataset suffices when using foundation models, with larger datasets offering diminishing returns.
Contribution
It demonstrates that effective breast cancer detection models can be trained with limited data by leveraging foundation models, reducing data requirements for medical institutions.
Findings
Models trained with fewer than 50 patients perform competitively.
Beyond 50 patients, additional data has minimal impact on performance.
Simple ensemble methods further enhance model accuracy.
Abstract
The past decade has witnessed a substantial increase in the number of startups and companies offering AI-based solutions for clinical decision support in medical institutions. However, the critical nature of medical decision-making raises several concerns about relying on external software. Key issues include potential variations in image modalities and the medical devices used to obtain these images, potential legal issues, and adversarial attacks. Fortunately, the open-source nature of machine learning research has made foundation models publicly available and straightforward to use for medical applications. This accessibility allows medical institutions to train their own AI-based models, thereby mitigating the aforementioned concerns. Given this context, an important question arises: how much data do medical institutions need to train effective AI models? In this study, we explore…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
