Towards Optimal Convolutional Transfer Learning Architectures for Breast Lesion Classification and ACL Tear Detection

Daniel Frees; Moritz Bolling; Aditri Bhagirath

arXiv:2508.17567·cs.CV·August 27, 2025

Towards Optimal Convolutional Transfer Learning Architectures for Breast Lesion Classification and ACL Tear Detection

Daniel Frees, Moritz Bolling, Aditri Bhagirath

PDF

TL;DR

This study investigates optimal CNN architectures for medical image classification, comparing pre-training datasets, and finds that certain architectures perform best, but RadImageNet pre-training does not always improve results.

Contribution

It extends prior work by systematically evaluating CNN architectures and pre-training effects for breast lesion and ACL tear detection, providing practical guidelines for model selection.

Findings

01

ResNet50 with partial backbone unfreezing yields top performance.

02

Best models achieve AUCs of 0.9969 for ACL tears and 0.9641 for breast lesions.

03

RadImageNet pre-training does not consistently outperform ImageNet pre-training.

Abstract

Modern computer vision models have proven to be highly useful for medical imaging classification and segmentation tasks, but the scarcity of medical imaging data often limits the efficacy of models trained from scratch. Transfer learning has emerged as a pivotal solution to this, enabling the fine-tuning of high-performance models on small data. Mei et al. (2022) found that pre-training CNNs on a large dataset of radiologist-labeled images (RadImageNet) enhanced model performance on downstream tasks compared to ImageNet pretraining. The present work extends Mei et al. (2022) by conducting a comprehensive investigation to determine optimal CNN architectures for breast lesion malignancy detection and ACL tear detection, as well as performing statistical analysis to compare the effect of RadImageNet and ImageNet pre-training on downstream model performance. Our findings suggest that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.