Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision
Pranav Jeevan, Amit Sethi

TL;DR
This paper systematically compares resource-efficient CNN backbones across diverse computer vision domains and dataset sizes to guide practitioners in selecting optimal models, especially for small datasets.
Contribution
It provides a comprehensive evaluation of lightweight pre-trained CNN backbones across multiple domains, highlighting their relative performance and guiding model selection.
Findings
CNN architectures like ConvNeXt, RegNet, and EfficientNet perform well across domains.
Attention-based architectures underperform in low-data fine-tuning scenarios.
Resource-efficient CNNs are effective choices for diverse computer vision tasks.
Abstract
In contemporary computer vision applications, particularly image classification, architectural backbones pre-trained on large datasets like ImageNet are commonly employed as feature extractors. Despite the widespread use of these pre-trained convolutional neural networks (CNNs), there remains a gap in understanding the performance of various resource-efficient backbones across diverse domains and dataset sizes. Our study systematically evaluates multiple lightweight, pre-trained CNN backbones under consistent training settings across a variety of datasets, including natural images, medical images, galaxy images, and remote sensing images. This comprehensive analysis aims to aid machine learning practitioners in selecting the most suitable backbone for their specific problem, especially in scenarios involving small datasets where fine-tuning a pre-trained network is crucial. Even though…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Sparse Evolutionary Training · Batch Normalization · Dropout · Dense Connections · Squeeze-and-Excitation Block · Pointwise Convolution · Convolution · ConvNeXt · Average Pooling
