Which Backbone to Use: A Resource-efficient Domain Specific Comparison   for Computer Vision

Pranav Jeevan; Amit Sethi

arXiv:2406.05612·cs.CV·March 11, 2025·2 cites

Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision

Pranav Jeevan, Amit Sethi

PDF

Open Access 1 Repo

TL;DR

This paper systematically compares resource-efficient CNN backbones across diverse computer vision domains and dataset sizes to guide practitioners in selecting optimal models, especially for small datasets.

Contribution

It provides a comprehensive evaluation of lightweight pre-trained CNN backbones across multiple domains, highlighting their relative performance and guiding model selection.

Findings

01

CNN architectures like ConvNeXt, RegNet, and EfficientNet perform well across domains.

02

Attention-based architectures underperform in low-data fine-tuning scenarios.

03

Resource-efficient CNNs are effective choices for diverse computer vision tasks.

Abstract

In contemporary computer vision applications, particularly image classification, architectural backbones pre-trained on large datasets like ImageNet are commonly employed as feature extractors. Despite the widespread use of these pre-trained convolutional neural networks (CNNs), there remains a gap in understanding the performance of various resource-efficient backbones across diverse domains and dataset sizes. Our study systematically evaluates multiple lightweight, pre-trained CNN backbones under consistent training settings across a variety of datasets, including natural images, medical images, galaxy images, and remote sensing images. This comprehensive analysis aims to aid machine learning practitioners in selecting the most suitable backbone for their specific problem, especially in scenarios involving small datasets where fine-tuning a pre-trained network is crucial. Even though…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pranavphoenix/Backbones
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Sparse Evolutionary Training · Batch Normalization · Dropout · Dense Connections · Squeeze-and-Excitation Block · Pointwise Convolution · Convolution · ConvNeXt · Average Pooling