Infrared Object Detection with Ultra Small ConvNets: Is ImageNet Pretraining Still Useful?

Srikanth Muralidharan; Heitor R. Medeiros; Masih Aminbeidokhti; Eric Granger; Marco Pedersoli

arXiv:2508.02927·cs.CV·January 21, 2026

Infrared Object Detection with Ultra Small ConvNets: Is ImageNet Pretraining Still Useful?

Srikanth Muralidharan, Heitor R. Medeiros, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli

PDF

TL;DR

This paper investigates the impact of ImageNet pretraining on ultra-small infrared object detection models, revealing that pretraining remains beneficial but offers diminishing returns as model size decreases, affecting robustness in varied conditions.

Contribution

The study systematically evaluates the effect of ImageNet pretraining on ultra-small models for infrared detection, providing insights into their robustness and practical recommendations.

Findings

01

Pretraining improves ultra-small model performance.

02

Diminishing returns of pretraining beyond a size threshold.

03

Smaller models are less robust to out-of-distribution data.

Abstract

Many real-world applications require recognition models that are robust to different operational conditions and modalities, but at the same time run on small embedded devices, with limited hardware. While for normal size models, pre-training is known to be very beneficial in accuracy and robustness, for small models, that can be employed for embedded and edge devices, its effect is not clear. In this work, we investigate the effect of ImageNet pretraining on increasingly small backbone architectures (ultra-small models, with less than 1M parameters) with respect to robustness in downstream object detection tasks in the infrared visual modality. Using scaling laws derived from standard object recognition architectures, we construct two ultra-small backbone families and systematically study their performance. Our experiments on three different datasets reveal that while ImageNet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.