Beyond ImageNet: Understanding Cross-Dataset Robustness of Lightweight Vision Models
Weidong Zhang, Pak Lun Kevin Ding, Huan Liu

TL;DR
This paper systematically evaluates lightweight vision models across multiple datasets, introduces the xScore metric for robustness, and identifies architectural features that enhance cross-domain generalization.
Contribution
It presents the first comprehensive cross-dataset robustness evaluation of lightweight models, introduces the xScore metric, and analyzes architectural elements influencing generalization.
Findings
ImageNet accuracy does not predict performance on other datasets.
xScore effectively estimates model robustness from limited data.
Certain architectural features improve cross-domain generalization.
Abstract
Lightweight vision classification models such as MobileNet, ShuffleNet, and EfficientNet are increasingly deployed in mobile and embedded systems, yet their performance has been predominantly benchmarked on ImageNet. This raises critical questions: Do models that excel on ImageNet also generalize across other domains? How can cross-dataset robustness be systematically quantified? And which architectural elements consistently drive generalization under tight resource constraints? Here, we present the first systematic evaluation of 11 lightweight vision models (2.5M parameters), trained under a fixed 100-epoch schedule across 7 diverse datasets. We introduce the Cross-Dataset Score (xScore), a unified metric that quantifies the consistency and robustness of model performance across diverse visual domains. Our results show that (1) ImageNet accuracy does not reliably predict performance on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
