Beyond ImageNet: Understanding Cross-Dataset Robustness of Lightweight Vision Models

Weidong Zhang; Pak Lun Kevin Ding; Huan Liu

arXiv:2511.00335·cs.CV·December 25, 2025

Beyond ImageNet: Understanding Cross-Dataset Robustness of Lightweight Vision Models

Weidong Zhang, Pak Lun Kevin Ding, Huan Liu

PDF

TL;DR

This paper systematically evaluates lightweight vision models across multiple datasets, introduces the xScore metric for robustness, and identifies architectural features that enhance cross-domain generalization.

Contribution

It presents the first comprehensive cross-dataset robustness evaluation of lightweight models, introduces the xScore metric, and analyzes architectural elements influencing generalization.

Findings

01

ImageNet accuracy does not predict performance on other datasets.

02

xScore effectively estimates model robustness from limited data.

03

Certain architectural features improve cross-domain generalization.

Abstract

Lightweight vision classification models such as MobileNet, ShuffleNet, and EfficientNet are increasingly deployed in mobile and embedded systems, yet their performance has been predominantly benchmarked on ImageNet. This raises critical questions: Do models that excel on ImageNet also generalize across other domains? How can cross-dataset robustness be systematically quantified? And which architectural elements consistently drive generalization under tight resource constraints? Here, we present the first systematic evaluation of 11 lightweight vision models (2.5M parameters), trained under a fixed 100-epoch schedule across 7 diverse datasets. We introduce the Cross-Dataset Score (xScore), a unified metric that quantifies the consistency and robustness of model performance across diverse visual domains. Our results show that (1) ImageNet accuracy does not reliably predict performance on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.