Is it enough to optimize CNN architectures on ImageNet?
Lukas Tuggener, J\"urgen Schmidhuber, Thilo Stadelmann

TL;DR
This paper questions whether optimizing CNN architectures solely on ImageNet leads to universally effective models, showing that dataset-specific performance varies and can be improved by using restricted ImageNet subsets.
Contribution
The study provides an extensive empirical analysis of CNN performance across multiple datasets, revealing dataset dependency and proposing methods to improve generalization beyond ImageNet.
Findings
Performance of CNNs is highly dataset dependent.
Using ImageNet subsets with fewer classes increases correlation with other datasets.
Over-reliance on ImageNet may limit generalization to diverse domains.
Abstract
Classification performance based on ImageNet is the de-facto standard metric for CNN development. In this work we challenge the notion that CNN architecture design solely based on ImageNet leads to generally effective convolutional neural network (CNN) architectures that perform well on a diverse set of datasets and application domains. To this end, we investigate and ultimately improve ImageNet as a basis for deriving such architectures. We conduct an extensive empirical study for which we train CNN architectures, sampled from the broad AnyNetX design space, on ImageNet as well as additional well known image classification benchmark datasets from a diverse array of application domains. We observe that the performances of the architectures are highly dataset dependent. Some datasets even exhibit a negative error correlation with ImageNet across all architectures. We show how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Advanced Neural Network Applications · Neural Networks and Applications
