NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search
Rameswar Panda, Michele Merler, Mayoore Jaiswal, Hui Wu, Kandan, Ramakrishnan, Ulrich Finkler, Chun-Fu Chen, Minsik Cho, David Kung, Rogerio, Feris, Bishwaranjan Bhattacharjee

TL;DR
This paper investigates how well neural architecture search methods transfer architectures from small datasets to large-scale benchmarks like ImageNet, revealing that dataset size and domain have limited impact, but search strategy choice is crucial.
Contribution
It provides a comprehensive empirical analysis of architecture transferability across different NAS methods and datasets, highlighting key factors influencing transfer performance.
Findings
Proxy set size and domain have minimal impact on transfer performance.
Different NAS methods perform similarly on source datasets but vary significantly when transferred.
Random sampling is a strong baseline, but strategic search improves results.
Abstract
Neural Architecture Search (NAS) is an open and challenging problem in machine learning. While NAS offers great promise, the prohibitive computational demand of most of the existing NAS methods makes it difficult to directly search the architectures on large-scale tasks. The typical way of conducting large scale NAS is to search for an architectural building block on a small dataset (either using a proxy set from the large dataset or a completely different small scale dataset) and then transfer the block to a larger dataset. Despite a number of recent results that show the promise of transfer from proxy datasets, a comprehensive evaluation of different NAS methods studying the impact of different source datasets has not yet been addressed. In this work, we propose to analyze the architecture transferability of different NAS methods by performing a series of experiments on large scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
