Zero-Shot Neural Architecture Search with Weighted Response Correlation
Kun Jing, Luoyu Chen, Jungang Xu, Jianwei Tai, Yiyu Wang, Shuaimin Li

TL;DR
This paper introduces a novel zero-shot neural architecture search method using weighted response correlation (WRCor), which efficiently estimates architecture quality without training, outperforming existing proxies and finding competitive architectures on ImageNet-1k.
Contribution
The paper proposes WRCor, a new training-free proxy for NAS that measures expressivity and generalizability, improving efficiency and effectiveness over existing methods.
Findings
WRCor outperforms existing proxies in efficiency and accuracy.
The NAS algorithm discovers architectures with 22.1% test error on ImageNet-1k.
The method finds architectures within 4 GPU hours.
Abstract
Neural architecture search (NAS) is a promising approach for automatically designing neural network architectures. However, the architecture estimation of NAS is computationally expensive and time-consuming because of training multiple architectures from scratch. Although existing zero-shot NAS methods use training-free proxies to accelerate the architecture estimation, their effectiveness, stability, and generality are still lacking. We present a novel training-free estimation proxy called weighted response correlation (WRCor). WRCor utilizes correlation coefficient matrices of responses across different input samples to calculate the proxy scores of estimated architectures, which can measure their expressivity and generalizability. Experimental results on proxy evaluation demonstrate that WRCor and its voting proxies are more efficient estimation strategies than existing proxies. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Image Processing Techniques and Applications
