Variation Matters: from Mitigating to Embracing Zero-Shot NAS Ranking Function Variation
Pavel Rumiantsev, Mark Coates

TL;DR
This paper addresses the variability in zero-shot NAS ranking functions by modeling their outputs as random variables and using stochastic ordering to improve architecture selection, leading to better search performance.
Contribution
It introduces a novel approach that accounts for ranking function variation through stochastic ordering, enhancing zero-shot NAS effectiveness.
Findings
Stochastic ordering improves NAS search outcomes.
Modeling ranking outputs as random variables is beneficial.
Proposed method outperforms traditional averaging approaches.
Abstract
Neural Architecture Search (NAS) is a powerful automatic alternative to manual design of a neural network. In the zero-shot version, a fast ranking function is used to compare architectures without training them. The outputs of the ranking functions often vary significantly due to different sources of randomness, including the evaluated architecture's weights' initialization or the batch of data used for calculations. A common approach to addressing the variation is to average a ranking function output over several evaluations. We propose taking into account the variation in a different manner, by viewing the ranking function output as a random variable representing a proxy performance metric. During the search process, we strive to construct a stochastic ordering of the performance metrics to determine the best architecture. Our experiments show that the proposed stochastic ordering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Advanced Graph Neural Networks
