Dextr: Zero-Shot Neural Architecture Search with Singular Value Decomposition and Extrinsic Curvature
Rohan Asthana, Joschua Conrad, Maurits Ortmanns, Vasileios Belagiannis

TL;DR
This paper introduces Dextr, a zero-shot neural architecture search proxy that uses SVD and extrinsic curvature to predict network performance without labeled data, improving efficiency and accuracy across various search spaces.
Contribution
We propose a novel label-free zero-cost proxy combining SVD and extrinsic curvature to evaluate neural architectures, addressing limitations of existing proxies.
Findings
Accurately predicts network performance with a single unlabeled sample.
Outperforms existing proxies on multiple benchmarks.
Effective across CNN and Transformer search spaces.
Abstract
Zero-shot Neural Architecture Search (NAS) typically optimises the architecture search process by exploiting the network or gradient properties at initialisation through zero-cost proxies. The existing proxies often rely on labelled data, which is usually unavailable in real-world settings. Furthermore, the majority of the current methods focus either on optimising the convergence and generalisation attributes or solely on the expressivity of the network architectures. To address both limitations, we first demonstrate how channel collinearity affects the convergence and generalisation properties of a neural network. Then, by incorporating the convergence, generalisation and expressivity in one approach, we propose a zero-cost proxy that omits the requirement of labelled data for its computation. In particular, we leverage the Singular Value Decomposition (SVD) of the neural network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
