Graph HyperNetworks for Neural Architecture Search
Chris Zhang, Mengye Ren, Raquel Urtasun

TL;DR
This paper introduces Graph HyperNetworks (GHNs) that efficiently generate neural network weights based on architecture topology, enabling faster neural architecture search and improved speed-accuracy tradeoffs.
Contribution
The paper presents GHNs that model architecture topology for rapid weight generation, significantly reducing NAS search time and improving performance over manual designs.
Findings
GHNs enable nearly 10x faster NAS on CIFAR-10 and ImageNet.
GHNs produce more accurate performance predictions than traditional hypernetworks.
GHNs discover networks with superior speed-accuracy tradeoffs in anytime prediction settings.
Abstract
Neural architecture search (NAS) automatically finds the best task-specific neural network topology, outperforming many manual architecture designs. However, it can be prohibitively expensive as the search requires training thousands of different networks, while each can last for hours. In this work, we propose the Graph HyperNetwork (GHN) to amortize the search cost: given an architecture, it directly generates the weights by running inference on a graph neural network. GHNs model the topology of an architecture and therefore can predict network performance more accurately than regular hypernetworks and premature early stopping. To perform NAS, we randomly sample architectures and use the validation accuracy of networks with GHN generated weights as the surrogate search signal. GHNs are fast -- they can search nearly 10 times faster than other random search methods on CIFAR-10 and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsRandom Search · HyperNetwork
