GraphPNAS: Learning Distribution of Good Neural Architectures via Deep Graph Generative Models
Muchen Li, Jeffrey Yunfan Liu, Leonid Sigal, Renjie Liao

TL;DR
GraphPNAS introduces a deep graph generative model for neural architecture search, learning a distribution of high-performing architectures to improve flexibility and efficiency over traditional point-estimation methods.
Contribution
The paper proposes GraphPNAS, a novel GNN-based generative model for NAS that captures architecture topologies and relations, enabling probabilistic search and outperforming existing methods.
Findings
GraphPNAS outperforms RNN-based generators in NAS tasks.
It achieves comparable or better results than state-of-the-art NAS methods.
Effective on diverse search spaces including complex RandWire.
Abstract
Neural architectures can be naturally viewed as computational graphs. Motivated by this perspective, we, in this paper, study neural architecture search (NAS) through the lens of learning random graph models. In contrast to existing NAS methods which largely focus on searching for a single best architecture, i.e, point estimation, we propose GraphPNAS a deep graph generative model that learns a distribution of well-performing architectures. Relying on graph neural networks (GNNs), our GraphPNAS can better capture topologies of good neural architectures and relations between operators therein. Moreover, our graph generator leads to a learnable probabilistic search method that is more flexible and efficient than the commonly used RNN generator and random search methods. Finally, we learn our generator via an efficient reinforcement learning formulation for NAS. To assess the effectiveness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Multimodal Machine Learning Applications · Topic Modeling
Methods1x1 Convolution · Average Pooling · Convolution · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Global Average Pooling · Batch Normalization · Softmax · Random Search · RandWire
