Neural Architecture Optimization

Renqian Luo; Fei Tian; Tao Qin; Enhong Chen; Tie-Yan Liu

arXiv:1808.07233·cs.LG·September 5, 2019·432 cites

Neural Architecture Optimization

Renqian Luo, Fei Tian, Tao Qin, Enhong Chen, Tie-Yan Liu

PDF

Open Access 5 Repos

TL;DR

This paper introduces Neural Architecture Optimization (NAO), a continuous optimization-based method for automatic neural architecture design that is more efficient and achieves competitive results on image classification and language modeling tasks.

Contribution

The paper presents a novel continuous optimization approach for neural architecture search, including an encoder, predictor, and decoder, enabling gradient-based search in a continuous space.

Findings

01

Achieved 1.93% error on CIFAR-10

02

Attained 56.0 perplexity on PTB

03

Reduced computational resources significantly

Abstract

Automatic neural architecture design has shown its potential in discovering powerful neural network architectures. Existing methods, no matter based on reinforcement learning or evolutionary algorithms (EA), conduct architecture search in a discrete space, which is highly inefficient. In this paper, we propose a simple and efficient method to automatic neural architecture design based on continuous optimization. We call this new approach neural architecture optimization (NAO). There are three key components in our proposed approach: (1) An encoder embeds/maps neural network architectures into a continuous space. (2) A predictor takes the continuous representation of a network as input and predicts its accuracy. (3) A decoder maps a continuous representation of a network back to its architecture. The performance predictor and the encoder enable us to perform gradient based optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Machine Learning and Data Classification