Understanding Neural Architecture Search Techniques

George Adam; Jonathan Lorraine

arXiv:1904.00438·cs.LG·November 22, 2019·29 cites

Understanding Neural Architecture Search Techniques

George Adam, Jonathan Lorraine

PDF

Open Access

TL;DR

This paper investigates the limitations of ENAS in neural architecture search, revealing its inability to learn structural similarities and proposing a memory buffer solution to improve controller interpretability.

Contribution

It identifies the failure mode of ENAS controllers in learning architecture similarities and introduces a memory buffer training method to enhance interpretability.

Findings

01

ENAS does not significantly outperform random search with weight sharing.

02

Models from identical controller states lack correlation with architecture similarity metrics.

03

Memory buffer training improves controller interpretability.

Abstract

Automatic methods for generating state-of-the-art neural network architectures without human experts have generated significant attention recently. This is because of the potential to remove human experts from the design loop which can reduce costs and decrease time to model deployment. Neural architecture search (NAS) techniques have improved significantly in their computational efficiency since the original NAS was proposed. This reduction in computation is enabled via weight sharing such as in Efficient Neural Architecture Search (ENAS). However, recently a body of work confirms our discovery that ENAS does not do significantly better than random search with weight sharing, contradicting the initial claims of the authors. We provide an explanation for this phenomenon by investigating the interpretability of the ENAS controller's hidden state. We find models sampled from identical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Machine Learning in Materials Science

MethodsInterpretability · Random Search · Sigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory