Understanding Neural Architecture Search Techniques
George Adam, Jonathan Lorraine

TL;DR
This paper investigates the limitations of ENAS in neural architecture search, revealing its inability to learn structural similarities and proposing a memory buffer solution to improve controller interpretability.
Contribution
It identifies the failure mode of ENAS controllers in learning architecture similarities and introduces a memory buffer training method to enhance interpretability.
Findings
ENAS does not significantly outperform random search with weight sharing.
Models from identical controller states lack correlation with architecture similarity metrics.
Memory buffer training improves controller interpretability.
Abstract
Automatic methods for generating state-of-the-art neural network architectures without human experts have generated significant attention recently. This is because of the potential to remove human experts from the design loop which can reduce costs and decrease time to model deployment. Neural architecture search (NAS) techniques have improved significantly in their computational efficiency since the original NAS was proposed. This reduction in computation is enabled via weight sharing such as in Efficient Neural Architecture Search (ENAS). However, recently a body of work confirms our discovery that ENAS does not do significantly better than random search with weight sharing, contradicting the initial claims of the authors. We provide an explanation for this phenomenon by investigating the interpretability of the ENAS controller's hidden state. We find models sampled from identical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Machine Learning in Materials Science
MethodsInterpretability · Random Search · Sigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
