HM-NAS: Efficient Neural Architecture Search via Hierarchical Masking
Shen Yan, Biyi Fang, Faen Zhang, Yu Zheng, Xiao Zeng, Hui Xu, Mi Zhang

TL;DR
HM-NAS introduces a hierarchical masking approach for neural architecture search, enabling more flexible architecture exploration and automatic learning of optimal structures, outperforming existing weight sharing methods.
Contribution
It proposes a hierarchical masking scheme and multi-level encoding to improve NAS flexibility and eliminate reliance on hand-designed heuristics.
Findings
HM-NAS achieves better search performance than state-of-the-art methods.
The searched architectures are more flexible and meaningful.
Models evaluated show competitive accuracy.
Abstract
The use of automatic methods, often referred to as Neural Architecture Search (NAS), in designing neural network architectures has recently drawn considerable attention. In this work, we present an efficient NAS approach, named HM- NAS, that generalizes existing weight sharing based NAS approaches. Existing weight sharing based NAS approaches still adopt hand-designed heuristics to generate architecture candidates. As a consequence, the space of architecture candidates is constrained in a subset of all possible architectures, making the architecture search results sub-optimal. HM-NAS addresses this limitation via two innovations. First, HM-NAS incorporates a multi-level architecture encoding scheme to enable searching for more flexible network architectures. Second, it discards the hand-designed heuristics and incorporates a hierarchical masking scheme that automatically learns and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
