Efficient Decoupled Neural Architecture Search by Structure and   Operation Sampling

Heung-Chang Lee; Do-Guk Kim; Bohyung Han

arXiv:1910.10397·cs.LG·April 28, 2020

Efficient Decoupled Neural Architecture Search by Structure and Operation Sampling

Heung-Chang Lee, Do-Guk Kim, Bohyung Han

PDF

1 Repo

TL;DR

This paper introduces a decoupled neural architecture search method using reinforcement learning that independently samples structures and operations, significantly improving efficiency while maintaining competitive accuracy.

Contribution

It presents a novel decoupled search algorithm that enhances efficiency and interpretability in neural architecture search compared to traditional RNN controller-based methods.

Findings

01

Achieves competitive accuracy with reduced search cost

02

Provides interpretable policy vectors during training

03

Outperforms state-of-the-art methods in efficiency

Abstract

We propose a novel neural architecture search algorithm via reinforcement learning by decoupling structure and operation search processes. Our approach samples candidate models from the multinomial distribution on the policy vectors defined on the two search spaces independently. The proposed technique improves the efficiency of architecture search process significantly compared to the conventional methods based on reinforcement learning with the RNN controllers while achieving competitive accuracy and model size in target tasks. Our policy vectors are easily interpretable throughout the training procedure, which allows to analyze the search progress and the discovered architectures; the black-box characteristics of the RNN controllers hamper understanding training progress in terms of policy parameter updates. Our experiments demonstrate outstanding performance compared to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

logue311/EDNAS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory