TL;DR
This paper introduces a Self-Evaluated Template Network (SETN) for neural architecture search that predicts promising candidates to evaluate, significantly reducing training costs and achieving state-of-the-art results on CIFAR and ImageNet.
Contribution
The paper proposes a novel SETN framework combining a candidate evaluator with a shared-parameter template network for efficient NAS.
Findings
Achieves state-of-the-art performance on CIFAR and ImageNet.
Reduces computational costs compared to traditional NAS methods.
Effectively predicts promising architectures for evaluation.
Abstract
Neural architecture search (NAS) aims to automate the search procedure of architecture instead of manual design. Even if recent NAS approaches finish the search within days, lengthy training is still required for a specific architecture candidate to get the parameters for its accurate evaluation. Recently one-shot NAS methods are proposed to largely squeeze the tedious training process by sharing parameters across candidates. In this way, the parameters for each candidate can be directly extracted from the shared parameters instead of training them from scratch. However, they have no sense of which candidate will perform better until evaluation so that the candidates to evaluate are randomly sampled and the top-1 candidate is considered the best. In this paper, we propose a Self-Evaluated Template Network (SETN) to improve the quality of the architecture candidates for evaluation so…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
