NAS evaluation is frustratingly hard
Antoine Yang, Pedro M. Esperan\c{c}a, Fabio M. Carlucci

TL;DR
This paper benchmarks eight NAS methods across five datasets, revealing that many struggle to outperform simple baselines and highlighting the importance of evaluation protocols, search space design, and macro-structure in NAS performance.
Contribution
It introduces a standardized benchmark for NAS methods, proposes a relative improvement metric, and provides insights into factors affecting NAS effectiveness and reproducibility.
Findings
Many NAS methods do not significantly outperform the average architecture baseline.
Evaluation tricks greatly influence reported NAS performance.
Macro-structure design impacts NAS success more than micro-operations.
Abstract
Neural Architecture Search (NAS) is an exciting new field which promises to be as much as a game-changer as Convolutional Neural Networks were in 2012. Despite many great works leading to substantial improvements on a variety of tasks, comparison between different methods is still very much an open issue. While most algorithms are tested on the same datasets, there is no shared experimental protocol followed by all. As such, and due to the under-use of ablation studies, there is a lack of clarity regarding why certain methods are more effective than others. Our first contribution is a benchmark of NAS methods on datasets. To overcome the hurdle of comparing methods with different search spaces, we propose using a method's relative improvement over the randomly sampled average architecture, which effectively removes advantages arising from expertly engineered search spaces or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular Biology Techniques and Applications · Radiomics and Machine Learning in Medical Imaging · Advanced X-ray and CT Imaging
MethodsDifferentiable Architecture Search
