TL;DR
UNAS is a unified neural architecture search framework that combines differentiable and reinforcement learning approaches, enabling efficient search for architectures with both differentiable and non-differentiable criteria, achieving state-of-the-art results.
Contribution
It introduces a unified framework that integrates differentiable and RL-based NAS methods, along with a new generalization gap-based objective for better architecture selection.
Findings
UNAS achieves state-of-the-art accuracy on CIFAR-10, CIFAR-100, and ImageNet.
UNAS outperforms existing architectures in the ProxylessNAS search space.
The framework maintains low search cost while optimizing complex criteria.
Abstract
Neural architecture search (NAS) aims to discover network architectures with desired properties such as high accuracy or low latency. Recently, differentiable NAS (DNAS) has demonstrated promising results while maintaining a search cost orders of magnitude lower than reinforcement learning (RL) based NAS. However, DNAS models can only optimize differentiable loss functions in search, and they require an accurate differentiable approximation of non-differentiable criteria. In this work, we present UNAS, a unified framework for NAS, that encapsulates recent DNAS and RL-based approaches under one framework. Our framework brings the best of both worlds, and it enables us to search for architectures with both differentiable and non-differentiable criteria in one unified framework while maintaining a low search cost. Further, we introduce a new objective function for search based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
UNAS: Differentiable Architecture Search Meets Reinforcement Learning· youtube
Taxonomy
MethodsDifferentiable Neural Architecture Search · Differentiable Architecture Search · REINFORCE · Gumbel Softmax · Depthwise Convolution · Pointwise Convolution · Differentiable Neural Architecture Search · Depthwise Separable Convolution · Batch Normalization · DropPath
