Meta Architecture Search

Albert Shaw; Wei Wei; Weiyang Liu; Le Song; Bo Dai

arXiv:1812.09584·cs.LG·November 18, 2019·26 cites

Meta Architecture Search

Albert Shaw, Wei Wei, Weiyang Liu, Le Song, Bo Dai

PDF

Open Access 1 Repo

TL;DR

This paper introduces Meta Architecture Search (MAS), a task-agnostic approach that learns a prior for neural architecture search, significantly reducing computational costs while maintaining high performance across multiple tasks.

Contribution

The paper proposes the Bayesian Meta Architecture Search (BASE) framework, the first to learn a task-agnostic prior for NAS, enabling faster adaptation and reduced computation.

Findings

01

Achieves 25.7% top-1 error on ImageNet with less than an hour of adaptation.

02

Reduces NAS computational cost by learning a good prior.

03

Finds competitive models for unseen datasets with quick adaptation.

Abstract

Neural Architecture Search (NAS) has been quite successful in constructing state-of-the-art models on a variety of tasks. Unfortunately, the computational cost can make it difficult to scale. In this paper, we make the first attempt to study Meta Architecture Search which aims at learning a task-agnostic representation that can be used to speed up the process of architecture search on a large number of tasks. We propose the Bayesian Meta Architecture SEarch (BASE) framework which takes advantage of a Bayesian formulation of the architecture search problem to learn over an entire set of tasks simultaneously. We show that on Imagenet classification, we can find a model that achieves 25.7% top-1 error and 8.1% top-5 error by adapting the architecture in less than an hour from an 8 GPU days pretrained meta-network. By learning a good prior for NAS, our method dramatically decreases the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ashaw596/meta_architecture_search
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings