Hyperparameter Optimization in Neural Networks via Structured Sparse   Recovery

Minsu Cho; Mohammadreza Soltani; and Chinmay Hegde

arXiv:2007.04087·cs.LG·July 9, 2020·1 cites

Hyperparameter Optimization in Neural Networks via Structured Sparse Recovery

Minsu Cho, Mohammadreza Soltani, and Chinmay Hegde

PDF

Open Access

TL;DR

This paper introduces a structured sparse recovery framework for hyperparameter optimization and neural architecture search, demonstrating improved performance and efficiency through novel algorithms and theoretical analysis.

Contribution

It establishes a new connection between HPO/NAS and structured sparse recovery, proposing algorithms that enhance search efficiency and discovering new neural architectures.

Findings

01

Improved hyperparameter optimization performance on CIFAR-10.

02

Proposed CoNAS algorithm outperforms existing NAS methods.

03

Theoretical bounds on validation error measurements for NAS.

Abstract

In this paper, we study two important problems in the automated design of neural networks -- Hyper-parameter Optimization (HPO), and Neural Architecture Search (NAS) -- through the lens of sparse recovery methods. In the first part of this paper, we establish a novel connection between HPO and structured sparse recovery. In particular, we show that a special encoding of the hyperparameter space enables a natural group-sparse recovery formulation, which when coupled with HyperBand (a multi-armed bandit strategy), leads to improvement over existing hyperparameter optimization methods. Experimental results on image datasets such as CIFAR-10 confirm the benefits of our approach. In the second part of this paper, we establish a connection between NAS and structured sparse recovery. Building upon ``one-shot'' approaches in NAS, we propose a novel algorithm that we call CoNAS by merging ideas…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Machine Learning and Algorithms

MethodsHyper-parameter optimization