Hyper-parameter optimization of Deep Convolutional Networks for object   recognition

Sachin S. Talathi

arXiv:1501.07645·cs.CV·May 19, 2015

Hyper-parameter optimization of Deep Convolutional Networks for object recognition

Sachin S. Talathi

PDF

TL;DR

This paper explores the use of sequential model-based optimization (SMBO) to automatically tune hyper-parameters of deep convolutional networks for object recognition, achieving competitive results on benchmark datasets.

Contribution

It introduces a simple SMBO strategy for architecture search in deep convolutional networks, demonstrating its effectiveness in finding high-performing models.

Findings

01

SMBO can identify architectures with performance comparable to state-of-the-art.

02

The proposed method effectively automates hyper-parameter tuning for DCNs.

03

Results are validated on standard object recognition benchmarks.

Abstract

Recently sequential model based optimization (SMBO) has emerged as a promising hyper-parameter optimization strategy in machine learning. In this work, we investigate SMBO to identify architecture hyper-parameters of deep convolution networks (DCNs) object recognition. We propose a simple SMBO strategy that starts from a set of random initial DCN architectures to generate new architectures, which on training perform well on a given dataset. Using the proposed SMBO strategy we are able to identify a number of DCN architectures that produce results that are comparable to state-of-the-art results on object recognition benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.