Designing Neural Network Architectures using Reinforcement Learning

Bowen Baker; Otkrist Gupta; Nikhil Naik; Ramesh Raskar

arXiv:1611.02167·cs.LG·March 24, 2017·424 cites

Designing Neural Network Architectures using Reinforcement Learning

Bowen Baker, Otkrist Gupta, Nikhil Naik, Ramesh Raskar

PDF

Open Access 5 Repos

TL;DR

This paper presents MetaQNN, a reinforcement learning-based method that automatically designs CNN architectures, outperforming handcrafted models and existing meta-modeling approaches on image classification benchmarks.

Contribution

Introduction of MetaQNN, a reinforcement learning algorithm that automatically generates high-performing CNN architectures, reducing reliance on human expertise and manual experimentation.

Findings

01

MetaQNN-designed networks outperform handcrafted CNNs.

02

The method is competitive with state-of-the-art complex models.

03

MetaQNN outperforms existing meta-modeling approaches.

Abstract

At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using $Q$ -learning with an $ϵ$ -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Reinforcement Learning in Robotics