DeepArchitect: Automatically Designing and Training Deep Architectures
Renato Negrinho, Geoff Gordon

TL;DR
DeepArchitect introduces an automated framework for designing and training deep neural network architectures, enabling efficient exploration of complex search spaces with various algorithms, outperforming random search on CIFAR-10.
Contribution
The paper presents a modular language for representing complex architecture search spaces and integrates multiple search algorithms, advancing automated model discovery.
Findings
MCTS and SMBO outperform random search on CIFAR-10
Framework effectively discovers competitive models with minimal human effort
Search space structure facilitates efficient exploration
Abstract
In deep learning, performance is strongly affected by the choice of architecture and hyperparameters. While there has been extensive work on automatic hyperparameter optimization for simple spaces, complex spaces such as the space of deep architectures remain largely unexplored. As a result, the choice of architecture is done manually by the human expert through a slow trial and error process guided mainly by intuition. In this paper we describe a framework for automatically designing and training deep models. We propose an extensible and modular language that allows the human expert to compactly represent complex search spaces over architectures and their hyperparameters. The resulting search spaces are tree-structured and therefore easy to traverse. Models can be automatically compiled to computational graphs once values for all hyperparameters have been chosen. We can leverage the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Artificial Intelligence in Games · Advanced Multi-Objective Optimization Algorithms
