Automated Architecture Design for Deep Neural Networks

Steven Abreu

arXiv:1908.10714·cs.LG·August 29, 2019·27 cites

Automated Architecture Design for Deep Neural Networks

Steven Abreu

PDF

Open Access

TL;DR

This paper explores automated methods for designing deep neural network architectures to reduce complexity and improve efficiency, addressing the manual trial-and-error process traditionally used.

Contribution

It introduces novel approaches for automating neural network architecture design, aiming to simplify and optimize the creation of deep learning models.

Findings

01

Automated design methods can produce less complex neural networks.

02

Automated approaches achieve competitive performance.

03

Reduction in manual effort for neural network architecture creation.

Abstract

Machine learning has made tremendous progress in recent years and received large amounts of public attention. Though we are still far from designing a full artificially intelligent agent, machine learning has brought us many applications in which computers solve human learning tasks remarkably well. Much of this progress comes from a recent trend within machine learning, called deep learning. Deep learning models are responsible for many state-of-the-art applications of machine learning. Despite their success, deep learning models are hard to train, very difficult to understand, and often times so complex that training is only possible on very large GPU clusters. Lots of work has been done on enabling neural networks to learn efficiently. However, the design and architecture of such neural networks is often done manually through trial and error and expert knowledge. This thesis inspects…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Neural Networks and Applications