Continuously Constructive Deep Neural Networks

Ozan \.Irsoy; Ethem Alpayd{\i}n

arXiv:1804.02491·cs.LG·April 10, 2018

Continuously Constructive Deep Neural Networks

Ozan \.Irsoy, Ethem Alpayd{\i}n

PDF

TL;DR

This paper introduces two innovative methods that automatically adapt neural network architecture during training by continuously adjusting complexity, demonstrated on synthetic and real datasets like MNIST and MIRFLICKR.

Contribution

It presents two novel approaches for dynamic neural network architecture adjustment through continuous parameterization, enabling automatic complexity tuning during training.

Findings

01

Methods effectively adjust network complexity to task difficulty.

02

Successful application on synthetic and real datasets.

03

Achieves correct complexity without hyperparameter tuning.

Abstract

Traditionally, deep learning algorithms update the network weights whereas the network architecture is chosen manually, using a process of trial and error. In this work, we propose two novel approaches that automatically update the network structure while also learning its weights. The novelty of our approach lies in our parameterization where the depth, or additional complexity, is encapsulated continuously in the parameter space through control parameters that add additional complexity. We propose two methods: In tunnel networks, this selection is done at the level of a hidden unit, and in budding perceptrons, this is done at the level of a network layer; updating this control parameter introduces either another hidden unit or another hidden layer. We show the effectiveness of our methods on the synthetic two-spirals data and on two real data sets of MNIST and MIRFLICKR, where we see…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.