Neural networks adapting to datasets: learning network size and topology

Romuald A. Janik; Aleksandra Nowak

arXiv:2006.12195·cs.LG·July 16, 2020

Neural networks adapting to datasets: learning network size and topology

Romuald A. Janik, Aleksandra Nowak

PDF

Open Access

TL;DR

This paper presents a method for neural networks to learn their size and topology during training, resulting in task-specific graph structures that encode dataset characteristics.

Contribution

It introduces a flexible training setup enabling neural networks to adapt their structure dynamically, which is a novel approach to architecture learning during standard training.

Findings

01

Networks can be trained from scratch with similar performance.

02

The resulting architectures encode dataset-specific properties.

03

Systematic regularities observed across different datasets.

Abstract

We introduce a flexible setup allowing for a neural network to learn both its size and topology during the course of a standard gradient-based training. The resulting network has the structure of a graph tailored to the particular learning task and dataset. The obtained networks can also be trained from scratch and achieve virtually identical performance. We explore the properties of the network architectures for a number of datasets of varying difficulty observing systematic regularities. The obtained graphs can be therefore understood as encoding nontrivial characteristics of the particular classification tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning