Generative Teaching Networks: Accelerating Neural Architecture Search by   Learning to Generate Synthetic Training Data

Felipe Petroski Such; Aditya Rawal; Joel Lehman; Kenneth O. Stanley,; Jeff Clune

arXiv:1912.07768·cs.LG·December 18, 2019·48 cites

Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data

Felipe Petroski Such, Aditya Rawal, Joel Lehman, Kenneth O. Stanley,, Jeff Clune

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces Generative Teaching Networks (GTNs), neural networks that generate training data to accelerate learning and neural architecture search, demonstrating significant speed-ups and improved performance over existing methods.

Contribution

The paper presents GTNs as a novel approach to automatically generate training data and environments, enabling faster learning and more efficient neural architecture search.

Findings

01

GTNs can substantially accelerate learning processes.

02

GTN-NAS improves neural architecture search efficiency.

03

GTNs achieve competitive performance with less computation.

Abstract

This paper investigates the intriguing question of whether we can create learning algorithms that automatically generate training data, learning environments, and curricula in order to help AI agents rapidly learn. We show that such algorithms are possible via Generative Teaching Networks (GTNs), a general approach that is, in theory, applicable to supervised, unsupervised, and reinforcement learning, although our experiments only focus on the supervised case. GTNs are deep neural networks that generate data and/or training environments that a learner (e.g. a freshly initialized neural network) trains on for a few SGD steps before being tested on a target task. We then differentiate through the entire learning process via meta-gradients to update the GTN parameters to improve performance on the target task. GTNs have the beneficial property that they can theoretically generate any type…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory · Stochastic Gradient Descent