Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data
Felipe Petroski Such, Aditya Rawal, Joel Lehman, Kenneth O. Stanley,, Jeff Clune

TL;DR
This paper introduces Generative Teaching Networks (GTNs), neural networks that generate training data to accelerate learning and neural architecture search, demonstrating significant speed-ups and improved performance over existing methods.
Contribution
The paper presents GTNs as a novel approach to automatically generate training data and environments, enabling faster learning and more efficient neural architecture search.
Findings
GTNs can substantially accelerate learning processes.
GTN-NAS improves neural architecture search efficiency.
GTNs achieve competitive performance with less computation.
Abstract
This paper investigates the intriguing question of whether we can create learning algorithms that automatically generate training data, learning environments, and curricula in order to help AI agents rapidly learn. We show that such algorithms are possible via Generative Teaching Networks (GTNs), a general approach that is, in theory, applicable to supervised, unsupervised, and reinforcement learning, although our experiments only focus on the supervised case. GTNs are deep neural networks that generate data and/or training environments that a learner (e.g. a freshly initialized neural network) trains on for a few SGD steps before being tested on a target task. We then differentiate through the entire learning process via meta-gradients to update the GTN parameters to improve performance on the target task. GTNs have the beneficial property that they can theoretically generate any type…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory · Stochastic Gradient Descent
