Universal Neural Functionals
Allan Zhou, Chelsea Finn, James Harrison

TL;DR
This paper introduces universal neural functionals (UNFs), algorithms that automatically create permutation equivariant models for any neural network weight space, improving learned optimizers for various architectures.
Contribution
It presents a novel algorithm to construct permutation equivariant models applicable to any neural network architecture's weight space.
Findings
UNFs improve optimization of small image classifiers.
UNFs enhance language model training.
Open-source library available for constructing UNFs.
Abstract
A challenging problem in many modern machine learning tasks is to process weight-space features, i.e., to transform or extract information from the weights and gradients of a neural network. Recent works have developed promising weight-space models that are equivariant to the permutation symmetries of simple feedforward networks. However, they are not applicable to general architectures, since the permutation symmetries of a weight space can be complicated by recurrence or residual connections. This work proposes an algorithm that automatically constructs permutation equivariant models, which we refer to as universal neural functionals (UNFs), for any weight space. Among other applications, we demonstrate how UNFs can be substituted into existing learned optimizer designs, and find promising improvements over prior methods when optimizing small image classifiers and language models. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsLib
