Universal Neural Functionals

Allan Zhou; Chelsea Finn; James Harrison

arXiv:2402.05232·cs.LG·February 9, 2024·2 cites

Universal Neural Functionals

Allan Zhou, Chelsea Finn, James Harrison

PDF

Open Access 1 Repo

TL;DR

This paper introduces universal neural functionals (UNFs), algorithms that automatically create permutation equivariant models for any neural network weight space, improving learned optimizers for various architectures.

Contribution

It presents a novel algorithm to construct permutation equivariant models applicable to any neural network architecture's weight space.

Findings

01

UNFs improve optimization of small image classifiers.

02

UNFs enhance language model training.

03

Open-source library available for constructing UNFs.

Abstract

A challenging problem in many modern machine learning tasks is to process weight-space features, i.e., to transform or extract information from the weights and gradients of a neural network. Recent works have developed promising weight-space models that are equivariant to the permutation symmetries of simple feedforward networks. However, they are not applicable to general architectures, since the permutation symmetries of a weight space can be complicated by recurrence or residual connections. This work proposes an algorithm that automatically constructs permutation equivariant models, which we refer to as universal neural functionals (UNFs), for any weight space. Among other applications, we demonstrate how UNFs can be substituted into existing learned optimizer designs, and find promising improvements over prior methods when optimizing small image classifiers and language models. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allanyangzhou/universal_neural_functional
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsLib