Equivariant Architectures for Learning in Deep Weight Spaces

Aviv Navon; Aviv Shamsian; Idan Achituve; Ethan Fetaya; Gal Chechik,; Haggai Maron

arXiv:2301.12780·cs.LG·June 2, 2023·5 cites

Equivariant Architectures for Learning in Deep Weight Spaces

Aviv Navon, Aviv Shamsian, Idan Achituve, Ethan Fetaya, Gal Chechik,, Haggai Maron

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a novel neural network architecture designed to process deep weight spaces by leveraging permutation equivariance, enabling tasks like domain adaptation and object editing in function representations.

Contribution

It presents the first architecture for learning in deep weight spaces that is equivariant to permutation symmetries, with a full characterization of invariant and equivariant layers.

Findings

01

Outperforms baseline models in various learning tasks

02

Effectively encodes permutation symmetries in weight spaces

03

Demonstrates versatility in adapting pre-trained networks

Abstract

Designing machine learning architectures for processing neural networks in their raw weight matrix form is a newly introduced research direction. Unfortunately, the unique symmetry structure of deep weight spaces makes this design very challenging. If successful, such architectures would be capable of performing a wide range of intriguing tasks, from adapting a pre-trained network to a new domain to editing objects represented as functions (INRs or NeRFs). As a first step towards this goal, we present here a novel network architecture for learning in deep weight spaces. It takes as input a concatenation of weights and biases of a pre-trained MLP and processes it using a composition of layers that are equivariant to the natural permutation symmetry of the MLP's weights: Changing the order of neurons in intermediate layers of the MLP does not affect the function it represents. We provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Equivariant Architectures for Learning in Deep Weight Spaces· slideslive

Taxonomy

TopicsMachine Learning in Materials Science · Neural Networks and Applications · Machine Learning in Bioinformatics