TL;DR
This paper introduces quasi-equivariance in metanetworks, allowing more flexible symmetry preservation in neural architectures to improve expressivity and robustness.
Contribution
It proposes the concept of quasi-equivariance, extending strict equivariance to better preserve functional identity while enhancing model expressiveness.
Findings
Quasi-equivariant metanetworks outperform strictly equivariant models in expressivity.
The framework applies broadly across feedforward, convolutional, and transformer architectures.
Empirical results show improved trade-offs between symmetry and expressiveness.
Abstract
Metanetworks are neural architectures designed to operate directly on pretrained weights to perform downstream tasks. However, the parameter space serves only as a proxy for the underlying function class, and the parameter-function mapping is inherently non-injective: distinct parameter configurations may yield identical input-output behaviors. As a result, metanetworks that rely solely on raw parameters risk overlooking the intrinsic symmetries of the architecture. Reasoning about functional identity is therefore essential for effective metanetwork design, motivating the development of equivariant metanetworks, which incorporate equivariance principles to respect architectural symmetries. Existing approaches, however, typically enforce strict equivariance, which imposes rigid constraints and often leads to sparse and less expressive models. To address this limitation, we introduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
