Symmetry-Aware Graph Metanetwork Autoencoders: Model Merging through Parameter Canonicalization
Odysseas Boufalis, Jorge Carrasco-Pollo, Joshua Rosenthal, Eduardo Terres-Caballero, Alejandro Garc\'ia-Castellanos

TL;DR
This paper introduces a symmetry-aware autoencoder framework using ScaleGMNs that aligns neural networks under permutation and scaling symmetries, enabling effective model merging without complex assignment computations.
Contribution
It extends previous permutation symmetry methods by incorporating scaling symmetries, providing a more comprehensive approach to align and merge neural network models.
Findings
Aligns INRs and CNNs under symmetries without solving assignment problems
Enables smooth model interpolation within shared loss basins
Facilitates model merging through symmetry-aware alignment
Abstract
Neural network parameterizations exhibit inherent symmetries that yield multiple equivalent minima within the loss landscape. Scale Graph Metanetworks (ScaleGMNs) explicitly leverage these symmetries by proposing an architecture equivariant to both permutation and parameter scaling transformations. Previous work by Ainsworth et al. (2023) addressed permutation symmetries through a computationally intensive combinatorial assignment problem, demonstrating that leveraging permutation symmetries alone can map networks into a shared loss basin. In this work, we extend their approach by also incorporating scaling symmetries, presenting an autoencoder framework utilizing ScaleGMNs as invariant encoders. Experimental results demonstrate that our method aligns Implicit Neural Representations (INRs) and Convolutional Neural Networks (CNNs) under both permutation and scaling symmetries without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Graph Theory and Algorithms
