Learning Symmetries via Weight-Sharing with Doubly Stochastic Tensors
Putri A. van der Linden, Alejandro Garc\'ia-Castellanos, Sharvaree, Vadgama, Thijs P. Kuipers, Erik J. Bekkers

TL;DR
This paper introduces a method to learn symmetry-based weight sharing in neural networks using doubly stochastic matrices, enabling dynamic discovery of symmetries and improving model flexibility.
Contribution
It proposes a novel approach to learn weight-sharing schemes via learnable doubly stochastic matrices, generalizing group equivariance to discover symmetries adaptively.
Findings
Permutation matrices converge to regular group representations with strong dataset symmetries
Method effectively captures partial symmetries in data
Learned symmetries improve model generalization and robustness
Abstract
Group equivariance has emerged as a valuable inductive bias in deep learning, enhancing generalization, data efficiency, and robustness. Classically, group equivariant methods require the groups of interest to be known beforehand, which may not be realistic for real-world data. Additionally, baking in fixed group equivariance may impose overly restrictive constraints on model architecture. This highlights the need for methods that can dynamically discover and apply symmetries as soft constraints. For neural network architectures, equivariance is commonly achieved through group transformations of a canonical weight tensor, resulting in weight sharing over a given group . In this work, we propose to learn such a weight-sharing scheme by defining a collection of learnable doubly stochastic matrices that act as soft permutation matrices on canonical weight tensors, which can take regular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Computational Physics and Python Applications
