A General Framework For Proving The Equivariant Strong Lottery Ticket Hypothesis
Damien Ferbach, Christos Tsirigotis, Gauthier Gidel, and Avishek, (Joey) Bose

TL;DR
This paper extends the Strong Lottery Ticket Hypothesis to general G-equivariant neural networks, providing a unified theoretical framework and empirical validation for pruning overparameterized models to match trained network performance.
Contribution
It generalizes the SLTH to G-equivariant networks, proves optimal overparameterization bounds, and applies the theory to various architectures including CNNs, GNNs, and steerable CNNs.
Findings
Theoretical proof of G-equivariant SLTH with high probability.
Optimal overparameterization bounds as a function of error tolerance.
Empirical validation on E(2)-steerable CNNs and GNNs matching trained network performance.
Abstract
The Strong Lottery Ticket Hypothesis (SLTH) stipulates the existence of a subnetwork within a sufficiently overparameterized (dense) neural network that -- when initialized randomly and without any training -- achieves the accuracy of a fully trained target network. Recent works by Da Cunha et. al 2022; Burkholz 2022 demonstrate that the SLTH can be extended to translation equivariant networks -- i.e. CNNs -- with the same level of overparametrization as needed for the SLTs in dense networks. However, modern neural networks are capable of incorporating more than just translation symmetry, and developing general equivariant architectures such as rotation and permutation has been a powerful design principle. In this paper, we generalize the SLTH to functions that preserve the action of the group -- i.e. -equivariant network -- and prove, with high probability, that one can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning in Materials Science · Machine Learning and Data Classification
MethodsPruning
