A General Framework For Proving The Equivariant Strong Lottery Ticket   Hypothesis

Damien Ferbach; Christos Tsirigotis; Gauthier Gidel; and Avishek; (Joey) Bose

arXiv:2206.04270·cs.LG·February 17, 2023·1 cites

A General Framework For Proving The Equivariant Strong Lottery Ticket Hypothesis

Damien Ferbach, Christos Tsirigotis, Gauthier Gidel, and Avishek, (Joey) Bose

PDF

Open Access 1 Video

TL;DR

This paper extends the Strong Lottery Ticket Hypothesis to general G-equivariant neural networks, providing a unified theoretical framework and empirical validation for pruning overparameterized models to match trained network performance.

Contribution

It generalizes the SLTH to G-equivariant networks, proves optimal overparameterization bounds, and applies the theory to various architectures including CNNs, GNNs, and steerable CNNs.

Findings

01

Theoretical proof of G-equivariant SLTH with high probability.

02

Optimal overparameterization bounds as a function of error tolerance.

03

Empirical validation on E(2)-steerable CNNs and GNNs matching trained network performance.

Abstract

The Strong Lottery Ticket Hypothesis (SLTH) stipulates the existence of a subnetwork within a sufficiently overparameterized (dense) neural network that -- when initialized randomly and without any training -- achieves the accuracy of a fully trained target network. Recent works by Da Cunha et. al 2022; Burkholz 2022 demonstrate that the SLTH can be extended to translation equivariant networks -- i.e. CNNs -- with the same level of overparametrization as needed for the SLTs in dense networks. However, modern neural networks are capable of incorporating more than just translation symmetry, and developing general equivariant architectures such as rotation and permutation has been a powerful design principle. In this paper, we generalize the SLTH to functions that preserve the action of the group $G$ -- i.e. $G$ -equivariant network -- and prove, with high probability, that one can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A General Framework For Proving The Equivariant Strong Lottery Ticket Hypothesis· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning in Materials Science · Machine Learning and Data Classification

MethodsPruning