Geometric sparsification in recurrent neural networks
Wyatt Mackey, Ioannis Schizas, Jared Deighton, David L. Boothe, Jr.,, Vasileios Maroulas

TL;DR
This paper introduces a geometric regularization technique called moduli regularization for sparsifying recurrent neural networks, enabling high sparsity while maintaining performance across diverse tasks.
Contribution
It proposes a novel geometric regularization method that guides the sparsification of RNNs, providing a priori architecture insights and improved stability.
Findings
Achieves up to 90% sparsity with maintained performance in navigation tasks.
Induces more stable RNNs in NLP and addition tasks.
Demonstrates the effectiveness of geometric regularization across multiple domains.
Abstract
A common technique for ameliorating the computational costs of running large neural models is sparsification, or the pruning of neural connections during training. Sparse models are capable of maintaining the high accuracy of state of the art models, while functioning at the cost of more parsimonious models. The structures which underlie sparse architectures are, however, poorly understood and not consistent between differently trained models and sparsification schemes. In this paper, we propose a new technique for sparsification of recurrent neural nets (RNNs), called moduli regularization, in combination with magnitude pruning. Moduli regularization leverages the dynamical system induced by the recurrent structure to induce a geometric relationship between neurons in the hidden state of the RNN. By making our regularizing term explicitly geometric, we provide the first, to our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Numerical Analysis Techniques
MethodsPruning
