Hyperpruning: Efficient Search through Pruned Variants of Recurrent Neural Networks Leveraging Lyapunov Spectrum
Caleb Zheng, Eli Shlizerman

TL;DR
This paper introduces LS-based Hyperpruning (LSH), a novel method that efficiently identifies high-performing pruned RNN variants by predicting their accuracy early, significantly reducing search time while surpassing baseline models.
Contribution
The paper proposes a Lyapunov Spectrum-based distance metric integrated with hyperparameter optimization to efficiently find optimal pruned RNNs, outperforming traditional methods.
Findings
LSH reduces search time by an order of magnitude.
Pruned models outperform baseline and dense models.
Method effective on multiple RNN architectures and datasets.
Abstract
A variety of pruning methods have been introduced for over-parameterized Recurrent Neural Networks to improve efficiency in terms of power consumption and storage utilization. These advances motivate a new paradigm, termed `hyperpruning', which seeks to identify the most suitable pruning strategy for a given network architecture and application. Unlike conventional hyperparameter search, where the optimal configuration's accuracy remains uncertain, in the context of network pruning, the accuracy of the dense model sets the target for the accuracy of the pruned one. The goal, therefore, is to discover pruned variants that match or even surpass this established accuracy. However, exhaustive search over pruning configurations is computationally expensive and lacks early performance guarantees. To address this challenge, we propose a novel Lyapunov Spectrum (LS)-based distance metric that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Big Data and Digital Economy
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Pruning
