FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network
Aditya Kusupati, Manish Singh, Kush Bhatia, Ashish Kumar, Prateek Jain, and Manik Varma

TL;DR
FastGRNN is a compact, efficient gated RNN that achieves high accuracy with significantly reduced model size, enabling deployment on resource-constrained devices and real-time applications.
Contribution
The paper introduces FastRNN and FastGRNN algorithms that improve training stability and prediction efficiency while drastically reducing model size, especially through low-rank, sparse, and quantized matrices.
Findings
FastGRNN models can be up to 35x smaller than existing gated RNNs.
FastGRNN accurately recognizes wakewords with only 1 KB model size.
Models are deployable on microcontrollers with severe resource constraints.
Abstract
This paper develops the FastRNN and FastGRNN algorithms to address the twin RNN limitations of inaccurate training and inefficient prediction. Previous approaches have improved accuracy at the expense of prediction costs making them infeasible for resource-constrained and real-time applications. Unitary RNNs have increased accuracy somewhat by restricting the range of the state transition matrix's singular values but have also increased the model size as they require a larger number of hidden units to make up for the loss in expressive power. Gated RNNs have obtained state-of-the-art accuracies by adding extra parameters thereby resulting in even larger models. FastRNN addresses these limitations by adding a residual connection that does not constrain the range of the singular values explicitly and has only two extra scalar parameters. FastGRNN then extends the residual connection to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Ferroelectric and Negative Capacitance Devices · Machine Learning and ELM
MethodsResidual Connection
