Delta Networks for Optimized Recurrent Network Computation
Daniel Neil, Jun Haeng Lee, Tobi Delbruck, Shih-Chii Liu

TL;DR
This paper introduces delta networks, a novel RNN architecture that transmits neuron values only when significant changes occur, leading to substantial reductions in computation and memory access with minimal accuracy loss.
Contribution
The paper proposes delta networks for RNNs, optimizing their execution by transmitting only significant activation changes, and demonstrates substantial speedups on multiple benchmarks.
Findings
9X reduction in cost for TIDIGITS with negligible accuracy loss
5.7X speedup on Wall Street Journal speech recognition
100X reduction in RNN cost for driving angle prediction
Abstract
Many neural networks exhibit stability in their activation patterns over time in response to inputs from sensors operating under real-world conditions. By capitalizing on this property of natural signals, we propose a Recurrent Neural Network (RNN) architecture called a delta network in which each neuron transmits its value only when the change in its activation exceeds a threshold. The execution of RNNs as delta networks is attractive because their states must be stored and fetched at every timestep, unlike in convolutional neural networks (CNNs). We show that a naive run-time delta network implementation offers modest improvements on the number of memory accesses and computes, but optimized training techniques confer higher accuracy at higher speedup. With these optimizations, we demonstrate a 9X reduction in cost with negligible loss of accuracy for the TIDIGITS audio digit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Music and Audio Processing
