LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units
Zeyu Liu, Gourav Datta, Anni Li, Peter Anthony Beerel

TL;DR
LMUFormer is a low-complexity, high-performance spiking model that combines Legendre Memory Units with convolutional modules, achieving near state-of-the-art accuracy with significantly reduced parameters and computational cost for sequence learning tasks.
Contribution
The paper introduces LMUFormer, a novel architecture that enhances recurrent models with convolutional modules and a spiking version, balancing performance and efficiency for streaming applications.
Findings
Achieves comparable performance to SOTA transformers on SCv2 dataset.
Reduces parameters by 53 times and FLOPs by 65 times compared to transformer models.
Enables real-time data processing with a 32.03% reduction in sequence length.
Abstract
Transformer models have demonstrated high accuracy in numerous applications but have high complexity and lack sequential processing capability making them ill-suited for many streaming applications at the edge where devices are heavily resource-constrained. Thus motivated, many researchers have proposed reformulating the transformer models as RNN modules which modify the self-attention computation with explicit states. However, these approaches often incur significant performance degradation. The ultimate goal is to develop a model that has the following properties: parallel training, streaming and low-cost inference, and SOTA performance. In this paper, we propose a new direction to achieve this goal. We show how architectural modifications to a recurrent model can help push its performance toward Transformer models while retaining its sequential processing capability. Specifically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing · Parallel Computing and Optimization Techniques
MethodsAttention Is All You Need · Residual Connection · Dropout · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing · Softmax · Absolute Position Encodings · Legendre Memory Unit
