LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre   Memory Units

Zeyu Liu; Gourav Datta; Anni Li; Peter Anthony Beerel

arXiv:2402.04882·cs.NE·February 8, 2024·2 cites

LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units

Zeyu Liu, Gourav Datta, Anni Li, Peter Anthony Beerel

PDF

Open Access 1 Repo 1 Video

TL;DR

LMUFormer is a low-complexity, high-performance spiking model that combines Legendre Memory Units with convolutional modules, achieving near state-of-the-art accuracy with significantly reduced parameters and computational cost for sequence learning tasks.

Contribution

The paper introduces LMUFormer, a novel architecture that enhances recurrent models with convolutional modules and a spiking version, balancing performance and efficiency for streaming applications.

Findings

01

Achieves comparable performance to SOTA transformers on SCv2 dataset.

02

Reduces parameters by 53 times and FLOPs by 65 times compared to transformer models.

03

Enables real-time data processing with a 32.03% reduction in sequence length.

Abstract

Transformer models have demonstrated high accuracy in numerous applications but have high complexity and lack sequential processing capability making them ill-suited for many streaming applications at the edge where devices are heavily resource-constrained. Thus motivated, many researchers have proposed reformulating the transformer models as RNN modules which modify the self-attention computation with explicit states. However, these approaches often incur significant performance degradation. The ultimate goal is to develop a model that has the following properties: parallel training, streaming and low-cost inference, and SOTA performance. In this paper, we propose a new direction to achieve this goal. We show how architectural modifications to a recurrent model can help push its performance toward Transformer models while retaining its sequential processing capability. Specifically,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zeyuliu1037/lmuformer
pytorchOfficial

Videos

LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units· slideslive

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing · Parallel Computing and Optimization Techniques

MethodsAttention Is All You Need · Residual Connection · Dropout · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing · Softmax · Absolute Position Encodings · Legendre Memory Unit