On-Chip Learning via Transformer In-Context Learning

Jan Finkbeiner; Emre Neftci

arXiv:2410.08711·cs.NE·October 14, 2024

On-Chip Learning via Transformer In-Context Learning

Jan Finkbeiner, Emre Neftci

PDF

Open Access

TL;DR

This paper introduces a neuromorphic transformer model with on-chip plasticity that enables in-context learning and adaptation, reducing memory transfer bottlenecks and highlighting the potential for hardware-efficient, local learning rules.

Contribution

It presents a novel neuromorphic transformer architecture utilizing on-chip plasticity for in-context learning, demonstrating hardware-efficient, local learning rules on Loihi 2.

Findings

01

Successful demonstration of in-context learning on Loihi 2

02

Reduced memory transfer through on-chip plasticity

03

Highlighting the importance of pretrained models for local learning

Abstract

Autoregressive decoder-only transformers have become key components for scalable sequence processing and generation models. However, the transformer's self-attention mechanism requires transferring prior token projections from the main memory at each time step (token), thus severely limiting their performance on conventional processors. Self-attention can be viewed as a dynamic feed-forward layer, whose matrix is input sequence-dependent similarly to the result of local synaptic plasticity. Using this insight, we present a neuromorphic decoder-only transformer model that utilizes an on-chip plasticity processor to compute self-attention. Interestingly, the training of transformers enables them to ``learn'' the input context during inference. We demonstrate this in-context learning ability of transformers on the Loihi 2 processor by solving a few-shot classification problem. With this we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnalog and Mixed-Signal Circuit Design · Embedded Systems Design Techniques · Neural Networks and Applications