NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN   Accelerator

Mohit Upadhyay; Rohan Juneja; Weng-Fai Wong; Li-Shiuan Peh

arXiv:2405.04206·cs.AR·May 8, 2024

NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN Accelerator

Mohit Upadhyay, Rohan Juneja, Weng-Fai Wong, Li-Shiuan Peh

PDF

Open Access

TL;DR

NOVA is a NoC-based vector unit designed to efficiently map attention layers onto neural network accelerators, significantly improving power efficiency for edge AI applications.

Contribution

This work introduces NOVA, a novel NoC-based vector unit that enables efficient mapping of attention layers on existing neural accelerators, addressing non-linear operation challenges.

Findings

01

NOVA achieves up to 37.8x power efficiency improvement.

02

NOVA effectively maps attention layers onto existing accelerators.

03

The architecture enhances utilization of vector units for non-linear operations.

Abstract

Attention mechanisms are becoming increasingly popular, being used in neural network models in multiple domains such as natural language processing (NLP) and vision applications, especially at the edge. However, attention layers are difficult to map onto existing neuro accelerators since they have a much higher density of non-linear operations, which lead to inefficient utilization of today's vector units. This work introduces NOVA, a NoC-based Vector Unit that can perform non-linear operations within the NoC of the accelerators, and can be overlaid onto existing neuro accelerators to map attention layers at the edge. Our results show that the NOVA architecture is up to 37.8x more power-efficient than state-of-the-art hardware approximators when running existing attention-based neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Image Processing Techniques and Applications · Advanced Neural Network Applications