PEANO-ViT: Power-Efficient Approximations of Non-Linearities in Vision   Transformers

Mohammad Erfan Sadeghi; Arash Fayyazi; Seyedarmin Azizi; Massoud; Pedram

arXiv:2406.14854·cs.CV·August 19, 2024

PEANO-ViT: Power-Efficient Approximations of Non-Linearities in Vision Transformers

Mohammad Erfan Sadeghi, Arash Fayyazi, Seyedarmin Azizi, Massoud, Pedram

PDF

Open Access

TL;DR

PEANO-ViT introduces power-efficient approximations for non-linear functions in Vision Transformers, enabling FPGA deployment with minimal accuracy loss and significant reductions in computational resource usage.

Contribution

It proposes novel division-free and approximation techniques for layer normalization, softmax, and GELU functions tailored for FPGA implementation.

Findings

01

Achieves up to 8.01x power efficiency improvement in non-linear functions.

02

Maintains less than 0.5% accuracy degradation on DeiT-B.

03

Reduces DSP, LUT, and register usage significantly.

Abstract

The deployment of Vision Transformers (ViTs) on hardware platforms, specially Field-Programmable Gate Arrays (FPGAs), presents many challenges, which are mainly due to the substantial computational and power requirements of their non-linear functions, notably layer normalization, softmax, and Gaussian Error Linear Unit (GELU). These critical functions pose significant obstacles to efficient hardware implementation due to their complex mathematical operations and the inherent resource count and architectural limitations of FPGAs. PEANO-ViT offers a novel approach to streamlining the implementation of the layer normalization layer by introducing a division-free technique that simultaneously approximates the division and square root function. Additionally, PEANO-ViT provides a multi-scale division strategy to eliminate division operations in the softmax layer, aided by a Pade-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Image Processing Techniques and Applications · Advanced Memory and Neural Computing

MethodsSoftmax · Layer Normalization