PEANO-ViT: Power-Efficient Approximations of Non-Linearities in Vision Transformers
Mohammad Erfan Sadeghi, Arash Fayyazi, Seyedarmin Azizi, Massoud, Pedram

TL;DR
PEANO-ViT introduces power-efficient approximations for non-linear functions in Vision Transformers, enabling FPGA deployment with minimal accuracy loss and significant reductions in computational resource usage.
Contribution
It proposes novel division-free and approximation techniques for layer normalization, softmax, and GELU functions tailored for FPGA implementation.
Findings
Achieves up to 8.01x power efficiency improvement in non-linear functions.
Maintains less than 0.5% accuracy degradation on DeiT-B.
Reduces DSP, LUT, and register usage significantly.
Abstract
The deployment of Vision Transformers (ViTs) on hardware platforms, specially Field-Programmable Gate Arrays (FPGAs), presents many challenges, which are mainly due to the substantial computational and power requirements of their non-linear functions, notably layer normalization, softmax, and Gaussian Error Linear Unit (GELU). These critical functions pose significant obstacles to efficient hardware implementation due to their complex mathematical operations and the inherent resource count and architectural limitations of FPGAs. PEANO-ViT offers a novel approach to streamlining the implementation of the layer normalization layer by introducing a division-free technique that simultaneously approximates the division and square root function. Additionally, PEANO-ViT provides a multi-scale division strategy to eliminate division operations in the softmax layer, aided by a Pade-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Image Processing Techniques and Applications · Advanced Memory and Neural Computing
MethodsSoftmax · Layer Normalization
