A Soft SIMD Based Energy Efficient Computing Microarchitecture
Pengbo Yu, Alexandre Levisse, Mohit Gupta, Evenblij Timon, Giovanni, Ansaloni, Francky Catthoor, David Atienza

TL;DR
This paper introduces a flexible, energy-efficient microarchitecture with a two-stage SIMD pipeline that dynamically supports operand quantization and zero-skipping, significantly reducing area and energy consumption for machine learning computations.
Contribution
It presents a novel microarchitecture enabling dynamic sub-word bitwidth adjustment and efficient SIMD operations, improving energy and area efficiency over traditional hardware SIMD solutions.
Findings
Up to 53.1% smaller area than traditional SIMD hardware
Performs multiplication up to 88.8% more efficiently
Supports fine-grained operand quantization dynamically
Abstract
The ever-increasing size and computational complexity of today's machine-learning algorithms pose an increasing strain on the underlying hardware. In this light, novel and dedicated architectural solutions are required to optimize energy efficiency by leveraging opportunities (such as intrinsic parallelism and robustness to quantization errors) exposed by algorithms. We herein address this challenge by introducing a flexible two-stages computing pipeline. The pipeline can support fine-grained operand quantization through software-supported Single Instruction Multiple Data (SIMD) operations. Moreover, it can efficiently execute sequential multiplications over SIMD sub-words thanks to zero-skipping and Canonical Signed Digit (CSD) coding. Finally, a lightweight repacking unit allows changing the bitwidth of sub-words at run-time dynamically. These features are implemented within a tight…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Error Correcting Code Techniques · Network Packet Processing and Optimization
