TL;DR
This paper evaluates the accuracy and efficiency of posit arithmetic compared to IEEE 754 FP32, demonstrating that 16-bit posits can match FP32 accuracy with significant speedups in neural network tasks.
Contribution
The paper introduces POSAR, a flexible posit arithmetic unit, and provides a comprehensive analysis of posit performance across benchmarks and applications.
Findings
16-bit posit matches FP32 accuracy on Cifar-10 CNN with 18% speedup
Posit arithmetic can outperform IEEE 754 FP32 in certain benchmarks
8-bit posits are unsuitable for ML due to low accuracy
Abstract
Motivated by the increasing interest in the posit numeric format, in this paper we evaluate the accuracy and efficiency of posit arithmetic in contrast to the traditional IEEE 754 32-bit floating-point (FP32) arithmetic. We first design and implement a Posit Arithmetic Unit (PAU), called POSAR, with flexible bit-sized arithmetic suitable for applications that can trade accuracy for savings in chip area. Next, we analyze the accuracy and efficiency of POSAR with a series of benchmarks including mathematical computations, ML kernels, NAS Parallel Benchmarks (NPB), and Cifar-10 CNN. This analysis is done on our implementation of POSAR integrated into a RISC-V Rocket Chip core in comparison with the IEEE 754-based Floting Point Unit (FPU) of Rocket Chip. Our analysis shows that POSAR can outperform the FPU, but the results are not spectacular. For NPB, 32-bit posit achieves better accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
