PositNN: Training Deep Neural Networks with Mixed Low-Precision Posit

Gon\c{c}alo Raposo; Pedro Tom\'as; Nuno Roma

arXiv:2105.00053·cs.LG·May 17, 2021

PositNN: Training Deep Neural Networks with Mixed Low-Precision Posit

Gon\c{c}alo Raposo, Pedro Tom\'as, Nuno Roma

PDF

1 Repo

TL;DR

This paper explores the use of low-precision posit formats for training deep neural networks, demonstrating that 8-bit posits can replace 32-bit floats without loss of accuracy.

Contribution

It introduces a software framework for training neural networks with mixed low-precision posits, showing their viability as an alternative to traditional floating-point formats.

Findings

01

8-bit posits can replace 32-bit floats in training without accuracy loss

02

Mixed precision training with posits is feasible and effective

03

Posit format reduces memory and hardware resource requirements

Abstract

Low-precision formats have proven to be an efficient way to reduce not only the memory footprint but also the hardware resources and power consumption of deep learning computations. Under this premise, the posit numerical format appears to be a highly viable substitute for the IEEE floating-point, but its application to neural networks training still requires further research. Some preliminary results have shown that 8-bit (and even smaller) posits may be used for inference and 16-bit for training, while maintaining the model accuracy. The presented research aims to evaluate the feasibility to train deep convolutional neural networks using posits. For such purpose, a software framework was developed to use simulated posits and quires in end-to-end training and inference. This implementation allows using any bit size, configuration, and even mixed precision, suitable for different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hpc-ulisboa/posit-neuralnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.