TL;DR
NITI introduces an integer-only neural network training framework that stores and computes entirely with integers, enabling efficient training with minimal accuracy loss on standard datasets.
Contribution
It presents a novel integer-only training method with a pseudo stochastic rounding scheme and integer backpropagation, reducing reliance on floating point arithmetic.
Findings
Achieves comparable accuracy to floating point training on MNIST and CIFAR10.
Uses 8-bit integers for storage and computation, with 16-bit for weight accumulation on ImageNet.
Demonstrates end-to-end training with native GPU integer operations.
Abstract
While integer arithmetic has been widely adopted for improved performance in deep quantized neural network inference, training remains a task primarily executed using floating point arithmetic. This is because both high dynamic range and numerical accuracy are central to the success of most modern training algorithms. However, due to its potential for computational, storage and energy advantages in hardware accelerators, neural network training methods that can be implemented with low precision integer-only arithmetic remains an active research challenge. In this paper, we present NITI, an efficient deep neural network training framework that stores all parameters and intermediate values as integers, and computes exclusively with integer arithmetic. A pseudo stochastic rounding scheme that eliminates the need for external random number generation is proposed to facilitate conversion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
