Mixed Precision Training of Convolutional Neural Networks using Integer   Operations

Dipankar Das; Naveen Mellempudi; Dheevatsa Mudigere; Dhiraj Kalamkar,; Sasikanth Avancha; Kunal Banerjee; Srinivas Sridharan; Karthik Vaidyanathan,; Bharat Kaul; Evangelos Georganas; Alexander Heinecke; Pradeep Dubey; Jesus; Corbal; Nikita Shustrov; Roma Dubtsov; Evarist Fomenko; Vadim Pirogov

arXiv:1802.00930·cs.NE·February 26, 2018·115 cites

Mixed Precision Training of Convolutional Neural Networks using Integer Operations

Dipankar Das, Naveen Mellempudi, Dheevatsa Mudigere, Dhiraj Kalamkar,, Sasikanth Avancha, Kunal Banerjee, Srinivas Sridharan, Karthik Vaidyanathan,, Bharat Kaul, Evangelos Georganas, Alexander Heinecke, Pradeep Dubey, Jesus, Corbal, Nikita Shustrov, Roma Dubtsov, Evarist Fomenko

PDF

Open Access

TL;DR

This paper demonstrates that training state-of-the-art CNNs on ImageNet-1K using 16-bit integer operations on general purpose hardware can match or surpass FP32 accuracy with significantly improved training throughput.

Contribution

It introduces a novel integer training scheme with shared exponent representation and efficient convolution kernels, enabling SOTA CNN training with 16-bit integers on GP hardware.

Findings

01

Achieved or exceeded SOTA accuracy with INT16 training on ImageNet-1K.

02

Realized a 1.8X increase in training throughput over FP32.

03

First to report INT16 training results for SOTA CNNs on GP hardware.

Abstract

The state-of-the-art (SOTA) for mixed precision training is dominated by variants of low precision floating point operations, and in particular, FP16 accumulating into FP32 Micikevicius et al. (2017). On the other hand, while a lot of research has also happened in the domain of low and mixed-precision Integer training, these works either present results for non-SOTA networks (for instance only AlexNet for ImageNet-1K), or relatively small datasets (like CIFAR-10). In this work, we train state-of-the-art visual understanding neural networks on the ImageNet-1K dataset, with Integer operations on General Purpose (GP) hardware. In particular, we focus on Integer Fused-Multiply-and-Accumulate (FMA) operations which take two pairs of INT16 operands and accumulate results into an INT32 output.We propose a shared exponent representation of tensors and develop a Dynamic Fixed Point (DFP) scheme…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Tensor decomposition and applications · Multimodal Machine Learning Applications

Methods1x1 Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax · How do I speak to a person at Expedia?-/+/ · Convolution