Training Deep Neural Networks Using Posit Number System

Jinming Lu; Siyuan Lu; Zhisheng Wang; Chao Fang; Jun Lin; Zhongfeng; Wang; and Li Du

arXiv:1909.03831·cs.LG·September 10, 2019·6 cites

Training Deep Neural Networks Using Posit Number System

Jinming Lu, Siyuan Lu, Zhisheng Wang, Chao Fang, Jun Lin, Zhongfeng, Wang, and Li Du

PDF

Open Access

TL;DR

This paper introduces a novel training methodology for deep neural networks using the posit number system, achieving efficient 16-bit training on ImageNet without accuracy loss and proposing hardware for energy-efficient posit operations.

Contribution

It presents the first successful DNN training on ImageNet with 16-bit posit and develops a hardware architecture for efficient posit multiply-accumulate operations.

Findings

01

Successful 16-bit posit training on ImageNet without accuracy loss

02

Significant energy efficiency improvements in hardware implementation

03

Effective stabilization strategies for posit-based DNN training

Abstract

With the increasing size of Deep Neural Network (DNN) models, the high memory space requirements and computational complexity have become an obstacle for efficient DNN implementations. To ease this problem, using reduced-precision representations for DNN training and inference has attracted many interests from researchers. This paper first proposes a methodology for training DNNs with the posit arithmetic, a type- 3 universal number (Unum) format that is similar to the floating point(FP) but has reduced precision. A warm-up training strategy and layer-wise scaling factors are adopted to stabilize training and fit the dynamic range of DNN parameters. With the proposed training methodology, we demonstrate the first successful training of DNN models on ImageNet image classification task in 16 bits posit with no accuracy loss. Then, an efficient hardware architecture for the posit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical Methods and Algorithms · Advanced Neural Network Applications · Model Reduction and Neural Networks