Resiliency of Deep Neural Networks under Quantization

Wonyong Sung; Sungho Shin; Kyuyeon Hwang

arXiv:1511.06488·cs.LG·January 8, 2016·146 cites

Resiliency of Deep Neural Networks under Quantization

Wonyong Sung, Sungho Shin, Kyuyeon Hwang

PDF

Open Access

TL;DR

This paper investigates how retraining affects the resilience of deep neural networks to quantization, showing that highly complex networks can effectively absorb quantization effects, while simpler, connection-limited networks are less resilient.

Contribution

It analyzes the impact of network complexity on quantization resilience and introduces an effective compression ratio for hardware-resource-limited scenarios.

Findings

01

Highly complex DNNs can absorb quantization effects through retraining.

02

Performance gap between floating-point and quantized networks diminishes in complex networks.

03

Connection-limited networks are less resilient to quantization.

Abstract

The complexity of deep neural network algorithms for hardware implementation can be much lowered by optimizing the word-length of weights and signals. Direct quantization of floating-point weights, however, does not show good performance when the number of bits assigned is small. Retraining of quantized networks has been developed to relieve this problem. In this work, the effects of retraining are analyzed for a feedforward deep neural network (FFDNN) and a convolutional neural network (CNN). The network complexity is controlled to know their effects on the resiliency of quantized networks by retraining. The complexity of the FFDNN is controlled by varying the unit size in each hidden layer and the number of layers, while that of the CNN is done by modifying the feature map configuration. We find that the performance gap between the floating-point and the retrain-based ternary (+1, 0,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Adversarial Robustness in Machine Learning