Resiliency of Deep Neural Networks under Quantization
Wonyong Sung, Sungho Shin, Kyuyeon Hwang

TL;DR
This paper investigates how retraining affects the resilience of deep neural networks to quantization, showing that highly complex networks can effectively absorb quantization effects, while simpler, connection-limited networks are less resilient.
Contribution
It analyzes the impact of network complexity on quantization resilience and introduces an effective compression ratio for hardware-resource-limited scenarios.
Findings
Highly complex DNNs can absorb quantization effects through retraining.
Performance gap between floating-point and quantized networks diminishes in complex networks.
Connection-limited networks are less resilient to quantization.
Abstract
The complexity of deep neural network algorithms for hardware implementation can be much lowered by optimizing the word-length of weights and signals. Direct quantization of floating-point weights, however, does not show good performance when the number of bits assigned is small. Retraining of quantized networks has been developed to relieve this problem. In this work, the effects of retraining are analyzed for a feedforward deep neural network (FFDNN) and a convolutional neural network (CNN). The network complexity is controlled to know their effects on the resiliency of quantized networks by retraining. The complexity of the FFDNN is controlled by varying the unit size in each hidden layer and the number of layers, while that of the CNN is done by modifying the feature map configuration. We find that the performance gap between the floating-point and the retrain-based ternary (+1, 0,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Adversarial Robustness in Machine Learning
