Quantized neural network design under weight capacity constraint

Sungho Shin; Kyuyeon Hwang; and Wonyong Sung

arXiv:1611.06342·cs.LG·November 22, 2016·1 cites

Quantized neural network design under weight capacity constraint

Sungho Shin, Kyuyeon Hwang, and Wonyong Sung

PDF

Open Access

TL;DR

This paper evaluates the trade-offs between network size scaling and weight quantization in neural networks for hardware efficiency, introducing the effective compression ratio to optimize resource use.

Contribution

It provides an analysis of neural network performance under different complexity and weight precision constraints, proposing the effective compression ratio as a new metric.

Findings

01

Quantization impacts neural network performance significantly.

02

Network size scaling can be more effective than weight quantization under certain conditions.

03

The effective compression ratio guides hardware-efficient neural network design.

Abstract

The complexity of deep neural network algorithms for hardware implementation can be lowered either by scaling the number of units or reducing the word-length of weights. Both approaches, however, can accompany the performance degradation although many types of research are conducted to relieve this problem. Thus, it is an important question which one, between the network size scaling and the weight quantization, is more effective for hardware optimization. For this study, the performances of fully-connected deep neural networks (FCDNNs) and convolutional neural networks (CNNs) are evaluated while changing the network complexity and the word-length of weights. Based on these experiments, we present the effective compression ratio (ECR) to guide the trade-off between the network size and the precision of weights when the hardware resource is limited.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and ELM