A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth   Trade-Off

Yaniv Blumenfeld; Dar Gilboa; Daniel Soudry

arXiv:1906.00771·stat.ML·November 1, 2019·5 cites

A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off

Yaniv Blumenfeld, Dar Gilboa, Daniel Soudry

PDF

Open Access 1 Repo

TL;DR

This paper uses mean-field theory to analyze how quantization affects deep neural networks, deriving formulas for maximum trainable depth based on quantization levels, which informs efficient low-precision model design.

Contribution

It introduces a mean-field framework for quantized networks, deriving a closed-form equation for maximum trainable depth as a function of quantization levels.

Findings

01

Maximum trainable depth scales as N^{1.82} with quantization levels N.

02

Proposed initialization schemes improve signal propagation in quantized networks.

03

Theoretical insights guide the design of resource-efficient deep models.

Abstract

Reducing the precision of weights and activation functions in neural network training, with minimal impact on performance, is essential for the deployment of these models in resource-constrained environments. We apply mean-field techniques to networks with quantized activations in order to evaluate the degree to which quantization degrades signal propagation at initialization. We derive initialization schemes which maximize signal propagation in such networks and suggest why this is helpful for generalization. Building on these results, we obtain a closed form implicit equation for $L_{m a x}$ , the maximal trainable depth (and hence model capacity), given $N$ , the number of quantization levels in the activation function. Solving this equation numerically, we obtain asymptotically: $L_{m a x} \propto N^{1.82}$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yanivbl6/quantized_meanfield
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Sparse and Compressive Sensing Techniques