Quantization in Layer's Input is Matter

Daning Cheng; WenGuang Chen

arXiv:2202.05137·cs.LG·February 11, 2022

Quantization in Layer's Input is Matter

Daning Cheng, WenGuang Chen

PDF

Open Access

TL;DR

This paper demonstrates that quantizing layer inputs has a greater impact on loss function performance than quantizing parameters, and proposes an input-based quantization algorithm outperforming Hessian-based methods.

Contribution

It introduces a novel input-based quantization algorithm that surpasses Hessian-based mixed precision approaches in neural network quantization.

Findings

01

Input quantization significantly affects loss function more than parameter quantization.

02

The proposed algorithm outperforms Hessian-based mixed precision methods.

03

Layer input quantization leads to better model performance.

Abstract

In this paper, we will show that the quantization in layer's input is more important than parameters' quantization for loss function. And the algorithm which is based on the layer's input quantization error is better than hessian-based mixed precision layout algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Algorithms and Applications · Advanced Sensor and Control Systems · Advanced Computational Techniques and Applications