Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization
Byeongwook Kim, Dongsoo Lee, Yeonju Ro, Yongkweon Jeon, Se Jung Kwon,, Baeseong Park, Daehwan Oh

TL;DR
This paper introduces Q-Rater, a non-convex optimization-based post-training quantization method that improves model accuracy in low-bit scenarios by optimizing hyper-parameters through task loss monitoring.
Contribution
It proposes a novel non-convex optimization approach for post-training quantization, leveraging hyper-parameter exploration and incremental optimization for better low-bit model accuracy.
Findings
Achieves higher accuracy in low-bit quantization.
Effective hyper-parameter tuning via task loss monitoring.
Demonstrates superiority over convex methods in experiments.
Abstract
Various post-training uniform quantization methods have usually been studied based on convex optimization. As a result, most previous ones rely on the quantization error minimization and/or quadratic approximations. Such approaches are computationally efficient and reasonable when a large number of quantization bits are employed. When the number of quantization bits is relatively low, however, non-convex optimization is unavoidable to improve model accuracy. In this paper, we propose a new post-training uniform quantization technique considering non-convexity. We empirically show that hyper-parameters for clipping and rounding of weights and activations can be explored by monitoring task loss. Then, an optimally searched set of hyper-parameters is frozen to proceed to the next layer such that an incremental non-convex optimization is enabled for post-training quantization. Throughout…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · CCD and CMOS Imaging Sensors · Image Enhancement Techniques
