Jumping through Local Minima: Quantization in the Loss Landscape of Vision Transformers
Natalia Frumkin, Dibakar Gope, and Diana Marculescu

TL;DR
This paper introduces Evol-Q, an evolutionary search method combined with infoNCE loss to effectively optimize quantization in vision transformers, overcoming non-smooth loss landscapes where gradient methods fail, leading to significant accuracy improvements.
Contribution
Evol-Q is a novel approach that uses evolutionary search and infoNCE loss to optimize quantization scales in non-smooth loss landscapes, outperforming gradient-based methods.
Findings
Evol-Q boosts top-1 accuracy of ViT-Base by 10.30% at 3-bit quantization.
It maintains robustness across CNN and ViT architectures.
It effectively navigates non-smooth loss landscapes where gradient methods struggle.
Abstract
Quantization scale and bit-width are the most important parameters when considering how to quantize a neural network. Prior work focuses on optimizing quantization scales in a global manner through gradient methods (gradient descent \& Hessian analysis). Yet, when applying perturbations to quantization scales, we observe a very jagged, highly non-smooth test loss landscape. In fact, small perturbations in quantization scale can greatly affect accuracy, yielding a accuracy boost in 4-bit quantized vision transformers (ViTs). In this regime, gradient methods break down, since they cannot reliably reach local minima. In our work, dubbed Evol-Q, we use evolutionary search to effectively traverse the non-smooth landscape. Additionally, we propose using an infoNCE loss, which not only helps combat overfitting on the small calibration dataset ( images) but also makes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Advanced Image and Video Retrieval Techniques
MethodsInfoNCE
