Mixed Non-linear Quantization for Vision Transformers

Gihwan Kim; Jemin Lee; Sihyeong Park; Yongin Kwon; Hyungshin Kim

arXiv:2407.18437·cs.CV·June 5, 2025

Mixed Non-linear Quantization for Vision Transformers

Gihwan Kim, Jemin Lee, Sihyeong Park, Yongin Kwon, Hyungshin Kim

PDF

Open Access 1 Repo

TL;DR

This paper introduces a mixed non-linear quantization approach for Vision Transformers that assigns different quantization methods to each non-linear operation based on layer-wise sensitivity, improving model compression without significant accuracy loss.

Contribution

It proposes a novel mixed non-linear quantization method that optimally assigns quantization techniques to each non-linear layer based on sensitivity, outperforming existing methods.

Findings

01

Outperforms I-BERT, FQ-ViT, and I-ViT in 8-bit and 6-bit settings.

02

Achieves an average of 0.6%p and 19.6%p improvement.

03

Maintains accuracy with limited training time.

Abstract

The majority of quantization methods have been proposed to reduce the model size of Vision Transformers, yet most of them have overlooked the quantization of non-linear operations. Only a few works have addressed quantization for non-linear operations, but they applied a single quantization method across all non-linear operations. We believe that this can be further improved by employing a different quantization method for each non-linear operation. Therefore, to assign the most error-minimizing quantization method from the known methods to each non-linear layer, we propose a mixed non-linear quantization that considers layer-wise quantization sensitivity measured by SQNR difference metric. The results show that our method outperforms I-BERT, FQ-ViT, and I-ViT in both 8-bit and 6-bit settings for ViT, DeiT, and Swin models by an average of 0.6%p and 19.6%p, respectively. Our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/ones-ai/mixed-non-linear-quantization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Infrared Target Detection Methodologies · Neural Networks and Applications

MethodsAttention Is All You Need · Dense Connections · Feedforward Network · Linear Layer · Softmax · Attention Dropout · I-BERT · Multi-Head Attention · Dropout · Data-efficient Image Transformer