AIQViT: Architecture-Informed Post-Training Quantization for Vision   Transformers

Runqing Jiang; Ye Zhang; Longguang Wang; Pengpeng Yu; Yulan Guo

arXiv:2502.04628·cs.CV·February 10, 2025

AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers

Runqing Jiang, Ye Zhang, Longguang Wang, Pengpeng Yu, Yulan Guo

PDF

Open Access 1 Video

TL;DR

AIQViT introduces a novel post-training quantization method for vision transformers that reduces information loss and improves performance by using architecture-informed compensation and dynamic focusing quantization.

Contribution

The paper proposes AIQViT, a new PTQ approach that incorporates low-rank compensation and dynamic quantization for better ViT performance.

Findings

01

Outperforms state-of-the-art PTQ methods on five vision tasks.

02

Effectively reduces performance degradation in low-bit quantization.

03

Enhances quantization of post-Softmax activations with dynamic focus.

Abstract

Post-training quantization (PTQ) has emerged as a promising solution for reducing the storage and computational cost of vision transformers (ViTs). Recent advances primarily target at crafting quantizers to deal with peculiar activations characterized by ViTs. However, most existing methods underestimate the information loss incurred by weight quantization, resulting in significant performance deterioration, particularly in low-bit cases. Furthermore, a common practice in quantizing post-Softmax activations of ViTs is to employ logarithmic transformations, which unfortunately prioritize less informative values around zero. This approach introduces additional redundancies, ultimately leading to suboptimal quantization efficacy. To handle these, this paper proposes an innovative PTQ method tailored for ViTs, termed AIQViT (Architecture-Informed Post-training Quantization for ViTs). First,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers· underline

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing · Advanced Neural Network Applications