AdaLog: Post-Training Quantization for Vision Transformers with Adaptive   Logarithm Quantizer

Zhuguanyu Wu; Jiaxin Chen; Hanwen Zhong; Di Huang; Yunhong Wang

arXiv:2407.12951·cs.CV·July 19, 2024

AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer

Zhuguanyu Wu, Jiaxin Chen, Hanwen Zhong, Di Huang, Yunhong Wang

PDF

Open Access 1 Repo

TL;DR

AdaLog introduces an adaptive logarithmic quantizer for Vision Transformers that optimizes activation quantization, significantly improving efficiency and accuracy across multiple vision tasks with hardware-friendly implementation.

Contribution

The paper proposes AdaLog, a novel adaptive logarithm quantizer with a fast search strategy, tailored for post-training quantization of ViT activations, addressing distribution challenges and hardware constraints.

Findings

01

Effective quantization of post-Softmax and post-GELU activations.

02

Improved accuracy and efficiency on various ViT architectures.

03

Versatile performance across classification, detection, and segmentation tasks.

Abstract

Vision Transformer (ViT) has become one of the most prevailing fundamental backbone networks in the computer vision community. Despite the high accuracy, deploying it in real applications raises critical challenges including the high computational cost and inference latency. Recently, the post-training quantization (PTQ) technique has emerged as a promising way to enhance ViT's efficiency. Nevertheless, existing PTQ approaches for ViT suffer from the inflexible quantization on the post-Softmax and post-GELU activations that obey the power-law-like distributions. To address these issues, we propose a novel non-uniform quantizer, dubbed the Adaptive Logarithm AdaLog (AdaLog) quantizer. It optimizes the logarithmic base to accommodate the power-law-like distribution of activations, while simultaneously allowing for hardware-friendly quantization and de-quantization. By employing the bias…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

GoatWu/AdaLog
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Infrared Target Detection Methodologies

MethodsResidual Connection · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Adam · Balanced Selection · Dropout · Multi-Head Attention · Dense Connections