IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers

Gihwan Kim; Jemin Lee; Hyungshin Kim

arXiv:2511.15369·cs.CV·November 20, 2025

IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers

Gihwan Kim, Jemin Lee, Hyungshin Kim

PDF

Open Access

TL;DR

IPTQ-ViT introduces a fully integer-only post-training quantization framework for vision transformers, utilizing novel approximation functions and a unified metric to improve accuracy without retraining, suitable for resource-limited environments.

Contribution

The paper proposes a new PTQ method for vision transformers that achieves fully integer-only inference without retraining, using optimized approximation functions and a unified selection metric.

Findings

01

Achieves up to 6.44% accuracy improvement over previous PTQ methods.

02

Outperforms partial floating-point PTQ methods under W8A8 and W4A8.

03

Comparable accuracy and latency to integer-only QAT methods.

Abstract

Previous Quantization-Aware Training (QAT) methods for vision transformers rely on expensive retraining to recover accuracy loss in non-linear layer quantization, limiting their use in resource-constrained environments. In contrast, existing Post-Training Quantization (PTQ) methods either partially quantize non-linear functions or adjust activation distributions to maintain accuracy but fail to achieve fully integer-only inference. In this paper, we introduce IPTQ-ViT, a novel PTQ framework for fully integer-only vision transformers without retraining. We present approximation functions: a polynomial-based GELU optimized for vision data and a bit-shifting-based Softmax designed to improve approximation accuracy in PTQ. In addition, we propose a unified metric integrating quantization sensitivity, perturbation, and computational cost to select the optimal approximation function per…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Infrared Target Detection Methodologies