Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers
Rui Ding, Liang Yong, Sihuan Zhao, Jing Nie, Lihui Chen, Haijun Liu,, Xichuan Zhou

TL;DR
This paper introduces a progressive fine-to-coarse reconstruction method for post-training quantization of vision transformers, significantly improving low-bit quantization accuracy by leveraging a multi-level reconstruction strategy.
Contribution
It proposes a novel progressive reconstruction framework that iteratively combines fine-grained units into coarser blocks, enhancing low-bit PTQ performance for vision transformers.
Findings
Achieves 75.61% Top-1 accuracy for 3-bit ViT-B on ImageNet.
Outperforms state-of-the-art PTQ methods in low-bit settings.
Demonstrates effectiveness on object detection and segmentation tasks.
Abstract
Due to its efficiency, Post-Training Quantization (PTQ) has been widely adopted for compressing Vision Transformers (ViTs). However, when quantized into low-bit representations, there is often a significant performance drop compared to their full-precision counterparts. To address this issue, reconstruction methods have been incorporated into the PTQ framework to improve performance in low-bit quantization settings. Nevertheless, existing related methods predefine the reconstruction granularity and seldom explore the progressive relationships between different reconstruction granularities, which leads to sub-optimal quantization results in ViTs. To this end, in this paper, we propose a Progressive Fine-to-Coarse Reconstruction (PFCR) method for accurate PTQ, which significantly improves the performance of low-bit quantized vision transformers. Specifically, we define multi-head…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing · Image Processing Techniques and Applications
