PTQ4RIS: Post-Training Quantization for Referring Image Segmentation

Xiaoyan Jiang; Hang Yang; Kaiying Zhu; Xihe Qiu; Shibo Zhao; Sifan; Zhou

arXiv:2409.17020·cs.CV·February 19, 2025

PTQ4RIS: Post-Training Quantization for Referring Image Segmentation

Xiaoyan Jiang, Hang Yang, Kaiying Zhu, Xihe Qiu, Shibo Zhao, Sifan, Zhou

PDF

Open Access 1 Repo

TL;DR

PTQ4RIS introduces a novel post-training quantization framework tailored for referring image segmentation, enabling efficient on-device inference without significant performance loss.

Contribution

This work is the first to develop a PTQ method specifically for RIS, addressing quantization challenges in visual and linguistic encoders with novel techniques.

Findings

01

Achieves superior performance across multiple benchmarks

02

Supports quantization from 8 to 4 bits with minimal accuracy loss

03

Demonstrates feasibility of PTQ for RIS applications

Abstract

Referring Image Segmentation (RIS), aims to segment the object referred by a given sentence in an image by understanding both visual and linguistic information. However, existing RIS methods tend to explore top-performance models, disregarding considerations for practical applications on resources-limited edge devices. This oversight poses a significant challenge for on-device RIS inference. To this end, we propose an effective and efficient post-training quantization framework termed PTQ4RIS. Specifically, we first conduct an in-depth analysis of the root causes of performance degradation in RIS model quantization and propose dual-region quantization (DRQ) and reorder-based outlier-retained quantization (RORQ) to address the quantization difficulties in visual and text encoders. Extensive experiments on three benchmarks with different bits settings (from 8 to 4 bits) demonstrates its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gugu511yy/ptq4ris
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection