TL;DR
This study evaluates how different quantization methods affect the robustness of YOLO object detection models to various real-world input degradations, providing insights into deployment challenges.
Contribution
It offers a comprehensive empirical analysis of quantization impacts on robustness and introduces a degradation-aware calibration strategy for INT8 PTQ.
Findings
Static INT8 TensorRT models are faster with moderate accuracy loss on clean data.
Degradation-aware calibration did not consistently improve robustness across models and degradations.
Larger models showed some robustness gains under specific noise conditions.
Abstract
Post-training quantization (PTQ) is crucial for deploying efficient object detection models, like YOLO, on resource-constrained devices. However, the impact of reduced precision on model robustness to real-world input degradations such as noise, blur, and compression artifacts is a significant concern. This paper presents a comprehensive empirical study evaluating the robustness of YOLO models (nano to extra-large scales) across multiple precision formats: FP32, FP16 (TensorRT), Dynamic UINT8 (ONNX), and Static INT8 (TensorRT). We introduce and evaluate a degradation-aware calibration strategy for Static INT8 PTQ, where the TensorRT calibration process is exposed to a mix of clean and synthetically degraded images. Models were benchmarked on the COCO dataset under seven distinct degradation conditions (including various types and levels of noise, blur, low contrast, and JPEG…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
