Enhancing Quantization-Aware Training on Edge Devices via Relative Entropy Coreset Selection and Cascaded Layer Correction
Yujia Tong, Jingling Yuan, Chuang Hu

TL;DR
This paper introduces QuaRC, a novel quantization-aware training framework for edge devices that uses relative entropy coreset selection and cascaded layer correction to improve model accuracy with minimal data.
Contribution
The paper proposes QuaRC, combining a new coreset selection method and layer correction strategy to enhance quantization-aware training on edge devices with limited data.
Findings
Achieves 5.72% higher Top-1 accuracy on ImageNet-1K with 2-bit ResNet-18 using only 1% data.
Effectively reduces quantization errors in intermediate layers.
Outperforms existing methods in low-data quantization scenarios.
Abstract
With the development of mobile and edge computing, the demand for low-bit quantized models on edge devices is increasing to achieve efficient deployment. To enhance the performance, it is often necessary to retrain the quantized models using edge data. However, due to privacy concerns, certain sensitive data can only be processed on edge devices. Therefore, employing Quantization-Aware Training (QAT) on edge devices has become an effective solution. Nevertheless, traditional QAT relies on the complete dataset for training, which incurs a huge computational cost. Coreset selection techniques can mitigate this issue by training on the most representative subsets. However, existing methods struggle to eliminate quantization errors in the model when using small-scale datasets (e.g., only 10% of the data), leading to significant performance degradation. To address these issues, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · CCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing
