QNNRepair: Quantized Neural Network Repair
Xidan Song, Youcheng Sun, Mustafa A. Mustafa, and Lucas C. Cordeiro

TL;DR
QNNRepair is a novel method that repairs quantized neural networks by localizing faulty neurons and adjusting their weights through linear programming, significantly improving accuracy especially on high-resolution image datasets.
Contribution
It introduces the first approach for repairing quantized neural networks using fault localization and linear programming, enhancing post-quantization accuracy.
Findings
QNNRepair improves accuracy by 24% over SQuant on ImageNet.
Effective across architectures like MobileNetV2, ResNet, VGGNet.
Outperforms state-of-the-art data-free quantization methods.
Abstract
We present QNNRepair, the first method in the literature for repairing quantized neural networks (QNNs). QNNRepair aims to improve the accuracy of a neural network model after quantization. It accepts the full-precision and weight-quantized neural networks and a repair dataset of passing and failing tests. At first, QNNRepair applies a software fault localization method to identify the neurons that cause performance degradation during neural network quantization. Then, it formulates the repair problem into a linear programming problem of solving neuron weights parameters, which corrects the QNN's performance on failing tests while not compromising its performance on passing tests. We evaluate QNNRepair with widely used neural network architectures such as MobileNetV2, ResNet, and VGGNet on popular datasets, including high-resolution images. We also compare QNNRepair with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Repair · Pointwise Convolution · Depthwise Convolution · Batch Normalization · Average Pooling · Depthwise Separable Convolution · Residual Block · 1x1 Convolution · Inverted Residual Block
