DivQAT: Enhancing Robustness of Quantized Convolutional Neural Networks against Model Extraction Attacks

Kacem Khaled; Felipe Gohring de Magalh\~aes; Gabriela Nicolescu

arXiv:2512.23948·cs.LG·January 1, 2026

DivQAT: Enhancing Robustness of Quantized Convolutional Neural Networks against Model Extraction Attacks

Kacem Khaled, Felipe Gohring de Magalh\~aes, Gabriela Nicolescu

PDF

Open Access

TL;DR

DivQAT introduces a novel quantization-aware training method that enhances the robustness of quantized CNNs against model extraction attacks, integrating defense mechanisms directly into the training process for improved security without sacrificing accuracy.

Contribution

This paper presents the first quantization process modification to incorporate model extraction defenses during training, improving robustness of quantized CNNs.

Findings

01

Effective defense against extraction attacks demonstrated on benchmark datasets.

02

Maintains model accuracy while enhancing robustness.

03

Combining DivQAT with other defenses yields superior protection.

Abstract

Convolutional Neural Networks (CNNs) and their quantized counterparts are vulnerable to extraction attacks, posing a significant threat of IP theft. Yet, the robustness of quantized models against these attacks is little studied compared to large models. Previous defenses propose to inject calculated noise into the prediction probabilities. However, these defenses are limited since they are not incorporated during the model design and are only added as an afterthought after training. Additionally, most defense techniques are computationally expensive and often have unrealistic assumptions about the victim model that are not feasible in edge device implementations and do not apply to quantized models. In this paper, we propose DivQAT, a novel algorithm to train quantized CNNs based on Quantization Aware Training (QAT) aiming to enhance their robustness against extraction attacks. To the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Physical Unclonable Functions (PUFs) and Hardware Security