DE-CROP: Data-efficient Certified Robustness for Pretrained Classifiers
Gaurav Kumar Nayak, Ruchit Rawal, Anirban Chakraborty

TL;DR
DE-CROP introduces a data-efficient method to certify the robustness of pretrained classifiers against adversarial attacks using limited training samples, by generating diverse class-boundary and interpolated samples for effective denoiser training.
Contribution
The paper proposes a novel approach to certify robustness of pretrained models with few training samples, utilizing class-boundary and interpolated sample generation and distribution matching techniques.
Findings
Significant robustness certification improvements on benchmark datasets.
Effective in both white box and black box attack scenarios.
Outperforms existing methods with limited training data.
Abstract
Certified defense using randomized smoothing is a popular technique to provide robustness guarantees for deep neural networks against l2 adversarial attacks. Existing works use this technique to provably secure a pretrained non-robust model by training a custom denoiser network on entire training data. However, access to the training set may be restricted to a handful of data samples due to constraints such as high transmission cost and the proprietary nature of the data. Thus, we formulate a novel problem of "how to certify the robustness of pretrained models using only a few training samples". We observe that training the custom denoiser directly using the existing techniques on limited samples yields poor certification. To overcome this, our proposed approach (DE-CROP) generates class-boundary and interpolated samples corresponding to each training sample, ensuring high diversity in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
DE-CROP: Data-efficient Certified Robustness for Pretrained Classifiers· youtube
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
MethodsRandomized Smoothing
