Countering Adversarial Examples: Combining Input Transformation and Noisy Training
Cheng Zhang, Pan Gao

TL;DR
This paper proposes a novel defense method against adversarial examples in image classification by combining a modified JPEG compression technique with noisy training and model ensemble, improving robustness without sacrificing accuracy.
Contribution
It introduces a NN-favored JPEG quantization table and an ensemble approach with multiple compressed models for effective adversarial defense.
Findings
Enhanced robustness against adversarial attacks.
Maintained high classification accuracy on benign images.
Effective ensemble strategy improves defense performance.
Abstract
Recent studies have shown that neural network (NN) based image classifiers are highly vulnerable to adversarial examples, which poses a threat to security-sensitive image recognition task. Prior work has shown that JPEG compression can combat the drop in classification accuracy on adversarial examples to some extent. But, as the compression ratio increases, traditional JPEG compression is insufficient to defend those attacks but can cause an abrupt accuracy decline to the benign images. In this paper, with the aim of fully filtering the adversarial perturbations, we firstly make modifications to traditional JPEG compression algorithm which becomes more favorable for NN. Specifically, based on an analysis of the frequency coefficient, we design a NN-favored quantization table for compression. Considering compression as a data augmentation strategy, we then combine our model-agnostic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Bacillus and Francisella bacterial research
