Guiding the retraining of convolutional neural networks against   adversarial inputs

Francisco Dur\'an L\'opez; Silverio Mart\'inez-Fern\'andez; Michael; Felderer; Xavier Franch

arXiv:2207.03689·cs.SE·January 12, 2024

Guiding the retraining of convolutional neural networks against adversarial inputs

Francisco Dur\'an L\'opez, Silverio Mart\'inez-Fern\'andez, Michael, Felderer, Xavier Franch

PDF

Open Access

TL;DR

This study evaluates guidance metrics and configurations for retraining CNNs to improve robustness against adversarial inputs, aiming for better accuracy and resource efficiency in image classification tasks.

Contribution

It introduces an empirical comparison of four guidance metrics and three retraining configurations for CNNs against adversarial inputs, providing practical recommendations.

Findings

01

Surprise adequacy metrics with original weights yield the best results.

02

Retraining with adversarial inputs and surprise adequacy improves model robustness.

03

Using surprise adequacy metrics reduces the number of inputs needed for effective retraining.

Abstract

Background: When using deep learning models, there are many possible vulnerabilities and some of the most worrying are the adversarial inputs, which can cause wrong decisions with minor perturbations. Therefore, it becomes necessary to retrain these models against adversarial inputs, as part of the software testing process addressing the vulnerability to these inputs. Furthermore, for an energy efficient testing and retraining, data scientists need support on which are the best guidance metrics and optimal dataset configurations. Aims: We examined four guidance metrics for retraining convolutional neural networks and three retraining configurations. Our goal is to improve the models against adversarial inputs regarding accuracy, resource utilization and time from the point of view of a data scientist in the context of image classification. Method: We conducted an empirical study in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning