Guiding the retraining of convolutional neural networks against adversarial inputs
Francisco Dur\'an L\'opez, Silverio Mart\'inez-Fern\'andez, Michael, Felderer, Xavier Franch

TL;DR
This study evaluates guidance metrics and configurations for retraining CNNs to improve robustness against adversarial inputs, aiming for better accuracy and resource efficiency in image classification tasks.
Contribution
It introduces an empirical comparison of four guidance metrics and three retraining configurations for CNNs against adversarial inputs, providing practical recommendations.
Findings
Surprise adequacy metrics with original weights yield the best results.
Retraining with adversarial inputs and surprise adequacy improves model robustness.
Using surprise adequacy metrics reduces the number of inputs needed for effective retraining.
Abstract
Background: When using deep learning models, there are many possible vulnerabilities and some of the most worrying are the adversarial inputs, which can cause wrong decisions with minor perturbations. Therefore, it becomes necessary to retrain these models against adversarial inputs, as part of the software testing process addressing the vulnerability to these inputs. Furthermore, for an energy efficient testing and retraining, data scientists need support on which are the best guidance metrics and optimal dataset configurations. Aims: We examined four guidance metrics for retraining convolutional neural networks and three retraining configurations. Our goal is to improve the models against adversarial inputs regarding accuracy, resource utilization and time from the point of view of a data scientist in the context of image classification. Method: We conducted an empirical study in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
