Robustness and invariance properties of image classifiers
Apostolos Modas

TL;DR
This paper investigates the robustness and invariance properties of deep image classifiers, analyzing adversarial perturbations, feature invariance, and proposing methods to improve robustness against data corruptions.
Contribution
It introduces a geometric framework linking data features to decision boundaries and proposes a novel data augmentation scheme for enhanced robustness.
Findings
Fast computation of sparse adversarial perturbations
Deep classifiers are biased towards invariance to non-discriminative features
Proposed data augmentation achieves state-of-the-art robustness to corruptions
Abstract
Deep neural networks have achieved impressive results in many image classification tasks. However, since their performance is usually measured in controlled settings, it is important to ensure that their decisions remain correct when deployed in noisy environments. In fact, deep networks are not robust to a large variety of semantic-preserving image modifications, even to imperceptible image changes known as adversarial perturbations. The poor robustness of image classifiers to small data distribution shifts raises serious concerns regarding their trustworthiness. To build reliable machine learning models, we must design principled methods to analyze and understand the mechanisms that shape robustness and invariance. This is exactly the focus of this thesis. First, we study the problem of computing sparse adversarial perturbations. We exploit the geometry of the decision boundaries of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
