Towards Robust Fine-grained Recognition by Maximal Separation of Discriminative Features
Krishna Kanth Nakka, Mathieu Salzmann

TL;DR
This paper proposes an attention-based regularization method to enhance the robustness of fine-grained recognition models against adversarial attacks by maximizing the separation of class-specific features.
Contribution
It introduces a novel regularization mechanism that improves adversarial robustness in fine-grained recognition without needing adversarial training data.
Findings
Significantly improves robustness to adversarial attacks.
Matches or surpasses adversarial training effectiveness.
Reduces the proximity of class representations in latent space.
Abstract
Adversarial attacks have been widely studied for general classification tasks, but remain unexplored in the context of fine-grained recognition, where the inter-class similarities facilitate the attacker's task. In this paper, we identify the proximity of the latent representations of different classes in fine-grained recognition networks as a key factor to the success of adversarial attacks. We therefore introduce an attention-based regularization mechanism that maximally separates the discriminative latent features of different classes while minimizing the contribution of the non-discriminative regions to the final class prediction. As evidenced by our experiments, this allows us to significantly improve robustness to adversarial attacks, to the point of matching or even surpassing that of adversarial training, but without requiring access to adversarial samples.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
