ConFoc: Content-Focus Protection Against Trojan Attacks on Neural   Networks

Miguel Villarreal-Vasquez; Bharat Bhargava

arXiv:2007.00711·cs.CV·July 3, 2020·23 cites

ConFoc: Content-Focus Protection Against Trojan Attacks on Neural Networks

Miguel Villarreal-Vasquez, Bharat Bhargava

PDF

Open Access 1 Repo

TL;DR

This paper introduces ConFoc, a defense method that teaches neural networks to ignore style features and focus on content, effectively reducing Trojan attack success rates in vision models.

Contribution

The paper presents a novel style-disregarding training approach that enhances neural network robustness against Trojan attacks in image classification.

Findings

01

Reduces attack success rate to below 1% across tested scenarios.

02

Maintains or improves accuracy on benign data.

03

Effective in traffic sign and face recognition applications.

Abstract

Deep Neural Networks (DNNs) have been applied successfully in computer vision. However, their wide adoption in image-related applications is threatened by their vulnerability to trojan attacks. These attacks insert some misbehavior at training using samples with a mark or trigger, which is exploited at inference or testing time. In this work, we analyze the composition of the features learned by DNNs at training. We identify that they, including those related to the inserted triggers, contain both content (semantic information) and style (texture information), which are recognized as a whole by DNNs at testing time. We then propose a novel defensive technique against trojan attacks, in which DNNs are taught to disregard the styles of inputs and focus on their content only to mitigate the effect of triggers during the classification. The generic applicability of the approach is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mvillarreal14/confoc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications