Dormant Neural Trojans

Feisi Fu; Panagiota Kiourti; Wenchao Li

arXiv:2211.01808·cs.CR·November 4, 2022

Dormant Neural Trojans

Feisi Fu, Panagiota Kiourti, Wenchao Li

PDF

Open Access

TL;DR

This paper introduces a new type of neural network backdoor, called dormant Trojans, which remain inactive until triggered by a specific weight perturbation, making them harder to detect.

Contribution

The paper proposes a novel dormant Trojan methodology that activates via weight perturbation, evading current detection techniques.

Findings

01

Dormant Trojans can be effectively activated with specific weight perturbations.

02

Dormant Trojans evade detection by state-of-the-art methods.

03

The approach demonstrates high attack success rates in experiments.

Abstract

We present a novel methodology for neural network backdoor attacks. Unlike existing training-time attacks where the Trojaned network would respond to the Trojan trigger after training, our approach inserts a Trojan that will remain dormant until it is activated. The activation is realized through a specific perturbation to the network's weight parameters only known to the attacker. Our analysis and the experimental results demonstrate that dormant Trojaned networks can effectively evade detection by state-of-the-art backdoor detection methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Advanced Neural Network Applications