Neural Trojans

Yuntao Liu; Yang Xie; and Ankur Srivastava

arXiv:1710.00942·cs.CR·August 19, 2025·1 cites

Neural Trojans

Yuntao Liu, Yang Xie, and Ankur Srivastava

PDF

Open Access

TL;DR

This paper investigates neural Trojans embedded in pre-trained neural network IPs, demonstrating their effectiveness and proposing three mitigation techniques—input anomaly detection, re-training, and input preprocessing—with high success rates.

Contribution

It introduces the concept of neural Trojans in neural network IPs and evaluates three mitigation strategies, providing practical solutions for security threats.

Findings

01

Input anomaly detection detects 99.8% of Trojan triggers

02

Re-training prevents 94.1% of Trojan triggers

03

Input preprocessing renders 90.2% of Trojan triggers ineffective

Abstract

While neural networks demonstrate stronger capabilities in pattern recognition nowadays, they are also becoming larger and deeper. As a result, the effort needed to train a network also increases dramatically. In many cases, it is more practical to use a neural network intellectual property (IP) that an IP vendor has already trained. As we do not know about the training process, there can be security threats in the neural IP: the IP vendor (attacker) may embed hidden malicious functionality, i.e. neural Trojans, into the neural IP. We show that this is an effective attack and provide three mitigation techniques: input anomaly detection, re-training, and input preprocessing. All the techniques are proven effective. The input anomaly detection approach is able to detect 99.8% of Trojan triggers although with 12.2% false positive. The re-training approach is able to prevent 94.1% of Trojan…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutism Spectrum Disorder Research · Law, AI, and Intellectual Property