Scientific Calculator for Designing Trojan Detectors in Neural Networks
Peter Bajcsy, Nicholas J. Schaub, Michael Majurski

TL;DR
This paper introduces a web-based neural network calculator and a novel inefficiency measurement using modified KL divergence to detect trojans embedded in neural network models, enhancing interpretability and security.
Contribution
It presents a new interactive NN calculator and a KL divergence-based inefficiency metric for trojan detection in neural networks, with experimental validation.
Findings
KL divergence effectively detects trojans in NN models.
The calculator provides analytical and visualization tools for NN analysis.
Experimental results show robustness of the proposed method.
Abstract
This work presents a web-based interactive neural network (NN) calculator and a NN inefficiency measurement that has been investigated for the purpose of detecting trojans embedded in NN models. This NN Calculator is designed on top of TensorFlow Playground with in-memory storage of data and NN graphs plus coefficients. It is "like a scientific calculator" with analytical, visualization, and output operations performed on training datasets and NN architectures. The prototype is aaccessible at https://pages.nist.gov/nn-calculator. The analytical capabilities include a novel measurement of NN inefficiency using modified Kullback-Liebler (KL) divergence applied to histograms of NN model states, as well as a quantification of the sensitivity to variables related to data and NNs. Both NN Calculator and KL divergence are used to devise a trojan detector approach for a variety of trojan…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhysical Unclonable Functions (PUFs) and Hardware Security · Adversarial Robustness in Machine Learning · Neural Networks and Applications
