An AI Architecture with the Capability to Classify and Explain Hardware   Trojans

Paul Whitten; Francis Wolff; Chris Papachristou

arXiv:2407.04551·cs.CR·October 1, 2024

An AI Architecture with the Capability to Classify and Explain Hardware Trojans

Paul Whitten, Francis Wolff, Chris Papachristou

PDF

Open Access

TL;DR

This paper presents an explainable AI architecture for hardware Trojan detection that not only classifies suspected circuits but also provides explanations for its decisions, enhancing transparency in digital hardware security.

Contribution

It introduces a novel explainable methodology and architecture for hardware Trojan detection based on existing detection features, with results demonstrating its effectiveness on trust-hub benchmarks.

Findings

01

Effective explanation of hardware Trojans in netlists

02

Improved transparency in Trojan detection decisions

03

Validated on trust-hub benchmark circuits

Abstract

Hardware trojan detection methods, based on machine learning (ML) techniques, mainly identify suspected circuits but lack the ability to explain how the decision was arrived at. An explainable methodology and architecture is introduced based on the existing hardware trojan detection features. Results are provided for explaining digital hardware trojans within a netlist using trust-hub trojan benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhysical Unclonable Functions (PUFs) and Hardware Security · Adversarial Robustness in Machine Learning · Advanced Malware Detection Techniques