Explainability Methods for Hardware Trojan Detection: A Systematic Comparison
Paul Whitten, Francis Wolff, Chris Papachristou

TL;DR
This paper systematically compares explainability methods for hardware Trojan detection, evaluating domain-aware, case-based, and feature attribution techniques on a benchmark dataset.
Contribution
It provides a comprehensive comparison of three explainability approaches tailored for hardware Trojan detection, highlighting their strengths and limitations.
Findings
Domain-aware analysis offers circuit-specific insights.
Case-based reasoning provides precedent-based explanations.
Feature attribution methods give importance scores without circuit context.
Abstract
Hardware trojans are malicious circuits which compromise the functionality and security of an integrated circuit (IC). These circuits are manufactured directly into the silicon and cannot be fixed by security patches like software. The solution would require a costly product recall by replacing the IC and hence, early detection in the design process is essential. Hardware detection at best provides statistically based solutions with many false positives and false negatives. These detection methods require more thorough explainable analysis to filter out false indicators. Existing explainability methods developed for general domains like image classification may not provide the actionable insights that hardware engineers need. A question remains: How do domain-aware property analysis, model-agnostic case-based reasoning, and model-agnostic feature attribution techniques compare for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhysical Unclonable Functions (PUFs) and Hardware Security · Security and Verification in Computing · Advanced Malware Detection Techniques
