ATTRITION: Attacking Static Hardware Trojan Detection Techniques Using Reinforcement Learning
Vasudev Gohil, Hao Guo, Satwik Patnaik, Jeyavijayan (JV) Rajendran

TL;DR
This paper introduces ATTRITION, an RL-based attack framework that systematically evades multiple hardware Trojan detection techniques, exposing their vulnerabilities and highlighting the need for more robust detection methods.
Contribution
We develop a scalable reinforcement learning attack framework that effectively evades existing hardware Trojan detection techniques, challenging their assumed efficacy and promoting improved defenses.
Findings
ATTRITION evades 8 detection techniques with high success rates.
It outperforms random insertion by 47 to 211 times in success rate.
Demonstrates effectiveness on various hardware designs and case studies.
Abstract
Stealthy hardware Trojans (HTs) inserted during the fabrication of integrated circuits can bypass the security of critical infrastructures. Although researchers have proposed many techniques to detect HTs, several limitations exist, including: (i) a low success rate, (ii) high algorithmic complexity, and (iii) a large number of test patterns. Furthermore, the most pertinent drawback of prior detection techniques stems from an incorrect evaluation methodology, i.e., they assume that an adversary inserts HTs randomly. Such inappropriate adversarial assumptions enable detection techniques to claim high HT detection accuracy, leading to a "false sense of security." Unfortunately, to the best of our knowledge, despite more than a decade of research on detecting HTs inserted during fabrication, there have been no concerted efforts to perform a systematic evaluation of HT detection techniques.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhysical Unclonable Functions (PUFs) and Hardware Security · Integrated Circuits and Semiconductor Failure Analysis · Adversarial Robustness in Machine Learning
MethodsTest · Greedy Policy Search
