Architectural Backdoors in Deep Learning: A Survey of Vulnerabilities, Detection, and Defense

Victoria Childress; Josh Collyer; Jodie Knapp

arXiv:2507.12919·cs.CR·July 18, 2025

Architectural Backdoors in Deep Learning: A Survey of Vulnerabilities, Detection, and Defense

Victoria Childress, Josh Collyer, Jodie Knapp

PDF

Open Access

TL;DR

This survey reviews the vulnerabilities of architectural backdoors in deep neural networks, discusses current detection and defense methods, and highlights open challenges for future research in securing AI models.

Contribution

It consolidates existing research on architectural backdoors, evaluates detection strategies, and identifies gaps and future directions for enhancing model security.

Findings

01

Architectural backdoors evade standard mitigation techniques.

02

Current defenses like static inspection and fuzzing have limitations.

03

Scalable, practical defenses against backdoors are still lacking.

Abstract

Architectural backdoors pose an under-examined but critical threat to deep neural networks, embedding malicious logic directly into a model's computational graph. Unlike traditional data poisoning or parameter manipulation, architectural backdoors evade standard mitigation techniques and persist even after clean retraining. This survey systematically consolidates research on architectural backdoors, spanning compiler-level manipulations, tainted AutoML pipelines, and supply-chain vulnerabilities. We assess emerging detection and defense strategies, including static graph inspection, dynamic fuzzing, and partial formal verification, and highlight their limitations against distributed or stealth triggers. Despite recent progress, scalable and practical defenses remain elusive. We conclude by outlining open challenges and proposing directions for strengthening supply-chain security,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning