On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing   Problem

Georg Pichler; Marco Romanelli; Divya Prakash Manivannan; Prashanth; Krishnamurthy; Farshad Khorrami; Siddharth Garg

arXiv:2402.16926·cs.CR·February 28, 2024·1 cites

On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem

Georg Pichler, Marco Romanelli, Divya Prakash Manivannan, Prashanth, Krishnamurthy, Farshad Khorrami, Siddharth Garg

PDF

Open Access 1 Repo

TL;DR

This paper formalizes backdoor detection as a statistical hypothesis testing problem, proving fundamental limitations and conditions for feasibility, and highlights the necessity of adversary-awareness in detection methods.

Contribution

It introduces a formal statistical framework for backdoor detection, proving an impossibility result for universal detection and linking it to PAC learnability.

Findings

01

Universal backdoor detection is impossible for large alphabets.

02

Backdoor detection requires adversary-awareness to be effective.

03

Specific scenarios can still enable successful detection.

Abstract

We introduce a formal statistical definition for the problem of backdoor detection in machine learning systems and use it to analyze the feasibility of such problems, providing evidence for the utility and applicability of our definition. The main contributions of this work are an impossibility result and an achievability result for backdoor detection. We show a no-free-lunch theorem, proving that universal (adversary-unaware) backdoor detection is impossible, except for very small alphabet sizes. Thus, we argue, that backdoor detection methods need to be either explicitly, or implicitly adversary-aware. However, our work does not imply that backdoor detection cannot work in specific scenarios, as evidenced by successful backdoor detection methods in the scientific literature. Furthermore, we connect our definition to the probably approximately correct (PAC) learnability of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

g-pichler/in_feasibility_of_ml_backdoor_detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmbedded Systems Design Techniques · VLSI and Analog Circuit Testing