Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

Ganghua Wang; Xun Xian; Jayanth Srinivasa; Ashish Kundu; Xuan Bi,; Mingyi Hong; Jie Ding

arXiv:2310.10780·cs.CR·October 19, 2023·1 cites

Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

Ganghua Wang, Xun Xian, Jayanth Srinivasa, Ashish Kundu, Xuan Bi,, Mingyi Hong, Jie Ding

PDF

Open Access 3 Reviews

TL;DR

This paper provides a statistical framework to understand and evaluate the effectiveness of backdoor poisoning attacks in machine learning, offering theoretical insights and experimental validation.

Contribution

It establishes tight bounds and fundamental principles for backdoor attack success, addressing key questions about attack factors and trigger detectability.

Findings

01

Derived bounds for model performance under attack

02

Identified key factors influencing attack success

03

Validated theory with experiments on benchmark datasets

Abstract

The growing dependence on machine learning in real-world applications emphasizes the importance of understanding and ensuring its safety. Backdoor attacks pose a significant security risk due to their stealthy nature and potentially serious consequences. Such attacks involve embedding triggers within a learning model with the intention of causing malicious behavior when an active trigger is present while maintaining regular functionality without it. This paper evaluates the effectiveness of any backdoor attack incorporating a constant trigger, by establishing tight lower and upper boundaries for the performance of the compromised model on both clean and backdoor test data. The developed theory answers a series of fundamental but previously underexplored problems, including (1) what are the determining factors for a backdoor attack's success, (2) what is the direction of the most…

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 3· reject, not good enoughConfidence 5

Strengths

1. Theoretical understanding of backdoor attacks is an important topics. 2. The authors demonstrate their skills in using statistical tools.

Weaknesses

While I appreciate the skills demonstrated by the authors, none of the obtained insights is interesting in a sense that they are either trivial or not true without assuming the model to be Bayesian optimal with respect to the poisoned training distribution. To be specific, insight 1&2 listed in the above Summary section are trivial (even though it may generalize to other backdoor/poison attacks); Insight 3&4&5 are trivial only when assuming the model to be Bayesian optimal but may not generali

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 3

Strengths

1. The paper provides a theoretical analysis on backdoor attacks, an important topic of machine learning security. 2. A few factors that contribute to the success of a backdoor attack are studied in the paper. The choice of a trigger is particularly interesting. The insights shown in the paper can provide a theoretical guideline for further work. 3. The empirical results on synthetic data validate the theoretical analysis and also provide an explanation for generative models.

Weaknesses

1. Some claims are not well validated empirically. The paper states "a large backdoor data ratio ρ will damage the performance on clean data." But there is no empirical evidence to support this claim. Also, according to the literature, a high poisoning rate usually does not significantly affect clean accuracy. It is recommended to empirically validate this claim and assess its consistency with the theories. 2. The experiment conducted in Table 2 is not clear. What does the magnitude of backdoor

Reviewer 03Rating 8· accept, good paperConfidence 4

Strengths

Their theoretical conclusion for the efficiency of backdoor attacks matches with the empirical results. For instance, the influence of the poisoning ratio and the magnitude of the trigger signal. Moreover, they also claimed that when fixing the poisoning ratio and the magnitude of the trigger, it is more efficient to choose the trigger along the direction the density of clean data drops quickly.

Weaknesses

One thing I want to mention is about the reference, as far as I know, there exist some references on the backdoor efficiency. The authors should cite them. [1] W. Guo, B. Tondi and M. Barni, "A Temporal Chrominance Trigger for Clean-Label Backdoor Attack Against Anti-Spoof Rebroadcast Detection," in IEEE Transactions on Dependable and Secure Computing, doi: 10.1109/TDSC.2022.3233519. [2] Yinghua Gao, Yiming Li, Linghui Zhu, Dongxian Wu, Yong Jiang, and Shu-Tao Xia. Not all samples are born equal

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques