Deferred Poisoning: Making the Model More Vulnerable via Hessian Singularization
Yuhao He, Jinyu Tian, Xianwei Zheng, Li Dong, Yuanman Li, Jiantao Zhou

TL;DR
This paper introduces Deferred Poisoning, a stealthy attack that makes models vulnerable by creating a large curvature in the loss landscape while maintaining normal performance during training and validation.
Contribution
The paper proposes a novel poisoning attack method that induces Hessian singularization, increasing model vulnerability without detection during training or validation phases.
Findings
The attack causes significant performance degradation with small perturbations.
It maintains normal training and validation accuracy, ensuring stealthiness.
The method is validated on image classification tasks with both theoretical and empirical evidence.
Abstract
Recent studies have shown that deep learning models are very vulnerable to poisoning attacks. Many defense methods have been proposed to address this issue. However, traditional poisoning attacks are not as threatening as commonly believed. This is because they often cause differences in how the model performs on the training set compared to the validation set. Such inconsistency can alert defenders that their data has been poisoned, allowing them to take the necessary defensive actions. In this paper, we introduce a more threatening type of poisoning attack called the Deferred Poisoning Attack. This new attack allows the model to function normally during the training and validation phases but makes it very sensitive to evasion attacks or even natural noise. We achieve this by ensuring the poisoned model's loss function has a similar value as a normally trained model at each input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStatistical and Computational Modeling · History and advancements in chemistry
MethodsSparse Evolutionary Training
