Poison in the Well: Feature Embedding Disruption in Backdoor Attacks
Zhou Feng, Jiahao Chen, Chunyi Zhou, Yuwen Pu, Qingming Li, Shouling Ji

TL;DR
This paper introduces ShadowPrint, a novel backdoor attack method that manipulates feature embeddings in neural networks to achieve high attack success rates, stealth, and stability with minimal data poisoning.
Contribution
ShadowPrint is a versatile backdoor attack that reduces data reliance and operates effectively at extremely low poison rates using clustering-based optimization.
Findings
Achieves up to 100% attack success rate.
Maintains low detection and decay rates.
Effective with poison rates as low as 0.01%.
Abstract
Backdoor attacks embed malicious triggers into training data, enabling attackers to manipulate neural network behavior during inference while maintaining high accuracy on benign inputs. However, existing backdoor attacks face limitations manifesting in excessive reliance on training data, poor stealth, and instability, which hinder their effectiveness in real-world applications. Therefore, this paper introduces ShadowPrint, a versatile backdoor attack that targets feature embeddings within neural networks to achieve high ASRs and stealthiness. Unlike traditional approaches, ShadowPrint reduces reliance on training data access and operates effectively with exceedingly low poison rates (as low as 0.01%). It leverages a clustering-based optimization strategy to align feature embeddings, ensuring robust performance across diverse scenarios while maintaining stability and stealth. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques
MethodsALIGN
