Clean-Label Physical Backdoor Attacks with Data Distillation
Thinh Dao, Khoa D Doan, Kok-Seng Wong

TL;DR
This paper introduces a novel clean-label physical backdoor attack method that injects imperceptible perturbations into training data, enabling physical-world backdoor activation without label manipulation, outperforming existing dirty-label approaches.
Contribution
The authors propose CLPBA, a new physical backdoor attack framework using dataset distillation techniques that do not require label changes or trigger injection during training.
Findings
CLPBA effectively creates physical backdoors without label manipulation.
It surpasses dirty-label attack baselines in physical scenarios.
The method is validated on facial recognition and animal classification datasets.
Abstract
Deep Neural Networks (DNNs) are shown to be vulnerable to backdoor poisoning attacks, with most research focusing on digital triggers -- artificial patterns added to test-time inputs to induce targeted misclassification. Physical triggers, which are natural objects embedded in real-world scenes, offer a promising alternative for attackers, as they can activate backdoors in real-time without digital manipulation. However, existing physical backdoor attacks are dirty-label, meaning that attackers must change the labels of poisoned inputs to the target label. The inconsistency between image content and label exposes the attack to human inspection, reducing its stealthiness in real-world settings. To address this limitation, we introduce Clean-Label Physical Backdoor Attack (CLPBA), a new paradigm of physical backdoor attack that does not require label manipulation and trigger injection at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Malware Detection Techniques · Security and Verification in Computing · Electrostatic Discharge in Electronics
