Augmentation Backdoors
Joseph Rance, Yiren Zhao, Ilia Shumailov, Robert Mullins

TL;DR
This paper reveals how data augmentation techniques in machine learning can be exploited to covertly insert backdoors into models, highlighting a new security vulnerability in the training pipeline.
Contribution
It introduces three novel backdoor attack methods leveraging different augmentation transforms, demonstrating their effectiveness and stealthiness across benchmarks.
Findings
Backdoors can be inserted via augmentation transforms without initial dataset tampering.
Attacks are effective across various computer vision benchmarks.
Proposed methods are difficult to detect and support arbitrary backdoor functionalities.
Abstract
Data augmentation is used extensively to improve model generalisation. However, reliance on external libraries to implement augmentation methods introduces a vulnerability into the machine learning pipeline. It is well known that backdoors can be inserted into machine learning models through serving a modified dataset to train on. Augmentation therefore presents a perfect opportunity to perform this modification without requiring an initially backdoored dataset. In this paper we present three backdoor attacks that can be covertly inserted into data augmentation. Our attacks each insert a backdoor using a different type of computer vision augmentation transform, covering simple image transforms, GAN-based augmentation, and composition-based augmentation. By inserting the backdoor using these augmentation transforms, we make our backdoors difficult to detect, while still supporting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection
