Augmentation Backdoors

Joseph Rance; Yiren Zhao; Ilia Shumailov; Robert Mullins

arXiv:2209.15139·cs.LG·October 3, 2022

Augmentation Backdoors

Joseph Rance, Yiren Zhao, Ilia Shumailov, Robert Mullins

PDF

Open Access 1 Repo

TL;DR

This paper reveals how data augmentation techniques in machine learning can be exploited to covertly insert backdoors into models, highlighting a new security vulnerability in the training pipeline.

Contribution

It introduces three novel backdoor attack methods leveraging different augmentation transforms, demonstrating their effectiveness and stealthiness across benchmarks.

Findings

01

Backdoors can be inserted via augmentation transforms without initial dataset tampering.

02

Attacks are effective across various computer vision benchmarks.

03

Proposed methods are difficult to detect and support arbitrary backdoor functionalities.

Abstract

Data augmentation is used extensively to improve model generalisation. However, reliance on external libraries to implement augmentation methods introduces a vulnerability into the machine learning pipeline. It is well known that backdoors can be inserted into machine learning models through serving a modified dataset to train on. Augmentation therefore presents a perfect opportunity to perform this modification without requiring an initially backdoored dataset. In this paper we present three backdoor attacks that can be covertly inserted into data augmentation. Our attacks each insert a backdoor using a different type of computer vision augmentation transform, covering simple image transforms, GAN-based augmentation, and composition-based augmentation. By inserting the backdoor using these augmentation transforms, we make our backdoors difficult to detect, while still supporting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

slkdfjslkjfd/augmentation_backdoors
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection