Defending Against Repetitive Backdoor Attacks on Semi-supervised   Learning through Lens of Rate-Distortion-Perception Trade-off

Cheng-Yi Lee; Ching-Chia Kao; Cheng-Han Yeh; Chun-Shien Lu; Chia-Mu Yu; and Chu-Song Chen

arXiv:2407.10180·cs.CV·December 5, 2024

Defending Against Repetitive Backdoor Attacks on Semi-supervised Learning through Lens of Rate-Distortion-Perception Trade-off

Cheng-Yi Lee, Ching-Chia Kao, Cheng-Han Yeh, Chun-Shien Lu, Chia-Mu Yu, and Chu-Song Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces UPure, a novel frequency domain perturbation method leveraging the RDP trade-off to effectively defend semi-supervised learning models against backdoor data poisoning attacks without requiring clean labeled data.

Contribution

The study proposes UPure, a new data purification technique that disrupts backdoor triggers in unlabeled data using frequency domain perturbations based on RDP trade-off analysis.

Findings

01

Reduces attack success rate from 99.78% to 0%.

02

Maintains high model accuracy on benchmark datasets.

03

Effective across multiple SSL algorithms.

Abstract

Semi-supervised learning (SSL) has achieved remarkable performance with a small fraction of labeled data by leveraging vast amounts of unlabeled data from the Internet. However, this large pool of untrusted data is extremely vulnerable to data poisoning, leading to potential backdoor attacks. Current backdoor defenses are not yet effective against such a vulnerability in SSL. In this study, we propose a novel method, Unlabeled Data Purification (UPure), to disrupt the association between trigger patterns and target classes by introducing perturbations in the frequency domain. By leveraging the Rate-Distortion-Perception (RDP) trade-off, we further identify the frequency band, where the perturbations are added, and justify this selection. Notably, UPure purifies poisoned unlabeled data without the need of extra clean labeled data. Extensive experiments on four benchmark datasets and five…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chengyi-chris/upure
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Advanced Malware Detection Techniques