PBP: Post-training Backdoor Purification for Malware Classifiers
Dung Thuy Nguyen, Ngoc N. Tran, Taylor T. Johnson, Kevin Leach

TL;DR
PBP is a post-training method that effectively removes backdoors from malware classifiers by adjusting batch normalization statistics, significantly reducing attack success rates with minimal training data.
Contribution
Introduces PBP, a novel post-training backdoor purification technique that does not rely on specific backdoor mechanisms and requires only 1% of training data.
Findings
Reduces attack success rate from 100% to nearly 0%
Requires only 1% of training data for effective purification
Outperforms several state-of-the-art backdoor defense methods
Abstract
In recent years, the rise of machine learning (ML) in cybersecurity has brought new challenges, including the increasing threat of backdoor poisoning attacks on ML malware classifiers. For instance, adversaries could inject malicious samples into public malware repositories, contaminating the training data and potentially misclassifying malware by the ML model. Current countermeasures predominantly focus on detecting poisoned samples by leveraging disagreements within the outputs of a diverse set of ensemble models on training data points. However, these methods are not suitable for scenarios where Machine Learning-as-a-Service (MLaaS) is used or when users aim to remove backdoors from a model after it has been trained. Addressing this scenario, we introduce PBP, a post-training defense for malware classifiers that mitigates various types of backdoor embeddings without assuming any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Anomaly Detection Techniques and Applications
MethodsSparse Evolutionary Training · Batch Normalization · Focus
