PBP: Post-training Backdoor Purification for Malware Classifiers

Dung Thuy Nguyen; Ngoc N. Tran; Taylor T. Johnson; Kevin Leach

arXiv:2412.03441·cs.LG·February 13, 2026

PBP: Post-training Backdoor Purification for Malware Classifiers

Dung Thuy Nguyen, Ngoc N. Tran, Taylor T. Johnson, Kevin Leach

PDF

Open Access 1 Repo

TL;DR

PBP is a post-training method that effectively removes backdoors from malware classifiers by adjusting batch normalization statistics, significantly reducing attack success rates with minimal training data.

Contribution

Introduces PBP, a novel post-training backdoor purification technique that does not rely on specific backdoor mechanisms and requires only 1% of training data.

Findings

01

Reduces attack success rate from 100% to nearly 0%

02

Requires only 1% of training data for effective purification

03

Outperforms several state-of-the-art backdoor defense methods

Abstract

In recent years, the rise of machine learning (ML) in cybersecurity has brought new challenges, including the increasing threat of backdoor poisoning attacks on ML malware classifiers. For instance, adversaries could inject malicious samples into public malware repositories, contaminating the training data and potentially misclassifying malware by the ML model. Current countermeasures predominantly focus on detecting poisoned samples by leveraging disagreements within the outputs of a diverse set of ensemble models on training data points. However, these methods are not suitable for scenarios where Machine Learning-as-a-Service (MLaaS) is used or when users aim to remove backdoors from a model after it has been trained. Addressing this scenario, we introduce PBP, a post-training defense for malware classifiers that mitigates various types of backdoor embeddings without assuming any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

judydnguyen/pbp-backdoor-purification-official
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Anomaly Detection Techniques and Applications

MethodsSparse Evolutionary Training · Batch Normalization · Focus