ByteShield: Adversarially Robust End-to-End Malware Detection through Byte Masking
Daniel Gibert, Felip Many\`a

TL;DR
ByteShield introduces a byte-level masking defense for malware detection that systematically occludes adversarial payloads, improving robustness against manipulations while maintaining high accuracy on clean data.
Contribution
This paper presents a deterministic byte masking strategy that enhances malware detector robustness by systematically occluding adversarial payloads, outperforming existing smoothing defenses.
Findings
Outperforms randomized smoothing defenses against adversarial payloads
Maintains high accuracy on clean malware samples
Effective on EMBER and BODMAS datasets
Abstract
Research has proven that end-to-end malware detectors are vulnerable to adversarial attacks. In response, the research community has proposed defenses based on randomized and (de)randomized smoothing. However, these techniques remain susceptible to attacks that insert large adversarial payloads. To address these limitations, we propose a novel defense mechanism designed to harden end-to-end malware detectors by leveraging masking at the byte level. This mechanism operates by generating multiple masked versions of the input file, independently classifying each version, and then applying a threshold-based voting mechanism to produce the final classification. Key to this defense is a deterministic masking strategy that systematically strides a mask across the entire input file. Unlike randomized smoothing defenses, which randomly mask or delete bytes, this structured approach ensures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Security and Verification in Computing
