Evading classifiers in discrete domains with provable optimality guarantees
Bogdan Kulynych, Jamie Hayes, Nikita Samarin, Carmela Troncoso

TL;DR
This paper presents a graphical framework for generating provably minimal adversarial examples in discrete domains, addressing limitations of norm-based attacks and enhancing security guarantees in applications like bot detection and privacy protection.
Contribution
The authors introduce a novel graphical framework that generalizes adversarial attacks in discrete domains, accommodating complex cost functions and providing provable minimal adversarial examples.
Findings
Successfully evaded a Twitter-bot classifier with minimal changes
Crafted adversarial examples for website fingerprinting defenses
Framework handles complex, domain-specific cost functions
Abstract
Machine-learning models for security-critical applications such as bot, malware, or spam detection, operate in constrained discrete domains. These applications would benefit from having provable guarantees against adversarial examples. The existing literature on provable adversarial robustness of models, however, exclusively focuses on robustness to gradient-based attacks in domains such as images. These attacks model the adversarial cost, e.g., amount of distortion applied to an image, as a -norm. We argue that this approach is not well-suited to model adversarial costs in constrained domains where not all examples are feasible. We introduce a graphical framework that (1) generalizes existing attacks in discrete domains, (2) can accommodate complex cost functions beyond -norms, including financial cost incurred when attacking a classifier, and (3) efficiently produces valid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Network Security and Intrusion Detection
