Evading classifiers in discrete domains with provable optimality   guarantees

Bogdan Kulynych; Jamie Hayes; Nikita Samarin; Carmela Troncoso

arXiv:1810.10939·cs.LG·July 2, 2019·20 cites

Evading classifiers in discrete domains with provable optimality guarantees

Bogdan Kulynych, Jamie Hayes, Nikita Samarin, Carmela Troncoso

PDF

Open Access 2 Repos

TL;DR

This paper presents a graphical framework for generating provably minimal adversarial examples in discrete domains, addressing limitations of norm-based attacks and enhancing security guarantees in applications like bot detection and privacy protection.

Contribution

The authors introduce a novel graphical framework that generalizes adversarial attacks in discrete domains, accommodating complex cost functions and providing provable minimal adversarial examples.

Findings

01

Successfully evaded a Twitter-bot classifier with minimal changes

02

Crafted adversarial examples for website fingerprinting defenses

03

Framework handles complex, domain-specific cost functions

Abstract

Machine-learning models for security-critical applications such as bot, malware, or spam detection, operate in constrained discrete domains. These applications would benefit from having provable guarantees against adversarial examples. The existing literature on provable adversarial robustness of models, however, exclusively focuses on robustness to gradient-based attacks in domains such as images. These attacks model the adversarial cost, e.g., amount of distortion applied to an image, as a $p$ -norm. We argue that this approach is not well-suited to model adversarial costs in constrained domains where not all examples are feasible. We introduce a graphical framework that (1) generalizes existing attacks in discrete domains, (2) can accommodate complex cost functions beyond $p$ -norms, including financial cost incurred when attacking a classifier, and (3) efficiently produces valid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Network Security and Intrusion Detection