Technical Report: When Does Machine Learning FAIL? Generalized   Transferability for Evasion and Poisoning Attacks

Octavian Suciu; Radu M\u{a}rginean; Yi\u{g}itcan Kaya; Hal Daum\'e; III; Tudor Dumitra\c{s}

arXiv:1803.06975·cs.CR·March 11, 2019·1 cites

Technical Report: When Does Machine Learning FAIL? Generalized Transferability for Evasion and Poisoning Attacks

Octavian Suciu, Radu M\u{a}rginean, Yi\u{g}itcan Kaya, Hal Daum\'e, III, Tudor Dumitra\c{s}

PDF

Open Access

TL;DR

This paper introduces the FAIL attacker model to define adversary capabilities in machine learning security, enabling the design of practical poisoning attacks like StingRay that are effective under realistic constraints and can bypass existing defenses.

Contribution

The paper proposes the FAIL model for adversary capabilities, and develops StingRay, a practical targeted poisoning attack that considers realistic constraints and transferability.

Findings

01

StingRay successfully attacks 4 ML applications across 3 algorithms.

02

StingRay bypasses 2 existing defenses effectively.

03

Prior evasion attacks are less effective under the FAIL model.

Abstract

Recent results suggest that attacks against supervised machine learning systems are quite effective, while defenses are easily bypassed by new attacks. However, the specifications for machine learning systems currently lack precise adversary definitions, and the existing attacks make diverse, potentially unrealistic assumptions about the strength of the adversary who launches them. We propose the FAIL attacker model, which describes the adversary's knowledge and control along four dimensions. Our model allows us to consider a wide range of weaker adversaries who have limited control and incomplete knowledge of the features, learning algorithms and training instances utilized. To evaluate the utility of the FAIL model, we consider the problem of conducting targeted poisoning attacks in a realistic setting: the crafted poison samples must have clean labels, must be individually and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications