Evasion and Hardening of Tree Ensemble Classifiers
Alex Kantchelian, J. D. Tygar, Anthony D. Joseph

TL;DR
This paper introduces two algorithms for finding adversarial examples in tree ensemble classifiers and proposes a method called adversarial boosting to harden models against evasion attacks.
Contribution
It presents novel algorithms for systematically computing evasions in tree ensembles and introduces adversarial boosting to improve model robustness.
Findings
Tree ensembles are highly susceptible to evasion attacks.
The proposed algorithms efficiently find adversarial instances.
Adversarial boosting enhances model robustness without losing accuracy.
Abstract
Classifier evasion consists in finding for a given instance the nearest instance such that the classifier predictions of and are different. We present two novel algorithms for systematically computing evasions for tree ensembles such as boosted trees and random forests. Our first algorithm uses a Mixed Integer Linear Program solver and finds the optimal evading instance under an expressive set of constraints. Our second algorithm trades off optimality for speed by using symbolic prediction, a novel algorithm for fast finite differences on tree ensembles. On a digit recognition task, we demonstrate that both gradient boosted trees and random forests are extremely susceptible to evasions. Finally, we harden a boosted tree model without loss of predictive accuracy by augmenting the training set of each boosting round with evading instances, a technique we call adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
