Scalable Whitebox Attacks on Tree-based Models
Giuseppe Castiglione, Gavin Ding, Masoud Hashemi, Christopher, Srinivasa, Ga Wu

TL;DR
This paper introduces a scalable whitebox attack method for tree ensemble models by smoothing them with sigmoid functions, enabling gradient-based attacks and revealing vulnerabilities efficiently.
Contribution
It presents a novel approach that adapts gradient-based adversarial testing to non-differentiable tree models, bridging a gap in robustness evaluation for industry-relevant models.
Findings
Successfully reveals vulnerabilities in tree ensembles
Scales to large testing tasks with manageable computational cost
Balances attack effectiveness and efficiency
Abstract
Adversarial robustness is one of the essential safety criteria for guaranteeing the reliability of machine learning models. While various adversarial robustness testing approaches were introduced in the last decade, we note that most of them are incompatible with non-differentiable models such as tree ensembles. Since tree ensembles are widely used in industry, this reveals a crucial gap between adversarial robustness research and practical applications. This paper proposes a novel whitebox adversarial robustness testing approach for tree ensemble models. Concretely, the proposed approach smooths the tree ensembles through temperature controlled sigmoid functions, which enables gradient descent-based adversarial attacks. By leveraging sampling and the log-derivative trick, the proposed approach can scale up to testing tasks that were previously unmanageable. We compare the approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)
