Feature Partitioning for Robust Tree Ensembles and their Certification in Adversarial Scenarios
Stefano Calzavara, Claudio Lucchese, Federico Marcuzzi, Salvatore, Orlando

TL;DR
This paper introduces a feature partitioning approach to enhance the robustness of tree ensemble models against adversarial evasion attacks, providing certification methods and demonstrating superior performance over existing techniques.
Contribution
The paper proposes a novel feature partitioning strategy for training robust ensemble models and an efficient certification method to evaluate their minimal accuracy under attack.
Findings
Outperforms state-of-the-art adversarial algorithms in experiments
Guarantees most models in the ensemble are unaffected by attacks
Provides an efficient certification method for ensemble robustness
Abstract
Machine learning algorithms, however effective, are known to be vulnerable in adversarial scenarios where a malicious user may inject manipulated instances. In this work we focus on evasion attacks, where a model is trained in a safe environment and exposed to attacks at test time. The attacker aims at finding a minimal perturbation of a test instance that changes the model outcome. We propose a model-agnostic strategy that builds a robust ensemble by training its basic models on feature-based partitions of the given dataset. Our algorithm guarantees that the majority of the models in the ensemble cannot be affected by the attacker. We experimented the proposed strategy on decision tree ensembles, and we also propose an approximate certification method for tree ensembles that efficiently assess the minimal accuracy of a forest on a given dataset avoiding the costly computation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)
