Glitches in Decision Tree Ensemble Models
Satyankar Chandra, Ashutosh Gupta, Kaushik Mallik, Krishna Shankaranarayanan, Namrita Varshney

TL;DR
This paper introduces the concept of glitches in decision tree ensemble models, formalizes their definition, demonstrates their widespread existence, and develops an MILP-based algorithm to detect them, highlighting potential model unreliability.
Contribution
It formally defines glitches in decision tree models, proves the NP-completeness of detecting them, and presents an effective MILP-based detection algorithm for GBDT models.
Findings
Glitches are widespread in decision tree models.
Detecting glitches is NP-complete for tree ensembles of depth 4.
The proposed MILP algorithm effectively finds glitches in benchmarks.
Abstract
Many critical decision-making tasks are now delegated to machine-learned models, and it is imperative that their decisions are trustworthy and reliable, and their outputs are consistent across similar inputs. We identify a new source of unreliable behaviors-called glitches-which may significantly impair the reliability of AI models having steep decision boundaries. Roughly speaking, glitches are small neighborhoods in the input space where the model's output abruptly oscillates with respect to small changes in the input. We provide a formal definition of glitches, and use well-known models and datasets from the literature to demonstrate that they have widespread existence and argue they usually indicate potential model inconsistencies in the neighborhood of where they are found. We proceed to the algorithmic search of glitches for widely used gradient-boosted decision tree (GBDT)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
