Trading Complexity for Sparsity in Random Forest Explanations
Gilles Audemard, Steve Bellart, Louenas Bounia, Fr\'ed\'eric, Koriche, Jean-Marie Lagniez, Pierre Marquis

TL;DR
This paper explores methods to explain random forest classifications by balancing explanation complexity and interpretability, introducing majoritary reasons as a computationally efficient alternative to traditional sufficient reasons.
Contribution
It introduces majoritary reasons as a new explanation type for random forests, offering a trade-off between explanation size and computational complexity.
Findings
Majoritary reasons are smaller and easier to compute than sufficient reasons.
A linear-time greedy algorithm can generate majoritary reasons efficiently.
Minimal majoritary reasons can be approximated with an anytime partial Max SAT algorithm.
Abstract
Random forests have long been considered as powerful model ensembles in machine learning. By training multiple decision trees, whose diversity is fostered through data and feature subsampling, the resulting random forest can lead to more stable and reliable predictions than a single decision tree. This however comes at the cost of decreased interpretability: while decision trees are often easily interpretable, the predictions made by random forests are much more difficult to understand, as they involve a majority vote over hundreds of decision trees. In this paper, we examine different types of reasons that explain "why" an input instance is classified as positive or negative by a Boolean random forest. Notably, as an alternative to sufficient reasons taking the form of prime implicants of the random forest, we introduce majoritary reasons which are prime implicants of a strict majority…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning
