Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classification Predictability Gap
Mattia Billa, Giovanni Orlandi, Veronica Guidetti, Federica Mandreoli

TL;DR
This paper systematically evaluates interpretable machine learning models on real-world tabular data, revealing performance hierarchies in regression but not in classification, and highlighting trade-offs between interpretability and training efficiency.
Contribution
It provides a comprehensive empirical comparison of sixteen interpretable models across diverse datasets, analyzing how dataset features influence model performance and interpretability trade-offs.
Findings
Regression models like EBMs and SR perform predictably based on dataset features.
Classification performance varies widely with no clear hierarchy.
Models with structural sparsity have longer training times.
Abstract
As machine learning models are increasingly deployed in high-stakes domains, the need for interpretability has grown to meet strict regulatory and accountability constraints. Despite this interest, systematic evaluations of inherently interpretable models for tabular data remain scarce and often focus solely on aggregated performance. To address this gap, we evaluate sixteen interpretable methods, including Explainable Boosting Machines (EBMs), Symbolic Regression (SR), and Generalized Optimal Sparse Decision Trees, across 216 real-world tabular datasets. We assess predictive accuracy, computational efficiency, and generalization under distributional shifts. Moving beyond aggregate performance rankings, we further analyze how model behavior varies with dataset meta-features and operationalize these descriptors to study algorithm selection. Our analyses reveal a clear dichotomy: in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques · Machine Learning in Healthcare
