Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classification Predictability Gap

Mattia Billa; Giovanni Orlandi; Veronica Guidetti; Federica Mandreoli

arXiv:2601.00428·cs.LG·March 27, 2026

Interpretable ML Under the Microscope: Performance, Meta-Features, and the Regression-Classification Predictability Gap

Mattia Billa, Giovanni Orlandi, Veronica Guidetti, Federica Mandreoli

PDF

Open Access

TL;DR

This paper systematically evaluates interpretable machine learning models on real-world tabular data, revealing performance hierarchies in regression but not in classification, and highlighting trade-offs between interpretability and training efficiency.

Contribution

It provides a comprehensive empirical comparison of sixteen interpretable models across diverse datasets, analyzing how dataset features influence model performance and interpretability trade-offs.

Findings

01

Regression models like EBMs and SR perform predictably based on dataset features.

02

Classification performance varies widely with no clear hierarchy.

03

Models with structural sparsity have longer training times.

Abstract

As machine learning models are increasingly deployed in high-stakes domains, the need for interpretability has grown to meet strict regulatory and accountability constraints. Despite this interest, systematic evaluations of inherently interpretable models for tabular data remain scarce and often focus solely on aggregated performance. To address this gap, we evaluate sixteen interpretable methods, including Explainable Boosting Machines (EBMs), Symbolic Regression (SR), and Generalized Optimal Sparse Decision Trees, across 216 real-world tabular datasets. We assess predictive accuracy, computational efficiency, and generalization under distributional shifts. Moving beyond aggregate performance rankings, we further analyze how model behavior varies with dataset meta-features and operationalize these descriptors to study algorithm selection. Our analyses reveal a clear dichotomy: in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques · Machine Learning in Healthcare