Learning Hybrid Interpretable Models: Theory, Taxonomy, and Methods
Julien Ferry (LAAS-ROC), Gabriel Laberge (EPM), Ulrich A\"ivodji (ETS)

TL;DR
This paper provides a comprehensive analysis of hybrid interpretable models, exploring their theoretical foundations, taxonomy, and new methods, demonstrating their potential to outperform black boxes while maintaining transparency.
Contribution
It introduces a unified framework for understanding, classifying, and implementing hybrid models, including the novel HybridCORELS method that offers optimality guarantees and transparency control.
Findings
Hybrid models can outperform standalone black boxes in certain settings.
HybridCORELS achieves competitive performance with enhanced interpretability.
Theoretical PAC guarantees identify an optimal transparency trade-off.
Abstract
A hybrid model involves the cooperation of an interpretable model and a complex black box. At inference, any input of the hybrid model is assigned to either its interpretable or complex component based on a gating mechanism. The advantages of such models over classical ones are two-fold: 1) They grant users precise control over the level of transparency of the system and 2) They can potentially perform better than a standalone black box since redirecting some of the inputs to an interpretable model implicitly acts as regularization. Still, despite their high potential, hybrid models remain under-studied in the interpretability/explainability literature. In this paper, we remedy this fact by presenting a thorough investigation of such models from three perspectives: Theory, Taxonomy, and Methods. First, we explore the theory behind the generalization of hybrid models from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
