Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects
Vasilis Gkolemis, Loukas Kavouras, Dimitrios Kyriakopoulos, Konstantinos Tsopelas, Dimitrios Rontogiannis, Giuseppe Casalicchio, Theodore Dalamagas, Christos Diou

TL;DR
CALMs introduce a novel model class that balances interpretability and accuracy by allowing feature effects to vary across regions, capturing interactions while maintaining interpretability.
Contribution
The paper proposes CALMs, a new model class that enables region-specific univariate effects, combining the interpretability of GAMs with the accuracy of GA²Ms.
Findings
CALMs outperform GAMs in accuracy across diverse tasks.
CALMs achieve comparable accuracy to GA²Ms while maintaining interpretability.
The training pipeline effectively identifies homogeneous regions for modeling.
Abstract
Generalized additive models (GAMs) offer interpretability through independent univariate feature effects but underfit when interactions are present in data. GAMs add selected pairwise interactions which improves accuracy, but sacrifices interpretability and limits model auditing. We propose \emph{Conditionally Additive Local Models} (CALMs), a new model class, that balances the interpretability of GAMs with the accuracy of GAMs. CALMs allow multiple univariate shape functions per feature, each active in different regions of the input space. These regions are defined independently for each feature as simple logical conditions (thresholds) on the features it interacts with. As a result, effects remain locally additive while varying across subregions to capture interactions. We further propose a principled distillation-based training pipeline that identifies homogeneous regions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
