Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects

Vasilis Gkolemis; Loukas Kavouras; Dimitrios Kyriakopoulos; Konstantinos Tsopelas; Dimitrios Rontogiannis; Giuseppe Casalicchio; Theodore Dalamagas; Christos Diou

arXiv:2602.16503·cs.LG·February 19, 2026

Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects

Vasilis Gkolemis, Loukas Kavouras, Dimitrios Kyriakopoulos, Konstantinos Tsopelas, Dimitrios Rontogiannis, Giuseppe Casalicchio, Theodore Dalamagas, Christos Diou

PDF

Open Access

TL;DR

CALMs introduce a novel model class that balances interpretability and accuracy by allowing feature effects to vary across regions, capturing interactions while maintaining interpretability.

Contribution

The paper proposes CALMs, a new model class that enables region-specific univariate effects, combining the interpretability of GAMs with the accuracy of GA²Ms.

Findings

01

CALMs outperform GAMs in accuracy across diverse tasks.

02

CALMs achieve comparable accuracy to GA²Ms while maintaining interpretability.

03

The training pipeline effectively identifies homogeneous regions for modeling.

Abstract

Generalized additive models (GAMs) offer interpretability through independent univariate feature effects but underfit when interactions are present in data. GA $^{2}$ Ms add selected pairwise interactions which improves accuracy, but sacrifices interpretability and limits model auditing. We propose \emph{Conditionally Additive Local Models} (CALMs), a new model class, that balances the interpretability of GAMs with the accuracy of GA $^{2}$ Ms. CALMs allow multiple univariate shape functions per feature, each active in different regions of the input space. These regions are defined independently for each feature as simple logical conditions (thresholds) on the features it interacts with. As a result, effects remain locally additive while varying across subregions to capture interactions. We further propose a principled distillation-based training pipeline that identifies homogeneous regions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning