NIMO: a Nonlinear Interpretable MOdel
Shijian Xu, Marcello Massimo Negri, Volker Roth

TL;DR
NIMO is a novel framework that combines the interpretability of linear models with the expressive power of neural networks, providing faithful and intelligible feature effects without sacrificing predictive accuracy.
Contribution
NIMO introduces an optimization method based on parameter elimination for neural networks that ensures inherent interpretability and flexibility.
Findings
Provides faithful and intelligible feature effects
Maintains competitive predictive performance
Incorporates sparsity through adaptive ridge regression
Abstract
Deep learning has achieved remarkable success across many domains, but it has also created a growing demand for interpretability in model predictions. Although many explainable machine learning methods have been proposed, post-hoc explanations lack guaranteed fidelity and are sensitive to hyperparameter choices, highlighting the appeal of inherently interpretable models. For example, linear regression provides clear feature effects through its coefficients. However, such models are often outperformed by more complex neural networks (NNs) that usually lack inherent interpretability. To address this dilemma, we introduce NIMO, a framework that combines inherent interpretability with the expressive power of neural networks. Building on the simple linear regression, NIMO is able to provide flexible and intelligible feature effects. Relevantly, we develop an optimization method based on…
Peer Reviews
Decision·ICLR 2026 Poster
- The authors take the definition of MEM seriously and introduce a novel and clever method that enhances MEM interpretability. - The Introduction, as well as Sections 3.1 and 3.2, are particularly clear and easy to follow. - NIMO can model non-linear relationships while providing linear interpretations at any sample x. - The toy model in Section 3.4 is excellent for illustrative purposes. - NIMO is competitive with other interpretable models in terms of performance, while offering intriguing pro
* The limitations of this model are not analyzed in great detail. I appreciate the analysis of what this Nimo can do equally as much as the discussion of the limitations. However the limitations are only discussed in a single sentence. Addressing these problems convincingly e.g. by experiments that clearly display key limitations, is crucial imo. * The authors themselves acknowledge the importance of both local and global explanations, yet they do not provide any local explanations, but only glo
The optimization algorithm consists of expressing the linear coefficients as the closed-form solution of the loss minimization, and then back-propagate through this closed-form solution to get the gradient of the neural network parameters. This is a novel theoretical and practical contribution. The sections that describe the theory and algorithm behind the approach (sections 3.3 and Appendix A, B, C) are very clear and well written. The method is compared on a variety of tabular datasets : sma
## The MEM The main motivation behind the proposed architecture is that the Marginal Effect at the Mean (MEM)\ $$ \frac{\partial f(x)}{\partial x\_i}\bigg\vert\_{x=\bar{x}}$$ is a useful measure for interpreting the model. Indeed, the main motivation behind architecture is that the learned coefficients $\beta$ will coincide with the MEM. Hence, the merits of the MEM should be discussed in more depth. For instance, why should we use this measure of feature importance instead of the mean derivati
- S1: The authors tackle an important problem - S2: The authors approach is novel - S3: The authors' optimization approach seems generally reasonable
- W1: The main weakness seems to be that the model does not provide significant global interpretability, as the model coefficients are a nonlinear function of the input. Thus, the model's provides interpretability similar to instance-level posthoc methods such as LIME or SHAP. The authors should provide a deeper discussion of this and a quantitative comparison showing a scenario where NIMO can provide more interpretability than post-hoc baselines (rather than just a qualitative comparison). - W2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
