Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity
Shiyun Xu, Zhiqi Bu, Pratik Chaudhari, Ian J. Barnett

TL;DR
This paper introduces Sparse Neural Additive Models (SNAM), which combine interpretability and feature selection in deep learning through group sparsity, with theoretical guarantees and empirical validation.
Contribution
SNAM extends neural additive models with group sparsity regularization, providing theoretical analysis and practical algorithms for feature selection and interpretability.
Findings
SNAM provably converges to zero training loss.
SNAM achieves asymptotic vanishing estimation error.
SNAM can recover true feature support with proper regularization.
Abstract
Interpretable machine learning has demonstrated impressive performance while preserving explainability. In particular, neural additive models (NAM) offer the interpretability to the black-box deep learning and achieve state-of-the-art accuracy among the large family of generalized additive models. In order to empower NAM with feature selection and improve the generalization, we propose the sparse neural additive models (SNAM) that employ the group sparsity regularization (e.g. Group LASSO), where each feature is learned by a sub-network whose trainable parameters are clustered as a group. We study the theoretical properties for SNAM with novel techniques to tackle the non-parametric truth, thus extending from classical sparse linear models such as the LASSO, which only works on the parametric truth. Specifically, we show that SNAM with subgradient and proximal gradient descents…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsFeature Selection · Neural Additive Model
