Sparse Neural Additive Model: Interpretable Deep Learning with Feature   Selection via Group Sparsity

Shiyun Xu; Zhiqi Bu; Pratik Chaudhari; Ian J. Barnett

arXiv:2202.12482·stat.ML·February 28, 2022

Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity

Shiyun Xu, Zhiqi Bu, Pratik Chaudhari, Ian J. Barnett

PDF

Open Access

TL;DR

This paper introduces Sparse Neural Additive Models (SNAM), which combine interpretability and feature selection in deep learning through group sparsity, with theoretical guarantees and empirical validation.

Contribution

SNAM extends neural additive models with group sparsity regularization, providing theoretical analysis and practical algorithms for feature selection and interpretability.

Findings

01

SNAM provably converges to zero training loss.

02

SNAM achieves asymptotic vanishing estimation error.

03

SNAM can recover true feature support with proper regularization.

Abstract

Interpretable machine learning has demonstrated impressive performance while preserving explainability. In particular, neural additive models (NAM) offer the interpretability to the black-box deep learning and achieve state-of-the-art accuracy among the large family of generalized additive models. In order to empower NAM with feature selection and improve the generalization, we propose the sparse neural additive models (SNAM) that employ the group sparsity regularization (e.g. Group LASSO), where each feature is learned by a sub-network whose trainable parameters are clustered as a group. We study the theoretical properties for SNAM with novel techniques to tackle the non-parametric truth, thus extending from classical sparse linear models such as the LASSO, which only works on the parametric truth. Specifically, we show that SNAM with subgradient and proximal gradient descents…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsFeature Selection · Neural Additive Model